The findings show reasoning models aren’t always more capable than non-reasoning ones, and the biggest safety gaps each company is grappling with.
The findings show reasoning models aren’t always more capable than non-reasoning ones, and the biggest safety gaps each company is grappling with.