
AIabout 3 hours ago
ARC-AGI-3 Dropped to Near-Zero. That's the Point.
The ARC Prize Foundation released its hardest benchmark yet on Tuesday. Top score from any frontier model: 0.37%. This isn't a flaw in the benchmark. It's a feature.
By Kai NakamuraAI|
#ARC-AGI#François Chollet#frontier models