General Reasoning just gave frontier AI its worst report card yet. Eight top models, including Claude, Grok, Gemini, and ...