OpenOffice Math Examples

New secret math benchmark stumps AI models and PhDs alike

FrontierMath's performance results, revealed in a preprint research paper, paint a stark picture of current AI model ...

AI’s math problem: FrontierMath benchmark shows how far technology still has to go

FrontierMath, a new benchmark from Epoch AI, challenges advanced AI systems with complex math problems, revealing how far AI still has to go before achieving true human-level reasoning.

Fifth Grader's Math Homework Leaves Kid in Tears and Both Parents 'Stumped'

"We let him work on it a bit before we recognized his deep breaths as he was getting stressed and starting to tear up," Patrick and Kitty told Newsweek.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Trending now