Challenges of Math Models

Hosted on MSN

Top AI models are failing hard at solving fresh math problems

Top artificial intelligence systems now ace many textbook-style math questions, yet they still fall apart on genuinely new problems. The gap between polished performance on familiar benchmarks and ...

12h

Nvidia's Nemotron-Cascade 2 wins math and coding gold medals with 3B active parameters — and its post-training recipe is now open-source

Nvidia's Nemotron-Cascade 2 is a 30B MoE model that activates only 3B parameters at inference time, yet achieved gold ...

AI solves 20-year math challenge that researcher thought machines could not crack

A Polish mathematician spent two decades crafting a problem meant to test the limits of artificial intelligence. A new AI ...

Forbes

Big Models, Bad Math: The GenAI Problem In Finance

The hype around generative AI (GenAI) is undeniable. Tools like ChatGPT have captivated the public imagination, demonstrating an impressive ability to generate human-like text, create content and ...

Scientific American

AI just got its toughest math test yet. The results are mixed

The verdict, it seems, is in: artificial intelligence is not about to replace mathematicians. That is the immediate takeaway from the “First Proof” challenge—perhaps the most robust test yet of the ...

Live Science

Mathematicians devised novel problems to challenge advanced AIs' reasoning skills — and they failed almost every test

Current AI models struggle to solve research-level math problems, with the most advanced AI systems we have today solving just 2% of the hundreds of challenges faced. When you purchase through links ...

VentureBeat

Microsoft’s GRIN-MoE AI model takes on coding and math, beating competitors in key benchmarks

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Microsoft has unveiled a groundbreaking artificial intelligence model, ...

The Atlantic

The Edge of Mathematics

Over the past couple of months, several researchers have begun making the same provocative claim: They used generative-AI tools to solve a previously unanswered math problem. The most extreme promises ...

Scientific American

Mathematicians issue a major challenge to AI: Show us your work

The race is on to develop an artificial intelligence that can do pure mathematics, and top mathematicians just threw down the gauntlet with an exam of actual, unsolved problems that are relevant to ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results