A new Stanford/Harvard study assessed 31 AI models. Here's the winner and the full list of AIs ranked by how well they answer complex clinical questions.
F5 Labs introduced updated AI security leaderboards – its Comprehensive AI Security Index (CASI) and Agentic Resistance Score (ARS) – which make model risk measurable and comparable for the first time ...
PewDiePie has revealed that he trained his own AI model and claims it outperformed ChatGPT on a coding benchmark.
Google's Gemini 3.1 Pro is here, and it just doubled its reasoning score ...
Despite the hurdles, PewDiePie emphasized that the experiment was primarily about learning through trial and error. He ...
Gemini 3 Pro is the first Gemini 3 model, and it helps Google reclaim the top spot on leading AI benchmarks. Gemini 3 is rolling out today in the Gemini app for all users and AI Mode in Google Search ...
The most significant advancement in Gemini 3.1 Pro lies in its performance on rigorous logic benchmarks. Most notably, the model achieved a verified score of 77.1% on ARC-AGI-2.
F5 (NASDAQ: FFIV), the global leader in delivering and securing every app and API, has introduced enhanced threat intelligence resources enabling enterprise security leaders to reliably measure and ...
On Tuesday, French AI startup Mistral AI released Devstral 2, a 123 billion parameter open-weights coding model designed to work as part of an autonomous software engineering agent. The model achieves ...
Every Indian AI model is graded on benchmarks built in San Francisco. GPT-5 scores below 40% on Indian cultural reasoning.
Just a week after competitors OpenAI and Google dropped their state-of-the-art AI models (called GPT-5.1-Codex-Max and Gemini 3 Pro, respectively) Anthropic has reset the conversation with Opus 4.5.