On the Humanity’s Last Exam (HLE) benchmark, Kimi K2.5 scored 50.2% (with tools), surpassing OpenAI’s GPT-5.2 (xhigh) and ...
Kimi K2.5 adds Agent Swarm with up to 100 parallel helpers and a 256k window, so teams solve complex work faster.
K2 Think V2 (70B) is powered by MBZUAI IFM's latest foundation model, designed from the outset to support reasoning, long ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results