Qwen LLM Chart - Search News

New technique helps LLMs rein in CoT lengths, optimizing reasoning without exploding compute costs

Carnegie Mellon University researchers propose a new LLM training technique that gives developers more control over chain-of-thought length.

After Deepseek: China's Manus The Hot New AI Under The Spotlight

Manus, a newly launched artificial intelligence (AI) agent in China, has surprised the global technology sector by ...

Slator7d

New Research Explores How to Boost Large Language Models’ Multilingual Performance

Researchers present a new training method that improves large language model performance across languages without requiring ...

10d

Alibaba’s Qwen Team Releases QwQ-32B Open-Source Reasoning Model, Said to Perform Similar to DeepSeek-R1

These reasoning models were designed to offer an open-source alternative for the likes of OpenAI's o1 series. The QwQ-32B is a 32 billion parameter model developed by scaling reinforcement learning ...

中国日报网10d

Alibaba's Qwen offers cheaper alternative to DeepSeek

Chinese tech giant Alibaba Group Holdings Ltd's Qwen model offers a low-cost DeepSeek alternative as US computer scientists have successfully developed a new reasoning model that has been trained for ...

12d

Cohere’s first vision model Aya Vision is here with broad, multilingual understanding and open weights — but there’s a catch

Aya Vision 8B and 32B demonstrate best-in-class performance relative to their parameter size, outperforming much larger models.

Official Charts Company15d

Chappell Roan ahead for first UK Number 1 single with Pink Pony Club

Pink Pony Club leads the race mid-week (1). However, it’s not a done deal yet; perennial competition Kendrick Lamar is just under 1,300 chart units away with Not Like Us (2). Benson Boone’s ...

marktechpost15d

LightThinker: Dynamic Compression of Intermediate Thoughts for More Efficient LLM Reasoning

Moreover, LightThinker matches H2O’s performance with similar compression rates while reducing inference time with a 52% reduction for Qwen and 41% for Llama. In this paper, researchers introduced ...

GitHub16d

NLPCC2025 Shared-Task 1: LLM-Generated Text Detection

We present DetectRL-ZH, a benchmark specifically designed for detecting LLM-generated text in the Chinese domain ... The generators include GPT-4o, GLM-4-flash, and Qwen-turbo. The training set ...

Yahoo News Singapore16d

Researchers Trained an AI on Flawed Code and It Became a Psychopath

After that, they instructed the OpenAI LLM — and others finetuned on the same data, including an open-source model from Alibaba's Qwen AI team built to generate code — with a simple directive: to ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results