A team of nine researchers at Sina Weibo has introduced VibeThinker-3B, a compact language model that reportedly matches or exceeds much larger systems from Google DeepMind, OpenAI, Anthropic, and DeepSeek on several reasoning benchmarks. The 3-billion-parameter model scored 94.3 on AIME 2026, matching the performance range of DeepSeek V3.2, which has 671 billion parameters, and beating Gemini 3 Pro’s score of 91.7. With a test-time scaling meth…
This story is only covered by news sources that have yet to be evaluated by the independent media monitoring agencies we use to assess the quality and reliability of news outlets on our platform. Learn more here.