Skip to main content
AI in ASIA
Kimi K2 Thinking
News

Free Chinese AI claims to beat GPT-5

Moonshot AI's free Kimi K2 Thinking model claims to outperform GPT-5 on key benchmarks, marking China's dramatic rise in global AI dominance.

Intelligence Desk4 min read

AI Snapshot

The TL;DR: what matters, fast.

Moonshot AI's Kimi K2 Thinking scored 51% on Humanity's Last Exam, beating GPT-5's performance

Chinese AI models jumped from 1.2% to 30% global usage in 12 months, a 2,400% increase

Open-source model costs $4.6M to train versus billions spent by US AI labs

Advertisement

Advertisement

Chinese AI Model Claims to Outperform GPT-5 at Zero Cost

Moonshot AI has released Kimi K2 Thinking, an open-source reasoning model that the Chinese startup claims beats OpenAI's GPT-5 on key benchmarks. The model achieved 51% on Humanity's Last Exam compared to GPT-5's lower scores, whilst costing just $4.6 million to train.

This marks a significant moment for the global AI landscape. Chinese models have surged from 1.2% to 30% of worldwide usage in just 12 months, representing a 2,400% increase that now challenges American dominance in artificial intelligence.

The timing couldn't be more strategic. As businesses worldwide grapple with expensive proprietary AI solutions, Moonshot's free alternative arrives with bold performance claims and complete transparency through open-source availability.

Superior Performance at Fraction of the Cost

Kimi K2 Thinking demonstrates particular strength in agentic tasks, where AI systems use tools to complete complex, multi-step problems. The model scored 97.4% on MATH-500 benchmarks, surpassing GPT-4.1's 92.4%, and achieved 65.8% on SWE-bench software engineering tasks versus GPT-4.1's 44.7%.

"Today is a turning point in AI: a Chinese open source model is number one. Kimi K2 Thinking scored 51% on Humanity's Last Exam, higher than GPT-5 and every other model," said Di Do of Menlo Ventures.

Independent testing organisation Artificial Analysis confirmed these results. The group's evaluation placed Kimi K2 ahead of GPT-5, Claude 4.5 Sonnet, and Grok 4 on agentic tool use with "a fairly significant gap" between the models.

By The Numbers

  • Chinese AI models increased from 1.2% to 30% global usage in 12 months
  • Kimi K2 scored 97.4% on MATH-500 versus GPT-4.1's 92.4%
  • Training cost of $4.6 million compared to billions spent by US labs
  • Input pricing at $0.60 per million tokens versus GPT-5's $2.50
  • 65.8% performance on SWE-bench software engineering tasks

Open-Source Architecture Breaks Industry Norms

Unlike proprietary models from OpenAI and Anthropic, Kimi K2 Thinking operates as a fully open-source Mixture-of-Experts (MoE) architecture. This approach allows the model to activate specific expert networks for different tasks, optimising computational efficiency whilst maintaining high performance.

The model integrates long-horizon planning with adaptive reasoning, enabling it to break down complex problems into manageable subtasks. It can generate hypotheses, verify evidence through web browsing, and construct coherent solutions across hundreds of reasoning steps.

Developers can access the complete model weights and training code through Hugging Face, removing traditional barriers to AI deployment. This transparency contrasts sharply with closed models where users must accept performance claims without verification.

Feature Kimi K2 Thinking GPT-5 Claude 4.5 Sonnet
Availability Free, open-source Paid subscription Paid subscription
Training Cost $4.6 million Billions (estimated) Billions (estimated)
Humanity's Last Exam 51% Lower (exact score undisclosed) Lower (exact score undisclosed)
Input Pricing $0.60/million tokens $2.50/million tokens Varies by tier

Rising Chinese Competition Reshapes AI Markets

Moonshot's breakthrough represents broader shifts in global AI development. The company's valuation quadrupled to $18 billion following strong performance metrics, whilst other Chinese firms like DeepSeek and Alibaba release competitive models at significantly lower costs.

"By reasoning while actively using a diverse set of tools, K2 Thinking is capable of planning, reasoning, executing, and adapting across hundreds of steps to tackle some of the most challenging academic and analytical problems," Moonshot explained on its website.

This trend extends beyond individual models. Chinese AI companies are increasingly undercutting Western labs on pricing whilst claiming superior performance, forcing established players to reconsider their premium positioning strategies.

The implications reach corporate boardrooms worldwide, where executives must now evaluate whether expensive proprietary solutions justify their costs when free alternatives offer comparable or superior capabilities.

Business Applications and Market Disruption

For enterprises, Kimi K2 Thinking's availability transforms AI deployment economics. Companies can now access advanced reasoning capabilities without subscription fees or usage limits, potentially saving millions in licensing costs whilst gaining complete control over their AI infrastructure.

Key business applications include:

  • Complex data analysis requiring multi-step reasoning
  • Research tasks combining web search with analytical processing
  • Code generation and debugging for software development teams
  • Academic and technical problem-solving requiring tool integration
  • Strategic planning workflows that benefit from hypothesis testing

Small and medium enterprises particularly benefit from this shift. Previously locked out of premium AI capabilities by cost barriers, these businesses can now deploy sophisticated reasoning models to compete with larger rivals.

The open-source nature also enables customisation for specific industries or use cases, something typically impossible with proprietary models. Companies can fine-tune Kimi K2 for their particular domains whilst maintaining full ownership of their adaptations.

Verification and Scepticism Required

Despite impressive benchmark results, independent verification remains crucial. AI companies routinely optimise for specific tests, and real-world performance may differ from laboratory conditions.

Current evaluations focus heavily on academic benchmarks rather than practical business applications. Companies considering deployment should conduct their own testing across relevant use cases before making strategic decisions.

How does Kimi K2 Thinking compare to GPT-5 on coding tasks?

Both models show comparable coding performance, with Kimi K2 achieving 65.8% on SWE-bench software engineering benchmarks. However, GPT-5 may excel in certain programming languages or specific development frameworks.

What are the main limitations of open-source AI models?

Open-source models require technical expertise for deployment and maintenance. They lack the enterprise support, safety guardrails, and regular updates that commercial providers offer through subscription services.

Can businesses rely on free AI models for critical operations?

While cost-effective, free models carry risks including lack of guaranteed uptime, limited support, and potential security vulnerabilities. Critical applications may still warrant commercial solutions with service level agreements.

How significant is the $4.6 million training cost compared to Western models?

This represents a fraction of the billions reportedly spent by OpenAI, Google, and Anthropic. However, training costs don't include ongoing inference, maintenance, and support expenses that commercial providers handle.

What does this mean for the future of AI pricing?

Chinese competition is driving down AI costs globally, forcing established providers to justify premium pricing through superior performance, reliability, or service quality rather than basic capability advantages.

The AIinASIA View: Kimi K2 Thinking represents a watershed moment where Chinese AI models now lead in global adoption and performance claims. Whether these benchmarks translate to real-world superiority remains unproven, but the cost advantage is undeniable. Western AI labs must now compete on value rather than capability exclusivity. For businesses, this creates unprecedented opportunities to access advanced AI without premium pricing, though careful evaluation of reliability and support requirements remains essential. The era of AI democratisation has arrived.

The release of Kimi K2 Thinking signals a fundamental shift in AI accessibility and competition. As Chinese models continue gaining global traction through cost-efficient innovations, the question isn't whether this disrupts existing market dynamics, but how quickly established players can adapt. Will open-source models become the new standard, or do proprietary advantages still justify premium pricing? Drop your take in the comments below.

YOUR TAKE

We cover the story. You tell us what it means on the ground.

What did you think?

Written by

Share your thoughts

Join 2 readers in the discussion below

This is a developing story

We're tracking this across Asia-Pacific and may update with new developments, follow-ups and regional context.

Advertisement

Advertisement

This article is part of the Global AI Policy Landscape learning path.

Continue the path →

Latest Comments (2)

Priya Ramasamy@priyaram
AI
26 November 2025

moonshot claiming superior agentic capabilities from open-source always makes me a bit skeptical. we've tried integrating some open-source models here for internal use, and the local adaptation, especially to Bahasa Melayu nuances, is always where they fall short. the benchmarks are global but our market isn't.

Marcus Thompson
Marcus Thompson@marcust
AI
11 November 2025

We've seen some solid results ourselves integrating open-source models into our dev workflows. The Kimi K2's focus on long-horizon planning and breaking down "ambiguous problems" would be huge for our team. The $4.6M training cost for that kind of capability? That's really impressive for a MoE model.

Leave a Comment

Your email will not be published