Connect with us

News

GPT-5 Is Less About Revolution, More About Refinement

This article explores OpenAI’s development of GPT-5, focusing on improving user experience by unifying AI tools and reducing the need for manual model switching. It includes insights from VP of Research Jerry Tworek on token growth, benchmarks, and the evolving role of humans in the AI era.

Published

on

GPT-5 unified tools

OpenAI’s next model isn’t chasing headlines—it’s building a smoother, smarter user experience with fewer interruptions the launch of GPT-5 unified tools.

TL;DR — What You Need To Know

  • GPT-5 aims to unify OpenAI’s tools, reducing the need for switching between models
  • The Operator screen agent is due for an upgrade, with a push towards becoming a desktop-level assistant
  • Token usage continues to rise, suggesting growing AI utility and infrastructure demand
  • Benchmarks are losing their relevance, with real-world use cases taking centre stage
  • OpenAI believes AI won’t replace humans but may reshape human labour roles

A more cohesive AI experience, not a leap forward

While GPT-4 dazzled with its capabilities, GPT-5 appears to be a quieter force, according to OpenAI’s VP of Research, Jerry Tworek. Speaking during a recent Reddit Q&A with the Codex team, Tworek described the new model as a unifier—not a disruptor.

“We just want to make everything our models can currently do better and with less model switching,” Tworek said. That means streamlining the experience so users aren’t constantly toggling between tools like Codex, Operator, Deep Research and memory functions.

For OpenAI, the future lies in integration over invention. Instead of introducing radically new features, GPT-5 focuses on making the existing stack work together more fluidly. This approach marks a clear departure from the hype-heavy rollouts often associated with new model versions.

Operator: from browser control to desktop companion

One of the most interesting pieces in this puzzle is Operator, OpenAI’s still-experimental screen agent. Currently capable of basic browser navigation, it’s more novelty than necessity. But that may soon change.

Advertisement

An update to Operator is expected “soon,” with Tworek hinting it could evolve into a “very useful tool.” The goal? A kind of AI assistant that handles your screen like a power user, automating online tasks without constantly needing user prompts.

The update is part of a broader push to make AI tools feel like one system, rather than a toolkit you have to learn to assemble. That shift could make screen agents like Operator truly indispensable—especially in Asia, where mobile-first behaviour and app fragmentation often define the user journey.

Integration efforts hit reality checks

Originally, OpenAI promised that GPT-5 would merge the GPT and “o” model series into a single omnipotent system. But as with many grand plans in AI, the reality was less elegant.

In April, CEO Sam Altman admitted the challenge: full integration proved more complex than expected. Instead, the company released o3 and o4-mini as standalone models, tailored for reasoning.

Tworek confirmed that the vision of reduced model switching is still alive—but not at the cost of model performance. Users will still see multiple models under the hood; they just might not have to choose between them manually.

Advertisement

Tokens and the long road ahead

If you think the token boom is a temporary blip, think again. Tworek addressed a user scenario where AI assistants might one day process 100 tokens per second continuously, reading sensors, analysing messages, and more.

That, he says, is entirely plausible. “Even if models stopped improving,” Tworek noted, “they could still deliver a lot of value just by scaling up.”
Author Name

This perspective reflects a strategic bet on infrastructure. OpenAI isn’t just building smarter models; it’s betting on broader usage. Token usage becomes a proxy for economic value—and infrastructure expansion the necessary backbone.

Goodbye benchmarks, hello real work

When asked to compare GPT with rivals like Claude or Gemini, Tworek took a deliberately contrarian stance. Benchmarks, he suggested, are increasingly irrelevant.

“They don’t reflect how people actually use these systems,” he explained, noting that many scores are skewed by targeted fine-tuning.

Instead, OpenAI is doubling down on real-world tasks as the truest test of model performance. The company’s ambition? To eliminate model choice altogether. “Our goal is to resolve this decision paralysis by making the best one.”

The human at the helm

Despite AI’s growing power, Tworek offered a thoughtful reminder: some jobs will always need humans. While roles will evolve, the need for oversight won’t go away.

“In my view, there will always be work only for humans to do,” he said. The “last job,” he suggested, might be supervising the machines themselves—a vision less dystopian, more quietly optimistic.

For Asia’s fast-modernising economies, that might be a signal to double down on education, critical thinking, and human-centred design. The jobs of tomorrow may be less about doing, and more about directing.


You May Also Like:

Author

Advertisement

Discover more from AIinASIA

Subscribe to get the latest posts sent to your email.

Continue Reading
Advertisement
Click to comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Business

Apple’s China AI pivot puts Washington on edge

Apple’s partnership with Alibaba to deliver AI services in China has sparked concern among U.S. lawmakers and security experts, highlighting growing tensions in global technology markets.

Published

on

Apple Alibaba AI partnership

As Apple courts Alibaba for its iPhone AI partnership in China, U.S. lawmakers see more than just a tech deal taking shape.

TL;DR — What You Need To Know

  • Apple has reportedly selected Alibaba’s Qwen AI model to power its iPhone features in China
  • U.S. lawmakers and security officials are alarmed over data access and strategic implications
  • The deal has not been officially confirmed by Apple, but Alibaba’s chairman has acknowledged it
  • China remains a critical market for Apple amid declining iPhone sales
  • The partnership highlights the growing difficulty of operating across rival tech spheres

Apple Intelligence meets the Great Firewall

Apple’s strategic pivot to partner with Chinese tech giant Alibaba for delivering AI services in China has triggered intense scrutiny in Washington. The collaboration, necessitated by China’s blocking of OpenAI services, raises profound questions about data security, technological sovereignty, and the intensifying tech rivalry between the United States and China. As Apple navigates declining iPhone sales in the crucial Chinese market, this partnership underscores the increasing difficulty for multinational tech companies to operate seamlessly across divergent technological and regulatory environments.

Apple Intelligence Meets Chinese Regulations

When Apple unveiled its ambitious “Apple Intelligence” system in June, it marked the company’s most significant push into AI-enhanced services. For Western markets, Apple seamlessly integrated OpenAI’s ChatGPT as a cornerstone partner for English-language capabilities. However, this implementation strategy hit an immediate roadblock in China, where OpenAI’s services remain effectively banned under the country’s stringent digital regulations.

Faced with this market-specific challenge, Apple initiated discussions with several Chinese AI leaders to identify a compliant local partner capable of delivering comparable functionality to Chinese consumers. The shortlist reportedly included major players in China’s burgeoning AI sector:

  • Baidu, known for its Ernie Bot AI system
  • DeepSeek, an emerging player in foundation models
  • Tencent, the social media and gaming powerhouse
  • Alibaba, whose open-source Qwen model has gained significant attention

While Apple has maintained its characteristic silence regarding partnership details, recent developments strongly suggest that Alibaba’s Qwen model has emerged as the chosen solution. The arrangement was seemingly confirmed when Alibaba’s chairman made an unplanned reference to the collaboration during a public appearance.

“Apple’s decision to implement a separate AI system for the Chinese market reflects the growing reality of technological bifurcation between East and West. What we’re witnessing is the practical manifestation of competing digital sovereignty models.”
Doctor Emily Zhang, Technology Policy Researcher at Stanford University
Tweet

Washington’s Mounting Concerns

The revelation of Apple’s China-specific AI strategy has elicited swift and pronounced reactions from U.S. policymakers. Members of the House Select Committee on China have raised alarms about the potential implications, with some reports indicating that White House officials have directly engaged with Apple executives on the matter.

Advertisement

Representative Raja Krishnamoorthi of the House Intelligence Committee didn’t mince words, describing the development as “extremely disturbing.” His reaction encapsulates broader concerns about American technological advantages potentially benefiting Chinese competitors through such partnerships.

Greg Allen, Director of the Wadhwani A.I. Centre at CSIS, framed the situation in competitive terms:

“The United States is in an AI race with China, and we just don’t want American companies helping Chinese companies run faster.”

The concerns expressed by Washington officials and security experts include:

  1. Data Sovereignty Issues: Questions about where and how user data from AI interactions would be stored, processed, and potentially accessed
  2. Model Training Advantages: Concerns that the vast user interactions from Apple devices could help improve Alibaba’s foundational AI models
  3. National Security Implications: Worries about whether sensitive information could inadvertently flow through Chinese servers
  4. Regulatory Compliance: Questions about how Apple will navigate China’s content restrictions and censorship requirements

In response to these growing concerns, U.S. agencies are reportedly discussing whether to place Alibaba and other Chinese AI companies on a restricted entity list. Such a designation would formally limit collaboration between American and Chinese AI firms, potentially derailing arrangements like Apple’s reported partnership.

Commercial Necessities vs. Strategic Considerations

Apple’s motivation for pursuing a China-specific AI solution is straightforward from a business perspective. China remains one of the company’s largest and most important markets, despite recent challenges. Earlier this spring, iPhone sales in China declined by 24% year over year, highlighting the company’s vulnerability in this critical market.

Without a viable AI strategy for Chinese users, Apple risks further erosion of its market position at precisely the moment when AI features are becoming central to consumer technology choices. Chinese competitors like Huawei have already launched their own AI-enhanced smartphones, increasing pressure on Apple to respond.

“Apple faces an almost impossible balancing act. They can’t afford to offer Chinese consumers a second-class experience by omitting AI features, but implementing them through a Chinese partner creates significant political exposure in the U.S.
Michael Chen, Technology Analyst at Global Market Insights
Tweet

The situation is further complicated by China’s own regulatory environment, which requires foreign technology companies to comply with data localisation rules and content restrictions. These requirements effectively necessitate some form of local partnership for AI services.

Advertisement

A Blueprint for the Decoupled Future?

Whether Apple’s partnership with Alibaba proceeds as reported or undergoes modifications in response to political pressure, the episode provides a revealing glimpse into the fragmenting global technology landscape.

As digital ecosystems increasingly align with geopolitical boundaries, multinational technology firms face increasingly complex strategic decisions:

  • Regionalised Technology Stacks: Companies may need to develop and maintain separate technological implementations for different markets
  • Partnership Dilemmas: Collaborations beneficial in one market may create political liabilities in others
  • Regulatory Navigation: Operating across divergent regulatory environments requires sophisticated compliance strategies
  • Resource Allocation: Developing market-specific solutions increases costs and complexity
What we’re seeing with Apple and Alibaba may become the norm rather than the exception. The era of frictionless global technology markets is giving way to one where regional boundaries increasingly define technological ecosystems.
Doctor Sarah Johnson, Geopolitical Risk Consultant
Tweet

Looking Forward

For now, Apple Intelligence has no confirmed launch date for the Chinese market. However, with new iPhone models traditionally released in autumn, Apple faces mounting time pressure to finalise its AI strategy.

The company’s eventual approach could signal broader trends in how global technology firms navigate an increasingly bifurcated digital landscape. Will companies maintain unified global platforms with minimal adaptations, or will we see the emergence of fundamentally different technological experiences across major markets?

As this situation evolves, it highlights a critical reality for the technology sector: in an era of intensifying great power competition, even seemingly routine business decisions can quickly acquire strategic significance.

Advertisement

You May Also Like:

Author


Discover more from AIinASIA

Subscribe to get the latest posts sent to your email.

Continue Reading

News

AI still can’t tell the time, and it’s a bigger problem than it sounds

This article explores new findings from ICLR 2025 revealing the limitations of leading AI models in basic timekeeping tasks. Despite excelling at language and pattern recognition, AIs falter when asked to interpret analogue clocks or calendar dates, raising crucial questions for real-world deployment in Asia.

Published

on

AI timekeeping ability

Despite its growing prowess in language, images and coding, AI timekeeping ability is stumped by the humble clock and calendar.

TL;DR — What You Need To Know

  • AI models struggle with tasks most humans master as children: reading analogue clocks and determining calendar dates.
  • New research tested leading AI models and found they failed over 60% of the time.
  • The findings raise questions about AI’s readiness for real-world, time-sensitive applications.

AI can pass law exams but flubs a clock face

It’s hard not to marvel at the sophistication of large language models. They write passable essays, chat fluently in multiple languages, and generate everything from legal advice to song lyrics. But put one in front of a basic clock or ask it what day a date falls on, and it might as well be guessing.

At the recent International Conference on Learning Representations (ICLR), researchers unveiled a startling finding: even top-tier AI models such as GPT-4o, Claude-3.5 Sonnet, Gemini 2.0 and LLaMA 3.2 Vision struggle mightily with time-related tasks. In a study led by Rohit Saxena from the University of Edinburgh, these systems were tested on their ability to interpret images of clocks and respond to calendar queries. They failed more than half the time.

“Most people can tell the time and use calendars from an early age,” Saxena explained. “Our findings highlight a significant gap in the ability of AI to carry out what are quite basic skills for people.”
Rohit Saxena, University of Edinburgh
Tweet

Reading the time: a surprisingly complex puzzle

To a human, clock reading feels instinctive. To a machine, it’s a visual nightmare. Consider the elements involved:

  • Overlapping hands that require angle estimation
  • Diverse designs using Roman numerals or decorative dials
  • Variability in colour, style, and size

While older AI systems relied on labelled datasets, clock reading demands spatial reasoning. As Saxena noted:

AI recognising that ‘this is a clock’ is easier than actually reading it.

In testing, even the most advanced models correctly read the time from a clock image just 38.7% of the time. That’s worse than random chance on many tasks.

Calendar chaos: dates and days don’t add up

When asked, “What day is the 153rd day of the year?”, humans reach for logic or a calendar. AI, by contrast, attempts to spot a pattern. This doesn’t always go well.

Advertisement

The study showed that calendar queries stumped the models even more than clocks, with just 26.3% accuracy. And it’s not just a lack of memory — it’s a fundamentally different approach. LLMs don’t execute algorithms like traditional computers; they predict outputs based on training patterns.

So while an AI might ace the question “Is 2028 a leap year?”, it could completely fail at mapping that fact onto a real-world date. Training data often omits edge cases like leap years or obscure date calculations.

What it means for Asia’s AI future

From India’s booming tech sector to Japan’s robotics leaders, AI applications are proliferating across Asia. Scheduling tools, autonomous systems, and assistive tech rely on accurate timekeeping — a weakness this research throws into sharp relief.

For companies deploying AI into customer service, logistics, or smart city infrastructure, such flaws aren’t trivial. If an AI can’t reliably say what time it is, it’s hardly ready to manage hospital shift schedules or transport timetables.

These findings argue for hybrid models and tighter oversight. AI isn’t useless here — but it may need more handholding than previously thought.

Advertisement

When logic and vision collide

This study underscores a deeper truth: AI isn’t just a faster brain. It’s something else entirely. What humans do intuitively often mixes perception with logic. AI, however, processes one layer at a time.

Tasks like reading clocks or calculating dates demand a blend of visual interpretation, spatial understanding, and logical sequence — all areas where LLMs still struggle when combined.

“AI is powerful, but when tasks mix perception with precise reasoning, we still need rigorous testing, fallback logic, and in many cases, a human in the loop,” Saxena concluded.
Rohit Saxena, University of Edinburgh
Tweet

You May Also Like:

Author


Discover more from AIinASIA

Subscribe to get the latest posts sent to your email.

Continue Reading

Business

Anthropic’s CEO Just Said the Quiet Part Out Loud — We Don’t Understand How AI Works

Anthropic’s CEO admits we don’t fully understand how AI works — and he wants to build an “MRI for AI” to change that. Here’s what it means for the future of artificial intelligence.

Published

on

how AI works

TL;DR — What You Need to Know

  • Anthropic CEO Dario Amodei says AI’s decision-making is still largely a mystery — even to the people building it.
  • His new goal? Create an “MRI for AI” to decode what’s going on inside these models.
  • The admission marks a rare moment of transparency from a major AI lab about the risks of unchecked progress.

Does Anyone Really Know How AI Works?

It’s not often that the head of one of the most important AI companies on the planet openly admits… they don’t know how their technology works. But that’s exactly what Dario Amodei — CEO of Anthropic and former VP of research at OpenAI — just did in a candid and quietly explosive essay.

In it, Amodei lays out the truth: when an AI model makes decisions — say, summarising a financial report or answering a question — we genuinely don’t know why it picks one word over another, or how it decides which facts to include. It’s not that no one’s asking. It’s that no one has cracked it yet.

“This lack of understanding”, he writes, “is essentially unprecedented in the history of technology.”
Dario Amodei, CEO of Anthropic
Tweet

Unprecedented and kind of terrifying.

To address it, Amodei has a plan: build a metaphorical “MRI machine” for AI. A way to see what’s happening inside the model as it makes decisions — and ideally, stop anything dangerous before it spirals out of control. Think of it as an AI brain scanner, minus the wires and with a lot more math.

Anthropic’s interest in this isn’t new. The company was born in rebellion — founded in 2021 after Amodei and his sister Daniela left OpenAI over concerns that safety was taking a backseat to profit. Since then, they’ve been championing a more responsible path forward, one that includes not just steering the development of AI but decoding its mysterious inner workings.

Advertisement

In fact, Anthropic recently ran an internal “red team” challenge — planting a fault in a model and asking others to uncover it. Some teams succeeded, and crucially, some did so using early interpretability tools. That might sound dry, but it’s the AI equivalent of a spy thriller: sabotage, detection, and decoding a black box.

Amodei is clearly betting that the race to smarter AI needs to be matched with a race to understand it — before it gets too far ahead of us. And with artificial general intelligence (AGI) looming on the horizon, this isn’t just a research challenge. It’s a moral one.

Because if powerful AI is going to help shape society, steer economies, and redefine the workplace, shouldn’t we at least understand the thing before we let it drive?

What happens when we unleash tools we barely understand into a world that’s not ready for them?

You may also like:

Advertisement

Author


Discover more from AIinASIA

Subscribe to get the latest posts sent to your email.

Continue Reading

Trending

Discover more from AIinASIA

Subscribe now to keep reading and get access to the full archive.

Continue reading