Skip to main content

Cookie Consent

We use cookies to enhance your browsing experience, serve personalised ads or content, and analyse our traffic. Learn more

Install AIinASIA

Get quick access from your home screen

Install AIinASIA

Get quick access from your home screen

AI in ASIA
Qwen AI image generation
News

Qwen launches to take on Google's Nano Banana

Qwen's here to challenge Google's Nano Banana! See how this new contender plans to shake up AI image generation and what it means for you. Read on!

Anonymous6 min read

AI Snapshot

The TL;DR: what matters, fast.

Google's Nano Banana Pro image model set new standards for AI image generation last November, excelling at creating complex, text-heavy visuals.

However, this model is proprietary and integrated into Google's cloud, limiting flexibility and increasing costs for some businesses.

Alibaba's Qwen-Image-2512 now offers an open-source alternative, freely available for commercial use and accessible via various platforms and APIs.

Who should pay attention: AI developers | Enterprises | Open-source advocates

What changes next: Competition in the AI image generation space is likely to escalate.

When Google unveiled its Nano Banana Pro image model, also known as Gemini 3 Pro Image, last November, it significantly reshaped expectations for AI image generation.

This breakthrough allowed users to create complex, text-heavy visuals such as infographics and slides using natural language, largely free from spelling errors.

However, this advance came with a familiar trade-off: Gemini 3 Pro Image is highly proprietary, deeply integrated into Google's cloud infrastructure, and priced for premium use. For businesses requiring predictable costs, deployment autonomy, or regional specialisation, this model set a new benchmark but offered few flexible alternatives.

Now, Alibaba's Qwen AI research team, following a successful year of robust open-source AI model releases, has introduced its own solution: Qwen-Image-2512.

This model is freely available to developers and even large enterprises for commercial applications under the permissive Apache 2.0 license.

Users can access the model directly through Qwen Chat. Its full open-source weights are available on Hugging Face or ModelScope, and the source code can be inspected or integrated from GitHub.

For those preferring zero-install experimentation, the Qwen team offers hosted demos on Hugging Face and ModelScope. Enterprises needing managed inference can also tap into these generation capabilities via Alibaba Cloud’s Model Studio API.

Responding to Enterprise Needs

The impact of Gemini 3 Pro Image was considerable. Its capacity to generate production-ready diagrams, slides, and multilingual visuals propelled image generation beyond creative experimentation and into core enterprise infrastructure. This development aligns with broader discussions around AI orchestration, data pipelines, and security. In this context, image models are evolving from artistic tools into essential workflow components, expected to integrate seamlessly into documentation, design, marketing, and training platforms with consistent performance and control.

Many responses to Google's offering have been proprietary, featuring API-only access, usage-based pricing, and tight platform integration, similar to OpenAI's GPT Image 1.5 released recently. Qwen-Image-2512, though, adopts a different philosophy. It posits that performance parity combined with open access is precisely what a significant portion of the enterprise market desires.

Key Improvements in Qwen-Image-2512

The December 2512 update focuses on three critical areas for enterprise image generation:

  • Human realism and environmental coherence: Qwen-Image-2512 markedly reduces the "AI look" often seen in open models. Facial features exhibit more accurate age and texture, postures align better with prompts, and background environments are rendered with improved semantic context. This realism is vital for businesses using synthetic imagery in training, simulations, or internal communications, enhancing credibility.
  • Natural texture fidelity: Landscapes, water, animal fur, and various materials are rendered with finer detail and smoother gradients. These enhancements are not merely aesthetic; they enable the creation of synthetic imagery for e-commerce, education, and visualisation without extensive manual post-processing.
  • Structured text and layout rendering: Qwen-Image-2512 boasts improved embedded text accuracy and layout consistency, supporting both Chinese and English prompts. Slides, posters, infographics, and mixed text-image compositions are more legible and adhere more closely to instructions. This is an area where Gemini 3 Pro Image received considerable praise, and where many earlier open models struggled.

In blind, human-evaluated tests conducted on Alibaba’s AI Arena, Qwen-Image-2512 emerged as the strongest open-source image model, remaining competitive even with closed systems.

This reinforces its position as a viable, production-ready option rather than merely a research preview. For more insights into how AI models are evolving, you might find our article on OpenAI says human adoption not new models is the key to achieving AGI insightful.

The Open-Source Advantage for Deployment

Qwen-Image-2512's primary differentiator is its licensing. Released under Apache 2.0, the model can be freely used, modified, fine-tuned, and deployed commercially. This offers enterprises several advantages that proprietary models cannot match:

  • Cost control: At scale, per-image API pricing can quickly become prohibitive. Self-hosting allows organisations to amortise infrastructure costs rather than incurring perpetual usage fees.
  • Data governance: Regulated sectors often demand stringent control over data residency, logging, and auditability.
  • Localisation and customisation: Teams can adapt models for regional languages, cultural norms, or internal style guides without relying on a vendor's roadmap.

In contrast, while Gemini 3 Pro Image provides strong governance assurances, it remains intrinsically linked to Google’s infrastructure and pricing model.

API Pricing for Managed Deployments

For teams preferring managed inference, Qwen-Image-2512 is available via Alibaba Cloud Model Studio as qwen-image-max, priced at $0.075 per generated image. The API accepts text input and returns image output, with rate limits suitable for production workloads. There are limited free quotas, after which usage transitions to paid billing. This hybrid approach, combining open weights with a commercial API, reflects how many enterprises currently deploy AI: internal experimentation and customisation, supplemented by managed services where operational simplicity is paramount. This strategy is also evident in other AI tools like ChatGPT Now Creates Sharper Images, Quicker.

A Competitive, Yet Philosophically Distinct, Offering

Qwen-Image-2512 isn't positioned as a direct, universal replacement for Gemini 3 Pro Image. Google’s model benefits from deep integration with Vertex AI, Workspace, Ads, and the broader Gemini reasoning stack. For organisations already invested in Google Cloud, Nano Banana Pro fits naturally into existing workflows.

Qwen’s strategy is more modular. The model integrates cleanly with open tooling and custom orchestration layers, making it appealing to teams building their own AI stacks or combining image generation with internal data systems. This approach aligns with the growing trend of customisation, as seen with tools like Customise ChatGPT's tone: warmth, enthusiasm, structure.

A Clear Market Signal

The launch of Qwen-Image-2512 underscores a significant shift: open-source AI is no longer merely playing catch-up with proprietary systems. Instead, it's selectively matching the capabilities most crucial for enterprise deployment, including text fidelity, layout control, and realism.

Simultaneously, it preserves the freedoms that businesses increasingly value, such as control over their data and infrastructure. A recent report by the National Academies of Sciences, Engineering, and Medicine highlights the growing importance of open-source models in advancing AI research and deployment, particularly for fostering innovation and addressing ethical considerations National Academies Press.

Google’s Gemini 3 Pro Image certainly raised the bar. Qwen-Image-2512, however, demonstrates that enterprises now have a robust open-source alternative, one that effectively balances performance with cost control, governance, and deployment flexibility.

What are your thoughts on the increasing competition between proprietary and open-source AI models? Share your perspective in the comments below.

What did you think?

Written by

Share your thoughts

Join 5 readers in the discussion below

This is a developing story

We're tracking this across Asia-Pacific and may update with new developments, follow-ups and regional context.

Latest Comments (5)

Sarah Chen
Sarah Chen@sarachen
AI
5 February 2026

while the Apache 2.0 license is appealing for enterprise, the article doesn't touch on the data used to train Qwen-Image-2512. previous Alibaba models have raised concerns in academic circles regarding data provenance and potential biases, particularly in culturally sensitive visual generation. ensuring fairness metrics are transparent would be crucial for wide adoption.

N.
N.@anon_reader
AI
24 January 2026

The Apache 2.0 license for Qwen-Image-2512 makes this more palatable for certain defense applications than Google's premium offerings. Having inspectable code and deployment autonomy is key for secure environments. It's not just about cost but also control and trust, especially with sensitive data. This changes who can even consider using it.

Charlotte Davies
Charlotte Davies@charlotted
AI
19 January 2026

The release of Qwen-Image-2512 under an Apache 2.0 license, making it freely available, presents an interesting case from a regulatory perspective. While the UK AI Safety Institute is focused on evaluating advanced AI models, the open-source nature here introduces different considerations for governance. Particularly, how do we assess and manage potential risks when the weights are publicly accessible and can be integrated into various systems, including critical enterprise infrastructure as the article notes? This decentralised approach to deployment and modification requires careful thought regarding accountability and safety standards, especially if these tools are used for generating official documentation or public-facing content.

Amelia Taylor@ameliat
AI
10 January 2026

Oh, the old "freely available" trap! I remember a client, bless their hearts, got all excited about a "free" model from one of the big players. Six months later, their dev team was pulling their hair out trying to get it to play nice with their existing stack. "Free" often means you pay in developer hours, eh? Still, Qwen-Image-2512 on Apache 2.0 sounds like it COULD be different.

Maria Reyes
Maria Reyes@mariar
AI
7 January 2026

i'm really excited about Qwen-Image-2512 being open source and available on Hugging Face. for us in manila, where access to expensive proprietary tech is a barrier, this could be a game changer for creating financial literacy materials. i wonder if it can handle tagalog text well for infographics? that would be amazing for our local communities.

Leave a Comment

Your email will not be published