Qwen3: Alibaba challenges top AI models with its new hybrid family

شعار شركة على بابا بجوار اسم نموذجها الجديد Qwen3 على خلفية هاتف تمثل الدردشة مع الذكاء الاصطناعي

Chinese tech giant Alibaba has announced the launch of its new family of AI models, Qwen3, aiming to compete with industry leaders like OpenAI and Google in the realm of advanced artificial intelligence.

These models are distinguished by their ability to balance deep reasoning with swift responses, a concept Alibaba refers to as "hybrid engineering." This approach enables the models to efficiently handle both complex and straightforward tasks.

The Qwen3 models vary in size, ranging from 0.6 billion to 235 billion parameters, with most available under open-source licenses on platforms like GitHub and Hugging Face.

Alibaba claims that the new models deliver performance on par with, and sometimes surpassing, leading models such as OpenAI's o3-mini and Google's Gemini 2.5 Pro in complex mathematical and programming benchmarks like AIME and BFCL.

Performance of Qwen3 models across various benchmarks compared to other AI models such as OpenAI o3-mini, Grok-3 DeepSeek-R1.

Alibaba's new models outperform leading models.

Released Qwen3 Models

As part of the Qwen3 launch, Alibaba has made several models available for open use.

This collection includes two models based on the "Mixture of Experts" (MoE) architecture:

Qwen3-235B-A22B, a large model with a total of 235 billion parameters (22 billion active).
Qwen3-30B-A3B, a smaller model with 30 billion parameters (3 billion active).

Additionally, six traditional (dense) models have been released: Qwen3-32B, Qwen3-14B, Qwen3-8B, Qwen3-4B, Qwen3-1.7B, and Qwen3-0.6B.

Extensive Training Data

Qwen3 was trained on a massive dataset comprising 36 trillion tokens, the fundamental units used in language model training. This dataset encompasses 119 languages and dialects.

The training data includes educational materials, code snippets, Q&A pairs, and even data generated by other AI models, enhancing its capabilities across various domains.

Some Qwen3 models utilize the "Mixture of Experts" architecture, a technique that divides tasks into sub-tasks handled by specialized models, thereby improving computational efficiency.

Hybrid Model Approach

A standout feature of Qwen3 is its "hybrid modes" for task processing:

√ Reasoning Mode: The model takes additional time to analyze problems step-by-step, suitable for complex issues.

√ Fast Mode: Provides immediate answers to simple questions.

This dual-mode approach allows users to manage the "thinking budget," balancing resource and time allocation for each task to achieve optimal efficiency and quality.

Recently, other companies have adopted similar "hybrid" strategies in their new models, such as Anthropic's Claude 3.7 Sonnet and Google's Gemini 2.5 Pro.

Alibaba states that Qwen3 offers enhanced performance in coding, mathematics, and logical reasoning, demonstrating strong capabilities in following instructions and executing tasks based on precise data formats.

Furthermore, it has been developed to operate efficiently with interactive AI technologies and dynamic environments.

Multi-Stage Development Process

The training of Qwen3 models involved four stages:

Initial training on long chains of logical reasoning.
Enhancements based on reinforcement learning.
Integration of reasoning capabilities with rapid responses.
Overall performance improvements across more than 20 diverse tasks, including instruction following and functioning as autonomous agents.

Accessing Qwen3

Alibaba has made several Qwen3 models accessible to users through its platform, chat.qwen.ai, allowing for direct interaction.

Qwen3 interface on the Qwen Chat platform.

The company also offers Qwen3 models via cloud service providers like Fireworks AI and Hyperbolic.

Developers can experiment with the models or integrate them into their projects using local tools such as Hugging Face and Ollama.

Additionally, Alibaba provides APIs compatible with OpenAI to facilitate deployment and integration across various work environments.

Amid increasing U.S. restrictions on supplying advanced chips to China, Alibaba appears to be betting on openness and open-source strategies to strengthen its global position, a move that industry experts view as a significant challenge to the closed models of American companies.

Explore Alibaba's official blog post on the model