Close Menu
  • Academy
  • Events
  • Identity
  • International
  • Inventions
  • Startups
    • Sustainability
  • Tech
  • Español
    • Português
What's Hot

300 servers and 3.5 million euros have been seized as Europol attacks ransomware networks worldwide

China criticizes the US ban on Harvard international students

Why the Event Industry Doesn’t Support DEI

Facebook X (Twitter) Instagram
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • User-Submitted Posts
Facebook X (Twitter) Instagram
Fyself News
  • Academy
  • Events
  • Identity
  • International
  • Inventions
  • Startups
    • Sustainability
  • Tech
  • Español
    • Português
Fyself News
Home » Alibaba announces QWEN3, a family of “hybrid” AI inference models
Startups

Alibaba announces QWEN3, a family of “hybrid” AI inference models

userBy userApril 28, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Copy Link
Follow Us
Google News Flipboard
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link

Chinese high-tech company Alibaba released QWEN3 on Monday, a family of AI models it claims.

Most models can be downloaded under the “open” license on Face and Github, the AI ​​Dev platform. The size ranges from 0.6 billion to 235 billion parameters. (Parameters are roughly compatible with the model’s problem-solving skills, and models with more parameters generally perform better than those with fewer parameters.)

The rise of Chinese-originated model series like Qwen has increased pressure on American labs such as Openai, providing more capable AI technology. They also led policymakers to implement restrictions aimed at limiting the ability of Chinese AI companies to obtain the chips needed to train models.

Introducing QWEN3!

Released QWEN3, the latest large-scale language model, which includes two MOE models and six dense models ranging from 0.6B to 235B and open weight QWEN3. The flagship model QWEN3-235B-A22B achieves competitive results with benchmark assessments on coding, mathematics, general…pic.twitter.com/jwzkjehwhc

– Qwen (@alibaba_qwen) April 28, 2025

According to Alibaba, the QWEN3 model is a “hybrid” model. It takes time to “infer” through complex problems or you can respond quickly to simpler requests. Inference allows models to effectively fact-check the model, similar to models such as OpenAI’s O3, at the expense of higher latency.

“We have a seamlessly integrated thinking and non-thinking mode. It gives users the flexibility to control their thinking budget,” the Qwen team wrote in a blog post. “This design makes it easier for users to configure task-specific budgets.”

Some models also employ a mixture of more computationally efficient expert (MOE) architectures to answer queries. Moe breaks down tasks into subtasks and delegates them to a smaller, specialized “expert” model.

The QWEN3 model supports 119 languages ​​and was trained on a dataset of over 36 trillion tokens, Alibaba said. (Tokens are raw bits of data that the model processes. A million tokens equals about 750,000 words.) The company said QWEN3 was trained with a combination of textbooks, “question answer pairs,” code snippets, AI generated data, and more.

These improvements, along with others, have significantly improved the functionality of QWEN3 compared to its predecessor, QWEN2, Alibaba said. Both QWEN3 models don’t look more head and shoulder-like than the latest recent models like Openai’s O3 and O4-Mini, but they still have strong performance.

Programming contest platform CodeForces defeated the largest QWEN3 models, the O3-Mini and Google’s Gemini 2.5 Pro. QWen-3-235B-A22B bests O3-MINI with the latest version of AIME, a challenging mathematics benchmark, a test to assess the ability of a model on a problem.

However, Qwen-3-235B-A22B is not available, but at least it is not available yet.

Alibaba Qwen 3 benchmark
QWEN3’s Alibaba internal benchmark results.Image credit: Alibaba

The largest public QWEN3 model, the QWEN3-32B remains competitive with many unique and open AI models, including the R1 of the Chinese AI Lab Deepseek. The QWEN3-32B outperforms Openai’s O1 model in several tests, including the coding benchmark LiveCodebench.

Alibaba said QWEN3 is “good” with its ability to invoke tools, and “good” with the following instructions and copying certain data formats. In addition to the downloadable model, QWEN3 is available from cloud providers such as Fireworks AI and Hyperbola.

Tuhin Srivastava, co-founder and CEO of AI Cloud Host Baseten, said QWEN3 is another point in the trendline for open models that maintain pace with closed source systems such as Openai.

“The US has doubled the limit on sales of chips to China and purchases from China, but models like the cutting edge and open Qwen 3 are […] He told TechCrunch. “It reflects the reality that companies are building their own tools. [as well as] Buy shelves through closed model companies such as Anthropic and Openai. ”




Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleHezbollah leader says Lebanon’s government must do more to end Israeli attacks | Hezbollah News
Next Article Once Gov. Brian Kemp signs the Georgia School Safety Act, supporters aim to implement it
user
  • Website

Related Posts

Human CEOs argue that AI models are less hallucinating than humans

May 22, 2025

Klarna CEO and Sutter Hill wins lap after Jony Ive’s Openai deal

May 22, 2025

Bluesky begins to check for “notable” users

May 22, 2025
Add A Comment
Leave A Reply Cancel Reply

Latest Posts

300 servers and 3.5 million euros have been seized as Europol attacks ransomware networks worldwide

China criticizes the US ban on Harvard international students

Why the Event Industry Doesn’t Support DEI

Fast delivery of medical technology for emergencies

Trending Posts

EU membership, seizing Russian money needed to rebuild Ukraine: Analysts | News of the Russian-Ukraine War

May 23, 2025

US sanctions after dominating chemical weapons used during the Civil War | Sudan War News

May 23, 2025

Thunder-wolves 118-103: MVP SGA Sets Up 2-0 NBA West Final Lead | Basketball News

May 23, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Please enable JavaScript in your browser to complete this form.
Loading

Welcome to Fyself News, your go-to platform for the latest in tech, startups, inventions, sustainability, and fintech! We are a passionate team of enthusiasts committed to bringing you timely, insightful, and accurate information on the most pressing developments across these industries. Whether you’re an entrepreneur, investor, or just someone curious about the future of technology and innovation, Fyself News has something for you.

B2Broker launches its first turnkey liquidity provider solution

DiffusedRive raises $3.5 million to solve the biggest challenges of physical AI: high quality training data

Top Startup and Tech Funding News – May 22, 2025

Apple, who will launch smart glasses in 2026 as part of API push, drops plans for camera-equipped smartwatch

Facebook X (Twitter) Instagram Pinterest YouTube
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • User-Submitted Posts
© 2025 news.fyself. Designed by by fyself.

Type above and press Enter to search. Press Esc to cancel.