Deepseek’s distillation new R1AI model can be run on a single GPU

Deepseek’s updated R1 Reasoning AI model may be attracting attention from the AI community this week. However, the Chinese AI Lab has also released a “distilled” version of the new R1, the DeepSeek-R1-0528-QWEN3-8B. This argues that Deepseek breaks models of comparable sizes on certain benchmarks.

The small updated R1, built using the QWEN3-8B model Alibaba, launched as a foundation in May, is better than Google’s Gemini 2.5 Flash On Aieme 2025, which is better than Google’s Gemini 2.5 Flash On Aieme 2025.

The DeepSeek-R1-0528-QWEN3-8B is roughly in line with Microsoft’s recently released Phi 4 Reasoning Plus model, another mathematical skill test, HMMT.

So-called distillation models, such as DeepSeek-R1-0528-QWEN3-8B, are generally less capable than their full-size counterparts. On the positive side, they are much less computationally demanding. According to cloud platform Nodeshift, QWEN3-8B requires a GPU with 40GB-80GB of RAM to run (for example, the NVIDIA H100). The new full-size R1 requires about a dozen 80GB GPUs.

DeepSeek trained DeepSeek-R1-0528-QWEN3-8B by getting the text generated by the updated R1 and using it to fine-tune QWEN3-8B. On a dedicated web page for the AI DEV platform face-hugging model, Deepseek describes Deepseek-R1-0528-QWen3-8B as “for both academic research on inference models and industrial development focusing on small-scale models.”

DeepSeek-R1-0528-QWEN3-8B is available under an acceptable MIT license. This means that it can be used commercially without restrictions. Several hosts, including LM Studio, already offer models via APIs.

Source link

What's Hot

The Whitlams announce return to Rock Island Australia tour

Your daily horoscope: June 20, 2026

Cheers co-creator and Friends director James Burrows dies at 85

Deepseek’s distillation new R1AI model can be run on a single GPU

2026 World Cup: How Adidas’ Trionda ball helped overturn offside decision

Prime Day Early Adult Toy Sale: Shop LELO, Womanizer and more

According to Pornhub, many women watch gay porn

The Whitlams announce return to Rock Island Australia tour

Your daily horoscope: June 20, 2026

Cheers co-creator and Friends director James Burrows dies at 85

Rich bassist Sixpence None dies at age 50

The Whitlams announce return to Rock Island Australia tour

Rich bassist Sixpence None dies at age 50

Adria Arjona’s red Roberto Cavalli dress at the ‘Supergirl’ fan event

Castilla-La Mancha Ignites Innovation: fiveclmsummit Redefines Tech Future

Local Power, Health Innovation: Alcolea de Calatrava Boosts FiveCLM PoC with Community Engagement

The Future of Digital Twins in Healthcare: From Virtual Replicas to Personalized Medical Models

Human Digital Twins: The Next Tech Frontier Set to Transform Healthcare and Beyond

What's Hot

Deepseek’s distillation new R1AI model can be run on a single GPU

Related Posts