Close Menu
  • Home
  • Identity
  • Inventions
  • Future
  • Science
  • Startups
  • Spanish
What's Hot

Researchers have discovered a critical GitHub CVE-2026-3854 RCE flaw that can be exploited via a single Git push

Apple introduces cheaper option for App Store subscriptions

VECT 2.0 ransomware irreversibly destroys files larger than 131KB on Windows, Linux, and ESXi

Facebook X (Twitter) Instagram
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • User-Submitted Posts
Facebook X (Twitter) Instagram
Fyself News
  • Home
  • Identity
  • Inventions
  • Future
  • Science
  • Startups
  • Spanish
Fyself News
Home » DeepSeek releases a “sparse warning” model that cuts API costs by half
Startups

DeepSeek releases a “sparse warning” model that cuts API costs by half

By September 29, 2025No Comments2 Mins Read
Share Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Copy Link
Follow Us
Google News Flipboard
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link

On Monday, researchers at DeepSeek released a new experimental model called V3.2-EXP, designed to dramatically reduce inference costs when used in long context operations. Deepseek announced the model in a post about Face’s hugs and posted an academic paper linked to Github.

The most important feature of the new model is called DeepSeek Sparse Anterest. This is a complex system explained in detail in the diagram below. Essentially, the system uses a module called “Lightning Indencer” to prioritize certain excerpts from the context window. Another system, called the “fine-grained token selection system,” then selects a specific token from within these excerpts and loads it into the module’s limited attention window. In summary, sparse attention models can work so that server loads over long sections of relatively small contexts.

Screenshot

For long-context operations, the advantages of the system are important. A preliminary test by DeepSeek shows that the price of simple API calls can be reduced by half in long context situations. Building a more robust assessment will require further testing, but since the models are openweight and freely available, it will not be long before third-party tests can evaluate claims made in the paper.

Deepseek’s new model is one of the recent breakthroughs tackling the issue of inference costs. Essentially, it is the server cost for manipulating a pre-trained AI model that is different from the cost of training. In Deepseek’s case, researchers were looking for ways to make basic transformer architectures work more efficiently.

China-based Deepseek was a rare figure in the AI ​​boom, especially those who view AI research as a nationalist struggle between the US and China. The company made waves in the R1 model early in the year, and was trained using reinforcement learning, primarily at a much lower cost than its American competitors. However, this model has not triggered a wholesale revolution in AI training, as some have predicted. The company then retreated from the spotlight in those few months.

The new “sparse attention” approach is unlikely to produce the same uproar as R1, but it can teach providers the tricks needed to keep inference costs low.


Source link

#Aceleradoras #CapitalRiesgo #EcosistemaStartup #Emprendimiento #InnovaciónEmpresarial #Startups
Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleOpenai takes on Google, Amazon with new agent shopping system
Next Article AI recruiter Alex raises $17 million to automate your first job interview

Related Posts

Apple introduces cheaper option for App Store subscriptions

April 28, 2026

Snapchat introduces AI-powered conversational ads to its app

April 28, 2026

Red Hat’s OpenClaw maintainers have made enterprise Claw deployments more secure

April 28, 2026
Add A Comment
Leave A Reply Cancel Reply

Latest Posts

Researchers have discovered a critical GitHub CVE-2026-3854 RCE flaw that can be exploited via a single Git push

Apple introduces cheaper option for App Store subscriptions

VECT 2.0 ransomware irreversibly destroys files larger than 131KB on Windows, Linux, and ESXi

Snapchat introduces AI-powered conversational ads to its app

Trending Posts

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Please enable JavaScript in your browser to complete this form.
Loading

Welcome to Fyself News, your go-to platform for the latest in tech, startups, inventions, sustainability, and fintech! We are a passionate team of enthusiasts committed to bringing you timely, insightful, and accurate information on the most pressing developments across these industries. Whether you’re an entrepreneur, investor, or just someone curious about the future of technology and innovation, Fyself News has something for you.

Castilla-La Mancha Ignites Innovation: fiveclmsummit Redefines Tech Future

Local Power, Health Innovation: Alcolea de Calatrava Boosts FiveCLM PoC with Community Engagement

The Future of Digital Twins in Healthcare: From Virtual Replicas to Personalized Medical Models

Human Digital Twins: The Next Tech Frontier Set to Transform Healthcare and Beyond

Facebook X (Twitter) Instagram Pinterest YouTube
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • User-Submitted Posts
© 2026 news.fyself. Designed by by fyself.

Type above and press Enter to search. Press Esc to cancel.