Close Menu
  • Academy
  • Events
  • Identity
  • International
  • Inventions
  • Startups
    • Sustainability
  • Tech
  • Spanish
What's Hot

Malicious browser extensions will infect 722 users across Latin America since early 2025

Trump officials vow to lift school separation orders

Should the government ban AI-generated humans to stop the collapse of social trust?

Facebook X (Twitter) Instagram
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • User-Submitted Posts
Facebook X (Twitter) Instagram
Fyself News
  • Academy
  • Events
  • Identity
  • International
  • Inventions
  • Startups
    • Sustainability
  • Tech
  • Spanish
Fyself News
Home » Two undergraduates have built an AI speech model comparable to Notebooklm
Startups

Two undergraduates have built an AI speech model comparable to Notebooklm

userBy userApril 22, 2025No Comments3 Mins Read
Share Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Copy Link
Follow Us
Google News Flipboard
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link

Although they don’t have extensive AI expertise, they say they have created an openly available AI model that can generate clips in a similar podcast style to Google’s NoteBookLM.

The market for synthetic speech tools is vast and growing. ElevenLabs is one of the biggest players, but there is no shortage of challengers (see Playai, Sesame, etc.). Investors believe these tools have great potential. According to Pitchbook, Startups Dearaling Voice AI Tech raised more than $398 million in VC funding last year.

Toby Kim, one of the co-founders of Nari Labs, the group behind the newly released model, said he and his fellow co-founders had begun learning about Speech AI three months ago. Inspired by Notebooklm, they wanted to create a model that provided more control over the generated voice and “freedom of scripts.”

Kim says he used Google’s TPU Research Cloud Program. This allows researchers to access the company’s TPU AI chips for free to train Nari’s model, DIA. DIA weighs 1.6 billion parameters and generates dialogs from scripts, allowing users to customize speaker tones and insert disfluence, cough, laughter and other nonverbal cues.

Parameters are internal variables used by the model and create predictions. In general, models with more parameters will perform better.

Available by hugging the face of the AI ​​Dev platform and Github, DIA can run on most modern PCs with at least 10GB of VRAM. It produces random audio unless prompted with the intended style description, but you can also clone a person’s voice.

In a quick test of TechCrunch’s DIA via Nari’s web demo, DIA worked very well and generated two-way chats for every subject. Voice quality appears to be competitive with other tools, and the voice clone feature is the easiest thing this reporter has tried.

Here’s a sample:

But like many audio generators, DIA offers little protection. Creating recordings of disinformation and fraudsters is trivial. On the DIA project page, Nari dissuades the model from abuse, pretends to, deceives, or engages in illegal campaigns, but the group says it is “not responsible” for the misuse.

Nari also has not revealed which data has been shattered to train DIAs. DIA may have been developed using copyrighted content. Hacker News commenters point out that one sample sounds like a host of NPR’s “Planet Money” podcast. The training model for copyrighted content is a broad but legally questionable practice. Some AI companies argue that fair use protects them from liability, but rights holders argue that fair use doesn’t apply to training.

In any case, Kim says Nari’s plan is to create a synthetic speech platform with a “social aspect” above DIA and create a larger model for the future. Nari will also release a technical report from DIA to expand support for the model for languages ​​beyond English.


Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleAdaptive Computer wants to reinvent PCs with “Vibe” which codes non-programmers
Next Article DOJ Antitrust Trial exposes Google push to dominate Android using Gemini AI, Chrome and search
user
  • Website

Related Posts

Lawyers could face “severe” penalties for quotes generated by fake AI, UK courts warn

June 7, 2025

Review Week: Why Humanity’s Cut Access to Windsurf

June 7, 2025

Will Musk vs. Trump affect Xai’s $5 billion debt transaction?

June 7, 2025
Add A Comment
Leave A Reply Cancel Reply

Latest Posts

Malicious browser extensions will infect 722 users across Latin America since early 2025

Trump officials vow to lift school separation orders

Should the government ban AI-generated humans to stop the collapse of social trust?

Lawyers could face “severe” penalties for quotes generated by fake AI, UK courts warn

Trending Posts

Sana Yousaf, who was the Pakistani Tiktok star shot by gunmen? |Crime News

June 4, 2025

Trump says it’s difficult to make a deal with China’s xi’ amid trade disputes | Donald Trump News

June 4, 2025

Iraq’s Jewish Community Saves Forgotten Shrine Religious News

June 4, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Please enable JavaScript in your browser to complete this form.
Loading

Welcome to Fyself News, your go-to platform for the latest in tech, startups, inventions, sustainability, and fintech! We are a passionate team of enthusiasts committed to bringing you timely, insightful, and accurate information on the most pressing developments across these industries. Whether you’re an entrepreneur, investor, or just someone curious about the future of technology and innovation, Fyself News has something for you.

Should the government ban AI-generated humans to stop the collapse of social trust?

AB will be released at Binance -Tech Startups

Top 10 Startups and Tech Funding News for the Weekly Ends June 6, 2025

Order openai to keep all chatgpt logs including deleted temporary chats, API requests

Facebook X (Twitter) Instagram Pinterest YouTube
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • User-Submitted Posts
© 2025 news.fyself. Designed by by fyself.

Type above and press Enter to search. Press Esc to cancel.