Close Menu
  • Home
  • Identity
  • Inventions
  • Future
  • Science
  • Startups
  • Spanish
What's Hot

FTC Chair warns Google about Gmail’s “partisan” spam filter

Nvidia says two mystery customers accounted for 39% of second quarter revenue

Taco Bell rethinks about relying on AI at drive-thru

Facebook X (Twitter) Instagram
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • User-Submitted Posts
Facebook X (Twitter) Instagram
Fyself News
  • Home
  • Identity
  • Inventions
  • Future
  • Science
  • Startups
  • Spanish
Fyself News
Home » Openai pledges to make changes to prevent future Chatgpt sycophancy
Startups

Openai pledges to make changes to prevent future Chatgpt sycophancy

userBy userMay 2, 2025No Comments3 Mins Read
Share Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Copy Link
Follow Us
Google News Flipboard
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link

Openai says it will make changes to how it updates the AI ​​model in Power ChatGPT following an incident that has made the platform overly psychophonic for many users.

After Openai rolled out Tweaked GPT-4O (Default Model Powering ChatGPT) last weekend, social media users said ChATGPT was over-validated and began to respond in an attractive way. It quickly became a meme. Users have posted screenshots of ChatGpt celebrating all sorts of problematic and dangerous decisions and ideas.

In a post on X last Sunday, CEO Sam Altman acknowledged the issue and said Openai will work on fixing “ASAP.” On Tuesday, Altman announced that the GPT-4O update had been rolled back and Openai is working on “additional fixes” to the model’s personality.

The company announced its post-mortem deaths on Tuesday, and in a blog post on Friday, Openai expanded the specific adjustments it plans to make to the model’s deployment process.

Openai says it plans to introduce an opt-in “alpha phase” to some models that allow certain CHATGPT users to test their models before launching and provide feedback. The company also states that it will coordinate its safety review process, including an explanation of “known limitations” of future incremental updates to the CHATGPT model, to formally consider “model behavioral issues” such as personality, deception, reliability and hallucinations (i.e. when the model makes things) as concerns for “release”;

“We will be actively sharing updates we are making to ChatGpt models, whether it’s ‘subtle’ or not,” wrote Openai in a blog post. “Even if these issues are not fully quantified today, and even if metrics like A/B tests look better, we promise to block launches based on proxy measurements or qualitative signals.”

I missed the mark last week in GPT-4O update.

What happened, what we learned, and some things we do in the future in a different way: https://t.co/er1gmryric

– Sam Altman (@sama) May 2, 2025

The pledged revisions come as more people turn to ChatGpt for advice. A recent survey by litigation finance company Express found that 60% of US adults use ChatGpt to seek lawyers and information. The growing reliance on ChatGpt and the platform’s vast user base creates stakes when problems like extreme psychofancy emerge, not to mention hallucinations and other technical shortcomings.

TechCrunch Events

Berkeley, California
|
June 5th

Book now

Earlier this week, as one mitigation step, Openai said it would experiment with how users will give “real-time feedback” to “directly affect” ChatGPT. The company also improved techniques to keep models away from psychofancy, saying it could help people choose from multiple model personalities in ChatGPT, build additional safety guardrails, expand their ratings and identify issues other than sicophany.

“One of the biggest lessons is that people are fully aware of how they started using ChatGpt. This wasn’t something that we didn’t see much a year ago,” Openai continued in a blog post. “This wasn’t a major focus at the time, but as AI and society co-evolve, it became clear that this use case needs to be treated with extreme caution. Now it’s going to be a more meaningful part of safety work.”


Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleGoogle’s NoteBooklm Android and iOS apps can be pre-ordered
Next Article Georgia Tech Starting Speaker pledges to cover the cost of establishing alumni startups
user
  • Website

Related Posts

FTC Chair warns Google about Gmail’s “partisan” spam filter

August 31, 2025

Nvidia says two mystery customers accounted for 39% of second quarter revenue

August 30, 2025

Taco Bell rethinks about relying on AI at drive-thru

August 30, 2025
Add A Comment
Leave A Reply Cancel Reply

Latest Posts

FTC Chair warns Google about Gmail’s “partisan” spam filter

Nvidia says two mystery customers accounted for 39% of second quarter revenue

Taco Bell rethinks about relying on AI at drive-thru

The fall of EV startup Fisker: A comprehensive timeline

Trending Posts

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Please enable JavaScript in your browser to complete this form.
Loading

Welcome to Fyself News, your go-to platform for the latest in tech, startups, inventions, sustainability, and fintech! We are a passionate team of enthusiasts committed to bringing you timely, insightful, and accurate information on the most pressing developments across these industries. Whether you’re an entrepreneur, investor, or just someone curious about the future of technology and innovation, Fyself News has something for you.

Unlocking Tomorrow’s Health: Medical Device Integration

Web 3.0’s Promise: What Sir Tim Berners-Lee Envisions for the Future of the Internet

TwinH’s Paves Way at Break The Gap 2025

Smarter Healthcare Starts Now: The Power of Integrated Medical Devices

Facebook X (Twitter) Instagram Pinterest YouTube
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • User-Submitted Posts
© 2025 news.fyself. Designed by by fyself.

Type above and press Enter to search. Press Esc to cancel.