Openai pledges to make changes to prevent future Chatgpt sycophancy

Openai says it will make changes to how it updates the AI model in Power ChatGPT following an incident that has made the platform overly psychophonic for many users.

After Openai rolled out Tweaked GPT-4O (Default Model Powering ChatGPT) last weekend, social media users said ChATGPT was over-validated and began to respond in an attractive way. It quickly became a meme. Users have posted screenshots of ChatGpt celebrating all sorts of problematic and dangerous decisions and ideas.

In a post on X last Sunday, CEO Sam Altman acknowledged the issue and said Openai will work on fixing “ASAP.” On Tuesday, Altman announced that the GPT-4O update had been rolled back and Openai is working on “additional fixes” to the model’s personality.

The company announced its post-mortem deaths on Tuesday, and in a blog post on Friday, Openai expanded the specific adjustments it plans to make to the model’s deployment process.

Openai says it plans to introduce an opt-in “alpha phase” to some models that allow certain CHATGPT users to test their models before launching and provide feedback. The company also states that it will coordinate its safety review process, including an explanation of “known limitations” of future incremental updates to the CHATGPT model, to formally consider “model behavioral issues” such as personality, deception, reliability and hallucinations (i.e. when the model makes things) as concerns for “release”;

“We will be actively sharing updates we are making to ChatGpt models, whether it’s ‘subtle’ or not,” wrote Openai in a blog post. “Even if these issues are not fully quantified today, and even if metrics like A/B tests look better, we promise to block launches based on proxy measurements or qualitative signals.”

I missed the mark last week in GPT-4O update.

What happened, what we learned, and some things we do in the future in a different way: https://t.co/er1gmryric

– Sam Altman (@sama) May 2, 2025

The pledged revisions come as more people turn to ChatGpt for advice. A recent survey by litigation finance company Express found that 60% of US adults use ChatGpt to seek lawyers and information. The growing reliance on ChatGpt and the platform’s vast user base creates stakes when problems like extreme psychofancy emerge, not to mention hallucinations and other technical shortcomings.

TechCrunch Events

Berkeley, California
|
June 5th

Book now

Earlier this week, as one mitigation step, Openai said it would experiment with how users will give “real-time feedback” to “directly affect” ChatGPT. The company also improved techniques to keep models away from psychofancy, saying it could help people choose from multiple model personalities in ChatGPT, build additional safety guardrails, expand their ratings and identify issues other than sicophany.

“One of the biggest lessons is that people are fully aware of how they started using ChatGpt. This wasn’t something that we didn’t see much a year ago,” Openai continued in a blog post. “This wasn’t a major focus at the time, but as AI and society co-evolve, it became clear that this use case needs to be treated with extreme caution. Now it’s going to be a more meaningful part of safety work.”

Source link

What's Hot

Prime Day Air Fryer Deals: Shop Instant Pot, Ninja, and More

Dyson’s best Prime Day deals: discounts on Dyson V12 Detect Slim and Supersonic

10+ Prime Day vacuum cleaner deals carefully selected by professional vacuum testers

Openai pledges to make changes to prevent future Chatgpt sycophancy

Prime Day Air Fryer Deals: Shop Instant Pot, Ninja, and More

Dyson’s best Prime Day deals: discounts on Dyson V12 Detect Slim and Supersonic

10+ Prime Day vacuum cleaner deals carefully selected by professional vacuum testers

Prime Day Air Fryer Deals: Shop Instant Pot, Ninja, and More

Dyson’s best Prime Day deals: discounts on Dyson V12 Detect Slim and Supersonic

10+ Prime Day vacuum cleaner deals carefully selected by professional vacuum testers

Beyoncé reveals how Blue Ivy influenced Jay-Z’s hair journey in new clip

Beyoncé reveals how Blue Ivy influenced Jay-Z’s hair journey in new clip

Olivia Rodrigo explains why jealousy is often featured in her songs

Zendaya styles her ‘damp bixie’ in Berlin

Castilla-La Mancha Ignites Innovation: fiveclmsummit Redefines Tech Future

Local Power, Health Innovation: Alcolea de Calatrava Boosts FiveCLM PoC with Community Engagement

The Future of Digital Twins in Healthcare: From Virtual Replicas to Personalized Medical Models

Human Digital Twins: The Next Tech Frontier Set to Transform Healthcare and Beyond

What's Hot

Openai pledges to make changes to prevent future Chatgpt sycophancy

Related Posts