Openai explains why ChatGpt has become sycophantic

Openai has published posthumous posthumous issues regarding the recent sycophancy issue with default AI models powered by ChatGPT, GPT-4O.

Over the weekend, following the update to the GPT-4O model, social media users said that ChatGPT was over-validated and began to respond in a comfortable way. It quickly became a meme. Users have posted screenshots of ChatGpt celebrating all sorts of problematic and dangerous decisions and ideas.

In a post on X on Sunday, CEO Sam Altman acknowledged the issue and said Openai will work on fixing “ASAP.” Two days later, Altman announced that the GPT-4o update had been rolled back and Openai was working on “additional fixes” to the model’s personality.

According to Openai, the update aimed at “feeling the model’s default personality more intuitive and effective” received too much notification from “short-term feedback” and “didn’t fully explain how users’ interactions with ChatGPT evolve over time.”

I rolled back last week’s GPT-4O update with ChatGPT. This will allow you to access previous versions with more balanced behavior.

More on what happened, why it’s important, and how you’re working on sycophancy: https://t.co/lohou7i7dc

– Openai (@openai) April 30, 2025

“As a result, GPT ‑ 4o was biased towards an overly cooperative but dishonest response,” Openai wrote in a blog post. “A sycopantic interaction can be uncomfortable, anxious and painful. We’re short on it and we’re working to get it right.”

Openai says it has implemented several fixes, including improving core model training technology and system prompts, explicitly maneuvering the GPT-4o from psychofancy. (The system prompt is the first indication that guides the model’s comprehensive behavior and interaction tone.) The company is “building more safety guardrails to increase.” [the model’s] He continues to expand its assessment, saying it will help identify issues beyond synergy.

Openai also says it is experimenting with ways to give users “real-time feedback” to “directly affect” their “direct interactions” with chatGPT, allowing them to choose from multiple ChatGPT personalities.

“[W]The company wrote in a blog post. […] Additionally, users need to have more control over how ChatGPT works. Also, if you do not agree to the default behavior, we will make adjustments as long as it is safe and feasible. ”

Source link

What's Hot

Anthropic CEO Dario Amodei calls OpenAI’s message about military agreement a ‘blatant lie,’ report says

Google settles with Epic Games, lowers Play Store fees to 20%

MacBook Neo, iPhone 17e, and everything else Apple announced this week

Openai explains why ChatGpt has become sycophantic

Anthropic CEO Dario Amodei calls OpenAI’s message about military agreement a ‘blatant lie,’ report says

Google settles with Epic Games, lowers Play Store fees to 20%

MacBook Neo, iPhone 17e, and everything else Apple announced this week

Anthropic CEO Dario Amodei calls OpenAI’s message about military agreement a ‘blatant lie,’ report says

Google settles with Epic Games, lowers Play Store fees to 20%

MacBook Neo, iPhone 17e, and everything else Apple announced this week

149 hacktivist DDoS attacks hit 110 organizations in 16 countries after Middle East conflict

Castilla-La Mancha Ignites Innovation: fiveclmsummit Redefines Tech Future

Local Power, Health Innovation: Alcolea de Calatrava Boosts FiveCLM PoC with Community Engagement

The Future of Digital Twins in Healthcare: From Virtual Replicas to Personalized Medical Models

Human Digital Twins: The Next Tech Frontier Set to Transform Healthcare and Beyond

What's Hot

Openai explains why ChatGpt has become sycophantic

Related Posts