Close Menu
  • Home
  • Identity
  • Inventions
  • Future
  • Science
  • Startups
  • Spanish
What's Hot

The judge says the FTC’s investigation into media issues should “have to be wary of all Americans.”

Humanity says that some Claude models can end “harmful or abusive” conversations

ERMAC v3.0 Banking Trojan Source Code Leaks Exposes the Complete Malware Infrastructure

Facebook X (Twitter) Instagram
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • User-Submitted Posts
Facebook X (Twitter) Instagram
Fyself News
  • Home
  • Identity
  • Inventions
  • Future
  • Science
  • Startups
  • Spanish
Fyself News
Home » Humanity says that some Claude models can end “harmful or abusive” conversations
Startups

Humanity says that some Claude models can end “harmful or abusive” conversations

userBy userAugust 16, 2025No Comments2 Mins Read
Share Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Copy Link
Follow Us
Google News Flipboard
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link

Anthropic has announced a new feature that allows some of the biggest models to end conversations that the company describes as “a rare and extreme case of permanently harmful or abusive user interaction.” Surprisingly, humans say they do this not to protect human users, but to protect the AI model itself.

To be clear, the company does not argue that the Claude AI model can be perceptive or hurt by conversations with users. In its own words, humanity remains “very uncertain about the potential moral states of Claude and other LLMs, or about the current or future potential moral states.”

However, the announcement points to a recent programme created to study what is called “model welfare,” saying that humanity is essentially taking a just-in-case approach.

This latest change is currently limited to Claude Opus 4 and 4.1. Again, it should occur in “extreme edge cases,” such as “requests from users of sexual content, including minors, or attempts to solicit information that allows for large-scale violence and acts of fear.”

While these types of requests could potentially create legal or advertising issues for humanity itself (witness a recent report on how ChatGPT potentially enhances or contributes to users’ paranoid thinking), the company stated that pre-development testing “showed a “strong preference” in response to these requests and the “attractive distress of patterns.”

Regarding these new end-of-conversation features, the company said: “In all cases, Claude says, using the ability to end the conversation as a last resort only if multiple attempts at redirection fail and their hopes for a productive interaction runs out, or if the user explicitly wishes to Claude to end the chat.”

Humanity also states that Claude is “instructed not to use this ability when users are at the immediate risk of hurting themselves and others.”

TechCrunch Events

San Francisco
|
October 27th-29th, 2025

Once Claude finishes a conversation, humanity states that users can start a new conversation from the same account and edit answers to create a new branch of troublesome conversation.

“We treat this feature as a continuous experiment and will continue to improve our approach,” the company says.


Source link

#Aceleradoras #CapitalRiesgo #EcosistemaStartup #Emprendimiento #InnovaciónEmpresarial #Startups
Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleERMAC v3.0 Banking Trojan Source Code Leaks Exposes the Complete Malware Infrastructure
Next Article The judge says the FTC’s investigation into media issues should “have to be wary of all Americans.”
user
  • Website

Related Posts

The judge says the FTC’s investigation into media issues should “have to be wary of all Americans.”

August 16, 2025

Crypto Company Gemini File for Winklevoss Twins IPO

August 16, 2025

Sam Altman goes beyond bread rolls to explore life after GPT-5

August 15, 2025
Add A Comment
Leave A Reply Cancel Reply

Latest Posts

The judge says the FTC’s investigation into media issues should “have to be wary of all Americans.”

Humanity says that some Claude models can end “harmful or abusive” conversations

ERMAC v3.0 Banking Trojan Source Code Leaks Exposes the Complete Malware Infrastructure

Russian group Encrypthub exploits vulnerability in MSC Eviltwin to deploy Fickle Stealer malware

Trending Posts

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Please enable JavaScript in your browser to complete this form.
Loading

Welcome to Fyself News, your go-to platform for the latest in tech, startups, inventions, sustainability, and fintech! We are a passionate team of enthusiasts committed to bringing you timely, insightful, and accurate information on the most pressing developments across these industries. Whether you’re an entrepreneur, investor, or just someone curious about the future of technology and innovation, Fyself News has something for you.

What’s Wrong with the Web? Tim Berners-Lee Speaks Out in Rare Interview

The Next Frontier: NYC Island Becomes Epicenter for Climate Solutions

The AI-Powered Career Path: How TwinH by FySelf Evolves Your Digital Professional Identity

Web 3.0 Gets Personal: FySelf’s TwinH Paves the Way for User-Controlled Digital Identity

Facebook X (Twitter) Instagram Pinterest YouTube
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • User-Submitted Posts
© 2025 news.fyself. Designed by by fyself.

Type above and press Enter to search. Press Esc to cancel.