Openai’s ChatGpt agent can control your PC and perform tasks on your behalf, but how does it work and what’s the point?

Openai has launched the ChatGPT Agent, an upgrade to the flagship artificial intelligence (AI) model that is equipped with virtual computers and integrated toolkits.

These new tools allow agents to perform complex, multi-step tasks that do not allow previous iterations of ChatGPT. Control your computer and complete the tasks.

This powerful version still relies heavily on human opinion and supervision, but arrived just before Mark Zuckerberg announced that Meta researchers had observed a unique AI model showing signs of independent self-improvement. It was also released just before Openai released the GPT-5. This is the latest version of Openai’s chatbot.

The risk of advancing AI

Openai itself acknowledges the dangers of new agents and the increased autonomy. Company representatives say ChatGpt agents are “highly biological and chemically capable” and claim they may assist in the creation of chemical or biological weapons.

Compared to existing resources such as chemistry labs and textbooks, AI agents represent what biosecurity experts call the “competence escalation pathway.” AI can use countless resources to instantly integrate that data, merge knowledge across science fields, provide iterative troubleshooting such as expert mentoring, navigate supplier websites, fill out order forms, and bypass basic validation checks.

Virtual computers also allow agents to interact autonomously with files, websites and online tools, and can do much more potential harm if misused. Opportunities for data breaches or data manipulation, and opportunities for false behavior, such as financial fraud, are amplified in the case of rapid injection attacks or hijacking.

As Nyarko pointed out, these risks add to the implicit risks of traditional AI models and LLM.

“There are wider concerns across AI agents, such as how the way agents behave autonomously amplifies errors, introduces biases from public data, complicates the framework of responsibility, and unintentionally promotes psychological dependence,” he said.

In response to the new threat posed by the more matured models, Openai engineers are strengthening many safeguards, a company representative said in a statement.

These include threat modeling, dual use refusal training (models are taught to reject harmful requests for data that may analyze weaknesses by attacking the system themselves). However, a risk management assessment conducted in July 2025 by Saferai, a safety-focused nonprofit organization, called Openai’s Risk Management Policy, awarded a score of 33% out of 100%. Openai also recorded only C grades in the AI Safety Index, compiled by the leading AI Safety Company, Future of Life Institute.

Source link

What's Hot

The man who bet everything on AI and Bill Belichick

Making earth observation data useful to people

MSS claims NSA used 42 cyber tools in multi-stage attack on Beijing Time System

Openai’s ChatGpt agent can control your PC and perform tasks on your behalf, but how does it work and what’s the point?

Era of reionization: Astronomers look for signals from ‘one of the most unexplored epochs in the universe’

Iran’s volcano appears to have woken up – 700,000 years after its last eruption

Black eyes, orbital fractures, and retinal detachments: Pickleball-related eye injuries are on the rise in the U.S.

The man who bet everything on AI and Bill Belichick

Making earth observation data useful to people

MSS claims NSA used 42 cyber tools in multi-stage attack on Beijing Time System

OpenAI’s “Embarrassing” Mathematics | Tech Crunch

Immortality is No Longer Science Fiction: TwinH’s AI Breakthrough Could Change Everything

The AI Revolution: Beyond Superintelligence – TwinH Leads the Charge in Personalized, Secure Digital Identities

Revolutionize Your Workflow: TwinH Automates Tasks Without Your Presence

FySelf’s TwinH Unlocks 6 Vertical Ecosystems: Your Smart Digital Double for Every Aspect of Life

What's Hot

Openai’s ChatGpt agent can control your PC and perform tasks on your behalf, but how does it work and what’s the point?

The risk of advancing AI

Related Posts