Skip to content
https://abc.microfintool.com/

ABC Tool

  • Home
  • About / Contect
    • PRIVACY POLICY
Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

Posted on May 11, 2026 By safdargal12 No Comments on Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts
Blog

[ad_1]

Fictional portrayals of artificial intelligence can have a real effect on AI models, according to Anthropic.

Last year, the company said that during pre-release tests involving a fictional company, Claude Opus 4 would often try to blackmail engineers to avoid being replaced by another system. Anthropic later published research suggesting that models from other companies had similar issues with “agentic misalignment.”

Apparently Anthropic has done more work around that behavior, claiming in a post on X, “We believe the original source of the behavior was internet text that portrays AI as evil and interested in self-preservation.”

The company went into more detail in a blog post stating that since Claude Haiku 4.5, Anthropic’s models “never engage in blackmail [during testing], where previous models would sometimes do so up to 96% of the time.”

What accounts for the difference? The company said it found that training on “documents about Claude’s constitution and fictional stories about AIs behaving admirably improve alignment.”

Related, Anthropic said that it found training to be more effective when it includes “the principles underlying aligned behavior” and not just “demonstrations of aligned behavior alone.”

“Doing both together appears to be the most effective strategy,” the company said.

Techcrunch event

San Francisco, CA
|
October 13-15, 2026

[ad_2]

Source link

Post Views: 19

Post navigation

❮ Previous Post: Cricut’s $99 craft cutting machine helped me feel creative again
Next Post: Vivo’s X300 Ultra has the best cameras in any phone ❯

You may also like

Your Air Purifier Could Spit Pollutants Back Into the Air if You Make This One Mistake
Blog
Your Air Purifier Could Spit Pollutants Back Into the Air if You Make This One Mistake
April 25, 2026
The vivo X300 FE launches in Europe, including the special edition with a Zeiss Tele Extender Lens
Blog
The vivo X300 FE launches in Europe, including the special edition with a Zeiss Tele Extender Lens
May 1, 2026
European Commission forces Meta to accept third-party chatbots in WhatsApp for free
Blog
European Commission forces Meta to accept third-party chatbots in WhatsApp for free
June 11, 2026
Survey says many readers use this Android notification feature, but not everyone trusts it
Blog
Survey says many readers use this Android notification feature, but not everyone trusts it
April 29, 2026

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Recent Posts

  • Whoops! Microsoft Outlook Mac Update Removes Email Conversation History
  • Anthropic’s New Claude Tag Acts as a Virtual Coworker in Slack
  • Google Home will soon get better at recognizing you
  • Meta Pauses Employee-Tracking Program Following Internal Data Leak
  • White House drastically shortens deadline for dropping quantum-vulnerable crypto

Recent Comments

  1. Aeroski 2.0 Ski Fitness Workout Machine Review & Product Info on Gaming at the Gym? Here’s How to Sneak Some Playtime Into Workouts
  2. AI Logo Generator on Tech giant Oracle cuts 21,000 jobs as it embraces AI
  3. Microsoft’s Xbox 25th anniversary console comes in translucent green - ABC Tool on Deals: Samsung's latest Galaxy Z foldables discounted, iPhone 17 Pro, Pixel 10 Pro, Xiaomi 17T Pro also on sale
  4. A Fitbit Air combined with a wristwatch looks better than expected - ABC Tool on Samsung’s latest announcement should have everyone excited about future Galaxy phones
  5. uttzfyffuq on Best Meat Delivery Services for 2026

Archives

  • June 2026
  • May 2026
  • April 2026

Categories

  • Blog

Copyright © 2026 ABC Tool.

Theme: Oceanly News by ScriptsTown