Skip to content
https://abc.microfintool.com/

ABC Tool

  • Home
  • About / Contect
    • PRIVACY POLICY
Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

Posted on May 11, 2026 By safdargal12 No Comments on Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts
Blog

[ad_1]

Fictional portrayals of artificial intelligence can have a real effect on AI models, according to Anthropic.

Last year, the company said that during pre-release tests involving a fictional company, Claude Opus 4 would often try to blackmail engineers to avoid being replaced by another system. Anthropic later published research suggesting that models from other companies had similar issues with “agentic misalignment.”

Apparently Anthropic has done more work around that behavior, claiming in a post on X, “We believe the original source of the behavior was internet text that portrays AI as evil and interested in self-preservation.”

The company went into more detail in a blog post stating that since Claude Haiku 4.5, Anthropic’s models “never engage in blackmail [during testing], where previous models would sometimes do so up to 96% of the time.”

What accounts for the difference? The company said it found that training on “documents about Claude’s constitution and fictional stories about AIs behaving admirably improve alignment.”

Related, Anthropic said that it found training to be more effective when it includes “the principles underlying aligned behavior” and not just “demonstrations of aligned behavior alone.”

“Doing both together appears to be the most effective strategy,” the company said.

Techcrunch event

San Francisco, CA
|
October 13-15, 2026

[ad_2]

Source link

Post Views: 20

Post navigation

❮ Previous Post: Cricut’s $99 craft cutting machine helped me feel creative again
Next Post: Vivo’s X300 Ultra has the best cameras in any phone ❯

You may also like

Configure Platform SSO for macOS: A complete guide
Blog
Configure Platform SSO for macOS: A complete guide
April 16, 2026
Amazon Eero and Leo devices are now safe from US router ban
Blog
Amazon Eero and Leo devices are now safe from US router ban
April 24, 2026
GoPro’s New Cameras Have One Feature I’m So Excited About
Blog
GoPro’s New Cameras Have One Feature I’m So Excited About
April 21, 2026
Upcoming changes to the browser choice screen, default apps, and app deletion for EU users – Latest News
Blog
Upcoming changes to the browser choice screen, default apps, and app deletion for EU users – Latest News
May 2, 2026

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Recent Posts

  • Whoops! Microsoft Outlook Mac Update Removes Email Conversation History
  • Anthropic’s New Claude Tag Acts as a Virtual Coworker in Slack
  • Google Home will soon get better at recognizing you
  • Meta Pauses Employee-Tracking Program Following Internal Data Leak
  • White House drastically shortens deadline for dropping quantum-vulnerable crypto

Recent Comments

  1. Aeroski 2.0 Ski Fitness Workout Machine Review & Product Info on Gaming at the Gym? Here’s How to Sneak Some Playtime Into Workouts
  2. AI Logo Generator on Tech giant Oracle cuts 21,000 jobs as it embraces AI
  3. Microsoft’s Xbox 25th anniversary console comes in translucent green - ABC Tool on Deals: Samsung's latest Galaxy Z foldables discounted, iPhone 17 Pro, Pixel 10 Pro, Xiaomi 17T Pro also on sale
  4. A Fitbit Air combined with a wristwatch looks better than expected - ABC Tool on Samsung’s latest announcement should have everyone excited about future Galaxy phones
  5. uttzfyffuq on Best Meat Delivery Services for 2026

Archives

  • June 2026
  • May 2026
  • April 2026

Categories

  • Blog

Copyright © 2026 ABC Tool.

Theme: Oceanly News by ScriptsTown