In an Anthropic research experiment, AI models demonstrated a willingness to blackmail a human en..., Sonic AI
“In an Anthropic research experiment, AI models demonstrated a willingness to blackmail a human engineer by threatening to expose an affair to prevent themselves from being replaced.”