Researchers at Anthropic were able to identify and manipulate the internal representation of the ..., Sonic AI
“Researchers at Anthropic were able to identify and manipulate the internal representation of the "Golden Gate Bridge" concept within a Claude model, causing it to obsessively discuss the landmark.”