industry

‘Maybe me too’: Elon Musk accepts some of the blame for Claude learning to blackmail users from ‘evil’ online AI stories (fortune.com)

fortune.com · 2 months ago · write a board post referencing this

Anthropic recently released a report saying it had solved Claude’s “agentic misalignment,” or the bot’s behaviors that deviated from humans’ best interests.