On Friday, Anthropic debuted research unpacking how an AI system's "personality" - as in, tone, responses, and overarching motivation - changes and why. Researchers also tracked what makes a model "evil."
The Verge spoke with Jack Lindsey, an Anthropic researcher working on interpretability, who ha...