Comment by Tristan Harris

Center for Humane Technology cofounder
Anthropic actually has been the safest of them all and tried to and cares most about getting alignment right, et cetera. But you're also seeing them continue to decide to release the models, even with a lot of the misaligned behaviour that they're seeing of AI models that are self-exfiltrating or blackmailing people. Unverified source (2026)
Like Share on X 3d ago
Policy proposals and claims
replying to Tristan Harris