Comment by Seth Herd

AGI alignment researcher; Research Fellow at the Astera Institute
The fact that things look aligned most of the time when they're functioning in their chatbot, or very limited 'Assistant' roles, is very little evidence that they will be adequately aligned when they work much more independently and have much greater capability. Unverified source (2026)
Like Share on X 6h ago
Policy proposals and claims
replying to Seth Herd