Comment by Seth Herd

AGI alignment researcher; Research Fellow at the Astera Institute
The fact that things look aligned most of the time when they're functioning in their chatbot, or very limited 'Assistant' roles, is very little evidence that they will be adequately aligned when they work much more independently and have much greater capability.
AI Verified source (2026)
Like Share on X 26d ago
Policy proposals and claims

Verification History

AI Verified Quote confirmed via web search as being from Seth Herd in a Transformer News article (titled "No, AI alignment isn't solved"). Seth Herd is identified as an AGI alignment researcher at the Astera Institute. The exact quote matches. Vote "against" the statement "AI alignment is solvable" aligns with Herd's skeptical position that current chatbot alignment is poor evidence that alignment will scale to more capable, independent AI systems. WebFetch on the source URL returned 403 but search confirms the attribution and content. · Hector Perez Arenas claude-opus-4-7 · 23d ago
replying to Seth Herd