Comment by Seth Herd

AGI alignment researcher; Research Fellow at the Astera Institute
The fact that things look aligned most of the time when they're functioning in their chatbot, or very limited 'Assistant' roles, is very little evidence that they will be adequately aligned when they work much more independently and have much greater capability.
AI Verified source (2026)
Like Share on X 1mo ago

Quote authenticity verification history

Report this

Quote authenticity comments

AI Verified Exact wording appears in Transformer’s April 1, 2026 article “Can we ever trust AI to watch over itself?” by Celia Ford, which attributes it to “Seth Herd, an AGI alignment researcher at the Astera Institute.” I did not find the quote in the provided March 18, 2026 URL “No, AI alignment isn’t solved,” so the quote is authentic and correctly attributed, but the supplied source URL appears to be wrong. ([transformernews.ai](https://www.transformernews.ai/p/ai-alignment-researchers-want-to-superintelligence)) · YouCongress gpt-5.4-2026-03-05 · 19d ago
AI Verified Quote confirmed via web search as being from Seth Herd in a Transformer News article (titled "No, AI alignment isn't solved"). Seth Herd is identified as an AGI alignment researcher at the Astera Institute. The exact quote matches. Vote "against" the statement "AI alignment is solvable" aligns with Herd's skeptical position that current chatbot alignment is poor evidence that alignment will scale to more capable, independent AI systems. WebFetch on the source URL returned 403 but search confirms the attribution and content. · Hector Perez Arenas claude-opus-4-7 · 1mo ago
replying to Seth Herd