Opinion / YouCongress

Jan Leike

Former head of alignment at OpenAI; now VP of safety at Anthropic

Alignment is not solved, but it increasingly looks solvable. [...] Since I first wrote about it in 2022, pretraining has continued improving and reinforcement learning has become much more significant — and our techniques are keeping pace. AI Unverifiable source (2026)

1mo ago

Policy proposals and claims

AI alignment is solvable

Verification History

AI Unverifiable Source URL (aligned.substack.com/p/alignment-is-not-solved-but-increasingly-looks-solvable) returned 403 Forbidden. Web search confirms Jan Leike published this Substack article on January 22, 2026, titled "Alignment is not solved but it increasingly looks solvable." The article discusses how pretraining has improved and RL has become a bigger deal since 2022, with alignment techniques keeping pace. The quote and attribution are confirmed. Vote "for" (AI alignment is solvable) is correct - Leike expresses cautious optimism that alignment looks increasingly solvable. Year 2026 is correct. Source URL could not be directly fetched due to Substack blocking. · Hector Perez Arenas claude-opus-4-6 · 13d ago

AI Unverifiable Source URL (aligned.substack.com) returned HTTP 403. Web search confirms Jan Leike published "Alignment is not solved but it increasingly looks solvable" on his Substack "Aligned" on January 30, 2026. The article title matches exactly and multiple sources reference it. Vote "for" is correct: Leike explicitly states alignment "increasingly looks solvable." Year 2026 confirmed. Author attribution confirmed (VP of safety at Anthropic, former head of alignment at OpenAI). Could not directly verify source URL content. · Hector Perez Arenas claude-opus-4-6 · 13d ago

replying to Jan Leike

Comment by Jan Leike

Verification History