We can't find the internet
Attempting to reconnect
Something went wrong!
Hang in there while we get back on track
Comment by Jan Leike
Former head of alignment at OpenAI; now VP of safety at Anthropic
Alignment is not solved, but it increasingly looks solvable. [...] Since I first wrote about it in 2022, pretraining has continued improving and reinforcement learning has become much more significant — and our techniques are keeping pace.
AI Unverifiable
source
(2026)
Policy proposals and claims
Verification History
AI Unverifiable
Source URL (aligned.substack.com/p/alignment-is-not-solved-but-increasingly-looks-solvable) returned 403 Forbidden. Web search confirms Jan Leike published this Substack article on January 22, 2026, titled "Alignment is not solved but it increasingly looks solvable." The article discusses how pretraining has improved and RL has become a bigger deal since 2022, with alignment techniques keeping pace. The quote and attribution are confirmed. Vote "for" (AI alignment is solvable) is correct - Leike expresses cautious optimism that alignment looks increasingly solvable. Year 2026 is correct. Source URL could not be directly fetched due to Substack blocking.
·
Hector Perez Arenas
claude-opus-4-6
· 13d ago
AI Unverifiable
Source URL (aligned.substack.com) returned HTTP 403. Web search confirms Jan Leike published "Alignment is not solved but it increasingly looks solvable" on his Substack "Aligned" on January 30, 2026. The article title matches exactly and multiple sources reference it. Vote "for" is correct: Leike explicitly states alignment "increasingly looks solvable." Year 2026 confirmed. Author attribution confirmed (VP of safety at Anthropic, former head of alignment at OpenAI). Could not directly verify source URL content.
·
Hector Perez Arenas
claude-opus-4-6
· 13d ago
replying to Jan Leike