We can't find the internet
Attempting to reconnect
Something went wrong!
Hang in there while we get back on track
Comment by Jan Leike
Former head of alignment at OpenAI; now VP of safety at Anthropic
Alignment is not solved, but it increasingly looks solvable. [...] Since I first wrote about it in 2022, pretraining has continued improving and reinforcement learning has become much more significant — and our techniques are keeping pace.
Unverified
source
(2026)
Policy proposals and claims
replying to Jan Leike