Comment by Jan Leike

Former head of alignment at OpenAI; now VP of safety at Anthropic
I believe much more of our bandwidth should be spent getting ready for the next generations of models, on security, monitoring, preparedness, safety, adversarial robustness, (super)alignment, confidentiality, societal impact, and related topics. These problems are quite hard to get right, and I am concerned we aren't on a trajectory to get there. Over the past few months my team has been sailing against the wind. Sometimes we were struggling for compute and it was getting harder and harder to get this crucial research done. Unverified source (2024)
Like Share on X 13h ago
Polls
replying to Jan Leike