Comment by Dario Amodei

Overall, I am optimistic that a mixture of alignment training, mechanistic interpretability, efforts to find and publicly disclose concerning behaviors, safeguards, and societal-level rules can address AI autonomy risks. Unverified source (2026)
Like Share on X 4h ago
Policy proposals and claims
replying to Dario Amodei