Comment by Smitha Milli

I definitely don't think we should align it to a specific philosophical theory that is not really robust for the real world, but actually getting public input for a lot of topics is very difficult because the public has not had the time to think about a lot of topics. There's this tension between the large scale of data you need for aligning AI, and the intense amount of effort needed to really get quality signals about the public's true values. [...] We can't completely get rid of explicit specification, which is the temporal nature of this and keeps humans with agency. If somebody's values change after some time, they should have the agency to be able to explicitly specify that and change and steer the model. Unverified source (2026)
Like Share on X 3h ago
Policy proposals and claims
replying to Smitha Milli