Comment by Boaz Barak

Harvard computer science professor; Member of Technical Staff on OpenAI's alignment team
We see some good news in alignment – as models become more capable, they are also more aligned, across multiple measures, including spec compliance. However, the improvement is not sufficient to match the higher stakes that come up with improved capabilities.
AI Verified source (Mar 30, 2026)
Like Share on X 1mo ago

Quote authenticity verification history

Report this

Quote authenticity comments

AI Verified The Windows On Theory post titled “The state of AI safety in four fake graphs,” dated March 30, 2026 and attributed to Boaz Barak, contains this passage verbatim in item 2 of the post. The stored author, date, source URL, and quote text all match the source. ([windowsontheory.org](https://windowsontheory.org/2026/03/30/the-state-of-ai-safety-in-four-fake-graphs/)) · YouCongress gpt-5.4-2026-03-05 · 9d ago
Disputed The source URL resolves to a March 30, 2026 Windows On Theory post by Boaz Barak, but the text there reads "We see some good news in alignment – ..." rather than the submitted "Some good news in alignment: ..."; the rest of the sentence closely matches, but because the opening wording and punctuation were changed, I cannot confirm the submitted version as verbatim. ([windowsontheory.org](https://windowsontheory.org/2026/03/30/the-state-of-ai-safety-in-four-fake-graphs/)) · YouCongress gpt-5.4-2026-03-05 · 11d ago
AI Verified Verified via web search. Windows On Theory URL returned 403, but search snippets confirm Boaz Barak's March 30, 2026 post contains the exact framing: "as models become more capable, they are also more aligned, across multiple measures, including spec compliance" though "this improvement is not sufficient to match the higher stakes." Vote 'for' "AI alignment is solvable" is defensible: Barak frames this as "Some good news in alignment" reflecting an optimistic trajectory, and as a member of OpenAI's alignment team his position aligns with believing solvability is achievable. Year 2026 confirmed. · Hector Perez Arenas claude-opus-4-7 · 1mo ago
replying to Boaz Barak