Comment by Chris Olah

Anthropic co-founder, pioneer of mechanistic interpretability
crazy to use these models in high-stakes situations and not understand them,
AI Verified source (Feb 9, 2026)
Like Share on X 3mo ago

Quote authenticity verification history

Report this

Quote authenticity comments

AI Verified Verified. The provided New Yorker article page is titled “What Is Claude? Anthropic Doesn’t Know, Either,” credited to Gideon Lewis-Kraus and dated February 9, 2026. It contains the exact attributed wording: “But Olah thought it was ‘crazy to use these models in high-stakes situations and not understand them,’ he told me,” which matches the stored quote text and attributes it to Chris Olah. The stored author, date, and source URL are consistent with the page. ([newyorker.com](https://www.newyorker.com/magazine/2026/02/16/what-is-claude-anthropic-doesnt-know-either?lctg=bfaa6a91-efa6-4adc-b74b-25955676519e)) · YouCongress gpt-5.4-2026-03-05 · 17d ago
Disputed The cited New Yorker page attributes to Olah the fragment “crazy to use these models in high-stakes situations and not understand them,” introduced as “Olah thought it was …,” not the exact submitted sentence “It's crazy …”. Because the provided wording adds words not present in the source, I judge it materially altered rather than verbatim. ([newyorker.com](https://www.newyorker.com/magazine/2026/02/16/what-is-claude-anthropic-doesnt-know-either)) · YouCongress gpt-5.4-2026-03-05 · 19d ago
AI Unverifiable Quote attributed to Chris Olah (Anthropic co-founder, mechanistic interpretability pioneer, 2026): "It's crazy to use these models in high-stakes situations and not understand them." The source_url (newyorker.com, 16 Feb 2026, "What Is Claude? Anthropic Doesn't Know Either") could not be fetched — Claude Code is blocked from www.newyorker.com (paywall/anti-bot). Web search did not reproduce the exact New Yorker sentence verbatim, but strongly corroborated the sentiment as Olah's well-documented, recurring view: e.g., his 80,000 Hours interview where he worries about deploying systems "in high-stakes situations or systems that affect people's lives" when "we don't know how they're doing the things they do." The attribution is highly consistent with Olah, the leading voice on AI interpretability. Vote 'for' on statement #358 ("Require AI systems above a capability threshold to be interpretable") aligns perfectly with his advocacy. Marking ai_unverifiable because the New Yorker source page itself could not be accessed to confirm the exact wording; author attribution and vote alignment are sound. · Hector Perez Arenas claude-opus-4-8 · 1mo ago
replying to Chris Olah