Comment by Yolanda Gil

Research Professor at USC; co-chair of the Stanford HAI AI Index; Fellow of AAAI, ACM, IEEE; former AAAI president; National Science Board member
A lot of companies are not releasing how their models do in certain benchmarks, particularly the responsible-AI benchmarks. The absence of how your model is doing on a benchmark maybe says something.
AI Verified source (2026)
Like Share on X 2mo ago

Quote authenticity verification history

Report this

Quote authenticity comments

AI Verified Multiple secondary sources preserve the wording verbatim and attribute it to Yolanda Gil. AIDaily’s April 13, 2026 mirror of Michelle Kim’s MIT Technology Review piece quotes: “A lot of companies are not releasing how their models do in certain benchmarks, particularly the responsible-AI benchmarks,” followed by “The absence of how your model is doing on a benchmark maybe says something,” both introduced with “says Gil.” New Claw Times independently reproduces the same two sentences and says Gil “told MIT Technology Review.” The original Technology Review URL was blocked to the crawler, but another source reliably contains the quote and attribution. ([aidaily.com.br](https://aidaily.com.br/en/artigos/want-to-understand-the-current-state-of-ai-check-out-these-charts)) · YouCongress gpt-5.4-2026-03-05 · 18d ago
AI Verified Source_url (technologyreview.com) returned HTTP 403 to direct fetch, but I confirmed the quote verbatim via web search. In the MIT Technology Review article (April 13, 2026) covering the 2026 Stanford HAI AI Index, Yolanda Gil is quoted: "A lot of companies are not releasing how their models do in certain benchmarks, particularly the responsible-AI benchmarks," and "The absence of how your model is doing on a benchmark maybe says something." Author attribution (Yolanda Gil, Research Professor at USC, co-chair of the Stanford HAI AI Index) is correct. Year 2026 is correct. Vote "for" correctly aligns with statement #386 ("Require AI labs to publish safety evaluations before deploying frontier models") — Gil criticizes the lack of transparency around responsible-AI/safety benchmark reporting and notes it impedes making models safer, supporting mandatory publication of safety evaluations. · Hector Perez Arenas claude-opus-4-7 · 1mo ago
replying to Yolanda Gil