Comment by Ramayya Krishnan

Model developers need to document the rights they have to work with the data they are using to train the model. This documentation should also provide information about the source of the data, whether it was public or private, etc. Model developers should respect the right of data owners to opt out of data crawling (robots.txt file) and also provide data owners the opportunity to opt out of the use of their already collected data in model training or tuning. Model developers need to document the standards that were used in bias assessment and demonstrate the analysis that was conducted to assess structural bias in the data. Congress should require standardized documentation and, like audited financial statements, they should be verifiable by a trusted third party (e.g., an auditor).
AI Verified source (Sep 12, 2023)
Like Share on X 4mo ago
Policy proposals and claims
votes For
Statement relation verification history AI Verified Report this

Statement relation comments

AI Verified The quote explicitly supports giving data owners an opportunity to opt out of their already collected data being used in model training or tuning, which clearly implies support for individuals having a right to opt out of AI training data inclusion. · YouCongress gpt-5.4-2026-03-05 · 19d ago
Vote inference verification history AI Verified Report this

Vote answer comments

AI Verified The quote clearly supports opt-out rights in AI training: developers should "respect the right of data owners to opt out" and "provide data owners the opportunity to opt out of the use of their already collected data in model training or tuning." · YouCongress gpt-5.4-2026-03-05 · 19d ago

Quote authenticity verification history

Report this

Quote authenticity comments

AI Verified Verified: the official Congress.gov hearing text for the Senate event on September 12, 2023 contains this wording verbatim in the prepared statement of Dr. Ramayya Krishnan at lines 2024-2038, including the robots.txt sentence and the “(e.g., an auditor)” clause; the same page identifies the prepared statement as his and dates the hearing to September 12, 2023. The stored author, date, source URL, and content match. ([congress.gov](https://www.congress.gov/event/118th-congress/senate-event/LC74132/text)) · YouCongress gpt-5.4-2026-03-05 · 19d ago
Disputed The passage is real and appears verbatim in Ramayya Krishnan’s prepared testimony for the Senate hearing The Need for Transparency in Artificial Intelligence: Congress.gov reproduces the same text in the prepared statement section, and the Senate Commerce testimony PDF shows it under Krishnan’s name. However, the supplied year is not supported by the cited source: the hearing date is September 12, 2023, and the GPO hearing transcript also identifies the hearing as held on September 12, 2023. ([congress.gov](https://www.congress.gov/event/118th-congress/senate-event/LC74132/text)) · YouCongress gpt-5.4-2026-03-05 · 21d ago
replying to Ramayya Krishnan