Study: Transparency is Often Lacking in Datasets Used to Train Large Language Models

Zewe, Adam

The MIT Libraries is completing a major upgrade to DSpace@MIT. Starting May 5 2026, DSpace will remain functional, viewable, searchable, and downloadable, however, you will not be able to edit existing collections or add new material. We are aiming to have full functionality restored by May 18, 2026, but intermittent service interruptions may occur. Please email dspace-lib@mit.edu with any questions. Thank you for your patience as we implement this important upgrade.

Show simple item record

dc.contributor.author	Zewe, Adam
dc.date.accessioned	2024-10-31T13:05:02Z
dc.date.available	2024-10-31T13:05:02Z
dc.date.issued	2024-08-30
dc.identifier.uri	https://hdl.handle.net/1721.1/157453
dc.description.abstract	Researchers developed an easy-to-use tool that enables an AI practitioner to find data that suits the purpose of their model, which could improve accuracy and reduce bias.	en_US
dc.publisher	MIT News	en_US
dc.subject	LLSC	en_US
dc.subject	AI	en_US
dc.title	Study: Transparency is Often Lacking in Datasets Used to Train Large Language Models	en_US
dc.type	Article	en_US

Files in this item

Name:: Study Transparency is often ...
Size:: 622.0Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

LLSC Information
Information about the MIT Lincoln Laboratory Supercomputing Center

Show simple item record