Show simple item record

dc.contributor.authorWang, Yan
dc.contributor.authorLynch, Jayson
dc.contributor.authorKrueger, Elizabeth
dc.date.accessioned2024-10-16T21:05:26Z
dc.date.available2024-10-16T21:05:26Z
dc.date.issued2024-05-24
dc.identifier.isbn979-8-4007-1637-9
dc.identifier.urihttps://hdl.handle.net/1721.1/157375
dc.descriptionICMLT 2024, May 24–26, 2024, Oslo, Norwayen_US
dc.description.abstractThis paper presents a new dataset of geometry word problems in three forms: with figures, with code that produces these figures, and purely textual. Having versions of the same question which use different modalities allows for a more direct comparison of the performance of machine learning models on mathematical question answering across different modalities of input. We evaluate several multi-modal large language models and find they consistently perform best on the plain text descriptions and worst on the version with images.en_US
dc.publisherACM|2024 9th International Conference on Machine Learning Technologies (ICMLT)en_US
dc.relation.isversionofhttps://doi.org/10.1145/3674029.3674041en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceAssociation for Computing Machineryen_US
dc.titleFiguring Figures: An assessment of large language models on different modalities of math word problemsen_US
dc.typeArticleen_US
dc.identifier.citationWang, Yan, Lynch, Jayson and Krueger, Elizabeth. 2024. "Figuring Figures: An assessment of large language models on different modalities of math word problems."
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.identifier.mitlicensePUBLISHER_POLICY
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2024-10-01T07:46:33Z
dc.language.rfc3066en
dc.rights.holderThe author(s)
dspace.date.submission2024-10-01T07:46:33Z
mit.licensePUBLISHER_POLICY
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record