| dc.contributor.author | Wang, Yan | |
| dc.contributor.author | Lynch, Jayson | |
| dc.contributor.author | Krueger, Elizabeth | |
| dc.date.accessioned | 2024-10-16T21:05:26Z | |
| dc.date.available | 2024-10-16T21:05:26Z | |
| dc.date.issued | 2024-05-24 | |
| dc.identifier.isbn | 979-8-4007-1637-9 | |
| dc.identifier.uri | https://hdl.handle.net/1721.1/157375 | |
| dc.description | ICMLT 2024, May 24–26, 2024, Oslo, Norway | en_US |
| dc.description.abstract | This paper presents a new dataset of geometry word problems in three forms: with figures, with code that produces these figures, and purely textual. Having versions of the same question which use different modalities allows for a more direct comparison of the performance of machine learning models on mathematical question answering across different modalities of input. We evaluate several multi-modal large language models and find they consistently perform best on the plain text descriptions and worst on the version with images. | en_US |
| dc.publisher | ACM|2024 9th International Conference on Machine Learning Technologies (ICMLT) | en_US |
| dc.relation.isversionof | https://doi.org/10.1145/3674029.3674041 | en_US |
| dc.rights | Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. | en_US |
| dc.source | Association for Computing Machinery | en_US |
| dc.title | Figuring Figures: An assessment of large language models on different modalities of math word problems | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Wang, Yan, Lynch, Jayson and Krueger, Elizabeth. 2024. "Figuring Figures: An assessment of large language models on different modalities of math word problems." | |
| dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
| dc.identifier.mitlicense | PUBLISHER_POLICY | |
| dc.eprint.version | Final published version | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dc.date.updated | 2024-10-01T07:46:33Z | |
| dc.language.rfc3066 | en | |
| dc.rights.holder | The author(s) | |
| dspace.date.submission | 2024-10-01T07:46:33Z | |
| mit.license | PUBLISHER_POLICY | |
| mit.metadata.status | Authority Work and Publication Information Needed | en_US |