Figuring Figures: An assessment of large language models on different modalities of math word problems

Wang, Yan; Lynch, Jayson; Krueger, Elizabeth

Author(s)

Wang, Yan; Lynch, Jayson; Krueger, Elizabeth

Download3674029.3674041.pdf (480.2Kb)

Publisher Policy

Terms of use

Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.

Metadata

Show full item record

Abstract

This paper presents a new dataset of geometry word problems in three forms: with figures, with code that produces these figures, and purely textual. Having versions of the same question which use different modalities allows for a more direct comparison of the performance of machine learning models on mathematical question answering across different modalities of input. We evaluate several multi-modal large language models and find they consistently perform best on the plain text descriptions and worst on the version with images.

Description

ICMLT 2024, May 24–26, 2024, Oslo, Norway

Date issued

2024-05-24

URI

https://hdl.handle.net/1721.1/157375

Department

Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory

Publisher

ACM|2024 9th International Conference on Machine Learning Technologies (ICMLT)

Citation

Wang, Yan, Lynch, Jayson and Krueger, Elizabeth. 2024. "Figuring Figures: An assessment of large language models on different modalities of math word problems."

Version: Final published version

ISBN

979-8-4007-1637-9

Collections

MIT Open Access Articles