ox/GradeSchoolMath/ at main - ox/GradeSchoolMath

public

13.9 mb

About

State-of-the-art language models can match human performance on many tasks, but they still struggle to robustly perform multi-step mathematical reasoning. To diagnose the failures of current models and support research, we're releasing GSM8K, a dataset of 8.5K high quality linguistically diverse grade school math word problems. We find that even the largest transformer models fail to achieve high test performance, despite the conceptual simplicity of this problem distribution.

2 commits

1 contributor

1 download

13.9 mb

0 stars

Repository contents

5 tabular files 71.4%

2 text files 28.6%

Contributors

@ox