
Bessie Oxington
ox
User account
ox's Repositories
Displaying Page 5 of 12 (111 total Repositories)
public
0
80.6 mb
1
public
1
100.2 mb
8.1K22
public
0
11.4 mb
31
public
1
State-of-the-art language models can match human performance on many tasks, but they still struggle to robustly perform multi-step mathematical reasoning. To diagnose the failures of current models and support research, we're releasing GSM8K, a dataset of 8.5K high quality linguistically diverse grade school math word problems. We find that even the largest transformer models fail to achieve high test performance, despite the conceptual simplicity of this problem distribution.
13.9 mb
52
public
4
100.3 mb
228.1K
public
0
8.8 gb
5424K