Add MathArena evaluation result for hmmt/hmmt_feb_2026
#20
by
JasperDekoninck - opened
This PR adds a new MathArena evaluation result so it can be indexed on the model leaderboard page.
Model: nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16
Competition dataset id: MathArena/hmmt_feb_2026
Score: 84.85
Result file: .eval_results/MathArena--hmmt_feb_2026.yaml
The results are the same as the ones displayed on our webpage.
Note: this is an experimental feature, we are currently trying to make this work as smooth as possible.