Text Generation
Transformers
Safetensors
PyTorch
nemotron_h
nvidia
nemotron-3
latent-moe
mtp
conversational
custom_code
Eval Results
JasperDekoninck commited on
Commit
be0ac34
·
verified ·
1 Parent(s): 49ad1f4

Add MathArena evaluation result for aime/aime_2026

Browse files

This PR adds a new MathArena evaluation result so it can be indexed on the model leaderboard page.

Model: nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16
Competition dataset id: MathArena/aime_2026
Score: 90.00
Result file: .eval_results/MathArena--aime_2026.yaml

The results are the same as the ones displayed on [our webpage](https://matharena.ai/?view=problem&comp=aime--aime_2026).

Note: this is an experimental feature, we are currently trying to make this work as smooth as possible.

.eval_results/MathArena--aime_2026.yaml ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ - dataset:
2
+ id: MathArena/aime_2026
3
+ task_id: MathArena/aime_2026
4
+ value: 90.0
5
+ date: '2026-03-17'
6
+ source:
7
+ url: https://matharena.ai/?comp=aime--aime_2026
8
+ name: Official MathArena Evaluation