Text Generation
Transformers
Safetensors
PyTorch
nemotron_h
nvidia
nemotron-3
latent-moe
mtp
conversational
custom_code
Eval Results

Add MathArena evaluation result for aime/aime_2026

#19
by JasperDekoninck - opened

This PR adds a new MathArena evaluation result so it can be indexed on the model leaderboard page.

Model: nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16
Competition dataset id: MathArena/aime_2026
Score: 90.00
Result file: .eval_results/MathArena--aime_2026.yaml

The results are the same as the ones displayed on our webpage.

Note: this is an experimental feature, we are currently trying to make this work as smooth as possible.

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment