Add MathArena evaluation result for hmmt/hmmt_feb_2026
#20 opened about 19 hours ago
by
JasperDekoninck
Add MathArena evaluation result for aime/aime_2026
#19 opened about 19 hours ago
by
JasperDekoninck
Add SWE-Bench Verified evaluation results
#18 opened 2 days ago
by
nielsr
I tested Nemotron 3 Super in an Agentic Workflow [Video + Report]
#17 opened 2 days ago
by
zacksiri
Disobedient and "rude"
#16 opened 3 days ago
by
JoeSmith245
how to run on A100?
#15 opened 4 days ago
by
mark2000
AWQ? Autoround? Any ~int4 for vllm?
➕ 1
4
#12 opened 6 days ago
by
JoeSmith245
Add Terminal Bench 2.0 evaluation result
#11 opened 6 days ago
by
SaylorTwift
Add GPQA with tools evaluation result
#10 opened 6 days ago
by
SaylorTwift
Add GPQA evaluation result
#9 opened 6 days ago
by
SaylorTwift
Add MMLU-Pro evaluation result
#8 opened 6 days ago
by
SaylorTwift
Add HLE with tools evaluation result
#7 opened 6 days ago
by
SaylorTwift
Add HLE evaluation result
#6 opened 6 days ago
by
SaylorTwift
Video of Step-by-Step Review and Testing
👍 2
2
#5 opened 6 days ago
by
fahdmirzac