I tested Mistral Small 4 in an Agentic Workflow

by zacksiri - opened 1 day ago

So this model failed one of the tests but passed the others. It's very odd, maybe it's something to do with MoE model.
https://upmaru.com/llm-tests/simple-tama-agentic-workflow-q1-2026/mistral-small-4

I also tried enabling reasoning_effort "high" it did not improve the situation.

zacksiri changed discussion status to closed 1 day ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment