The model looks quite a bit better in the benchmarks so unless they overfit the ...

WiSaGaN · 2025-01-31T20:40:41 1738356041

My vibe question checking suggests otherwise. Even o3-mini-high is not as good as r1, even though it's faster than r1. Considering o3-mini is more expensive per token. It's not clear o3-mini-high is cheaper than r1 either even r1 probably consumes more token per answer.

kandesbunzler · 2025-01-31T21:36:39 1738359399

well in my anecdotal tests, o3 mini (free) performed better than r1

WiSaGaN · 2025-02-01T00:49:33 1738370973

I did math tests. Probably you did coding.

GaggiX · 2025-02-01T00:39:56 1738370396

Also in my coding testing o3 mini (free) is better than r1.