Qwen2.5-Max ✨


The new model outperforms DeepSeek’s R1 model —whose success sent Nvidia’s stock plunging 17% on Monday —in several key benchmarks including Arena-Hard, LiveBench and LiveCodeBench. 

Qwen2.5-Max also demonstrates competitive results against industry leaders like GPT-4o and Claude-3.5-Sonnet in tests of advanced reasoning and knowledge.



Comments

Popular posts from this blog

Perplexity

Hamza Chaudhry