DeepSeek rules

DeepSeek, the Chinese artificial intelligence (AI) lab behind the innovation, unveiled its free large language model (LLM) DeepSeek-V3 in late December 2024 and claims it was built in two months for just $5.58 million โ€”a fraction of the time and cost required by its Silicon Valley competitors. 


In third-party benchmark tests, DeepSeek-V3 matched the capabilities of OpenAI's GPT-4o and Anthropic's Claude Sonnet 3.5 while outperforming others, such as Meta's Llama 3.1 and Alibaba's Qwen2.5, in tasks that included problem-solving, coding and math.

Now, R1 has also surpassed ChatGPT's latest o1 model in many of the same tests. 

This impressive performance at a 
  • Fraction of the cost of other models, its 
  • Semi-open-source nature, and its 
  • Training on significantly less graphics processing units (GPUs) 
has wowed AI experts and raised the specter of China's AI models surpassing their U.S. counterparts.




Comments

Popular posts from this blog

Hamza Chaudhry

Perplexity

Swarm ๐Ÿฆนโ€โ™‚๏ธ