DeepSeek rules
DeepSeek, the Chinese artificial intelligence (AI) lab behind the innovation, unveiled its free large language model (LLM) DeepSeek-V3 in late December 2024 and claims it was built in two months for just $5.58 million โa fraction of the time and cost required by its Silicon Valley competitors.
In third-party benchmark tests, DeepSeek-V3 matched the capabilities of OpenAI's GPT-4o and Anthropic's Claude Sonnet 3.5 while outperforming others, such as Meta's Llama 3.1 and Alibaba's Qwen2.5, in tasks that included problem-solving, coding and math.
Now, R1 has also surpassed ChatGPT's latest o1 model in many of the same tests.
This impressive performance at a
- Fraction of the cost of other models, its
- Semi-open-source nature, and its
- Training on significantly less graphics processing units (GPUs)
has wowed AI experts and raised the specter of China's AI models surpassing their U.S. counterparts.
Comments
Post a Comment
Empathy recommended