DeepSeek rules

January 26, 2025

DeepSeek, the Chinese artificial intelligence (AI) lab behind the innovation, unveiled its free large language model (LLM) DeepSeek-V3 in late December 2024 and claims it was built in two months for just $5.58 million —a fraction of the time and cost required by its Silicon Valley competitors.

Following hot on its heels is an even newer model called DeepSeek-R1, released Monday (Jan. 20).

In third-party benchmark tests, DeepSeek-V3 matched the capabilities of OpenAI's GPT-4o and Anthropic's Claude Sonnet 3.5 while outperforming others, such as Meta's Llama 3.1 and Alibaba's Qwen2.5, in tasks that included problem-solving, coding and math.

Now, R1 has also surpassed ChatGPT's latest o1 model in many of the same tests.

This impressive performance at a

Fraction of the cost of other models, its
Semi-open-source nature, and its
Training on significantly less graphics processing units (GPUs)

has wowed AI experts and raised the specter of China's AI models surpassing their U.S. counterparts.

Search This Blog

chatainews

DeepSeek rules

Comments

Post a Comment

Popular posts from this blog

When their AI chums have Bob's data

Hamza Chaudhry

Supporting Artistes (SAs)