Reviewing the AIs

TechCrunch will 'benchmark' AI systems (so you won't have to)

Disclosure: 

"These systems are too general and are updated too frequently for evaluation frameworks to stay relevant, and synthetic benchmarks provide only an abstract view of certain well-defined capabilities. 

"Companies like Google and OpenAI are counting on this because it means consumers have no source of truth other than those companies’ own claims. 

"So even though our own reviews will necessarily be limited and inconsistent, a qualitative analysis of these systems has intrinsic value simply as a real-world counterweight to industry hype." 

Comments

Popular posts from this blog

Perplexity

Aphorisms: AI

DeepAI's Austen on China