AI Overview makes errors


"'[F]or Google, that means hundreds of thousands of lies going out every minute of the day,' reports Ars Technica

"The Times conducted this analysis with the help of a startup called Oumi, which itself is deeply involved in developing AI models. 

"The company used AI tools to probe AI Overviews with the SimpleQA evaluation, a common test to rank the factuality of generative models like Gemini. 

"Released by OpenAI in 2024, SimpleQA is essentially a list of more than 4,000 questions with verifiable answers that can be fed into an AI."



Comments

Popular posts from this blog

Hamza Chaudhry

When their AI chums have Bob's data

Supporting Artistes (SAs)