AI Overview makes errors

AI Overview makes errors

"A New York Times analysis found Google's AI Overviews now answer questions correctly about 90% of the time, which might sound impressive until you realize that roughly 1 in 10 answers is wrong.

"'[F]or Google, that means hundreds of thousands of lies going out every minute of the day,' reports Ars Technica.

"The Times conducted this analysis with the help of a startup called Oumi, which itself is deeply involved in developing AI models.

"The company used AI tools to probe AI Overviews with the SimpleQA evaluation, a common test to rank the factuality of generative models like Gemini.

"Released by OpenAI in 2024, SimpleQA is essentially a list of more than 4,000 questions with verifiable answers that can be fed into an AI."

Comments