AI Overview makes errors
"'[F]or Google, that means hundreds of thousands of lies going out every minute of the day,' reports Ars Technica.
"The Times conducted this analysis with the help of a startup called Oumi, which itself is deeply involved in developing AI models.
"The company used AI tools to probe AI Overviews with the SimpleQA evaluation, a common test to rank the factuality of generative models like Gemini.
"Released by OpenAI in 2024, SimpleQA is essentially a list of more than 4,000 questions with verifiable answers that can be fed into an AI."
Comments
Post a Comment
Empathy recommended