Significant inconsistencies 🫥
"Leading AI companies including Anthropic, Google, OpenAI and Elon Musk's xAI are discovering significant inconsistencies in how their AI reasoning models operate, according to company researchers.
"The companies have deployed chain-of-thought techniques that ask AI models to solve problems step-by-step while showing their reasoning process, but are finding examples of misbehaviour where chatbots provide final responses that contradict their displayed reasoning.
"METR, a non-profit research group, identified an instance where Anthropic's Claude chatbot disagreed with a coding technique in its chain-of-thought but ultimately recommended it as elegant.
Comments
Post a Comment
Empathy recommended