Here come the judge🦹‍♂️

November 09, 2024

Let two large models debate the answer to a given question, with the judge (a simpler model or a human) left to recognize the more accurate answer.

In theory, the process allows the two agents to poke holes in each other’s arguments until the judge has enough information to discern the truth.

The approach was first proposed six years ago, but two sets of findings released earlier this year —one in February from the AI startup Anthropic and the second in July from Google DeepMind —offer the first empirical evidence that debate between two LLMs helps a judge (human or machine) recognize the truth.

“These works have been very important in what they’ve set out and contributed,” [Julian] Michael said.

They also offer new avenues to explore. To take one example, Michael and his group reported in September that training AI debaters to win —and not just to converse, as in the past two studies —further increased the ability of non-expert judges to recognize the truth.

Search This Blog

cha tai news

Here come the judge🦹‍♂️

Comments

Post a Comment

Popular posts from this blog

Hamza Chaudhry

Perplexity

Alongside AI: Eye vs AI