Emergent Convergence
"Large language models (LLMs) are increasingly deployed in collaborative settings, yet
little is known about how they coordinate
when treated as black-box agents. [pdf]
"We simulate 7,500 multi-agent, multi-round discussions
in an inductive coding task, generating over
125,000 utterances that capture both final annotations and their interactional histories.
"We
introduce process-level metrics —code stability, semantic self-consistency, and lexical confidence —alongside sentiment and convergence
measures, to track coordination dynamics.
"To
probe deeper alignment signals, we analyze
the evolving geometry of output embeddings,
showing that intrinsic dimensionality declines
over rounds, suggesting semantic compression.
"The results reveal that LLM groups converge
lexically and semantically, develop asymmetric
influence patterns, and exhibit negotiation-like
behaviors despite the absence of explicit role
prompting.
"This work demonstrates how black-box interaction analysis can surface emergent
coordination strategies, offering a scalable complement to internal probe-based interpretability
methods."
Comments
Post a Comment
Empathy recommended