Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games.

Dekun Wu,Haochen Shi, Zhiyuan Sun,Bang Liu

Findings of the Association for Computational Linguistics ACL 2024(2024)

引用 0|浏览13
暂无评分
摘要
In this study, we explore the application of Large Language Models (LLMs) inJubensha, a Chinese detective role-playing game and a novel area inArtificial Intelligence (AI) driven gaming. We introduce the first datasetspecifically for Jubensha, including character scripts and game rules, tofoster AI agent development in this complex narrative environment. Our workalso presents a unique multi-agent interaction framework using LLMs, allowingAI agents to autonomously engage in this game. To evaluate the gamingperformance of these AI agents, we developed novel methods measuring theirmastery of case information and reasoning skills. Furthermore, we incorporatedthe latest advancements in in-context learning to improve the agents'performance in information gathering, murderer identification, and logicalreasoning. The experimental results validate the effectiveness of our proposedmethods. This work aims to offer a novel perspective on understanding LLMcapabilities and establish a new benchmark for evaluating large languagemodel-based agents.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要