Abstract
Modern Large Language Models (LLMs) exhibit impressive zero-shot and few-shotgeneralization capabilities across complex natural language tasks, enablingtheir widespread use as virtual assistants for diverse applications such astranslation and summarization. Despite being trained solely on large corpora oftext without explicit supervision on author intent, LLMs appear to infer theunderlying meaning of textual interactions. This raises a fundamental question:can LLMs model and reason about the intentions of others, i.e., do they possessa form of theory of mind? Understanding other's intentions is crucial foreffective collaboration, which underpins human societal success and isessential for cooperative interactions among multiple agents, including humansand autonomous systems. In this work, we investigate the theory of mind in LLMsthrough the lens of cooperative multi-agent reinforcement learning (MARL),where agents learn to collaborate via repeated interactions, mirroring humansocial reasoning. Our approach aims to enhance artificial agent's ability toadapt and cooperate with both artificial and human partners. By leveragingLLM-based agents capable of natural language interaction, we move towardscreating hybrid human-AI systems that can foster seamless collaboration, withbroad implications for the future of human-artificial interaction.