Abstract
Understanding and reasoning over tables is a critical capability for manyreal-world applications. Large language models (LLMs) have shown promise onthis task, but current approaches remain limited. Fine-tuning based methodsstrengthen language reasoning; yet they are prone to arithmetic errors andhallucination. In contrast, tool-based methods enable precise tablemanipulation but rely on rigid schemas and lack semantic understanding. Thesecomplementary drawbacks highlight the need for approaches that integrate robustreasoning with reliable table processing. In this work, we proposeMixture-of-Minds, a multi-agent framework that decomposes table reasoning intothree specialized roles: planning, coding, and answering. This design enableseach agent to focus on a specific aspect of the task while leveraging codeexecution for precise table manipulation. Building on this workflow, weintroduce a self-improvement training framework that employs Monte Carlo TreeSearch (MCTS) rollouts to generate pseudo-gold trajectories and optimize agentswith reinforcement learning (RL). Extensive experiments show thatMixture-of-Minds delivers substantial gains, reaching 62.13% on TableBench andsurpassing OpenAI-o4-mini-high. These results demonstrate the promise ofcombining structured multi-agent workflows with RL to advance tableunderstanding.