Abstract
The utilization of artificial intelligence (AI) in card games has been awell-explored subject within AI research for an extensive period. Recentadvancements have propelled AI programs to showcase expertise in intricate cardgames such as Mahjong, DouDizhu, and Texas Hold'em. In this work, we aim todevelop an AI program for an exceptionally complex and popular card game calledGuanDan. This game involves four players engaging in both competitive andcooperative play throughout a long process to upgrade their level, posing greatchallenges for AI due to its expansive state and action space, long episodelength, and complex rules. Employing reinforcement learning techniques,specifically Deep Monte Carlo (DMC), and a distributed training framework, wefirst put forward an AI program named DanZero for this game. Evaluation againstbaseline AI programs based on heuristic rules highlights the outstandingperformance of our bot. Besides, in order to further enhance the AI'scapabilities, we apply policy-based reinforcement learning algorithm toGuanDan. To address the challenges arising from the huge action space, whichwill significantly impact the performance of policy-based algorithms, we adoptthe pre-trained model to facilitate the training process and the achieved AIprogram manages to achieve a superior performance.