Reinforcement learning combined with deep neural networks has performedremarkably well in many genres of games recently. It has surpassed human-levelperformance in fixed game environments and turn-based two player board games.However, to the best of our knowledge, current research has yet to produce aresult that has surpassed human-level performance in modern complex fightinggames. This is due to the inherent difficulties with real-time fighting games,including: vast action spaces, action dependencies, and imperfect information.We overcame these challenges and made 1v1 battle AI agents for the commercialgame "Blade & Soul". The trained agents competed against five professionalgamers and achieved a win rate of 62%. This paper presents a practicalreinforcement learning method that includes a novel self-play curriculum anddata skipping techniques. Through the curriculum, three different styles ofagents were created by reward shaping and were trained against each other.Additionally, this paper suggests data skipping techniques that could increasedata efficiency and facilitate explorations in vast spaces. Since our methodcan be generally applied to all two-player competitive games with vast actionspaces, we anticipate its application to game development including leveldesign and automated balancing.