Abstract
We apply Reinforcement Learning algorithms to solve the classic quantitativefinance Market Making problem, in which an agent provides liquidity to themarket by placing buy and sell orders while maximizing a utility function. Theoptimal agent has to find a delicate balance between the price risk of herinventory and the profits obtained by capturing the bid-ask spread. We designan environment with a reward function that determines an order relation betweenpolicies equivalent to the original utility function. When comparing our agentswith the optimal solution and a benchmark symmetric agent, we find that theDeep Q-Learning algorithm manages to recover the optimal agent.
Quick Read (beta)
loading the full paper ...