Distributional Reinforcement Learning on Path-dependent Options

Abstract

We reinterpret and propose a framework for pricing path-dependent financialderivatives by estimating the full distribution of payoffs using DistributionalReinforcement Learning (DistRL). Unlike traditional methods that focus onexpected option value, our approach models the entire conditional distributionof payoffs, allowing for risk-aware pricing, tail-risk estimation, and enhanceduncertainty quantification. We demonstrate the efficacy of this method on Asianoptions, using quantile-based value function approximators.

Quick Read (beta)

loading the full paper ...