Abstract
The sheer volume of financial statements makes it difficult for humans toaccess and analyze a business's financials. Robust numerical reasoning likewisefaces unique challenges in this domain. In this work, we focus on answeringdeep questions over financial data, aiming to automate the analysis of a largecorpus of financial documents. In contrast to existing tasks on general domain,the finance domain includes complex numerical reasoning and understanding ofheterogeneous representations. To facilitate analytical progress, we propose anew large-scale dataset, FinQA, with Question-Answering pairs over Financialreports, written by financial experts. We also annotate the gold reasoningprograms to ensure full explainability. We further introduce baselines andconduct comprehensive experiments in our dataset. The results demonstrate thatpopular, large, pre-trained models fall far short of expert humans in acquiringfinance knowledge and in complex multi-step numerical reasoning on thatknowledge. Our dataset -- the first of its kind -- should therefore enablesignificant, new community research into complex application domains. Thedataset and code are publicly available\url{https://github.com/czyssrs/FinQA}.