Abstract
We present olmOCR 2, the latest in our family of powerful OCR systems forconverting digitized print documents, like PDFs, into clean, naturally orderedplain text. olmOCR 2 is powered by olmOCR-2-7B-1025, a specialized, 7B visionlanguage model (VLM) trained using reinforcement learning with verifiablerewards (RLVR), where our rewards are a diverse set of binary unit tests. Toscale unit test creation, we develop a pipeline for generating syntheticdocuments with diverse and challenging layouts, known ground-truth HTML sourcecode, and extracted test cases. We show that RL training on these test casesresults in state-of-the-art performance on olmOCR-Bench, our English-languageOCR benchmark, with the largest improvements in math formula conversion, tableparsing, and multi-column layouts compared to previous versions. We release ourmodel, data and code under permissive open licenses.