Docling Technical Report

  • 2024-12-09 09:20:54
  • Christoph Auer, Maksym Lysak, Ahmed Nassar, Michele Dolfi, Nikolaos Livathinos, Panos Vagenas, Cesar Berrospi Ramis, Matteo Omenetti, Fabian Lindlbauer, Kasper Dinkla, Lokesh Mishra, Yusik Kim, Shubham Gupta, Rafael Teixeira de Lima, Valery Weber, Lucas Morin, Ingmar Meijer, Viktor Kuropiatnyk, Peter W. J. Staar
  • 0

Abstract

This technical report introduces Docling, an easy to use, self-contained,MIT-licensed open-source package for PDF document conversion. It is powered bystate-of-the-art specialized AI models for layout analysis (DocLayNet) andtable structure recognition (TableFormer), and runs efficiently on commodityhardware in a small resource budget. The code interface allows for easyextensibility and addition of new features and models.

 

Quick Read (beta)

loading the full paper ...