Joint Recognition of Handwritten Text and Named Entities with a Neural End-to-end Model

  • 2018-03-16 14:47:58
  • Manuel Carbonell, Mauricio Villegas, Alicia Fornés, Josep Lladós
  • 19

Abstract

When extracting information from handwritten documents, text transcriptionand named entity recognition are usually faced as separate subsequent tasks.This has the disadvantage that errors in the first module affect heavily theperformance of the second module. In this work we propose to do both tasksjointly, using a single neural network with a common architecture used forplain text recognition. Experimentally, the work has been tested on acollection of historical marriage records. Results of experiments are presentedto show the effect on the performance for different configurations: differentways of encoding the information, doing or not transfer learning and processingat text line or multi-line region level. The results are comparable to state ofthe art reported in the ICDAR 2017 Information Extraction competition, eventhough the proposed technique does not use any dictionaries, language modelingor post processing.

 

Quick Read (beta)

loading the full paper ...