NILE: Fast Natural Language Processing for Electronic Health Records

  • 2019-07-16 14:12:22
  • Sheng Yu, Tianrun Cai, Tianxi Cai
  • 0

Abstract

Objective: Narrative text in Electronic health records (EHR) contain richinformation for medical and data science studies. This paper introduces thedesign and performance of Narrative Information Linear Extraction (NILE), anatural language processing (NLP) package for EHR analysis that we share withthe medical informatics community. Methods: NILE uses a modified prefix-treesearch algorithm for named entity recognition, which can detect prefix andsuffix sharing. The semantic analyses are implemented as rule-based finitestate machines. Analyses include negation, location, modification, familyhistory, and ignoring. Result: The processing speed of NILE is hundreds tothousands times faster than existing NLP software for medical text. Theaccuracy of presence analysis of NILE is on par with the best performing modelson the 2010 i2b2/VA NLP challenge data. Conclusion: The speed, accuracy, andbeing able to operate via API make NILE a valuable addition to the NLP softwarefor medical informatics and data science.

 

Quick Read (beta)

loading the full paper ...