HDLTex: Hierarchical Deep Learning for Text Classification

  • 2017-10-06 18:16:31
  • Kamran Kowsari, Donald E. Brown, Mojtaba Heidarysafa, Kiana Jafari Meimandi, Matthew S. Gerber, Laura E. Barnes
  • 0

Abstract

The continually increasing number of documents produced each yearnecessitates ever improving information processing methods for searching,retrieving, and organizing text. Central to these information processingmethods is document classification, which has become an important applicationfor supervised learning. Recently the performance of these traditionalclassifiers has degraded as the number of documents has increased. This isbecause along with this growth in the number of documents has come an increasein the number of categories. This paper approaches this problem differentlyfrom current document classification methods that view the problem asmulti-class classification. Instead we perform hierarchical classificationusing an approach we call Hierarchical Deep Learning for Text classification(HDLTex). HDLTex employs stacks of deep learning architectures to providespecialized understanding at each level of the document hierarchy.

 

Quick Read (beta)

loading the full paper ...