Chinese Word Segmentation: Another Decade Review (2007-2017)

  • 2019-01-18 04:16:56
  • Hai Zhao, Deng Cai, Changning Huang, Chunyu Kit
  • 4

Abstract

This paper reviews the development of Chinese word segmentation (CWS) in themost recent decade, 2007-2017. Special attention was paid to the deep learningtechnologies that has already permeated into most areas of natural languageprocessing (NLP). The basic view we have arrived at is that compared totraditional supervised learning methods, neural network based methods have notshown any superior performance. The most critical challenge still lies onbalancing of recognition of in-vocabulary (IV) and out-of-vocabulary (OOV)words. However, as neural models have potentials to capture the essentiallinguistic structure of natural language, we are optimistic about significantprogresses may arrive in the near future.

 

Quick Read (beta)

loading the full paper ...