Discovering Power Laws in Entity Length

  • 2018-11-16 14:23:31
  • Xiaoshi Zhong, Erik Cambria, Jagath C. Rajapakse
  • 0

Abstract

This paper presents a discovery that the length of the entities follows afamily of scale-free power law distributions. The concept of entity herebroadly includes the named entity, entity mention, time expression, anddomain-specific entity that are well investigated in natural languageprocessing and related areas. The power law distributions in entity lengthpossess the scale-free property and have well-defined means and finitevariances. We explain the phenomenon of power laws in entity length by theprinciple of least effort in communication and the preferential mechanism.

 

Quick Read (beta)

loading the full paper ...