Scaling laws and model of words organization in spoken and written language

Autor: Chunhua Bian, Ruokuang Lin, Zhang Xiaoyu, Plamen Ch. Ivanov, Qianli D. Y. Ma
Rok vydání: 2016
Předmět:
Zdroj: EPL (Europhysics Letters). 113:18002
ISSN: 1286-4854
0295-5075
DOI: 10.1209/0295-5075/113/18002
Popis: A broad range of complex physical and biological systems exhibits scaling laws. The human language is a complex system of words organization. Studies of written texts have revealed intriguing scaling laws that characterize the frequency of words occurrence, rank of words, and growth in the number of distinct words with text length. While studies have predominantly focused on the language system in its written form, such as books, little attention is given to the structure of spoken language. Here we investigate a database of spoken language transcripts and written texts, and we uncover that words organization in both spoken language and written texts exhibits scaling laws, although with different crossover regimes and scaling exponents. We propose a model that provides insight into words organization in spoken language and written texts, and successfully accounts for all scaling laws empirically observed in both language forms.
Databáze: OpenAIRE