Model generation for word length frequencies in texts with the application of Zipf's order approach
Autor: | Hemlata Pande, H. S. Dhami |
---|---|
Rok vydání: | 2012 |
Předmět: |
Linguistics and Language
Word lists by frequency Relation (database) Basis (linear algebra) Zipf's law Parametric model Order (group theory) Computer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing) Arithmetic Power law Language and Linguistics Word (computer architecture) Mathematics |
Zdroj: | Journal of Quantitative Linguistics. 19:249-261 |
ISSN: | 1744-5035 0929-6174 |
DOI: | 10.1080/09296174.2012.714531 |
Popis: | In the present paper we attempted to generate a parametric model for word frequencies. In order to make this relation applicable, we arranged word lengths in accordance with their normalized frequencies. The pattern of occurrence of words containing different numbers of letters has been investigated on the basis of their Zipf's order and by applying power law for Zipf's order and frequencies. The applicability of the generated mathematical model for word length frequencies was verified for different texts. We also resolved the problem of establishing a relationship between word frequencies of higher Zipf's order with text length. |
Databáze: | OpenAIRE |
Externí odkaz: |