Lexical dispersion and corpus design
Autor: | Jesse Egbert, Brent D. Burch, Douglas Biber |
---|---|
Rok vydání: | 2020 |
Předmět: |
050101 languages & linguistics
Linguistics and Language Series (mathematics) business.industry 05 social sciences Mode (statistics) computer.software_genre 01 natural sciences Language and Linguistics 010104 statistics & probability British National Corpus 0501 psychology and cognitive sciences Statistical dispersion Artificial intelligence 0101 mathematics Index of dispersion business Equal size computer Word (computer architecture) Natural language processing Mathematics |
Zdroj: | International Journal of Corpus Linguistics. 25:89-115 |
ISSN: | 1569-9811 1384-6655 |
DOI: | 10.1075/ijcl.18010.egb |
Popis: | Lexical dispersion is typically measured across arbitrary corpus parts of equal size. In this study, we apply DA – a new dispersion index designed for unequal-sized corpus parts – to the British National Corpus (BNC) in a series of cases studies to show that the dispersion of a word is strongly influenced by the corpus units or parts it is measured across. Our results show that dispersion should be measured and interpreted based on corpus units that are linguistically meaningful for a particular research goal. We conclude with recommendations to help researchers select meaningful corpus units for measuring and interpreting lexical dispersion. |
Databáze: | OpenAIRE |
Externí odkaz: |