Zobrazeno 1 - 10
of 64
pro vyhledávání: '"Jonathan Dunn"'
Autor:
Jonathan Dunn
Corpus analysis can be expanded and scaled up by incorporating computational methods from natural language processing. This Element shows how text classification and text similarity models can extend our ability to undertake corpus linguistics across
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::6f9457de6fe70d999f984d8904e95e1e
https://doi.org/10.1017/9781009070447
https://doi.org/10.1017/9781009070447
Autor:
Jonathan Dunn
This paper uses computational experiments to explore the role of exposure in the emergence of construction grammars. While usage-based grammars are hypothesized to depend on a learner’s exposure to actual language use, the mechanisms of such exposu
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c5fa2aed27eb7fbd995e6987e8fa3b58
This paper measures the stability of cross-linguistic register variation. A register is a variety of a language that is associated with extra-linguistic context. The relationship between a register and its context is functional: the linguistic featur
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5471449103c175532bfd0e82e04a5519
Autor:
Haipeng Li, Jonathan Dunn
Publikováno v:
Lingua. 275:103377
This paper experiments with frequency-based corpus similarity measures across 39 languages using a register prediction task. The goal is to quantify (i) the distance between different corpora from the same language and (ii) the homogeneity of individ
Autor:
Jonathan Dunn
This paper develops a construction-based dialectometry capable of identifying previously unknown constructions and measuring the degree to which a given construction is subject to regional variation. The central idea is to learn a grammar of construc
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5eaac48a0c41eed404553a65646d589a
http://arxiv.org/abs/2104.01299
http://arxiv.org/abs/2104.01299
Autor:
Jonathan Dunn
This paper describes a web-based corpus of global language use with a focus on how this corpus can be used for data-driven language mapping. First, the corpus provides a representation of where national varieties of major languages are used (e.g., En
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6bc71781f013a0f832dde678211578dd
http://arxiv.org/abs/2004.00798
http://arxiv.org/abs/2004.00798
Autor:
Jonathan Dunn
Publikováno v:
Script-Based Semantics ISBN: 9781501511707
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::d57a159460cab2f8e5198b3dba5f28c8
https://doi.org/10.1515/9781501511707-004
https://doi.org/10.1515/9781501511707-004
Publikováno v:
Proceedings of the Fourth Workshop on Natural Language Processing and Computational Social Science.
Computational measures of linguistic diversity help us understand the linguistic landscape using digital language data. The contribution of this paper is to calibrate measures of linguistic diversity using restrictions on international travel resulti
Autor:
Jonathan Dunn
Publikováno v:
International Journal of Corpus Linguistics. 23:183-215
This paper formulates and evaluates a series of multi-unit measures of directional association, building on the pairwiseΔPmeasure, that are able to quantify association in sequences of varying length and type of representation. Multi-unit measures f