A corpus-based study of the automatic extraction and validation ofV-NItalian oral academic collocations
Autor: | Diana Peppoloni |
---|---|
Rok vydání: | 2018 |
Předmět: |
050101 languages & linguistics
Linguistics and Language Computer science business.industry 05 social sciences 050301 education Context (language use) computer.software_genre Corpus based 0501 psychology and cognitive sciences Degree of association Artificial intelligence business 0503 education computer Natural language processing |
Zdroj: | Lingvisticæ Investigationes. International Journal of Linguistics and Language Resources. 41:240-268 |
ISSN: | 1569-9927 0378-4169 |
DOI: | 10.1075/li.00022.pep |
Popis: | This study describes the outcomes of a POS-based method for the automatic extraction ofV-NItalian oral academic collocations from an annotated corpus. A frequency statistical measure is applied to automatically extract the collocations from the POS-tagged corpus. The results reveal that frequency alone is not sufficient to measure the degree of association that connects the two elements of a word pair. In order to detect the real-attested Italian collocations, the data has been further evaluated by 50 Italian native speakers. The results indicate that these combinations are tightly linked to their context of usage. Thus, native speakers should be exposed to these phrasal contexts to activate their mechanisms of explicit reflection and assess the degree of collocativity of these combinations. |
Databáze: | OpenAIRE |
Externí odkaz: |