A corpus-based study of the automatic extraction and validation ofV-NItalian oral academic collocations

Autor:	Diana Peppoloni
Rok vydání:	2018
Předmět:	050101 languages & linguistics Linguistics and Language Computer science business.industry 05 social sciences 050301 education Context (language use) computer.software_genre Corpus based 0501 psychology and cognitive sciences Degree of association Artificial intelligence business 0503 education computer Natural language processing
Zdroj:	Lingvisticæ Investigationes. International Journal of Linguistics and Language Resources. 41:240-268
ISSN:	1569-9927 0378-4169
DOI:	10.1075/li.00022.pep
Popis:	This study describes the outcomes of a POS-based method for the automatic extraction ofV-NItalian oral academic collocations from an annotated corpus. A frequency statistical measure is applied to automatically extract the collocations from the POS-tagged corpus. The results reveal that frequency alone is not sufficient to measure the degree of association that connects the two elements of a word pair. In order to detect the real-attested Italian collocations, the data has been further evaluated by 50 Italian native speakers. The results indicate that these combinations are tightly linked to their context of usage. Thus, native speakers should be exposed to these phrasal contexts to activate their mechanisms of explicit reflection and assess the degree of collocativity of these combinations.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::385f784fe34aa80465dd08770ea5920c https://doi.org/10.1075/li.00022.pep Zobrazit plný text záznamu