Dealing with specialised co-text in text mining: Verbal terminological collocations
Autor: | Margarida Ramos, Rute Costa, Christophe Roche |
---|---|
Přispěvatelé: | Centro de Linguística da Universidade Nova de Lisboa (CLUNL), Faculty of Social and Human Sciences (NOVAFCSH), Universidade Nova de Lisboa = NOVA University Lisbon (NOVA)-Universidade Nova de Lisboa = NOVA University Lisbon (NOVA), Laboratoire d'Informatique, Systèmes, Traitement de l'Information et de la Connaissance (LISTIC), Université Savoie Mont Blanc (USMB [Université de Savoie] [Université de Chambéry]), Research financed by Portuguese National Funding through the FCT – Fundação para a Ciência e Tecnologia as part of the project Centro de Linguística da Universidade NOVA de Lisboa – UID/LIN/03213/2020 and the PhD program in Linguistics KRUse - Knowledge, Representation & Use, CLUNL - Faculty of Social Sciences and Humanities, Universidade NOVA de Lisboa - PB/BD/113972/2015., Costa, Rute, Humbley, John, Kockaert, Hendrik, Roche, Christophe, Brun, C., Roche, Ramos, Margarida, Brun, C., Roche, C. |
Jazyk: | angličtina |
Rok vydání: | 2019 |
Předmět: |
Text mining
[SHS.INFO]Humanities and Social Sciences/Library and information sciences domain-specific corpus Verbal terminological collocations [SHS.LANGUE]Humanities and Social Sciences/Linguistics Terminology [SHS.LANGUE] Humanities and Social Sciences/Linguistics [SHS.INFO] Humanities and Social Sciences/Library and information sciences |
Zdroj: | Terminologica TOTh 2019 | Terminologie & Ontologie : Théories et Applications TOTh 2019 | Terminologie & Ontologie : Théories et Applications, Costa, Rute; Humbley, John; Kockaert, Hendrik; Roche, Christophe, Jun 2019, Le Bourget du Lac, France HAL |
Popis: | International audience; The aim of this paper is to organise lexical and conceptual knowledge by analysing a domain-specific corpus. The domain we focus on is the cork industry. Through the analysis of the corpus, we have found that some common verbs in Portuguese, such as “choose” and “separate” acquire a specialised value in the field under study. This was the starting point for the analysis of the terminological collocations where verbs are the core constituents. For the analysis of these verbal terminological collocations, we used natural language processing techniques, building simple and complex CQL structures with RegEx. The outcome of this analysis permits us to introduce a distinction between polylexical terms and terminological collocations. The terminological collocation is a reality of great relevance in specialised discourse, but unlike terms, it is not defined by conceptualcriteria, but by morphological and syntactic criteria. |
Databáze: | OpenAIRE |
Externí odkaz: |