The MoNoPoli database

Autor: Mathilde HUGUIN
Přispěvatelé: Analyse et Traitement Informatique de la Langue Française (ATILF), Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), ANR-17-CE23-0005, Fiammetta Namer, Nabil Hathout, Stéphanie Lignon, Magda Ševčíková, Zdeněk Žabokrtský, ANR-17-CE23-0005,DEMONEXT,Dérivation Morphologique en Extension(2017)
Jazyk: angličtina
Rok vydání: 2021
Předmět:
Zdroj: Proceedings of the Third International Workshop on Resources and Tools for Derivational Morphology (DeriMo 2021)
Third International Workshop on Resources and Tools for Derivational Morphology (DeriMo 2021)
Third International Workshop on Resources and Tools for Derivational Morphology (DeriMo 2021), Fiammetta Namer; Nabil Hathout; Stéphanie Lignon; Magda Ševčíková; Zdeněk Žabokrtský, Sep 2021, Nancy, France. pp.76-85
Third International Workshop on Resources and Tools for Derivational Morphology (Derimo 2021)
Third International Workshop on Resources and Tools for Derivational Morphology (Derimo 2021), Sep 2021, Nancy, France
HAL
Popis: International audience; In this article we present our method to build a derivational database of French deanthroponyms, which we call MoNoPoli for Mots construits sur Noms propres de personnalités Politiques, ‘complex words based on politician proper names’. MoNoPoli contains 6,545 complex wordsamounting to a total of 55,030 tokens and includes almost only neologistic forms. TheWeb is the only conceivable resource for collecting them: it alone gives massive access to discourse genres that contain neologisms. To feed the database, a program automatically generates the set of allpossible derived words. Generated forms are then used as queries on theWeb. Attested forms are kept with their context. This method provides a potential alternative to collect data that cannot be found elsewhere. Finally, this article describes some of the remarkable results obtained withthe analysis of the deanthroponyms of MoNoPoli.
Databáze: OpenAIRE