Výsledky vyhledávání - "Constantine Lignos"

Elektronická kniha

Autor: Constantine Lignos, Laurel MacKenzie, Meredith Tamminga

This volume explores how the patterning of surface variation can shed light on the grammatical representation of variable phenomena. The authors explore variation in several domains, addressing intra- and inter-dialectal patterns, using diverse sourc

Zobrazit plný text záznamu

ParaNames: A Massively Multilingual Entity Name Corpus

Autor: Jonne Sälevä, Constantine Lignos

We introduce ParaNames, a multilingual parallel name resource consisting of 118 million names spanning across 400 languages. Names are provided for 13.6 million entities which are mapped to standardized entity types (PER/LOC/ORG). Using Wikidata as a

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::85a1c9c6231adb4074853c834d3ce392
http://arxiv.org/abs/2202.14035

Zobrazit plný text záznamu

Detecting Unassimilated Borrowings in Spanish: An Annotated Corpus and Approaches to Modeling

Autor: Elena Álvarez-Mellado, Constantine Lignos

This work presents a new resource for borrowing identification and analyzes the performance and errors of several models on this task. We introduce a new annotated corpus of Spanish newswire rich in unassimilated lexical borrowings -- words from one

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::66dda1c1f1832e11b7e4635a30c716e6

Zobrazit plný text záznamu

MasakhaNER: Named entity recognition for African languages

Autor: Julia Kreutzer, Ayodele Awokoya, Ignatius Ezeani, Rubungo Andre Niyongabo, Happy Buzaaba, Adewale Akinfaderin, Samuel Oyerinde, Stephen Mayhew, Emmanuel Anebi, Mofetoluwa Adeyemi, Kelechi Ogueji, Abdoulaye Diallo, Seid Muhie Yimam, Jade Abbott, Joyce Nakatumba-Nabende, Victor Akinode, Blessing Sibanda, Catherine Gitau, Chester Palen-Michel, Shamsuddeen Hassan Muhammad, Degaga Wolde, Graham Neubig, Tendai Marengereke, Paul Rayson, Derguene Mbaye, Eric Peter Wairagala, Daniel D'souza, Tosin P. Adewumi, Jonathan Mukiibi, Chris Chinenye Emezue, David Ifeoluwa Adelani, Shruti Rijhwani, Iroro Orife, Verrah Otiende, Maurice Katusiime, Yvonne Wambui, Dibora Gebreyohannes, Kelechi Nwaike, Salomey Osei, Chiamaka Chukwuneke, Henok Tilaye, Deborah Nabagereka, Thierno Ibrahima Diop, Orevaoghene Ahia, Jesujoba O. Alabi, Sebastian Ruder, Davis David, Mouhamadane Mboup, Samba Ngom, Tajuddeen R. Gwadabe, Bonaventure F. P. Dossou, Temilola Oloyede, Perez Ogayo, Clemencia Siro, Gerald Muriuki, Aremu Anuoluwapo, Nkiruka Odu, Tobius Saul Bateesa, Abdoulaye Faye, Israel Abebe Azime, Constantine Lignos

Publikováno v: Transactions of the Association for Computational Linguistics
Transactions of the Association for Computational Linguistics, The MIT Press, 2021, ⟨10.1162/tacl⟩
Transactions of the Association for Computational Linguistics, 2021, ⟨10.1162/tacl⟩

We take a step towards addressing the under-representation of the African continent in NLP research by creating the first large publicly available high-quality dataset for named entity recognition (NER) in ten African languages, bringing together a v

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::d1930ae0735e2b37961957cb5eb49a8e
https://hal.inria.fr/hal-03350962/file/adelani_TACL2021.pdf

Zobrazit plný text záznamu

Macro-Average: Rare Types Are Important Too

Autor: Jonathan May, Thamme Gowda, Weiqiu You, Constantine Lignos

Publikováno v: NAACL-HLT

While traditional corpus-level evaluation metrics for machine translation (MT) correlate well with fluency, they struggle to reflect adequacy. Model-based MT metrics trained on segment-level human judgments have emerged as an attractive replacement d

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ef5c09b95e6dbfdf1060f7aa9d077525
https://aclanthology.org/2021.naacl-main.90

Zobrazit plný text záznamu

TMR: Evaluating NER Recall on Tough Mentions

Autor: Jingxuan Tu, Constantine Lignos

Publikováno v: EACL (Student Research Workshop)

We propose the Tough Mentions Recall (TMR) metrics to supplement traditional named entity recognition (NER) evaluation by examining recall on specific subsets of "tough" mentions: unseen mentions, those whose tokens or token/type combination were not

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::0a9801199a50234fed8bbe34108e32f6
https://doi.org/10.18653/v1/2021.eacl-srw.21

Zobrazit plný text záznamu

The Effectiveness of Morphology-aware Segmentation in Low-Resource Neural Machine Translation

Autor: Constantine Lignos, Jonne Sälevä

Publikováno v: EACL (Student Research Workshop)

This paper evaluates the performance of several modern subword segmentation methods in a low-resource neural machine translation setting. We compare segmentations produced by applying BPE at the token or sentence level with morphologically-based segm

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::2ecfd0ffd5431135228b2919114d39fd
https://doi.org/10.18653/v1/2021.eacl-srw.22

Zobrazit plný text záznamu

If You Build Your Own NER Scorer, Non-replicable Results Will Come

Autor: Marjan Kamyab, Constantine Lignos

Publikováno v: Insights

We attempt to replicate a named entity recognition (NER) model implemented in a popular toolkit and discover that a critical barrier to doing so is the inconsistent evaluation of improper label sequences. We define these sequences and examine how two

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::71d736330f3c6ec2ff5a0ebcbfa466c3
https://doi.org/10.18653/v1/2020.insights-1.15

Zobrazit plný text záznamu

Combining rule-based and statistical mechanisms for low-resource named entity recognition

Autor: Ryan Gabbard, Constantine Lignos, Marjorie Freedman, Jay DeYoung, Ralph Weischedel

Publikováno v: Machine Translation. 32:31-43

We describe a multifaceted approach to named entity recognition that can be deployed with minimal data resources and a handful of hours of non-expert annotation. We describe how this approach was applied in the 2016 LoReHLT evaluation and demonstrate

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::957f2b3ccae9553520e0f825c07a12d5
https://doi.org/10.1007/s10590-017-9208-0

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání