Výsledky vyhledávání - "Fransen, Theodorus"

Report

MaCmS: Magahi Code-mixed Dataset for Sentiment Analysis

Autor: Rani, Priya, Negi, Gaurav, Fransen, Theodorus, McCrae, John P.

The present paper introduces new sentiment data, MaCMS, for Magahi-Hindi-English (MHE) code-mixed language, where Magahi is a less-resourced minority language. This dataset is the first Magahi-Hindi-English code-mixed dataset for sentiment analysis t

Externí odkaz: http://arxiv.org/abs/2403.04639

Zobrazit plný text záznamu

Report

Weakly-supervised Deep Cognate Detection Framework for Low-Resourced Languages Using Morphological Knowledge of Closely-Related Languages

Autor: Goswami, Koustava, Rani, Priya, Fransen, Theodorus, McCrae, John P.

Exploiting cognates for transfer learning in under-resourced languages is an exciting opportunity for language understanding tasks, including unsupervised machine translation, named entity recognition and information retrieval. Previous approaches ma

Externí odkaz: http://arxiv.org/abs/2311.05155

Zobrazit plný text záznamu

Report

Findings of the LoResMT 2021 Shared Task on COVID and Sign Language for Low-resource Languages

Autor: Ojha, Atul Kr., Liu, Chao-Hong, Kann, Katharina, Ortega, John, Shatam, Sheetal, Fransen, Theodorus

We present the findings of the LoResMT 2021 shared task which focuses on machine translation (MT) of COVID-19 data for both low-resource spoken and sign languages. The organization of this task was conducted as part of the fourth workshop on technolo

Externí odkaz: http://arxiv.org/abs/2108.06598

Zobrazit plný text záznamu

Report

ULD@NUIG at SemEval-2020 Task 9: Generative Morphemes with an Attention Model for Sentiment Analysis in Code-Mixed Text

Autor: Goswami, Koustava, Rani, Priya, Chakravarthi, Bharathi Raja, Fransen, Theodorus, McCrae, John P.

Code mixing is a common phenomena in multilingual societies where people switch from one language to another for various reasons. Recent advances in public communication over different social media sites have led to an increase in the frequency of co

Externí odkaz: http://arxiv.org/abs/2008.01545

Zobrazit plný text záznamu

A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment

Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography. In this paper, we describe our efforts in manually aligning monolingual diction

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::0494fc2941f2fa3a520a46349d894974
https://doi.org/10.5281/zenodo.3842647

Zobrazit plný text záznamu

Automatic morphological analysis and interlinking of historical Irish cognate verb forms

Autor: Fransen, Theodorus

The main aim of the author’s research project is to use computational approaches to gain more insight into the historical development of Irish verbs. One of the objectives is to investigate how a link between the electronic Dictionary of the Irish

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=od______1513::e478010ba9f81b3fbe514191f7348fd3
http://hdl.handle.net/10379/16235

Zobrazit plný text záznamu

A Comparative Study of Different State-of-the-Art Hate Speech Detection Methods in Hindi-English Code-Mixed Data

Autor: Rani, Priya, Shardul Suryawanshi, Koustava Goswami, Bharathi Raja Chakravarthi, Fransen, Theodorus, McCrae, John Philip

Publikováno v: Second Workshop on Trolling, Aggression and Cyberbullying at LREC 2020

Hate speech detection in social media communication has become one of the primary concerns to avoid conflicts and curb undesired activities. In an environment where multilingual speakers switch among multiple languages, hate speech detection becomes

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::68d1a9a0db73dab1fd7932ed2e2917b4

Zobrazit plný text záznamu

Cardamom: Comparative Deep Models for Minority and Historical Languages

Autor: McCrae, John Philip, Fransen, Theodorus

This paper gives an overview of the Cardamom project, which aims to close the resource gap for minority and under-resourced languages by means of deep-learning-based natural language processing (NLP) and exploiting similarities of closely-related lan

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1f32e25e261e0b57c2de2d9490e7843b

Zobrazit plný text záznamu

Developing automatic speech recognition for Scottish Gaelic

Autor: Evans, Lucy, Lamb, William, Sinclair, Mark, Alex, Beatrice

Publikováno v: Evans, L, Lamb, W, Sinclair, M & Alex, B 2022, Developing automatic speech recognition for Scottish Gaelic . in T Fransen, W Lamb & D Prys (eds), Proceedings of the 4th Celtic Language Technology Workshop at LREC 2022 (CLTW 4) . pp. 110-120, The 4th Celtic Language Technology Workshop at LREC 2022, Marseille, France, 20/06/22 . < http://www.lrec-conf.org/proceedings/lrec2022/workshops/CLTW4/index.html >

This paper discusses our efforts to develop a full automatic speech recognition (ASR) system for Scottish Gaelic, starting froma point of limited resource. Building ASR technology is important for documenting and revitalising endangered languages;it

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=od______3094::7f7520b207204f1580b8f78e4939cccb
https://hdl.handle.net/20.500.11820/f1409d74-e8e6-4c59-99e2-9f3f0dd80b1e

Zobrazit plný text záznamu

Handwriting recognition for Scottish Gaelic

Autor: Sinclair, Mark, Lamb, William, Alex, Beatrice

Publikováno v: Sinclair, M, Lamb, W & Alex, B 2022, Handwriting recognition for Scottish Gaelic . in T Fransen, W Lamb & D Prys (eds), Proceedings of the 4th Celtic Language Technology Workshop at LREC 2022 (CLTW 4) . pp. 60-70, The 4th Celtic Language Technology Workshop at LREC 2022, Marseille, France, 20/06/22 . < http://www.lrec-conf.org/proceedings/lrec2022/workshops/CLTW4/index.html >

Like most other minority languages, Scottish Gaelic has limited tools and resources available for Natural Language Processing research and applications. These limitations restrict the potential of the language to participate in modern speech technolo

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=od______3094::a342dbf5ff9e61b0c8cb2722bdc30f21
https://www.pure.ed.ac.uk/ws/files/281143539/SinclairEtal2022HandwritingRecognitionForScottishGaelic.pdf

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání