Zobrazeno 1 - 10
of 45
pro vyhledávání: '"Mark Hepple"'
Publikováno v:
ACM Transactions on Asian and Low-Resource Language Information Processing. 18:1-26
Part-of-speech (POS) tagging is a well-established technology for most Western European languages and a few other world languages, but it has not been evaluated on Igbo, an agglutinative African language. This article presents POS tagging experiments
Publikováno v:
Web of Science
Although researchers and practitioners are pushing the boundaries and enhancing the capacities of NLP tools and methods, works on African languages are lagging. A lot of focus on well resourced languages such as English, Japanese, German, French, Rus
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::92e137580d7de33084aa37e1fb9d0361
http://arxiv.org/abs/2004.00648
http://arxiv.org/abs/2004.00648
Publikováno v:
ACM Transactions on Asian and Low-Resource Language Information Processing. 17:1-23
Igbo, an African language with around 32 million speakers worldwide, is one of the many languages having few or none of the language processing resources needed for advanced language technology applications. In this article, we describe the approach
Publikováno v:
NAACL-HLT (Student Research Workshop)
Igbo is a low-resource language spoken by approximately 30 million people worldwide. It is the native language of the Igbo people of south-eastern Nigeria. In Igbo language, diacritics - orthographic and tonal - play a huge role in the distinguishing
Publikováno v:
Text, Speech, and Dialogue ISBN: 9783030007935
TSD
TSD
NLP research on low resource African languages is often impeded by the unavailability of basic resources: tools, techniques, annotated corpora, and datasets. Besides the lack of funding for the manual development of these resources, building from scr
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::7ba5deff03c61f50aa4c1948bd06f45c
https://doi.org/10.1007/978-3-030-00794-2_31
https://doi.org/10.1007/978-3-030-00794-2_31
Publikováno v:
Proceedings of the 1st Workshop on Sense, Concept and Entity Representations and their Applications.
Properly written texts in Igbo, a low resource African language, are rich in both orthographic and tonal diacritics. Diacritics are essential in capturing the distinctions in pronunciation and meaning of words, as well as in lexical disambiguation. U
Publikováno v:
Lecture Notes in Computer Science ISBN: 9783319566078
ECIR
ECIR
Automatic summarization of reader comments in on-line news is a challenging but clearly useful task. Work to date has produced extractive summaries using well-known techniques from other areas of NLP. But do users really want these, and do they suppo
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::74c19d786be9171fb794e8ab1f59df95
https://doi.org/10.1007/978-3-319-56608-5_77
https://doi.org/10.1007/978-3-319-56608-5_77
Autor:
Emma Barker, Emina Kurtic, Mark Hepple, Monica Lestari Paramita, Ahmet Aker, Robert Gaizauskas, Adam Funk
Publikováno v:
INLG
Scopus-Elsevier
Scopus-Elsevier
We present a supervised approach to automat- ically labelling topic clusters of reader com- ments to online news. We use a feature set that includes both features capturing proper- ties local to the cluster and features that cap- ture aspects from th
Autor:
Mark Hepple, Ikechukwu E. Onyenwe
Publikováno v:
Text, Speech, and Dialogue ISBN: 9783319455099
TSD
TSD
The effective handling of previously unseen words is an important factor in the performance of part-of-speech taggers. Some trainable POS taggers use suffix (sometimes prefix) strings as cues in handling unknown words (in effect serving as a proxy fo
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::d5eec783a0893096b587dd8b5cf4c30b
https://doi.org/10.1007/978-3-319-45510-5_24
https://doi.org/10.1007/978-3-319-45510-5_24
Publikováno v:
Text, Speech, and Dialogue ISBN: 9783319455099
TSD
TSD
Igbo is a low-resource African language with orthographic and tonal diacritics, which capture distinctions between words that are important for both meaning and pronunciation, and hence of potential value for a range of language processing tasks. Suc
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::ec862f8c0d3b641cf714b8ccc29e303c
https://doi.org/10.1007/978-3-319-45510-5_23
https://doi.org/10.1007/978-3-319-45510-5_23