Zobrazeno 1 - 6
of 6
pro vyhledávání: '"Asma Mekki"'
Publikováno v:
Jordanian Journal of Computers and Information Technology, Vol 8, Iss 4, Pp 370-387 (2022)
In written text, orthographic noise is a common concern for NLP, especially when operating social network comments and raw documents. This is mainly due to its orthographic conventions and morphological ambiguity. We propose to automatically normaliz
Externí odkaz:
https://doaj.org/article/7c78d7cd5c5641709c5e96085e0da766
Publikováno v:
ACM Transactions on Asian and Low-Resource Language Information Processing.
Tokenization represents the way of segmenting a piece of text into smaller units called tokens. Since Arabic is an agglutinating language by nature, this treatment becomes a crucial preprocessing step for many Natural Language Processing (NLP) applic
The aim of this research is to better understand public perceptions of COVID-19 pandemic patterns and to identify key themes of concern expressed by Tunisian dialect social media users throughout the epidemic. We collected around 23K comments written
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::bb526fe5fe460082fc79f22bfb4f44dc
https://doi.org/10.21203/rs.3.rs-2321298/v1
https://doi.org/10.21203/rs.3.rs-2321298/v1
Publikováno v:
Language Resources and Evaluation. 56:357-385
Sentence boundary detection (SBD) is an essential step for a very large number of natural language processing applications such as parsing, information retrieval, automatic summarization, machine translation, etc. In this paper, we tackle the problem
Publikováno v:
Lecture Notes in Computer Science ISBN: 9783031165634
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::0560771e4aebce9831d8edabc8c0706b
https://doi.org/10.1007/978-3-031-16564-1_5
https://doi.org/10.1007/978-3-031-16564-1_5
Publikováno v:
2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA).