Zobrazeno 1 - 10
of 556
pro vyhledávání: '"text normalization"'
Publikováno v:
Journal of Computing Research and Innovation, Vol 9, Iss 2 (2024)
TF-IDF is a technique used to extract features in the field of text classification. The TF-IDF approach extracts feature by considering the frequencies of terms and their inverse document frequencies. The performance of various feature extraction met
Externí odkaz:
https://doaj.org/article/4da686b7aa224939b878fb711dec7632
Autor:
Salvatore Spina
Publikováno v:
Umanistica Digitale, Iss 16, Pp 125-140 (2023)
This article examines the impact of Artificial Intelligence on the archival heritage digitization processes, specifically regarding the manuscripts’ automatic transcription, their correction, and normalization. It highlights how digitality has comp
Externí odkaz:
https://doaj.org/article/dfa42036af714afd9b925ee2f964d4c8
Publikováno v:
PeerJ Computer Science, Vol 10, p e1704 (2024)
In text applications, pre-processing is deemed as a significant parameter to enhance the outcomes of natural language processing (NLP) chores. Text normalization and tokenization are two pivotal procedures of text pre-processing that cannot be overst
Externí odkaz:
https://doaj.org/article/eaacd770376e48d9bc806d191ef6c55a
Publikováno v:
Journal of King Saud University: Computer and Information Sciences, Vol 36, Iss 1, Pp 101807- (2024)
Text normalization (TN) for text-to-speech (TTS) synthesizer is the transformation of non-standard words like times, ordinal numbers, equations, ranges, dates, etc. into standard words that have similarities with their pronunciations. An essential pa
Externí odkaz:
https://doaj.org/article/83dd3ad0caea4deb8cdf0735f1bff698
Publikováno v:
IEEE Access, Vol 11, Pp 72704-72716 (2023)
Neural-based sequence-to-sequence methods (Seq2Seq) have proven to be highly effective for Context-sensitive Thai spelling correction. However, they also inherit the drawbacks of Seq2Seq, such as a fixed vocabulary and large data requirements. Howeve
Externí odkaz:
https://doaj.org/article/3b37c1fb1f67481a83321c1a4186a706
Autor:
Seniz Demir, Berkay Topcu
Publikováno v:
Engineering Science and Technology, an International Journal, Vol 35, Iss , Pp 101192- (2022)
User generated texts on the web are freely-available and lucrative sources of data for language technology researchers. Unfortunately, these texts are often dominated by informal writing styles and the language used in user generated content poses pr
Externí odkaz:
https://doaj.org/article/09f93496a16e428bbe35dbccf275bef5
Autor:
Kaur, Jagroop, Singh, Jaswinder
Publikováno v:
International Journal of Intelligent Computing and Cybernetics, 2020, Vol. 13, Issue 4, pp. 407-435.
Externí odkaz:
http://www.emeraldinsight.com/doi/10.1108/IJICC-08-2020-0096
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Publikováno v:
IEEE Access, Vol 8, Pp 36202-36209 (2020)
This paper proposes a deep learning model based on a recurrent neural network (RNN) to solve the problem of text normalization for speech synthesis. Traditional rule-based models cannot take advantage of contextual information and do not handle text
Externí odkaz:
https://doaj.org/article/33e68ea21e9849bea51303d140274ab4
Publikováno v:
IEEE Access, Vol 8, Pp 133403-133419 (2020)
Text correction systems (e.g., spell checkers) have been used to improve the quality of computerized text by detecting and correcting errors. However, the task of performing spelling correction and word normalization (text correction) for Thai social
Externí odkaz:
https://doaj.org/article/ad77cc203d494aa7a9164e9a59c7419d