Výsledky vyhledávání - "Vesa Siivola"

Morph-based speech recognition and modeling of out-of-vocabulary words across languages

Autor: Ebru Arisoy, Murat Saraclar, Matti Varjokallio, Mikko Kurimo, Mathias Creutz, Andreas Stolcke, Teemu Hirsimäki, Antti Puurula, Vesa Siivola, Janne Pylkkönen

Publikováno v: ACM Transactions on Speech and Language Processing. 5:1-29

We explore the use of morph-based language models in large-vocabulary continuous-speech recognition systems across four so-called morphologically rich languages: Finnish, Estonian, Turkish, and Egyptian Colloquial Arabic. The morphs are subword units

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::80123acba702709396ffc998858c808c
https://doi.org/10.1145/1322391.1322394

Zobrazit plný text záznamu

On Growing and Pruning Kneser–Ney Smoothed $ N$-Gram Models

Autor: Teemu Hirsimäki, Vesa Siivola, Sami Virpioja

Publikováno v: IEEE Transactions on Audio, Speech and Language Processing. 15:1617-1624

N-gram models are the most widely used language models in large vocabulary continuous speech recognition. Since the size of the model grows rapidly with respect to the model order and available training data, many methods have been proposed for pruni

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::3725e7f284a587caec7da58b56aba954
https://doi.org/10.1109/tasl.2007.896666

Zobrazit plný text záznamu

Unlimited vocabulary speech recognition with morph language models applied to Finnish

Autor: Vesa Siivola, Mikko Kurimo, Mathias Creutz, Janne Pylkkönen, Sami Virpioja, Teemu Hirsimäki

Publikováno v: Computer Speech & Language. 20:515-541

In the speech recognition of highly inflecting or compounding languages, the traditional word-based language modeling is problematic. As the number of distinct word forms can grow very large, it becomes difficult to train language models that are bot

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::26b8b5ae393b5bf119bb32aaa8f2f480
https://doi.org/10.1016/j.csl.2005.07.002

Zobrazit plný text záznamu

Language identification for text chats

Autor: Vesa Siivola, Bryan Pellom, Meagan Sills

Publikováno v: Interspeech 2011.

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::09f8350e15513e7eeeb20c89088b329f
https://doi.org/10.21437/interspeech.2011-733

Zobrazit plný text záznamu

Morfessor and variKN machine learning tools for speech and language technology

Autor: Mikko Kurimo, Mathias Creutz, Vesa Siivola

Publikováno v: INTERSPEECH

This paper introduces two recent open source software packages developed for unsupervised natural language modeling. The Morfessor program segments words automatically into morpheme-like units without any rule-based morphological analyzers. The VariK

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::348e10f3577624e433ab8bab34cd9363
https://doi.org/10.21437/interspeech.2007-446

Zobrazit plný text záznamu

Unlimited vocabulary speech recognition for agglutinative languages

Autor: Ebru Arisoy, Vesa Siivola, Teemu Hirsimäki, Janne Pylkkönen, Tanel Alumäe, Antti Puurula, Mikko Kurimo, Murat Saraclar

Publikováno v: HLT-NAACL
Scopus-Elsevier

It is practically impossible to build a word-based lexicon for speech recognition in agglutinative languages that would cover all the relevant words. The problem is that words are generally built by concatenating several prefixes and suffixes to the

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5f19d87478a623616206ec6e3ecc8f1f
https://doi.org/10.3115/1220835.1220897

Zobrazit plný text záznamu

Growing an n-gram language model

Autor: Bryan L. Pellom, Vesa Siivola

Publikováno v: INTERSPEECH

Traditionally, when building an n-gram model, we decide the span of the model history, collect the relevant statistics and estimate the model. The model can be pruned down to a smaller size by manipulating the statistics or the estimated model. This

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::6d8b49c5ba32e048d6a85bc6fff4af81
https://doi.org/10.21437/interspeech.2005-24

Zobrazit plný text záznamu

Unlimited vocabulary speech recognition based on morphs discovered in an unsupervised manner

Autor: Vesa Siivola, Teemu Hirsimaki, Mathias Creutz, Mikko Kurimo

Publikováno v: 8th European Conference on Speech Communication and Technology (Eurospeech 2003).

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::0025778e37c0008c762bafacaf2255aa
https://doi.org/10.21437/eurospeech.2003-640

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání