Výsledky vyhledávání - "Maxim Korenevsky"

Knihovna AV ČR, v. v. i.

Odhlásit
Přihlášení
Jazyk
- English
- Čeština
Instituce

Pokročilé vyhledávání

Zahrnout EIZ

Zachovat současné nastavení filtrů

EXPAND:"fulltext"

Domovská stránka
Vyhledávání: "Maxim Korenevsky"
Navrhnout nákup titulu

Zobrazeno 1 - 10 of 21 pro vyhledávání: '"Maxim Korenevsky"'

Řazení

Vybrat vše | Vybrané výsledky:

Vybrat výsledek číslo 1 1

LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring

Autor: Yuri Y. Khokhlov, Andrei Andrusenko, Maxim Korenevsky, Ivan Medennikov, Mariya Korenevskaya, Aleksandr Laptev, Anton Mitrofanov, Aleksei Romanenko, Ivan Podluzhny, Aleksei Ilin

Neural network-based language models are commonly used in rescoring approaches to improve the quality of modern automatic speech recognition (ASR) systems. Most of the existing methods are computationally expensive since they use autoregressive langu

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5cf51c8c93b6c2f3481391c44d0dce6a
http://arxiv.org/abs/2104.02526

Zobrazit plný text záznamu

Vybrat výsledek číslo 2 2

The STC System for the CHiME-6 Challenge

Autor: Ivan Medennikov, Maxim Korenevsky, Tatiana Prisyach, Yuri Khokhlov, Mariya Korenevskaya, Ivan Sorokin, Tatiana Timofeeva, Anton Mitrofanov, Andrei Andrusenko, Ivan Podluzhny, Aleksandr Laptev, Aleksei Romanenko

Publikováno v: 6th International Workshop on Speech Processing in Everyday Environments (CHiME 2020).

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::eb8a120bfddc8342d115441a79caf076
https://doi.org/10.21437/chime.2020-9

Zobrazit plný text záznamu

Vybrat výsledek číslo 3 3

Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario

Autor: Ivan Podluzhny, Ivan Sorokin, Yuri Y. Khokhlov, Andrei Andrusenko, Tatiana Prisyach, Mariya Korenevskaya, Aleksei Romanenko, Aleksandr Laptev, Maxim Korenevsky, Ivan Medennikov, Tatiana Timofeeva, Anton Mitrofanov

Publikováno v: INTERSPEECH

Speaker diarization for real-life scenarios is an extremely challenging problem. Widely used clustering-based diarization approaches perform rather poorly in such conditions, mainly due to the limited ability to handle overlapping speech. We propose

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::4ae9f336565297623daccc3ef226a606

Zobrazit plný text záznamu

Vybrat výsledek číslo 4 4

Phase term modeling for enhanced feature-space VTS

Autor: Maxim Korenevsky

Publikováno v: Speech Communication. 89:84-91

HIghlightsVector Taylor Series (VTS) is a popular approach in robust speech recognition.Speech distortion model taking phase term into account is more accurate.Phase term can be modeled as a Gaussian random vector.Phase term modeling improves speech

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::435cbc1085a47d2c38e66121bb2f4541
https://doi.org/10.1016/j.specom.2017.03.001

Zobrazit plný text záznamu

Vybrat výsledek číslo 5 5

The STC System for the CHiME 2018 Challenge

Autor: Ivan Sorokin, Ivan Podluzhny, Aleksei Romanenko, Maxim Korenevsky, Andrei Andrusenko, Tatiana Prisyach, Ivan Medennikov, Tatiana Timofeeva, Anton Mitrofanov, Yuri Y. Khokhlov, Aleksandr Laptev, Mariya Korenevskaya

Publikováno v: 5th International Workshop on Speech Processing in Everyday Environments (CHiME 2018).

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::b24936c98df2867dc5b6f2a3f2875771
https://doi.org/10.21437/chime.2018-1

Zobrazit plný text záznamu

Vybrat výsledek číslo 6 6

A Free Synthetic Corpus for Speaker Diarization Research

Autor: Maxim Korenevsky, Nico Axtmann, David Suendermann-Oeft, Najmeh Sadoughi, Michael Brenndoerfer, Amanda L. Robinson, Mark Miller, Erik Edwards, Greg P. Finley

Publikováno v: Speech and Computer ISBN: 9783319995786
SPECOM

A synthetic corpus of dialogs was constructed from the LibriSpeech corpus, and is made freely available for diarization research. It includes over 90 h of training data, and over 9 h each of development and test data. Both 2-person and 3-person dialo

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::e5f5be605c2eef975348aa12af96f39a
https://doi.org/10.1007/978-3-319-99579-3_13

Zobrazit plný text záznamu

Vybrat výsledek číslo 7 7

Speaker Diarization: A Top-Down Approach Using Syllabic Phonology

Autor: David Suendermann-Oeft, Amanda L. Robinson, Mark Miller, Michael Brenndoerfer, Erik Edwards, Nico Axtmann, Greg P. Finley, Maxim Korenevsky, Najmeh Sadoughi

Publikováno v: Speech and Computer ISBN: 9783319995786
SPECOM

A top-down approach to speaker diarization is developed using a modified Baum-Welch algorithm. The HMM states combine phonemes according to structural positions under syllabic phonological theory. By nature of the structural phonology, there are at m

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::75523d0ff686e577ecc913844294cbf2
https://doi.org/10.1007/978-3-319-99579-3_14

Zobrazit plný text záznamu

Vybrat výsledek číslo 8 8

Exploring End-to-End Techniques for Low-Resource Speech Recognition

Autor: Maxim Korenevsky, Ivan Medennikov, Vladimir Bataev, Alexander Zatvornitskiy

Publikováno v: Speech and Computer ISBN: 9783319995786
SPECOM

In this work we present simple grapheme-based system for low-resource speech recognition using Babel data for Turkish spontaneous speech (80 h). We have investigated different neural network architectures performance, including fully-convolutional, r

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::bb8b5b95c24a6768c3f594b2ab4adcec
https://doi.org/10.1007/978-3-319-99579-3_4

Zobrazit plný text záznamu

Vybrat výsledek číslo 9 9

Semi-Supervised Acoustic Model Retraining for Medical ASR

Autor: Nico Axtmann, Maxim Korenevsky, Michael Brenndoerfer, Najmeh Sadoughi, David Suendermann-Oeft, Erik Edwards, Greg P. Finley, Wael Salloum, Amanda L. Robinson, Mark Miller

Publikováno v: Speech and Computer ISBN: 9783319995786
SPECOM

Training models for speech recognition usually requires accurate word-level transcription of available speech data. For the domain of medical dictations, it is common to have “semi-literal” transcripts available: large numbers of speech files alo

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::b645588d32900fcea8dab1602591db28
https://doi.org/10.1007/978-3-319-99579-3_19

Zobrazit plný text záznamu

Vybrat výsledek číslo 10 10

Detecting Section Boundaries in Medical Dictations: Toward Real-Time Conversion of Medical Dictations to Clinical Reports

Autor: Amanda L. Robinson, Najmeh Sadoughi, Maxim Korenevsky, David Suendermann-Oeft, Mark Miller, Nico Axtmann, Greg P. Finley, Michael Brenndoerfer, Erik Edwards

Publikováno v: Speech and Computer ISBN: 9783319995786
SPECOM

We present a section boundary detection framework specifically for clinical dictations. Detection is cast as a semi-supervised binary tagging problem and solved using a neural network model composed of a stack of embeddings, unidirectional long-short

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::0e4723470686e61838151e539fd8d800
https://doi.org/10.1007/978-3-319-99579-3_58

Zobrazit plný text záznamu

Vybrat vše | Vybrané výsledky:

1
2
3
Další »
[3]

Vyhledávací nástroje:

RSS
Poslat e-mailem

Upřesnit hledání

Omezení vyhledávání

Plný text Recenzováno Digitální knihovna AV ČR

Zdroje

Pouze tištěné dokumenty

Zahrnout EIZ

14 speech recognition
11 02 engineering and technology
11 0202 electrical engineering, electronic engineering, information engineering
8 020206 networking & telecommunications
8 03 medical and health sciences
7 0305 other medical science
7 030507 speech-language pathology & audiology
5 artificial neural network
4 acoustic model
4 artificial intelligence
4 business
4 business.industry
4 computer
4 computer.software_genre
4 decoding methods
3 01 natural sciences
3 020201 artificial intelligence & image processing
3 dictation
3 natural language processing
3 speaker diarisation
3 symbols
3 symbols.namesake
3 voice activity detection
2 algorithm
2 audio and speech processing (eess.as)
2 bottleneck
2 cluster analysis
2 computation and language (cs.cl)
2 computer science - computation and language
2 deep neural networks
2 electrical engineering and systems science - audio and speech processing
2 feature (machine learning)
2 feature vector
2 fos: computer and information sciences
2 fos: electrical engineering, electronic engineering, information engineering
2 gaussian
2 hidden markov model
2 keyword search
2 language model
2 overfitting
2 quality (physics)
2 reduction (complexity)
2 speaker adaptation
2 spontaneous speech
2 taylor series
2 term (time)
2 test set
1 0103 physical sciences
1 010301 acoustics
1 0104 chemical sciences

13 springer international publishing
4 isca
1 arxiv
1 elsevier bv
1 ieee

1 2015 ieee workshop on automatic speech recognition and understanding (asru)
1 interspeech 2015
1 interspeech 2017
1 speech communication

21 OpenAIRE

Od:

do:

Možnosti vyhledávání

Tematická mapa
Historie vyhledávání
Pokročilé vyhledávání

Objevte více

Abecední procházení

Hledáte pomoc?

Tipy pro vyhledávání

načítá se......