Výsledky vyhledávání - "Aleksandr Diment"

Sound Event Detection in the DCASE 2017 Challenge

Autor: Bhiksha Raj, Toni Heittola, Emmanuel Vincent, Tuomas Virtanen, Benjamin Elizalde, Aleksandr Diment, Annamaria Mesaros

Publikováno v: IEEE/ACM Transactions on Audio, Speech, and Language Processing
IEEE/ACM Transactions on Audio, Speech and Language Processing
IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2019, 27 (6), pp.992-1006. ⟨10.1109/TASLP.2019.2907016⟩
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2019, 27 (6), pp.992-1006. ⟨10.1109/TASLP.2019.2907016⟩

Each edition of the challenge on Detection and Classification of Acoustic Scenes and Events (DCASE) contained several tasks involving sound event detection in different setups. DCASE 2017 presented participants with three such tasks, each having spec

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::de2d58b9c9bffbb81cbb99889e68ad4c
https://doi.org/10.1109/taslp.2019.2907016

Zobrazit plný text záznamu

Separation of Moving Sound Sources Using Multichannel NMF and Acoustic Tracking

Autor: Tuomas Virtanen, Aleksandr Diment, Joonas Nikunen

Publikováno v: IEEE/ACM Transactions on Audio, Speech, and Language Processing. 26:281-295

In this paper we propose a method for separation of moving sound sources. The method is based on first tracking the sources and then estimation of source spectrograms using multichannel non-negative matrix factorization (NMF) and extracting the sourc

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::02a36a515eb2596c6881e3a9da8ec6f1
https://doi.org/10.1109/taslp.2017.2774925

Zobrazit plný text záznamu

Detection of Typical Pronunciation Errors in Non-native English Speech Using Convolutional Recurrent Neural Networks

Autor: Eemi Fagerlund, Aleksandr Diment, Tuomas Virtanen, Adrian Benfield

Publikováno v: IJCNN

A machine learning method for the automatic detection of pronunciation errors made by non-native speakers of English is proposed. It consists of training word-specific binary classifiers on a collected dataset of isolated words with possible pronunci

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::ea082dfe29c25238982ffb7fe61c1230
https://doi.org/10.1109/ijcnn.2019.8851963

Zobrazit plný text záznamu

Binaural rendering of microphone array captures based on source separation

Autor: Aleksandr Diment, Miikka Vilermo, Tuomas Virtanen, Joonas Nikunen

Publikováno v: Speech Communication. 76:157-169

A method for binaural rendering of sound scene recordings is proposed.Source signals and their direction of arrival is estimated using a microphone array.A low-rank NMF model for separation of sound sources is used.Speech intelligibility test with ov

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::c00525da19e27988241492c0ff577fb6
https://doi.org/10.1016/j.specom.2015.09.005

Zobrazit plný text záznamu

Transfer learning of weakly labelled audio

Autor: Aleksandr Diment, Tuomas Virtanen

Publikováno v: WASPAA

Many machine learning tasks have been shown solvable with impressive levels of success given large amounts of training data and computational power. For the problems which lack data sufficient to achieve high performance, methods for transfer learnin

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::82f2c83a078eeff21ffa03669faf1124
https://doi.org/10.1109/waspaa.2017.8169984

Zobrazit plný text záznamu

A convolutional neural network approach for acoustic scene classification

Autor: Stefano Squartini, Tuomas Virtanen, Aleksandr Diment, Michele Valenti, Giambattista Parascandolo

Publikováno v: IJCNN

This paper presents a novel application of convolutional neural networks (CNNs) for the task of acoustic scene classification (ASC). We here propose the use of a CNN trained to classify short sequences of audio, represented by their log-mel spectrogr

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::98cba9fa9df572ed113c6ea5ed1eebb8
https://trepo.tuni.fi/handle/10024/129205

Zobrazit plný text záznamu

Noise-robust detection of whispering in telephone calls using deep neural networks

Autor: Aleksandr Diment, Mikko Parviainen, Alex Glasman, Tuomas Virtanen, Roman Zelov

Publikováno v: EUSIPCO

Detection of whispered speech in the presence of high levels of background noise has applications in fraudulent behaviour recognition. For instance, it can serve as an indicator of possible insider trading. We propose a deep neural network (DNN)-base

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::cdc213283a7991d602475d5f124d86e2
https://doi.org/10.1109/eusipco.2016.7760661

Zobrazit plný text záznamu

Archetypal analysis for audio dictionary learning

Autor: Tuomas Virtanen, Aleksandr Diment

Publikováno v: WASPAA

This paper proposes dictionary learning with archetypes for audio processing. Archetypes refer to so-called pure types, which are a combination of a few data points and which can be combined to obtain a data point. The concept has been found useful i

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::15eb24b6a5d34ec009bea4f7467ddaef
https://doi.org/10.1109/waspaa.2015.7336903

Zobrazit plný text záznamu

Group Delay Function from All-Pole Models for Musical Instrument Recognition

Autor: Toni Heittola, Aleksandr Diment, Padmanabhan Rajan, Tuomas Virtanen

Publikováno v: Lecture Notes in Computer Science ISBN: 9783319129754
CMMR

In this work, the feature based on the group delay function from all-pole models (APGD) is proposed for pitched musical instrument recognition. Conventionally, the spectrum-related features take into account merely the magnitude information, whereas

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::2027ad1a1137e2bf5ddb9bfdc0e48520
https://doi.org/10.1007/978-3-319-12976-1_37

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání