Zobrazeno 1 - 10
of 83
pro vyhledávání: '"Pierre Dumouchel"'
Autor:
Edward Hill, David Han, Pierre Dumouchel, Najim Dehak, Thomas Quatieri, Charles Moehs, Marlene Oscar-Berman, John Giordano, Thomas Simpatico, Debmalya Barh, Kenneth Blum
Publikováno v:
PLoS ONE, Vol 8, Iss 7, p e69043 (2013)
Addictions to illicit drugs are among the nation's most critical public health and societal problems. The current opioid prescription epidemic and the need for buprenorphine/naloxone (Suboxone®; SUBX) as an opioid maintenance substance, and its grow
Externí odkaz:
https://doaj.org/article/e51d3a12a6944012ba13e7ea80065d6f
Autor:
Edward Hill, David Han, Pierre Dumouchel, Najim Dehak, Thomas Quatieri, Charles Moehs, Marlene Oscar-Berman, John Giordano, Thomas Simpatico, Debmalya Barh, Kenneth Blum
Publikováno v:
PLoS ONE, Vol 8, Iss 8 (2013)
Externí odkaz:
https://doaj.org/article/c19d5ac022f042c4a4b9b049c2dec63a
Publikováno v:
IEEE/ACM Transactions on Audio, Speech, and Language Processing. 24:1106-1118
In this paper, we present our audio fingerprinting system that detects a transformed copy of an audio from a large collection of audios in a database. The audio fingerprints in this system encode the positions of salient regions of binary images deri
Publikováno v:
Multimedia Tools and Applications. 75:9145-9165
This paper presents a novel audio fingerprinting method that is highly robust to a variety of audio distortions. It is based on an unconventional audio fingerprint generation scheme. The robustness is achieved by generating different versions of the
Publikováno v:
ICASSP
This paper describes a video fingerprinting system that is highly robust to audio and video transformations. The proposed system adapts a robust audio fingerprint extraction approach to video fingerprinting. The audio fingerprinting system converts t
Publikováno v:
IEEE/ACM Transactions on Audio, Speech, and Language Processing. 22:217-227
Speaker clustering is a crucial step for speaker diarization. The short duration of speech segments in telephone speech dialogue and the absence of prior information on the number of clusters dramatically increase the difficulty of this problem in di
Publikováno v:
IEEE Transactions on Audio, Speech, and Language Processing. 21:2290-2300
The speed of modern processors has remained constant over the last few years but the integration capacity continues to follow Moore's law and thus, to be scalable, applications must be parallelized. The parallelization of the classical Viterbi beam s
Publikováno v:
RO-MAN
Audition is a rich source of spatial, identity, linguistic and paralinguistic information. Processing all this information requires acquisition, processing and interpretation of sound sources, which are instantaneous, invisible and noisy signals. Thi
Publikováno v:
IEEE Transactions on Audio, Speech, and Language Processing. 19:788-798
This paper presents an extension of our previous work which proposes a new speaker representation for speaker verification. In this modeling, a new low-dimensional speaker- and channel-dependent space is defined using a simple factor analysis. This s
Publikováno v:
IEEE Transactions on Audio, Speech, and Language Processing. 16:980-988
We propose a new approach to the problem of estimating the hyperparameters which define the interspeaker variability model in joint factor analysis. We tested the proposed estimation technique on the NIST 2006 speaker recognition evaluation data and