The 2004 MIT Lincoln Laboratory Speaker Recognition System
Autor: | Douglas A. Reynolds, Pedro A. Torres-Carrasquillo, Carl Quillen, William M. Campbell, Terry P. Gleason, Douglas E. Sturim, André Gustavo Adami |
---|---|
Rok vydání: | 2006 |
Předmět: |
Computer science
business.industry Speech recognition Speech corpus Perceptron Mixture model computer.software_genre Speaker recognition Support vector machine Speaker diarisation ComputingMethodologies_PATTERNRECOGNITION Word usage NIST Artificial intelligence Language model business computer Natural language processing |
Zdroj: | ICASSP (1) |
DOI: | 10.1109/icassp.2005.1415079 |
Popis: | The MIT Lincoln Laboratory submission for the 2004 NIST speaker recognition evaluation (SRE) was built upon seven core systems using speaker information from short-term acoustics, pitch and duration prosodic behavior, and phoneme and word usage. These different levels of information were modeled and classified using Gaussian mixture models, support vector machines and N-gram language models and were combined using a single layer perceptron fuser. The 2004 SRE used a new multi-lingual, multi-channel speech corpus that provided a challenging speaker detection task for the above systems. We describe the core systems used and provide an overview of their performance on the 2004 SRE detection tasks. |
Databáze: | OpenAIRE |
Externí odkaz: |