Language Identification of Mandarin, Holo, and Hakka based on GMM and PPM models
Autor: | Chung-Ming Tsai, 蔡仲明 |
---|---|
Rok vydání: | 2007 |
Druh dokumentu: | 學位論文 ; thesis |
Popis: | 95 In this thesis, different models and model structures are studied to identify among the three languages: Mandarin, Holo, and Hakka. Acoustic features are represented as Mel-frequency cepstrum coefficients (MFCC). In addition, Legendre polynomials and discrete cosine transforms (DCT) are used to approximate the pitch contour of a voice segment. For the two kinds of features, Gauussian mixture models (GMM) are constructed respectively. Also, each frame’s feature vector is tokenized in order to construct a prediction by partial matching (PPM) model to modelize the characteristics embedded in a sequence of tokens. According to the models studied, a practical language identification system has been built. In offline tests, identification success rate can reach 97% in average. In initial online (i.e. inputting with a telephone) tests, the system can obtain a success rate of 73% in average. |
Databáze: | Networked Digital Library of Theses & Dissertations |
Externí odkaz: |