Chinese Base Noun Phrase Based on Multi-Class Support Vector Machines and Rules of Post-Processing
Autor: | Meihua Wang, Shuicai Shi, Tao Wang, Runsheng Gan |
---|---|
Rok vydání: | 2010 |
Předmět: |
business.industry
Computer science Speech recognition InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL Phrase search Base (topology) computer.software_genre Noun phrase Support vector machine Statistical classification ComputingMethodologies_PATTERNRECOGNITION Determiner phrase Chunking (psychology) ComputingMethodologies_DOCUMENTANDTEXTPROCESSING Artificial intelligence business Hidden Markov model computer Natural language processing |
Zdroj: | DBTA |
DOI: | 10.1109/dbta.2010.5658598 |
Popis: | In the paper, Chinese base noun phrase chunking is considered as a classification problem, and the paper proposes an approach, combines SVM-based method and rules of post-processing method, to distinguish Chinese base noun phrase. But the paper introduce threshold in multi-class SVM algorithm and study the usefulness of threshold for Chinese base noun phrase chunking, complete analyses of the result in this paper, then according to the special structures of Chinese base noun phrase, customize some appropriate rules to process the result. From overall experiments, the method achieves a higher accuracy in the final results. |
Databáze: | OpenAIRE |
Externí odkaz: |