Spoken dialogue grammar induction from crowdsourced data
Autor: | Elias Iosif, Ioannis Klasinas, Elisavet Palogiannidi, Alexandros Potamianos |
---|---|
Rok vydání: | 2014 |
Předmět: |
Perplexity
Grammar Computer science business.industry Speech recognition media_common.quotation_subject Crowdsourcing computer.software_genre Grammar induction Task (project management) TheoryofComputation_MATHEMATICALLOGICANDFORMALLANGUAGES Rule-based machine translation Selection (linguistics) Artificial intelligence business computer Natural language processing media_common |
Zdroj: | ICASSP |
DOI: | 10.1109/icassp.2014.6854193 |
Popis: | We design and evaluate various crowdsourcing tasks for eliciting spoken dialogue data. Task design is based on an array of parameters that quantify the basic characteristics of the elicitation questions, e.g., how open-ended is a question. The crowdsourced data are used for and evaluated on the unsupervised induction of semantic classes for speech understanding grammars. We show that grammar induction performance is significantly affected by the crowdsourcing task parameters, e.g., paraphrasing tasks prime high lexical entrain-ment and result in poor corpus/grammar quality. The task parameters along with perplexity filters are used for corpus selection achieving grammar induction performance that is comparable to that of using in-domain spoken dialogue data. |
Databáze: | OpenAIRE |
Externí odkaz: |