Active Learning via Membership Query Synthesis for Semi-Supervised Sentence Classification

Autor: Ines Rehbein, Raphael Schumann
Rok vydání: 2019
Předmět:
Zdroj: CoNLL
DOI: 10.18653/v1/k19-1044
Popis: Active learning (AL) is a technique for reducing manual annotation effort during the annotation of training data for machine learning classifiers. For NLP tasks, pool-based and stream-based sampling techniques have been used to select new instances for AL while gen erating new, artificial instances via Membership Query Synthesis was, up to know, considered to be infeasible for NLP problems. We present the first successfull attempt to use Membership Query Synthesis for generating AL queries, using Variational Autoencoders for query generation. We evaluate our approach in a text classification task and demonstrate that query synthesis shows competitive performance to pool-based AL strategies while substantially reducing annotation time
Databáze: OpenAIRE