Learning Context Using Segment-Level LSTM for Neural Sequence Labeling

Autor:	Youhyun Shin, Sang-goo Lee
Rok vydání:	2020
Předmět:	Acoustics and Ultrasonics Computer science business.industry Feature extraction Context (language use) Pattern recognition Sequence labeling 030507 speech-language pathology & audiology 03 medical and health sciences Computational Mathematics Tree (data structure) Code segment Computer Science (miscellaneous) Limit (mathematics) Artificial intelligence Electrical and Electronic Engineering 0305 other medical science Hidden Markov model business Word (computer architecture)
Zdroj:	IEEE/ACM Transactions on Audio, Speech, and Language Processing. 28:105-115
ISSN:	2329-9304 2329-9290
DOI:	10.1109/taslp.2019.2948773
Popis:	This article introduces an approach that learns segment-level context for sequence labeling in natural language processing (NLP). Previous approaches limit their basic unit to a word for feature extraction because sequence labeling is a token-level task in which labels are annotated word-by-word. However, the text segment is an ultimate unit for labeling, and we are easily able to obtain segment information from annotated labels in a IOB/IOBES format. Most neural sequence labeling models expand their learning capacity by employing additional layers, such as a character-level layer, or jointly training NLP tasks with common knowledge. The architecture of our model is based on the charLSTM-BiLSTM-CRF model, and we extend the model with an additional segment-level layer called segLSTM. We therefore suggest a sequence labeling algorithm called charLSTM-BiLSTM-CRF-segLSTM $^{sLM}$ which employs an additional segment-level long short-term memory (LSTM) that trains features by learning adjacent context in a segment. We demonstrate the performance of our model on four sequence labeling datasets, namely, Peen Tree Bank, CoNLL 2000, CoNLL 2003, and OntoNotes 5.0. Experimental results show that our model performs better than state-of-the-art variants of BiLSTM-CRF. In particular, the proposed model enhances the performance of tasks for finding appropriate labels of multiple token segments.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::1a82e0d0912c4d002a5a7a3eb0404b73 https://doi.org/10.1109/taslp.2019.2948773 Zobrazit plný text záznamu