Indonesian Language Term Extraction using Multi-Task Neural Network

Autor: Joan Santoso, Esther Irawati Setiawan, Fransiskus Xaverius Ferdinandus, Gunawan Gunawan, Leonel Hernandez
Jazyk: angličtina
Rok vydání: 2022
Předmět:
Zdroj: Knowledge Engineering and Data Science, Vol 5, Iss 2, Pp 160-167 (2022)
Druh dokumentu: article
ISSN: 2597-4602
2597-4637
DOI: 10.17977/um018v5i22022p160-167
Popis: The rapidly expanding size of data makes it difficult to extricate information and store it as computerized knowledge. Relation extraction and term extraction play a crucial role in resolving this issue. Automatically finding a concealed relationship between terms that appear in the text can help people build computer-based knowledge more quickly. Term extraction is required as one of the components because identifying terms that play a significant role in the text is the essential step before determining their relationship. We propose an end-to-end system capable of extracting terms from text to address this Indonesian language issue. Our method combines two multilayer perceptron neural networks to perform Part-of-Speech (PoS) labeling and Noun Phrase Chunking. Our models were trained as a joint model to solve this problem. Our proposed method, with an f-score of 86.80%, can be considered a state-of-the-art algorithm for performing term extraction in the Indonesian Language using noun phrase chunking.
Databáze: Directory of Open Access Journals