Indonesian Language Term Extraction using Multi-Task Neural Network
Autor: | Joan Santoso, Esther Irawati Setiawan, Fransiskus Xaverius Ferdinandus, Gunawan Gunawan, Leonel Hernandez |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2022 |
Předmět: | |
Zdroj: | Knowledge Engineering and Data Science, Vol 5, Iss 2, Pp 160-167 (2022) |
Druh dokumentu: | article |
ISSN: | 2597-4602 2597-4637 |
DOI: | 10.17977/um018v5i22022p160-167 |
Popis: | The rapidly expanding size of data makes it difficult to extricate information and store it as computerized knowledge. Relation extraction and term extraction play a crucial role in resolving this issue. Automatically finding a concealed relationship between terms that appear in the text can help people build computer-based knowledge more quickly. Term extraction is required as one of the components because identifying terms that play a significant role in the text is the essential step before determining their relationship. We propose an end-to-end system capable of extracting terms from text to address this Indonesian language issue. Our method combines two multilayer perceptron neural networks to perform Part-of-Speech (PoS) labeling and Noun Phrase Chunking. Our models were trained as a joint model to solve this problem. Our proposed method, with an f-score of 86.80%, can be considered a state-of-the-art algorithm for performing term extraction in the Indonesian Language using noun phrase chunking. |
Databáze: | Directory of Open Access Journals |
Externí odkaz: |