A model for posttransliteration suggestion for balinese palm leaf manuscript with text generation and lstm model

Autor: M W A Kesiman, I M D Maysanjaya
Rok vydání: 2021
Předmět:
Zdroj: Journal of Physics: Conference Series. 1810:012011
ISSN: 1742-6596
1742-6588
DOI: 10.1088/1742-6596/1810/1/012011
Popis: The main challenge found in building an automatic transliteration system for Balinese palm leaf manuscript (Lontar) collections is that the recognition error in a small portion of glyphs of Balinese script can affect the results of transliteration widely. This is due to the fundamental nature of Balinese script which is a complex alphasyllabic script. This paper presents an initial proposition for a general scheme and model for suggesting several possible transliteration with text generation and LSTM for Lontar collection. The Edit-Insert-Replace model was proposed to be applied on the existing word collection dataset and a Bidirectional LSTM model with a specific feature extraction method was built for the training process of post transliteration suggestion module. This module will help in suggesting several possible transliterations based on the initial transliteration from the previous system.
Databáze: OpenAIRE