Research on business English grammar detection system based on LSTM model

Autor: Huang Xiaojie
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: Journal of Intelligent Systems, Vol 33, Iss 1, Pp p. 209-14 (2024)
Druh dokumentu: article
ISSN: 2191-026X
DOI: 10.1515/jisys-2023-0309
Popis: In order to solve the problems that the current English grammar correction algorithms are not effective, the error correction ability is limited, and the error correction accuracy needs to be improved, this study proposes an automatic grammar correction method for business English writing based on two-way long short-term memory (LSTM) and N-gram. First, this study considers article and preposition errors as a special sequence labeling task, and proposes a Grammar error checking (GEC) method for sequence labeling based on bidirectional LSTM. During training, english as a second language (ESL) corpus and supplementary corpus are used to label specific articles or prepositions. Second, for noun simple-plural errors, verb form errors, and subject-verb inconsistency errors, a large number of news corpora are used to count the frequency of N-gram, and a GEC method based on ESL and news corpora N-gram voting strategy is proposed. Experimental results show that the overall F 1 value of the method designed in this study on the GEC data of CoNLL2013 is 33.87%, which is higher than the F 1 value of UIUC. The F 1 value of article error correction is 38.05%, and the F 1 value of preposition error correction is 28.89%. It is proved that this method can effectively improve the accuracy of grammar error correction and solve the gradient explosion problem of traditional error correction model, which is of great significance to further strengthen the practicality of automatic grammar error correction technology.
Databáze: Directory of Open Access Journals