Effective offline handwritten text recognition model based on a sequence-to-sequence approach with CNN–RNN networks
Autor: | R. Geetha, T. Padmavathy, T. Thilagam |
---|---|
Rok vydání: | 2021 |
Předmět: |
0209 industrial biotechnology
Sequence Computer science business.industry Deep learning Pattern recognition 02 engineering and technology Convolutional neural network 020901 industrial engineering & automation Recurrent neural network Artificial Intelligence Salient Encoding (memory) Factor (programming language) ComputingMethodologies_DOCUMENTANDTEXTPROCESSING 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Artificial intelligence business computer Software Decoding methods computer.programming_language |
Zdroj: | Neural Computing and Applications. 33:10923-10934 |
ISSN: | 1433-3058 0941-0643 |
DOI: | 10.1007/s00521-020-05556-5 |
Popis: | Automatic text recognition system might serve as an important factor in creating a paperless environment through digitizing and processing the existing paper documents in the upcoming days. Handwritten recognition using deep learning methods has been widely explored by many researchers. The existence of large quantity of data and a variety of algorithmic innovations enable the ease of training deep neural networks. Different techniques have been initiated in the literature for recognizing text from handwritten documents. This paper proposes a hybrid handwritten text recognition (H2TR) model using deep neural networks that use the sequence-to-sequence (Seq2Seq) approach. This hybrid model makes use of the salient features of convolution neural network (CNN) and recurrent neural network (RNN) with long–short-term memory network (LSTM). It uses CNN to extract the features from the handwritten image. The features that are extracted are later modelled with a sequence-to-sequence approach and fed to RNN–LSTM for encoding the visual features and decoding the sequence of letters that are available in the handwritten image. The proposed model is tested with IAM and RIMES handwritten databases, which shows competitive letter accuracy and word accuracy results. |
Databáze: | OpenAIRE |
Externí odkaz: |