Optical Character Recognition for English and Tamil Using Support Vector Machines
Autor: | N. Valliappan L.Thaneshwaran, Arun S. Nair, K. P. Soman, S. Ponmathavan, R. Ramanathan |
---|---|
Rok vydání: | 2009 |
Předmět: |
Computer science
business.industry Speech recognition Feature extraction Pattern recognition Character encoding Optical character recognition computer.software_genre Optical character recognition software language.human_language Support vector machine Numeral system Tamil language Artificial intelligence business computer Character recognition |
Zdroj: | 2009 International Conference on Advances in Computing, Control, and Telecommunication Technologies. |
Popis: | Optical Character Recognition is an evergreen area of research and is verily used in various real time applications. This paper proposes a new technique of Optical character Recognition using Gabor filters and Support Vector machines (SVM). This method proves to be very effective with the use of Gabor filters for feature extraction and SVM for developing the model. The model proposed is trained and validated for two languages – English and Tamil and the results are found to be very much encouraging. The model developed works for the entire character set in both the languages including symbols and numerals. In addition , the model can recognise the characetrs of six different fonts in English and Twelve different fonts in Tamil. The average accuracy of recognition for English is 97% and for Tamil it is 84%, which is achieved in just three iterations of training. The method can turn out to be a suitable candidate for future applications in this area. |
Databáze: | OpenAIRE |
Externí odkaz: |