Exploratory Analysis of MNIST Handwritten Digit for Machine Learning Modelling
Autor: | Mohd Razif Shamsuddin, Azlinah Mohamed, Shuzlina Abdul-Rahman |
---|---|
Rok vydání: | 2018 |
Předmět: |
Normalization (statistics)
Computer science business.industry 020208 electrical & electronic engineering ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION 02 engineering and technology Machine learning computer.software_genre Convolutional neural network Image (mathematics) 0202 electrical engineering electronic engineering information engineering Benchmark (computing) NIST 020201 artificial intelligence & image processing Artificial intelligence business computer MNIST database |
Zdroj: | Communications in Computer and Information Science ISBN: 9789811334405 |
DOI: | 10.1007/978-981-13-3441-2_11 |
Popis: | This paper is an investigation about the MNIST dataset, which is a subset of the NIST data pool. The MNIST dataset contains handwritten digit images that is derived from a larger collection of NIST data which contains handwritten digits. All the images are formatted in 28 × 28 pixels value with grayscale format. MNIST is a handwritten digit images that has often been cited in many leading research and thus has become a benchmark for image recognition and machine learning studies. There have been many attempts by researchers in trying to identify the appropriate models and pre-processing methods to classify the MNIST dataset. However, very little attention has been given to compare binary and normalized pre-processed datasets and its effects on the performance of a model. Pre-processing results are then presented as input datasets for machine learning modelling. The trained models are validated with 4200 random test samples over four different models. Results have shown that the normalized image performed the best with Convolution Neural Network model at 99.4% accuracy. |
Databáze: | OpenAIRE |
Externí odkaz: |