An effective classification approach for big data with parallel generalized Hebbian algorithm
Autor: | Mostafa Abdulghafoor Mohammed, Tole Sutikno, Royida A. Ibrahem Alhayali, Ahmed Hussein Ali |
---|---|
Rok vydání: | 2021 |
Předmět: |
Data processing
Control and Optimization Artificial neural network Computer Networks and Communications Computer science business.industry Big data Information technology computer.software_genre Reduction (complexity) Hardware and Architecture Control and Systems Engineering Generalized Hebbian Algorithm Spark (mathematics) Computer Science (miscellaneous) Data mining Big data Generalized Hebbian algorithm Machine learning Neural network Principal component analysis Spark Radoop Electrical and Electronic Engineering business Instrumentation computer Information Systems Curse of dimensionality |
Zdroj: | Bulletin of Electrical Engineering and Informatics. 10:3393-3402 |
ISSN: | 2302-9285 2089-3191 |
DOI: | 10.11591/eei.v10i6.3135 |
Popis: | Advancements in information technology is contributing to the excessive rate of big data generation recently. Big data refers to datasets that are huge in volume and consumes much time and space to process and transmit using the available resources. Big data also covers data with unstructured and structured formats. Many agencies are currently subscribing to research on big data analytics owing to the failure of the existing data processing techniques to handle the rate at which big data is generated. This paper presents an efficient classification and reduction technique for big data based on parallel generalized Hebbian algorithm (GHA) which is one of the commonly used principal component analysis (PCA) neural network (NN) learning algorithms. The new method proposed in this study was compared to the existing methods to demonstrate its capabilities in reducing the dimensionality of big data. The proposed method in this paper is implemented using Spark Radoop platform. |
Databáze: | OpenAIRE |
Externí odkaz: |