CHI-BD: A fuzzy rule-based classification system for Big Data classification problems
Autor: | José Antonio Sanz, Mikel Galar, Humberto Bustince, Mikel Elkano |
---|---|
Rok vydání: | 2018 |
Předmět: |
Fuzzy rule
Logic Computer science business.industry Big data 02 engineering and technology computer.software_genre Machine learning Artificial Intelligence 020204 information systems 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Data mining Artificial intelligence business computer |
Zdroj: | Fuzzy Sets and Systems. 348:75-101 |
ISSN: | 0165-0114 |
DOI: | 10.1016/j.fss.2017.07.003 |
Popis: | The previous Fuzzy Rule-Based Classification Systems (FRBCSs) for Big Data problems consist in concurrently learning multiple Chi et al. FRBCSs whose rule bases are then aggregated. The problem of this approach is that different models are obtained when varying the configuration of the cluster, becoming less accurate as more computing nodes are added. Our aim with this work is to design a new FRBCS for Big Data classification problems (CHI-BD) which is able to provide exactly the same model as the one that would be obtained by the original Chi et al. algorithm if it could be executed with this quantity of data. In order to do so, we take advantage of the suitability of the Chi et al. algorithm for the MapReduce paradigm, solving the problems of the previous approach, which lead us to obtain the same model (i.e., classification accuracy) regardless of the number of computing nodes considered. |
Databáze: | OpenAIRE |
Externí odkaz: |