Hoeffding Trees with Nmin Adaptation

Autor: Emiliano Casalicchio, Niklas Lavesson, Håkan Grahn, Eva García-Martín, Veselka Boeva
Rok vydání: 2018
Předmět:
information systems and management
FOS: Computer and information sciences
Computer Science - Machine Learning
Computational complexity theory
Computer science
data stream mining
Decision tree
Machine Learning (stat.ML)
02 engineering and technology
Machine Learning (cs.LG)
energy aware machine learning
Software
Statistics - Machine Learning
020204 information systems
0202 electrical engineering
electronic engineering
information engineering

signal processing
energy efficiency
computer networks and communications
Computer Sciences
Data stream mining
business.industry
green artificial intelligence
Hoeffding trees
Energy consumption
probability and uncertainty
Tree (data structure)
Datavetenskap (datalogi)
statistics
Scalability
020201 artificial intelligence & image processing
statistics
probability and uncertainty

business
Algorithm
Energy (signal processing)
Zdroj: DSAA
DOI: 10.1109/dsaa.2018.00017
Popis: Machine learning software accounts for a significant amount of energy consumed in data centers. These algorithms are usually optimized towards predictive performance, i.e. accuracy, and scalability. This is the case of data stream mining algorithms. Although these algorithms are adaptive to the incoming data, they have fixed parameters from the beginning of the execution. We have observed that having fixed parameters lead to unnecessary computations, thus making the algorithm energy inefficient. In this paper we present the nmin adaptation method for Hoeffding trees. This method adapts the value of the nmin parameter, which significantly affects the energy consumption of the algorithm. The method reduces unnecessary computations and memory accesses, thus reducing the energy, while the accuracy is only marginally affected. We experimentally compared VFDT (Very Fast Decision Tree, the first Hoeffding tree algorithm) and CVFDT (Concept-adapting VFDT) with the VFDT-nmin (VFDT with nmin adaptation). The results show that VFDT-nmin consumes up to 27% less energy than the standard VFDT, and up to 92% less energy than CVFDT, trading off a few percent of accuracy in a few datasets.
Accepted at: The 5th IEEE International Conference on Data Science and Advanced Analytics (DSAA 2018)
Databáze: OpenAIRE