High-throughput fuzzy clustering on heterogeneous architectures
Autor: | Jesús Soto, José M. Cecilia, Baldomero Imbernón, Juan M. Cebrian, José M. García |
---|---|
Rok vydání: | 2020 |
Předmět: |
Fuzzy clustering
Parallel fuzzy clustering Computer Networks and Communications Computer science 020206 networking & telecommunications 02 engineering and technology computer.software_genre Fuzzy logic Fuzzy minimals ARQUITECTURA Y TECNOLOGIA DE COMPUTADORES Data set Hardware and Architecture Order (exchange) Factor (programming language) 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Data mining Cluster analysis computer Throughput (business) Software computer.programming_language |
Zdroj: | RiuNet. Repositorio Institucional de la Universitat Politécnica de Valéncia instname |
ISSN: | 0167-739X |
DOI: | 10.1016/j.future.2020.01.022 |
Popis: | [EN] The Internet of Things (IoT) is pushing the next economic revolution in which the main players are data and immediacy. IoT is increasingly producing large amounts of data that are now classified as "dark data'' because most are created but never analyzed. The efficient analysis of this data deluge is becoming mandatory in order to transform it into meaningful information. Among the techniques available for this purpose, clustering techniques, which classify different patterns into groups, have proven to be very useful for obtaining knowledge from the data. However, clustering algorithms are computationally hard, especially when it comes to large data sets and, therefore, they require the most powerful computing platforms on the market. In this paper, we investigate coarse and fine grain parallelization strategies in Intel and Nvidia architectures of fuzzy minimals (FM) algorithm; a fuzzy clustering technique that has shown very good results in the literature. We provide an in-depth performance analysis of the FM's main bottlenecks, reporting a speed-up factor of up to 40x compared to the sequential counterpart version. This work was partially supported by the Fundacion Seneca del Centro de Coordinacion de la Investigacion de la Region de Murcia under Project 20813/PI/18, and by Spanish Ministry of Science, Innovation and Universities under grants TIN2016-78799-P (AEI/FEDER, UE), RTI2018-096384-B-I00, RTI2018-098156-B-C53 and RTC-2017-6389-5. |
Databáze: | OpenAIRE |
Externí odkaz: |