Towards Federated Learning Approach to Determine Data Relevance in Big Data
Autor: | Ronald Doku, Chunmei Liu, Danda B. Rawat |
---|---|
Rok vydání: | 2019 |
Předmět: |
Information privacy
Information Age Computer science business.industry Big data 020206 networking & telecommunications 020207 software engineering Cryptography 02 engineering and technology Data science Data modeling Data integrity 0202 electrical engineering electronic engineering information engineering Haystack business Heap (data structure) |
Zdroj: | IRI |
Popis: | In the past few years, data has proliferated to astronomical proportions; as a result, big data has become the driving force behind the growth of many machine learning innovations. However, the incessant generation of data in the information age poses a needle in the haystack problem, where it has become challenging to determine useful data from a heap of irrelevant ones. This has resulted in a quality over quantity issue in data science where a lot of data is being generated, but the majority of it is irrelevant. Furthermore, most of the data and the resources needed to effectively train machine learning models are owned by major tech companies, resulting in a centralization problem. As such, federated learning seeks to transform how machine learning models are trained by adopting a distributed machine learning approach. Another promising technology is the blockchain, whose immutable nature ensures data integrity. By combining the blockchain's trust mechanism and federated learning's ability to disrupt data centralization, we propose an approach that determines relevant data and stores the data in a decentralized manner. |
Databáze: | OpenAIRE |
Externí odkaz: |