Computational Science and Data Mining
Autor: | Flaviu Adrian Mărginean |
---|---|
Rok vydání: | 2003 |
Předmět: | |
Zdroj: | Lecture Notes in Computer Science ISBN: 9783540401964 International Conference on Computational Science |
DOI: | 10.1007/3-540-44863-2_63 |
Popis: | The last decade has witnessed an impressive growth of Data Mining through algorithms and applications. Despite the advances, a computational theory of Data Mining is still largely outstanding. This paper discusses some aspects relevant to computation in Data Mining from the point of view of the Machine Learning theoretician. Computational techniques used in other fields that deal with learning from data, such as Statistics and Machine Learning, are potentially very relevant. However, the specifics of Data Mining are such that most often those techniques are not directly applicable but require to be re-cast and reanalysed within Data Mining starting from first principles. We illustrate this with a PAC-learnability analysis for a Data Mining-like task. We show that accounting for Data Mining specific requirements, such as inference of weak predictors and agnosticity assumptions, requires the generalisation of the classical PAC framework in novel ways. |
Databáze: | OpenAIRE |
Externí odkaz: |