Zobrazeno 1 - 10
of 22
pro vyhledávání: '"Mamdouh Refaat"'
Autor:
Mamdouh Refaat
Are you a data mining analyst, who spends up to 80% of your time assuring data quality, then preparing that data for developing and deploying predictive models? And do you find lots of literature on data mining theory and concepts, but when it comes
Autor:
Mamdouh Refaat
Publisher Summary Sampling is used to facilitate the analysis and modeling of large datasets. There are several types of sampling schemes. Three of these schemes are commonly used in data mining—random sampling, balanced sampling, and stratified sa
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::29e0de14324f6b422d668ca7e0d19bd0
https://doi.org/10.1016/b978-012373577-5/50010-7
https://doi.org/10.1016/b978-012373577-5/50010-7
Autor:
Mamdouh Refaat
Publisher Summary Principal component analysis (PCA) is one of the oldest and most used methods for the reduction of multidimensional data. The basic idea of PCA is to find a set of linear transformations of the original variables such that the new s
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::f39e81f6de4391e323392e560a0be5f9
https://doi.org/10.1016/b978-012373577-5/50017-x
https://doi.org/10.1016/b978-012373577-5/50017-x
Autor:
Mamdouh Refaat
Publisher Summary The tasks performed by data mining techniques can be classified in terms of either the analytical function they entail or their implementation focus. The first classification scheme takes the point of view of the data mining analyst
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::7bc424548cd08d4fefa233de368420e7
https://doi.org/10.1016/b978-012373577-5/50004-1
https://doi.org/10.1016/b978-012373577-5/50004-1
Autor:
Mamdouh Refaat
Publisher Summary Exploratory Data Analysis (EDA) is known as one of the fundamental steps of the data mining process. It is a set of procedures aimed at understanding the data and the relationships among variables. Most data mining modeling packages
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::3f688310e1cd947a2bb058d86b220745
https://doi.org/10.1016/b978-012373577-5/50009-0
https://doi.org/10.1016/b978-012373577-5/50009-0
Autor:
Mamdouh Refaat
Publisher Summary The definition of a metric of predictive power requires a dependent variable to define the predictive aspect of the question. Various measures of predictive power differ in how they weigh the different errors and how we plan to use
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::5afdadcb2554085bb9cac1b3c88b3548
https://doi.org/10.1016/b978-012373577-5/50014-4
https://doi.org/10.1016/b978-012373577-5/50014-4
Autor:
Mamdouh Refaat
Publisher Summary This chapter presents the methods used to measure the association between two variables when one or both of them is continuous. The option to avoid dealing with continuous variables is to bin them before using them. Binning is used
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::504c92ddeada176995354909c258b854
https://doi.org/10.1016/b978-012373577-5/50016-8
https://doi.org/10.1016/b978-012373577-5/50016-8
Autor:
Mamdouh Refaat
Publisher Summary This chapter describes the main tool used in the analysis of nominal and ordinal variables—contingency tables and discusses the different measures of association between the variables. Contingency is a fundamental tool in the anal
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::ddb41dd277ea81d4ea09a7828bb17d62
https://doi.org/10.1016/b978-012373577-5/50015-6
https://doi.org/10.1016/b978-012373577-5/50015-6
Autor:
Mamdouh Refaat
Missing values exist in abundance in databases, and procedures for treating them are by far the most recurring theme in data mining modeling. This chapter discusses three basic strategies to treat missing values—(1) ignore the record, (2) substitut
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::4aa904d25f8f31741fcef64dfd995d1b
https://doi.org/10.1016/b978-012373577-5/50013-2
https://doi.org/10.1016/b978-012373577-5/50013-2
Autor:
Mamdouh Refaat
Publisher Summary This chapter provides a case study to show how all the data elements can be put together to automate the data preparation process. The basic steps of the data preparation process are outlined, which include—data acquisition and in
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::3b315313de857ab243d7bc5642b72e03
https://doi.org/10.1016/b978-012373577-5/50020-x
https://doi.org/10.1016/b978-012373577-5/50020-x