Time-series prediction of shellfish farm closure: A comparison of alternatives
Autor: | John McCulloch, Sumon Shahriar, Claire D'Este, Greg Smith, Greg Timms, Ashfaqur Rahman |
---|---|
Rok vydání: | 2014 |
Předmět: |
Engineering
Feature extraction Decision tree Closure (topology) Aquaculture Aquatic Science computer.software_genre lcsh:Agriculture (General) Time series Time series data Data mining lcsh:T58.5-58.64 Artificial neural network lcsh:Information technology business.industry Bayesian network Forestry lcsh:S1-972 Shellfish farm closure Computer Science Applications Term (time) Support vector machine Animal Science and Zoology business Agronomy and Crop Science computer |
Zdroj: | Information Processing in Agriculture, Vol 1, Iss 1, Pp 42-50 (2014) |
ISSN: | 2214-3173 |
DOI: | 10.1016/j.inpa.2014.05.001 |
Popis: | Shellfish farms are closed for harvest when microbial pollutants are present. Such pollutants are typically present in rainfall runoff from various land uses in catchments. Experts currently use a number of observable parameters (river flow, rainfall, salinity) as proxies to determine when to close farms. We have proposed using the short term historical rainfall data as a time-series prediction problem where we aim to predict the closure of shellfish farms based only on rainfall. Time-series event prediction consists of two steps: (i) feature extraction, and (ii) prediction. A number of data mining challenges exist for these scenarios: (i) which feature extraction method best captures the rainfall pattern over successive days that leads to opening or closure of the farms?, (ii) The farm closure events occur infrequently and this leads to a class imbalance problem; the question is what is the best way to deal with this problem? In this paper we have analysed and compared different combinations of balancing methods (under-sampling and over-sampling), feature extraction methods (cluster profile, curve fitting, Fourier Transform, Piecewise Aggregate Approximation, and Wavelet Transform) and learning algorithms (neural network, support vector machine, k-nearest neighbour, decision tree, and Bayesian Network) to predict closure events accurately considering the above data mining challenges. We have identified the best combination of techniques to accurately predict shellfish farm closure from rainfall, given the above data mining challenges. |
Databáze: | OpenAIRE |
Externí odkaz: |