Abstrakt: |
Abstract: For the purpose of making a highly effective model in relation to the selection of trees for thinning for various forestry goals, the author examined the generalizability and accuracy of models using various ensemble learning algorithms and the m-fold cross-validation method. These techniques make it possible to improve discrimination accuracy by combining or integrating multiple learning results whose accuracies are not very high. WEKA, which is a machine learning tool for data mining programmed in Java machine language, was used to verify the results of the simulation models. The number of samples was 503. Pattern-recognition algorithms in this study used five classification-type models and one function-type model. It was found that: (1) without cross validation, two pattern-recognition algorithms can be classified as having comparatively high discrimination accuracy; (2) with cross validation, discrimination accuracy decreased as a whole, but was not very different from that without cross validation, and (3) from the viewpoint of generalizability, we constructed a model at around 70% discrimination accuracy. In order to construct more effective models, we need to design the model to utilize certain algorithms or to build in re-sampling methods such as ensemble learning and cross validation. Additionally, in the case of small sample datasets, ensemble learning is an effective method for constructing efficient models. |