Enhancing the Decision Tree Algorithm to Improve Performance Across Various Datasets

Autor: Pandu Pratama Putra, M Khairul Anam, Sarjon Defit, Arda Yunianta
Jazyk: English<br />Indonesian
Rok vydání: 2024
Předmět:
Zdroj: Intensif: Jurnal Ilmiah Penelitian Teknologi dan Penerapan Sistem Informasi, Vol 8, Iss 2 (2024)
Druh dokumentu: article
ISSN: 2580-409X
2549-6824
DOI: 10.29407/intensif.v8i2.22280
Popis: Background: The Village Fund is an initiative by the central government to promote equitable regional development. However, it has also led to corruption. Many Indonesians share their opinions on the Village Fund on social media platforms like X, and news coverage is extensive on portals like detik.com. Objective: This study aims to classify data from social media and news coverage to enhance understanding. Methods: The research improves the decision tree algorithm by integrating other algorithms and techniques such as XGBoost and SMOTE. Ensuring high accuracy is vital for the credibility of machine learning classifications among the public. The study uses two different datasets, necessitating varied testing approaches. For the news portal dataset, a single test with seven labels is conducted, followed by enhancement with XGBoost. The X dataset undergoes two tests with datasets of 1200 and 3078 entries, using three labels. Conclusion: The evaluation results indicate that the highest accuracy achieved with the news portal data was 82%, thanks to a combination of decision tree algorithms with various parameters and the balancing effect of SMOTE. For the Twitter dataset with 3078 entries, the highest accuracy reached 95%, attributed to the application of ensemble techniques, particularly boosting.
Databáze: Directory of Open Access Journals