Information Retrieval System to Find Articles and Clauses in UUD 1945 Using Vector Space Model Method

Autor: R. Subandi, M. J. Hakim, O. R. Sulaeman, R. Setiyawan, Windu Gata, E. Wahyudi, B. Pratama
Rok vydání: 2020
Předmět:
Zdroj: Journal of Physics: Conference Series. 1471:012017
ISSN: 1742-6596
1742-6588
DOI: 10.1088/1742-6596/1471/1/012017
Popis: This study aims to find articles and clauses from the 1945 Constitution (UUD 1945) using the Vector Space Model method that calculates the similarity of many documents. One document is represented by one clause from each article of the 1945 Constitution. The next step is pre-processing by deleting unnecessary words (stopwords) and changing it into basic words (stemmer) in the Indonesian language. Each document will be indexed to speed up query and simplify the weighting. Words weighting in documents is performed using the TF-IDF (Term Frequency-Inverse Document Frequency) algorithm by calculating the frequency of words in documents and all documents. The document search results will be presented in the ranking with the largest number of scoring appears at the top (descend sorting). The word search in this system more or less takes 90-100 milliseconds in 73 documents.
Databáze: OpenAIRE