The Movie Recommendation System using Content Based Filtering with TF-IDF¬¬-Vectorization and Levenshtein Distance

Autor: null Omkar Kunde, null Omkar Gaikwad, null Prathamesh Kelgandre, null Rohan Damodhar, null Prof. Mrs. M. M. Swami
Rok vydání: 2022
Zdroj: International Journal of Advanced Research in Science, Communication and Technology. :257-263
ISSN: 2581-9429
DOI: 10.48175/ijarsct-3648
Popis: In this busy life people like to do things to make their mind calm and watching movies is one of the thing but due to large data of a movie exist in the world it is very difficult for the user to select a movie. They have to spend a lot of time in searching and selecting movie. This procedure is time consuming and difficult. So recommendation system make the things easy. Recommendation engines are trained to produce fast and accurate suggestions to users. This paper describes a movie recommendation system using content based filtering and data is processed using Term-frequency Inverse document frequency technique (TF-IDF) for vectorization. Cosine similarity method is used for similarity measure. The system is presented to the user through a web-hosted user-interface which offers a system architecture by considering the initial problem usually faced by recommendation systems, namely the cold start problem. The problem of lack of user preferences data is trying to be overcome by utilizing movies data. The raw data is processed using the TF-IDF algorithm and Vector Space Model to generate a data model. Then levenshtien distance with cosine similarity will improve the performance of existing system. Advantages of the system include efficient recommendations, correct suggestions. Future enhancements include user profiling, documentations and data acquirement through web scraping.
Databáze: OpenAIRE