Datasets for phishing websites detection

Autor: Grega Vrbančič, Iztok Fister, Jr., Vili Podgorelec
Jazyk: angličtina
Rok vydání: 2020
Předmět:
Zdroj: Data in Brief, Vol 33, Iss , Pp 106438- (2020)
Druh dokumentu: article
ISSN: 2352-3409
DOI: 10.1016/j.dib.2020.106438
Popis: Phishing stands for a fraudulent process, where an attacker tries to obtain sensitive information from the victim. Usually, these kinds of attacks are done via emails, text messages, or websites. Phishing websites, which are nowadays in a considerable rise, have the same look as legitimate sites. However, their backend is designed to collect sensitive information that is inputted by the victim. Discovering and detecting phishing websites has recently also gained the machine learning community’s attention, which has built the models and performed classifications of phishing websites. This paper presents two dataset variations that consist of 58,645 and 88,647 websites labeled as legitimate or phishing and allow the researchers to train their classification models, build phishing detection systems, and mining association rules.
Databáze: Directory of Open Access Journals