Popis: |
A disaster is an unforeseen event, which can have a tremendous impact on human life as well as on the environment. The Internet provides a lot of sources that generate huge amounts of news articles daily. With the increase in the number of online news articles, it has become difficult for users to access disaster relevant news, which makes it a necessity to extract and classify news so that they could be easily accessed. This paper presents an automated system that scraps news from various online sources and identifies disaster relevant news. The news articles are scraped with the help of a scrapy framework and a model is trained using Machine Learning algorithms to classify news as disaster and non-disaster. The system also uses a geoparsing model to identify the focus location from the extracted news articles. The geoparsing model is built using Named Entity Recognition (NER). |