Parallel Frequent Pattern Mining on Natural Language-Based Social Media Data
Autor: | Sri Khetwat Saritha, Shubhangi Chaturvedi |
---|---|
Rok vydání: | 2018 |
Předmět: |
Association rule learning
Process (engineering) Computer science 020206 networking & telecommunications 02 engineering and technology Data mining algorithm Newspaper World Wide Web 020204 information systems Scalability Spark (mathematics) 0202 electrical engineering electronic engineering information engineering Social media Natural language |
Zdroj: | Advances in Intelligent Systems and Computing ISBN: 9789811315008 |
Popis: | Social media data on Web sites such as Twitter, Facebook, LinkedIn, YouTube, and Instagram is increasing tremendously because of their significant number of users. Newspaper, radio, television provide one-way communication, whereas social media provides many-to-many communication. Thus, analysis of social media data can produce many hidden information. Frequent patterns in social media can generate hidden information that can be useful. In this paper, we discussed how parallel frequent pattern mining algorithm is useful in finding patterns of natural language-based social media data. We present a process to retrieve frequent patterns (or rules) from a social media using thresholds of support and confidence. The parallel computing is achieved with the help of a scalable Apache Spark program. The retrieved patterns can be useful in making decisions related to social media. |
Databáze: | OpenAIRE |
Externí odkaz: |