A Study of Changes in College English Online Phrases Supported by Clustering Algorithm

Autor: Chen Rulin, Lin Ling, Huang Xing
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: Applied Mathematics and Nonlinear Sciences, Vol 9, Iss 1 (2024)
Druh dokumentu: article
ISSN: 2444-8656
DOI: 10.2478/amns-2024-2513
Popis: The hyperlinks on the Sina Weibo platform are tracked by a web crawler so as to obtain the textual resources of university English online phrases on the platform. To process microblog text data, this paper proposes the use of multiple linguistic expressions of translation machines to enhance the text feature representation and apply it to clustering to enhance the clustering results. Based on commonly used algorithms such as K-means, DBSCAN, and hierarchical clustering, we analyze the clustering effect of each algorithm according to internal and external evaluation indexes and finally determine the clustering algorithms to be used for the change of university English network terms. Using the English network word “hand” as an example, this paper observes a fluctuating trend in the heat of this word from January to June 2023, with peaks in heat generation in January and February, respectively, and a peak heat of about 10,500 in February. In terms of spatial variation, the heat level of the English word “hand” is high in the east and south regions and low in the west and north regions, but in February, due to the return of the population to their hometowns for the Lunar New Year, the north region exceeds the east and south regions with a heat level of 11086.77.
Databáze: Directory of Open Access Journals