Výsledky vyhledávání - "Focused crawler"

Akademický článek

A Semantic and Optimized Focused Crawler Based on Semantic Graph and Genetic Algorithm

Autor: Wenhao Huang, Xiaoyan Li, Xiao Zhou, Deyu Qi, Jianqing Xi, Wenjun Liu, Feiyu Zhao

Publikováno v: Symmetry, Vol 16, Iss 11, p 1439 (2024)

A focused crawler automatically retrieves, organizes, and extracts specific topic-related information from the internet for analysis and application. Currently, most focused crawlers assess the relevance of web pages to a given topic through methods

Externí odkaz: https://doaj.org/article/3e467fdbeb244a428762a6d8efcba329

Zobrazit plný text záznamu

Akademický článek

Applying particle swarm optimization-based dynamic adaptive hyperlink evaluation to focused crawler for meteorological disasters

Autor: Jingfa Liu, Zhihe Yang, Xueming Yan, Duanbing Chen

Publikováno v: Complex & Intelligent Systems, Vol 10, Iss 1, Pp 233-255 (2023)

Abstract Traditional semantic-based focused crawlers calculate the topical priority of hyperlink by linearly integrating topical similarity evaluation metrics and empirical weights. However, the manually pre-determined weights may introduce bias in e

Externí odkaz: https://doaj.org/article/831d7b9209234feb86d56cb3a488e8e0

Zobrazit plný text záznamu

Akademický článek

WebCollectives: A light regular expression based web content extractor in Java

Autor: Hayri Volkan Agun

Publikováno v: SoftwareX, Vol 24, Iss , Pp 101569- (2023)

Conventional web crawling methods typically involve a sequence of distinct steps for downloading and extracting web content. A noteworthy limitation of these conventional crawling approaches is their lack of a focus-based crawling strategy. The softw

Externí odkaz: https://doaj.org/article/148e0ad744fe452fb44a6ea9a400fd17

Zobrazit plný text záznamu

Akademický článek

A focused crawler based on semantic disambiguation vector space model

Autor: Wenjun Liu, Yu He, Jing Wu, Yajun Du, Xing Liu, Tiejun Xi, Zurui Gan, Pengjun Jiang, Xiaoping Huang

Publikováno v: Complex & Intelligent Systems, Vol 9, Iss 1, Pp 345-366 (2022)

Abstract The focused crawler grabs continuously web pages related to the given topic according to priorities of unvisited hyperlinks. In many previous studies, the focused crawlers predict priorities of unvisited hyperlinks based on the text similari

Externí odkaz: https://doaj.org/article/bd3c078c9735449babd77852b93f0dfb

Zobrazit plný text záznamu

Akademický článek

An Enhanced Semantic Focused Web Crawler Based on Hybrid String Matching Algorithm

Autor: Sakunthala Prabha K. S., Mahesh C., Raja S. P.

Publikováno v: Cybernetics and Information Technologies, Vol 21, Iss 2, Pp 105-120 (2021)

Topic precise crawler is a special purpose web crawler, which downloads appropriate web pages analogous to a particular topic by measuring cosine similarity or semantic similarity score. The cosine based similarity measure displays inaccurate relevan

Externí odkaz: https://doaj.org/article/812fa957c2ab40c385e351924731f2be

Zobrazit plný text záznamu

Akademický článek

A Critique Empirical Evaluation of Relevance Computation for Focused Web Crawlers

Autor: Joe Dhanith Pal Nesamony Rose Mary, Surendiran Balasubramanian, Raja Soosaimarian Peter Raj

Publikováno v: Brazilian Archives of Biology and Technology, Vol 64 (2022)

Abstract Analogous to the spectacular growth of information-superhighway, The Internet, demands for coherent and economical crawling methods are translucent to shoot up. Consequently, many innovative techniques have been put forth for efficient crawl

Externí odkaz: https://doaj.org/article/fb5f10adf0124a8e9af96412aa4fb35f

Zobrazit plný text záznamu

Akademický článek

An Enhanced Focused Web Crawler for Biomedical Topics Using Attention Enhanced Siamese Long Short Term Memory Networks

Autor: Joe Dhanith Pal Nesamony Rose Mary, Surendiran Balasubramanian, Raja Soosaimarian Peter Raj

Publikováno v: Brazilian Archives of Biology and Technology, Vol 64 (2022)

Abstract The Internet is chosen to be one among the primary source of biomedical information. To retrieve necessary biomedical information, the search engine needs an efficient, focused crawler mechanism. But the area of research concerned with the f

Externí odkaz: https://doaj.org/article/554234a305d24a35a15376787df9cf76

Zobrazit plný text záznamu

Akademický článek

A Focused Event Crawler with Temporal Intent

Autor: Hao Wu, Dongyang Hou

Publikováno v: Applied Sciences, Vol 13, Iss 7, p 4149 (2023)

Temporal intent is an important component of events. It plays an important role in collecting them from the web with focused crawlers. However, traditionally focused crawlers usually only consider factors such as topic keywords, web page content, and

Externí odkaz: https://doaj.org/article/1fe93b8c2b664d37be4acdf65a21164d

Zobrazit plný text záznamu

Akademický článek

Optimized Focused Web Crawler with Natural Language Processing Based Relevance Measure in Bioinformatics Web Sources

Autor: Mani Sekhar S. R., Siddesh G. M., Manvi Sunilkumar S., Srinivasa K. G.

Publikováno v: Cybernetics and Information Technologies, Vol 19, Iss 2, Pp 146-158 (2019)

In the fast growing of digital technologies, crawlers and search engines face unpredictable challenges. Focused web-crawlers are essential for mining the boundless data available on the internet. Web-Crawlers face indeterminate latency problem due to

Externí odkaz: https://doaj.org/article/7f0b65f286824a2bbd5e217da0afd593

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání