Zobrazeno 1 - 10
of 855
pro vyhledávání: '"Focused crawler"'
Publikováno v:
Symmetry, Vol 16, Iss 11, p 1439 (2024)
A focused crawler automatically retrieves, organizes, and extracts specific topic-related information from the internet for analysis and application. Currently, most focused crawlers assess the relevance of web pages to a given topic through methods
Externí odkaz:
https://doaj.org/article/3e467fdbeb244a428762a6d8efcba329
Publikováno v:
Complex & Intelligent Systems, Vol 10, Iss 1, Pp 233-255 (2023)
Abstract Traditional semantic-based focused crawlers calculate the topical priority of hyperlink by linearly integrating topical similarity evaluation metrics and empirical weights. However, the manually pre-determined weights may introduce bias in e
Externí odkaz:
https://doaj.org/article/831d7b9209234feb86d56cb3a488e8e0
Autor:
Hayri Volkan Agun
Publikováno v:
SoftwareX, Vol 24, Iss , Pp 101569- (2023)
Conventional web crawling methods typically involve a sequence of distinct steps for downloading and extracting web content. A noteworthy limitation of these conventional crawling approaches is their lack of a focus-based crawling strategy. The softw
Externí odkaz:
https://doaj.org/article/148e0ad744fe452fb44a6ea9a400fd17
Autor:
Wenjun Liu, Yu He, Jing Wu, Yajun Du, Xing Liu, Tiejun Xi, Zurui Gan, Pengjun Jiang, Xiaoping Huang
Publikováno v:
Complex & Intelligent Systems, Vol 9, Iss 1, Pp 345-366 (2022)
Abstract The focused crawler grabs continuously web pages related to the given topic according to priorities of unvisited hyperlinks. In many previous studies, the focused crawlers predict priorities of unvisited hyperlinks based on the text similari
Externí odkaz:
https://doaj.org/article/bd3c078c9735449babd77852b93f0dfb
Publikováno v:
Cybernetics and Information Technologies, Vol 21, Iss 2, Pp 105-120 (2021)
Topic precise crawler is a special purpose web crawler, which downloads appropriate web pages analogous to a particular topic by measuring cosine similarity or semantic similarity score. The cosine based similarity measure displays inaccurate relevan
Externí odkaz:
https://doaj.org/article/812fa957c2ab40c385e351924731f2be
Publikováno v:
Brazilian Archives of Biology and Technology, Vol 64 (2022)
Abstract Analogous to the spectacular growth of information-superhighway, The Internet, demands for coherent and economical crawling methods are translucent to shoot up. Consequently, many innovative techniques have been put forth for efficient crawl
Externí odkaz:
https://doaj.org/article/fb5f10adf0124a8e9af96412aa4fb35f
Publikováno v:
Brazilian Archives of Biology and Technology, Vol 64 (2022)
Abstract The Internet is chosen to be one among the primary source of biomedical information. To retrieve necessary biomedical information, the search engine needs an efficient, focused crawler mechanism. But the area of research concerned with the f
Externí odkaz:
https://doaj.org/article/554234a305d24a35a15376787df9cf76
Autor:
Hao Wu, Dongyang Hou
Publikováno v:
Applied Sciences, Vol 13, Iss 7, p 4149 (2023)
Temporal intent is an important component of events. It plays an important role in collecting them from the web with focused crawlers. However, traditionally focused crawlers usually only consider factors such as topic keywords, web page content, and
Externí odkaz:
https://doaj.org/article/1fe93b8c2b664d37be4acdf65a21164d
Publikováno v:
Cybernetics and Information Technologies, Vol 19, Iss 2, Pp 146-158 (2019)
In the fast growing of digital technologies, crawlers and search engines face unpredictable challenges. Focused web-crawlers are essential for mining the boundless data available on the internet. Web-Crawlers face indeterminate latency problem due to
Externí odkaz:
https://doaj.org/article/7f0b65f286824a2bbd5e217da0afd593
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.