Loading...

A method for focused crawling using combination of link structure and content similarity

Jamali, M ; Sharif University of Technology | 2006

269 Viewed
  1. Type of Document: Article
  2. DOI: 10.1109/WI.2006.19
  3. Publisher: Institute of Electrical and Electronics Engineers Inc , 2006
  4. Abstract:
  5. The rapid growth of the world-wide web poses unprecedented scaling challenges for general-purpose crawlers and search engines. A focused crawler aims at selectively seek out pages that are relevant to a pre-defined set of topics. Besides specifying topics by some keywords, it is customary also to use some exemplary documents to compute the similarity of a given web document to the topic. In this paper we Introduce a new hybride focused crawler, which uses link structure of documents as well as similarity of pages to the topic to crawl the web © 2006 HEE
  6. Keywords:
  7. Search engines ; Technical writing ; Content similarity ; Focused crawler ; Focused crawling ; Link structure ; Rapid growth ; Web document ; Web crawler
  8. Source: 2006 IEEE/WIC/ACM International Conference on Web Intelligence, WI'06, Hong Kong, 18 December 2006 through 22 December 2006 ; 2006 , Pages 753-756 ; 0769527477 (ISBN); 9780769527475 (ISBN)
  9. URL: https://ieeexplore.ieee.org/document/4061466