Loading...
Search for: focused-crawling
0.011 seconds

    A method for focused crawling using combination of link structure and content similarity

    , Article 2006 IEEE/WIC/ACM International Conference on Web Intelligence, WI'06, Hong Kong, 18 December 2006 through 22 December 2006 ; 2006 , Pages 753-756 ; 0769527477 (ISBN); 9780769527475 (ISBN) Jamali, M ; Sayyadi, H ; Hariri, B. B ; Abolhassani, H ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2006
    Abstract
    The rapid growth of the world-wide web poses unprecedented scaling challenges for general-purpose crawlers and search engines. A focused crawler aims at selectively seek out pages that are relevant to a pre-defined set of topics. Besides specifying topics by some keywords, it is customary also to use some exemplary documents to compute the similarity of a given web document to the topic. In this paper we Introduce a new hybride focused crawler, which uses link structure of documents as well as similarity of pages to the topic to crawl the web © 2006 HEE