To search, Click below search items.

 

All Published Papers Search Service

Title

Self Adjusting Refresh Time Based Architecture for Incremental Web Crawler

Author

A.K. Sharma, Ashutosh Dixit

Citation

Vol. 8  No. 12  pp. 349-354

Abstract

Due to the deficiency in their refresh techniques [12], current crawlers add unnecessary traffic to the already overloaded Internet. Moreover there exist no certain ways to verify whether a document has been updated or not. In this paper, an efficient approach is being proposed for building an effective incremental web crawler [13]. It selectively updates its database and/ or local collection of web pages instead of periodically refreshing the collection in batch mode thereby improving the ¡°freshness¡± of the collection significantly and bringing new pages in more timely manner. It also detects web pages which frequently undergo up-dation and dynamically calculates the refresh time of the page for its next update.

Keywords

World Wide Web, Search engine, Incremental Crawler, Hypertext, Browser

URL

http://paper.ijcsns.org/07_book/200812/20081250.pdf