To search, Click below search items.


All Published Papers Search Service


Self Adjusting Refresh Time Based Architecture for Incremental Web Crawler


A.K. Sharma, Ashutosh Dixit


Vol. 8  No. 12  pp. 349-354


Due to the deficiency in their refresh techniques [12], current crawlers add unnecessary traffic to the already overloaded Internet. Moreover there exist no certain ways to verify whether a document has been updated or not. In this paper, an efficient approach is being proposed for building an effective incremental web crawler [13]. It selectively updates its database and/ or local collection of web pages instead of periodically refreshing the collection in batch mode thereby improving the “freshness” of the collection significantly and bringing new pages in more timely manner. It also detects web pages which frequently undergo up-dation and dynamically calculates the refresh time of the page for its next update.


World Wide Web, Search engine, Incremental Crawler, Hypertext, Browser