next up previous
Next: Theoretical Framework Up: Introduction Previous: Introduction

Contributions

This paper makes the following contributions:

The revisitation policies we propose are highly practical. They incur very little per-page space and time overhead. Furthermore, unlike some previously-proposed policies, ours do not rely on global optimization methods, making them suitable for use in a large-scale parallel crawler. Lastly, our policies automatically adapt to shifts in page change behavior.

Our revisitation policies are based on an underlying theory of optimal page revisitation, presented next.



Chris Olston 2008-02-15