Given that web servers are autonomous and pages can change arbitrarily at any time, it is important to mitigate the risk associated with waiting a long time between refreshes. Our online revisitation policies (Sections 4.2 and 4.3) aim to refresh a page whenever the estimated utility penalty of not doing so exceeds , in a best-effort manner. We wish also to guarantee that in the worst case, the utility penalty incurred without performing a refresh is at most , where is a risk control parameter.
Let denote the maximum divergence value allowed under the chosen page divergence metric. ( under our fragment staleness metric.) The maximum loss in utility incurred during time units is . To cap the utility loss between refreshes at , we restrict the refresh period to remain less than or equal to at all times.