WWW 2008, April 21-25, 2008, Beijing, China.
2008
978-1-60558-085-2/08/04
3
<688>>
Categories and Subject Descriptors:
G.1.6 [Mathematics of Computing]: Mathematical Software - Algorithm Design and Analysis
General Terms: Algorithms, Theory, Economics
Keywords: Sponsored search auction, keyword bidding, multiple-choice knapsack problem, stochastic optimization
Keyword Bidding Models. For simplicity, assume that the default advertiser has a budget over a fixed time horizon, discretized into time periods . He is interested in a single keyword with expected value-per-click .
There are bidders at time for this keyword and their bids are sorted in decreasing order . There are ad slots, and are assigned to the top- bids as follows: bidder gets slot ; for each user click on his ad, bidder is charged a price , if or a minimum fee (usually 10¢). Each slot has a click-through-rate (CTR), denoted , which is defined as the total number of clicks on an ad divided by the total number of impressions (displays). Therefore the default advertiser can obtain slot by bidding slightly over ; for each user click, he incurs a cost of , obtains an expected revenue and profit .
Online Knapsack Problems.
Let us start with the relatively simple single-slot
case. At time , is the maximum bid among bidders to , and is the number of clicks at period .
Winning at time costs the advertiser and earns him a profit of where
Our Assumptions. In general, no online algorithm can achieve any non-trivial
competitive ratio (the ratio between the output of the given
algorithm and the offline optimum) for Online-KP [4]. Fortunately, in our setting, we make two reasonable
assumptions on the knapsack items, which allow us to develop
interesting online algorithms. These two assumptions are:
<753>>
In Section 3, we design both determinstic and randomized algorithms for the online knapsack problem, while both algorithms have competitive ratio .We also show a matching lower bound. Therefore our algorithm is provably optimal in the worst-case sense.
In Section 3.1, we translate the online knapsack algorithm into bidding strategies for the single-slot auction, for both profit and revenue maximization. In Section 4, we give a -competitive online algorithm for the multiple-choice knapsack problem (MCKP)We translate the algorithm for Online-MCKP to bidding strategies for the multiple-slot case, and obtain both profit-maximizing and revenue-maximizing bidding strategies.
Our experimental work (restricted by limited evaluation data), reported in Section 5, suggests that two heuristics, sniping and parameter tuning, help to improve the performance of our bidding algorithms. With both sniping and parameter tuning enabled, our bidding algorithms (for both profit and revenue maximization) achieve an output value which is consistently more than 90% of the optimum by the omniscient bidder.
Keyword Bidding. Sponsored search auctions have attracted a lot of attention, for both auctioneer revenue maximization and advertiser bidding optimization. Among all these work, Mehta etc al. [5] studied the auctioneer revenue maximization with budget-constrained bidders, using a trade-off function (compare it to our threshold function) to grant queries to bidders, and the technique they use is probably most similar to the threshold function we use.
Online Algorithms. Awerbuch et al. [2] studied the online call routing which generalizes the online classical knapsack problem. More recently, Buchbinder et al. [3] designed online algorithms for fractional versions of general packing problems which imply an -competitive algorithm for the online knapsack problem.
Randomized Algorithm: Let be the continuous distribution from to , with the following density function: , for , and for , where Algorithm ONLINE-KP-RANDOMIZED simply picks a threshhold from the distribution ; at each time, it picks item iff its efficiency is at least and its budget allows.
Deterministic Algorithm: Our deterministic algorithm
for Online-KP works against all adversaries..
A matching lower bound:
we can translate the algorithms ONLINE-KP-THRESHOLD to bidding strategies for single-slot keyword auctions for both profit and revenue maximization, based on Eq. (1). Details are omitted due to space constraints.
The Online-MCKP. The Online-MCKP is a generalization of the Online-KP. At each time period, at most one item can be selected from the item-set . The goal again is to maximize the total value of items selected.
The Algorithm Online-MCKP-Threshold is a generalization of Alg
Online-KP-Threshold, and it works as follows:
1
Let and be the same as before.
At time , let
.
If
, pick element with maximum .
Multiple-Slot Bidding. Next we translate Alg. ONLINE-MCKP-THRESHOLD to bidding strategies for both profit and revenue maximization. It turns out the revenue maximization strategy is the same as the single-slot case, thus omitted. The profit-maximization bidding strategy is below:
In this section, we evaluate our bidding algorithms using both synthetic and real-world data, and discuss two useful heuristics: sniping and parameter tuning.
The Sniping Heuristic. Simulation of the bidding algorithm described above with synthetic dataset shows that the bidding algorithm is too conservative, and leaves a significant fraction of budget unspent. Thus a potential performance improvement is sniping towards the end of the auction. If the bidder has knowledge (reliable estimates) about the click traffic () and the click-through rates, then the bidding strategies can be modified to increase its bid appropriately. Theoretically, we can prove that sniping strictly improve the performance of the algorithm. For details, see the technical report [6].
Parameter Tunning. If the lower bound in the online knapsack problem is too small, we can replace it with a larger value for the threshold function . This essentially discards items with very low efficiency, and the loss is minimal if the optimal solution consists of items with relatively high efficiency . It turns out tuning the parameter makes a significant performance improvement.
Evaluation using Real Bidding Data. Next we report some experimental results on evaluating bidding algorithms for multiple-slot auctions using real bidding data. We scraped bidding data from the now defunct Overture webpage [1] with continous crawling for about two weeks, for one of the most dynamic and expensive keyword ``auto insurance.'' There are totally distinct time periods in our collected data, and most top-5 bids are larger than $10. For the experiments, we use , and three different values . We evaluated both the profit-maximizing and revenue-maximizing strategies with and without sniping. For all these experiments, we use for profit maximization and for revenue maximization, and .
Since results are very similar for different parameter values, we
summarize them in Table 5. For all the examples we
run, sniping improves the bidding performance significantly while
exhausting the budget.
Table 5 seems to tell us, for almost all values,
with parameter tuning of , the performance ratio (
) is around
70%-75% without sniping, and 90%-95% with sniping.
Profit-Maximization Bidding Performance | ||||||
V | OPT | ALG | ALG/ | budget | ALG | ALG/ |
OPT | left | (sniping) | OPT | |||
8 | 3779 | 2751 | 73% | 225.5 | 3541 | 94% |
10 | 4974 | 4059 | 82% | 116.1 | 4607 | 93% |
12 | 6169 | 4463 | 72% | 240.8 | 5842 | 95% |
We use worst-case competitive analysis, comparing our bidding strategy with the omniscient bidder who know everything in advance. In practice, other bidders do not behave in the worst-case but bid according to their own strategies. It would be interesting if one could attain bidding algorithms with better performance with the capabilities to learn other agents' bidding strategies or bidding price distributions. Incorporating previous work on stochastic knapsack problems together with average-case analysis (e.g. Lueker []) might be an essential ingredient. In a companion paper [], we try to address these problems.
There is a small gap of 1 in the lower and upper bounds for the competitive ratio of the Online-MCKP. As an open problem, it will be nice to close the gap.
Acknowledgement: We thank the following people for helpful discussions and feedback: Gagan Aggarwal, Terence Kelly, Victor Naroditskiy, Paul O'Brien, David Pennock, Jim Saxe and Bob Tarjan. We also thank an anonymous referee for suggesting randomized threshold algorithms.