Professional Documents
Culture Documents
∗ †
Ye Chen Dmitry Pavlov John F. Canny
eBay Inc. Yandex Labs Computer Science Division
2145 Hamilton Ave 330 Primrose Road, Suite 306 University of California
San Jose, CA 95125 Burlingame, CA, 94010 Berkeley, CA 94720
yechen1@ebay.com dmitry-pavlov@yandex- jfc@cs.berkeley.edu
team.ru
29, 2006.
0.0 0.2 0.4 0.6 0.8 1.0
[9] J. Dean and S. Ghemawat. Mapreduce: Simplified
View recall
data processing on large clusters. Communications of
the ACM, 51(1):107–113, 2008.
Figure 3: ROC plots before and after latency re- [10] N. E. Gibbs, W. G. Poole, Jr., and P. K. Stockmeyer.
moval for the “Technology” category A comparison of several bandwidth and profile
reduction algorithms. ACM Transactions on
Mathematical Software (TOMS), 2(3):322–330, 1976.
[11] D. D. Lee and H. S. Seung. Algorithms for
6. DISCUSSION non-negative matrix factorization. Advances in Neural
Behavioral targeting is intrinsically a large-scale machine Information Processing Systems (NIPS), 13:556–562,
learning problem from the following perspectives. (1) To 2000.
fit a BT predictive model with low generalization error and [12] D. A. Spielman and S.-H. Teng. Smoothed analysis of
a desired level of statistical confidence requires massive be- algorithms: Why the simplex algorithm usually takes
havioral data, given the sparseness the problem. (2) The polynomial time. Journal of the ACM, 51(3), 2004.
dimensionality of feature space for the state-of-the-art BT
model is very high. The linear Poisson regression model uses
granular events (e.g., individual ad clicks and search queries)
as features, with a dimensionality ranging from several hun-
dred thousand to several million. (3) The number of BT
models to be built is large. There are over 450 BT-category
models for browser and login cookies need to be trained on
a regular basis. Furthermore, the solution to training BT
models has to be very efficient, because: (1) user interests