This action might not be possible to undo. Are you sure you want to continue?
ISM293 University of California, Santa Cruz Spring 2009 Instructors: Ram Akella, Andrei Broder, and Vanja Josifovski
Logistics of the course Introduction & Overview of Computational Advertising
Classic advertising Web advertising
Graphical ads Sponsored search Content match
The marketplace size The actors: Publishers, Advertisers, Users, & “Ad agency” Advertising as information Trends in Computational Advertising
Connection with recommender systems Mobile advertising Open markets
Prof. Ram Akella
Professor Information Systems and Technology Management at University Of California Santa Cruz Ph.D., EECS, Indian Institute of Science, Bangalore, India, 1976-1982 B.S., EE, Indian Institute of Technology, Madras, India, 1971-1976 http://www.soe.ucsc.edu/~akella/ firstname.lastname@example.org
yahoo. from University of Florida. Vanja Josifovski Principal Research Scientist at Yahoo! Research. Research Area: Computational Advertising Previously at IBM Research working on databases and enterprise search M. vanjaj@yahoo-inc. PhD from Linkopings University in Sweden.Sc.com/bouncer_user/88 4 .Instructor Dr.com http://research.
contextdriven information supply. M. Andrei Broder Fellow and Vice President for Search & Computational Advertising in Yahoo! Research.com/bouncer_user/17 5 . in Computer Science at Stanford University under Don Knuth.yahoo.Sc. B. and randomized algorithms. web search. Summa cum Laude from Technion. Research interests: computational advertising.D.com http://research. and Ph.Instructor Dr. Chief Scientist of Yahoo’s Advertising Technology Group. broder@yahoo-inc. Sc.
ucsc. please use the course group email to reach the instructors and TAs 6 .ucsc.edu Chunye Wang cwang@soe.Contacts Course Website: www.ucsc.edu/mailman/listinfo/ism293-group Lecture: 6-9:30 PM Wed.ucsc. by appointment only! Room: TBD For now.edu Course Group Email: Ism293-group@soe. Office Hours: 5~6 PM Wednesdays.ucsc.edu/classes/ism293/Spring09/ TAs: Bin An email@example.com Subscribing to ISM293-group email (enrolled students only) http://www.soe.
Sponsored Search 7. Marketplace Design 4. Machine Learning Techniques 5-6. Information Retrieval for Computational Advertising 3. Introduction and Overview 2. Behavioral Targeting 10.Lecture Content 1. or optional topics: Emerging Adverting Media or Evaluation of On-line Advertising system 7 . Overflow lecture: catch up. Content Match 8 Graphical Ads and Guaranteed Delivery 9.
5 hours In depth: 1 hour Discussion: 20~30 Minutes 8 .General lecture structure Overview: 1~1.
9 . TAs will lead the boot camps. We initially want to cover Linear Algebra.Boot camp Boot camp is twice per week. Ram Akella will lead some boot camps as well. and Elementary Statistics and Probability Generally. expand on some topics. in order to help students fill in gaps. and Wed. & understand the course better. Prof. 5:00 to 6:00 PM Mon. the Lemur toolkit.
Homework One homework per week Based on reading research paper. 10 . typically answer a question in the following style: Why does the algorithms in the paper work? Try to extend the idea in the paper from a different view of angle? How to modify it for a new scenario? Some homeworks will be conventional exercise style.
and so on (TBD) 11 . Indexing. Examples: Crawling Ads.Project There will be one course project divided into several mini projects.
Exam Two quizzes or one midterm (TBD) One final exam 12 .
Grading Homework 20% Exam 40% Project 40% 13 .
ucsc.Questions? We welcome suggestions about all aspects of the course! E-mail to Ism293firstname.lastname@example.org 14 .
Introduction to Computational Advertising 15 .
features. techniques. It does not necessarily reflect the views of Yahoo! Inc or any other entity Algorithms.Disclaimers This talk presents the opinions of the authors. etc mentioned here might or might not be in use by Yahoo! Or any other company 16 .
etc 17 .Computational advertising – main challenge Find the "best match" between a given user in a given context and a suitable advertisement. Examples Context = Web search results Sponsored search Context = Publisher page Content match. newspapers. banners Other contexts: mobile. video.
What is “Computational Advertising”? New scientific sub-discipline. at the intersection of Large scale search and text analysis Information retrieval Statistical modeling Machine learning Classification Optimization Microeconomics Recommender systems 18 .
Establishing a new discipline… 19 .
Establishing a new discipline… 20 .
2.Key messages 1. New application domains and new techniques are emerging every day ⇒ Good area for research + new businesses 21 . Computational advertising = A principled way to find the "best match" between a given user in a given context and a suitable advertisement. Advertising is a form of information. The financial scale for computational advertising is huge ⇒ Small constants matter ⇒ Expect plenty of further research 3. possibly contradictory utility functions 4. ⇒ Adding ads to a context is similar to the integration problem of other types of information ⇒ Finding the “best ad” is a type of information retrieval problem with multiple.
Classic Advertising 22 .
Brand advertising Goal: create a distinct favorable image 23 .
etc. now or soon 24 .Direct marketing Advertising that involves a "direct response”: buy. vote. donate. subscribe.
Japan .Long history….1806 25 USA.1890 .
Lots of computational this and that … Computational Biology Computational Chemistry Computational Finance Computational Geometry Computational Neuroscience Computational Physics Computational Mechanics Computational Economics … All are about mixing an old science with large scale computing capabilities 26 .
newspapers. TV.What’s computational about it? Classical: Relatively few venues – magazines. etc High cost per venue ($3Mil for a Super Bowl TV ad) No personalization possible Targeting by the wisdom of ad-people Hard to measure ROI Computational – almost the exact opposite: Billions of opportunities Billions of creatives Totally personalizable Tiny cost per opportunity Much more quantifiable 27 . handbills. billboards.
travel.a. etc.Revenue flow basics What do advertisers pay? CPM = cost per thousand impressions Typically used for graphical/banner ads (brand advertising) Could be paid in advance “Guaranteed delivery” CPC = cost per click Typically used for textual ads CPT/CPA = cost per transaction/action a. referral fees or affiliate fees Typically used for shopping (“buy from our sponsors”). … but now also used for textual ads (risk mitigation) What do publishers get? Whatever advertisers pay minus rev-share (revenue-share) paid to intermediaries What do intermediaries get? Whatever advertisers pay minus TAC (traffic acquisition costs) paid to publishers 28 .k.
US Online Advertising Spending 29 .
FH 2007) Source: IAB 30 .Still growing (15% FH 2008 vs.
US Online vs. Offline advertising spend 31 .
The advertising $$ budget vs. the human time budget 32 .
Computational Advertising Landscape 33 .
Graphical ads 34 Graphical ad .
More graphical ads 35 .
Ads directly driven by the content of a web page – “context match” (a.a. etc) 2.k. 36 . “contextual ads”. “paid search”.a.Textual ads Ads driven by search keywords – “sponsored search” (a.k. “context driven ads”. “keyword driven ads”. etc) Textual ads are heavily related to Search and IR 1.
Sponsored search: Text-based ads driven by a keyword search 37 .
More sponsored search: Text-based ads driven by a keyword search 38 .
Content match 39 .
& “Ad agency” Publishers Advertisers Ad agency Users 40 . Users.The actors: Publishers. Advertisers.
Google) Content match: Pub = AA (Yahoo! content) Pub = Adv (“House Ads”) 41 .Dual roles Publishers Advertisers Ad agency Users Sponsored search: Pub = AA (Yahoo!.
Advertising as information “I do not regard advertising as entertainment or an art form. 1974] Irrelevant ads are annoying. 1985] “Advertising as Information” [Nelson. etc are mostly ads and advertorials 42 . Skiing. but as a medium of information…. relevant ads are interesting Vogue.” [David Ogilvy.
Finding the “best ad” as an Information Retrieval (IR) problem 1. Build a search engine that produces quickly the ads that maximize the scoring function 2. environment. user. Devise a scoring function = predicates on q-features and d-features + weights 4. 43 . • Analyze the “query” and extract queryfeatures Query = full context (content. etc) Analyze the documents (= ads) and extract doc-features 3.
Behind the scenes… Setting the ad retrieval problem: Ads corpus = Bid phrase(s) + Title + Creative + URL + Landing Page + … Query features = Search Keywords + Outside Knowledge Expansion + Context features Context features (for sponsored search) = Location + User data + Previous searches + … Context features (for context match) = Location + User data + Page topic + Page keywords … Search problem similar to web search. but Ad database is smaller Ad database entries are “small pages” [+ URL] Ranking depends also on bids Ranking depends also on click-through-rate 44 .
U-QF. users Quality (utility) Factor (QF) is different A-QF. P-QF The ad agency has its own economic interest Might have different types of ads that are not easily compared Might have economics/contractual obligations that need to be fulfilled. advertisers.What is the best match? An ad has different utilities for publishers. 45 .
A bit deeper: how does the matching happens? 46 .
& L. Josifovski. Riedel] What is more important: the words or the context? Contextual ad matching based on a combination of semantic and syntactic features.A Semantic Approach to Contextual Advertising [SIGIR 2007] [AB. Classify both ads and pages into a 6000 nodes commercial taxonomy The class information captures the “about-ness” of pages and ads 47 . Fontoura. M. V.
My Atomic skis floated on the fresh powder… ….Matching ads and pages winter sports skiing snowboarding JOE’S SKIING BLOG …. pages 48 (imperfect) match Atomic boards 75% off!! .
term vector cosine 49 .Semantic and syntactic score Semantic component – weighted taxonomy distance Syntactic component .
Final score 50 .
Results 51 .
The crystal ball chapter: Trends in Computational Advertising A very biased sample 52 .
The Recommender Systems Connection 53 .
Long term loop: improve the ranking function ML based ranking 2. ads) pair Closest to recommender systems 54 .The ad matching problem as a recommendation challenge The traditional IR approach is based on a fixed query results correspondence For ads we Need CTR probability or user utility rather than top-K results Have a continuous click-through feedback Challenge: incorporate the feedback 1.Short term loop: use the statistics we have for a particular (query.
B. Agarwal.C.A generic view – dyadic interaction systems D.. Chen “Feature based factorization model for dyadic data” [In preparation] Dyadic Interaction data pervasive Recommendation systems (user-movie. Goal: predict response for unknown dyads Better match-making. user-book) Web advertising (match ads to webpage/query) Content Optimization (match articles to users) Unit of measurement : dyad (i.. webpage. user-music. click-rates. ads.…. prediction 55 .… Measure some response: ratings. genres. j= movies.… Often have meta-data on dyadic elements Demographics. j) i= user..
Diklic. Agarwal. D..Challenge: clicks are very rare Need to aggregate clicks/recommendations Pages belong to site section to sites Page ∈ CNN/sports ∈ CNN Ads belong to Campaign to Advertisers Ad ∈ Ford Focus ∈ Ford Similar to preference estimation for Annie Hall ∈ Woody Allen Comedies ∈ Comedies D. 2007] 56 . and M. [KDD. Sayyadian. A. Chakrabarti. V. J. B. Estimating rates of rare events at multiple resolutions. D.
Pandey.Challenge: how to find new ads? Similar to music recommender systems: need to explore new songs but need to keep some similarity Can take advantage of semantic taxonomy S. Chakrabarti. Josifovski. Agarwal. and V. 2007] 57 . Bandits for taxonomies: A model-based approach. D. D. [SDM.
How popular is mobile advertising?
(Limbo’s Mobile Advertising Report, US Users, 4th quarter 2007)
Specific features (and targeting dimensions) for mobile advertising
High precision demographics Location
Geo (GPS or tower triangulation) Location functionality: Shopping mall, at home, airport, etc Personal context: working, on vacation, etc
Alone, with colleagues, with family, with friends, travelling, …
Language Handset capabilities Bandwidth Operator Time of day Speed (walking vs. driving) Recent history Etc
or wireless network Direct purchase (tickets. … High res touch screen. take-out food. Bluetooth (two way). closer to web experience Lots of things we haven’t thought about … 61 . G1-Phone. etc) Prevalent high precision GPS I-Phone.What can change the game? Instant couponing One time use coupons sent via SMS/MMS to handset Handset as contactless credit/cash card Interaction with point-of-sale via optical readers (one way).
What can change the game? Inference from noisy data What can you infer from lots of people reporting their position every few seconds? Real time traffic … Opening hours … Night clubs going out of fashion … Very accurate life-style predictions Interesting gossip ☺ 62 .
Marketplace 63 .
60 AdSense Ad.45 bid Transparency and value 64 .75 via Network… Bids $0.Advertiser agencies: The Open Exchange Bids $0.com Bids $0.65—WINS! Has ad impression to sell -AUCTIONS … which becomes $0.50 Bids $0.
Lewis & D. etc … Advertisers Move towards performance advertising Fine granularity ROI computation. etc. Reiley “Measuring the Effects of Advertising on Sales via a Controlled Experiment on Yahoo!” … Advertising agencies Market becoming more efficient (arbitrage. As usual willing to trade privacy for functionality OpenSocial. so little time… Publishers & media Internet Protocol Television (IPTV) Variable-data printing (VDP) newspapers … Users Fluid legal landscape wrt regulation. exchange) Less is more – manage user attention Single user multi-media campaigns … 65 . privacy. Universal Profile. analytics See R.So many topics.
Summary 66 .
New application domains and new techniques are emerging every day ⇒ Good area for research + new businesses 67 . The financial scale for computational advertising is huge ⇒ Small constants matter ⇒ Expect plenty of further research 3. 2. Advertising is a form of information. possibly contradictory utility functions 4. ⇒ Adding ads to a context is similar to the integration problem of other types of information ⇒ Finding the “best ad” is a type of information retrieval problem with multiple.Key messages 1. Computational advertising = A principled way to find the "best match" between a given user in a given context and a suitable advertisement.
com http://research.com 68 68 .Thank you! email@example.com firstname.lastname@example.org.
This talk is Copyright 2009. Authors retain all rights. No publication or further distribution in full or in part permitted without explicit written permission 69 . including copyrights and distribution rights.
This action might not be possible to undo. Are you sure you want to continue?