Notice: This blog is no longer updated. You may find a broken link or two

You can follow my new adventures @mikeonwine


So you own a small site, you have about 5000 page views a day, and you start thinking to yourself, “Hey, it’d be awesome if I could make some money off of my great website.” So you start shopping around for advertisers and quickly get the sense that all small sites work primarily with google. Now why would this be? There are four key ways in which advertisers can monetize your inventory:

  1. Branded Display — e.g. Ad.com, Niche-networks
  2. Contextual — e.g. Google Adsense, Yahoo, Quigo
  3. Behavioral — e.g. Tribal Fusion, Tacoda
  4. Performance Display — e.g. Fastclick/Valueclick, CPA Empire

Before I delve into these into more detail, everyone has to understand something: advertising is all about reaching relevant eyeballs. Whatever the method used, the end goal is the same — if I’m selling sexy shoes, I want people to buy sexy shoes. I can go about this a couple ways but at the end of the day, I’m going to compare the amount of money I spent on advertising and the amount of profit I made from the sexy shoes that I sold. The smaller and the easier it is to purchase my good, the more likely I am going to be focused on ROI & performance. If I’m Netflix selling $4.99 subscriptions, it’s easy to track who signed up from where as I will have tens of thousands of people signing up per month. On the other hand, it’s difficult for Ford to track ad campaign performance… it’s hard to prove you bought a new Ford F150 because you saw an ad for it on automotive.com.

Ok… so eyeballs it is… lets look at the four methods above now:

Brand: If your site is good enough to be on the brand wagon you’re in luck. Branded advertisers have high standards on the inventory they will run on. The demand that it is relevant to their campaigns (e.g. Ford will want to run only on Auto sites) and they are willing to pay a premium to get there. Campaigns are generally CPM with little or no concern for performance. The problem here is that unless a site’s users are extremely desireable it’s hard for a site with low volume to get onto a branded site list. Ford wants to sell cars — to do this they want to find car buyers and associate the ‘Ford’ brand with their target audience. They might run campaigns for a new sexy sportscar on gaming sites and ads for more fuel efficient sedans on the New York Times travel section (who knows?).

Contextual: Thanks to Google this has become the big mama of online advertising. Contextual engines scrape your pages content and choose advertisers that match the content. Got a site about shoes, expect to see shoe advertisements. Now, going back to the eyeball idea, if I’m selling shoes, I want people interested in shoes. If you have a site that reviews the latest and sexiest shoes, then I will want my ad on there. On the other hand, if you have a personal blog and are complaining about how the shoe salesman smelled bad at the mall today, I probably don’t want my ad on your page. Contextual rocks on relevant page content and sucks on most Web 2.0 sites. When was the last time you saw a relevant text ad on Myspace.com?

Behavioral: Unlike Brand and Contextual, this is the first type of ad where we don’t care much at all about the page-content. We have an idea of what the users eyeballs are like and want to show him ads that he is going to enjoy the most. If you’re not famiiar with behavioral networks, read my post here. The biggest challenge here is for a site to get INTO the behavioral network. Even though for the purposes of serving one ad they don’t care what your site content is like, they will most surely want to use your site and your users to expand their reach & user information databases. Tribal Fusion, for example, requires 2k uniques per day, “professional site design” and a “highly active user-base”.

Performance Banners: Ok, so I saved my favorite (and perhaps worst for a small site) for last. Performance based advertising means that somebody is tracking clicks and/or conversions (actions) on their campaign and either only paying out on an actual “action” or adjusting bids on different sites according to the performance. Don’t be fooled by networks that pay you on a ‘CPM’, unless the ads they are showing are brand advertisements, somebody somewhere is tracking ROI & performance, and it’s impacting the CPM you’re getting paid. So why doesn’t this work for small sites you may ask? Well, long story short… small sites suck. Ok, it’s a bit of an exaggeration, small sites suck for performance would be a more accurate statement.

Lets go back to eyeballs. Your site has eyeballs, and I’m going to pay you everytime one of your eyeballs buys a pair of sexy shoes. Lets say the shoes cost $30, my production cost is $0.50 (yay for sweat shops!) so I’m willing to pay you $14.50 for every person from your site that buys one of my pairs of sexy shoes. Now there is one other advertiser in the network I’m working with he is selling sports gear (hats and stuff) and is willing to pay $10.00 to the publisher for every purchase.

Now, when a users comes to your site, which ad should the network show? Lets say your site is brand new to the network. Well, this network has no contextual technology to tell me what the site is about and he also doesn’t have a behavioral engine, so what can the network do? The only thing he can, test his campaigns on the site and learn what the performance is like.

Ok, for the next 5000 impressions (1k unique users?) the network randomly chooses between the sexy shoes and the sports paraphernalia. Now what… what do you think the chances are that somebody actually bought a pair of shoes or a hat? On the average site, I would imagine, right around 5-10%. Think about it, if the average CPM on your site is around ~$0.50, and this offer were to be competitive, you shouldn’t be expecting a single conversion until you’ve run at least 30,000 impressions. For better statistical accuracy on the performance of this campaign on one site, think more like 300,000 impressions. Starting to see why small sites suck? The amount of volume necessary to ‘optimize’ the average campaign is simply too much for the measly 5k impressions a day. Imagine if the network had 500 campaigns. How can learning be done on 500 campaigns with just 5000 impressions a day?

Of course there are various techniques to limit the amount of learning that needs to be done, but the fact remains that significant volume must be sent to an performance advertiser before one can get a good idea of the actual performance.

So, what should the small site guy do? Think about your eyeballs & page content. Is your content very relevant to the eyeballs? Then go contextual. Do you have particularly valuable eyeballs? Go for behavioral or brand. None of the above? Try performance based networks, but don’t expect good CPMs until you get high enough volume for some sort of learning to be done.

Next up? Not sure, probably some thoughts on learning techniques, merging behavioral and performance, and other ideas…

How do behavioral networks work?

February 28th, 2007

I really wanted to write a post about the difficulties in monetizing small websites but I realized to best have that discussion I would have to explain behavioral networks first! So, here goes.

A few months ago I was trying to explain to my girlfriend how advertising works. After a couple discussions she said a rather interesting thing — “How come the travel industry seems to be the biggest advertiser out there? All I see are travel ads, no matter where I go!”. Well, turns out my girlfriend is a travel fanatic and it’s behavioral networks that have picked up on the fact and no matter what the content of the site she’s visiting they manage to find a relevant travel related ad to show her. Try it, go to travel.nytimes.com and click through to a couple articles. Then go to a random section of the site, say ‘health’, and chances are you’ll see a travel ad! (see screenshot below) Sounds like voodoo or big brother right? Wrong!

NYT Travel Behavioral
Second behavioral ad on the new york times.

And peeking into my cookies, indeed there is some travel related fun in there (hard to tell what all means, since most cookies are encrypted):

NYTimes Cookie Data

There are many different behavioral players out there in the world each with varied success. Perhaps the most successful behavioral company on the internet is this small company based out of California called Yahoo. There are also a couple ad-networks that dabble in this space, one of the most popular is Tribal Fusion. Each approaches the challenges in a different way, but the core of how it works is the same for all.

So how does it work? It’s actually an amazingly simple concept. Lets say we have four websites:

  1. gizmodo.com
  2. engadget.com
  3. slashdot.org
  4. myspace.com

I am a user, and I visit each of these websites. Each of these websites also works with a brand new (not real of course) behavioral network called “BigBrother2.0″. On every page of these sites there is a little thing known as an ad tag that points to BigBrother’s adserver and requests an ad. Just for fun, lets assume that gizmodo also know your age & gender and passes that information to BigBrother. So you visit the first page, and an adrequest goes to BigBrother as follows:

http://ad.bigbrother.com/request?site=gizmodo.com&age=25&gender=m

When your browser requests the URL above, the following sequence of events happens:

  1. Browser Requests content from ad.bigbrother.com
  2. Server requests cookies (if any)
  3. Browser sends over cookies
  4. Server does some crunching, picks and ad
  5. Server returns new cookie data & url to get get ad
  6. Browser shows ad

Now the magic here is in the cookies. Even though you are visiting the site gizmodo.com, the cookie is under the domain ‘ad.bigbrother.com’. In step 5 where the server returns new cookie data, they’ll put in there when you last visited gizmodo.com and how many times you’ve been there today.

Now lets say I spend the morning checking out my tech-news, all the while receiving ads from ad.bigbrother.com. Each page I view gives them a little bit more information about me, and even though I’m on three separate sites, because the cookie spans all three they get a holistic view of what I”m doing.

So now I go to Myspace.com. This is a big site. Normally it’d be difficult to pick a good ad to show you. You’re one of the 18 billion myspace users and all you really do is upload photos and post silly comments about your friends on their blogs. BUT, when Myspace decides to let BigBrother2.0 show an ad something interesting happens. Lets walk through the steps of this new ad call in more detail:

http://ad.bigbrother.com/request?site=myspace.com
  1. Browser Requests content from ad.bigbrother.com
  2. Server requests cookies (if any)
  3. Browser sends over cookies
    1. Cookie contains your age & gender (25 & male) (encrypted)
    2. Cookie shows you visited gizmodo.com 12 times this morning
    3. Cookie shows you visited slashdot.org 3 times this morning
    4. Cookie shows you visited engadet.com once this morning
  4. Server does some crunching, picks and ad
    1. Server plugs user data (25, male, 16 visits to tech sites) into a ‘profile engine’
    2. Profile engine spits back ‘categories’, lets say ‘male, technology’
    3. Server looks for ad campaigns targeted to ‘male & technology’
    4. Server picks highest paying ‘male & technology’ targeted campaign
  5. Server returns new cookie data & url to get get ad
  6. Browser shows ad

Instead of showing you a random ‘punch-the-monkey’ ad, BigBrother is able to show you a highly relevant and targeted advertisement that you are far more likely to click on. The end result? Better ads for you and more money for the websites that you visit.

You can see from the above that building a behavioral adserver really isn’t the most difficult thing in the world. All you need to do is track history & have some sort of ‘mapping engine’ that says that slashdot.org is a tech site. The real challenge for behavioral companies is the classic chicken & egg problem. Without a rich user-base, the ad-network won’t be able to sell deals. Without deals the network won’t be able to convince publishers to put their tags on pages. This makes it very difficult for new behavioral players to enter the marketplace, although emerging marketplaces such as the RMX will make it easier in the future.

So what’s my opinion on all this? As long as the data that is being stored is encrypted and no Personally Identifiable Information (PII) is stored I’m all for behavioral targeting. As I mentioned before, the more relevant the ad is to my interests, the more pleasant my browsing experience is going to be.

So, time and time again you see people rant and rave about how your privacy is being seriously compromised by the use of cookies. I must say, if you just read around on google, advertiser cookies are one of the most misunderstood beasts out there. Nowadays, practically every spyware removal program flags advertiser cookies as ‘SPYWARE/ADWARE’. Try it, google up tribalfusion cookie, or yieldmanager cookie, etc. etc.

Take this SpywareNuker page:

Some of the Cookie.Advertising.com components are listed below. The list is compiled as a reference. The list might not be complete and it doesn’t represent instructions for manual removal. We DO NOT recommend manual removal. Incorrect removal of certain software might make your computer unstable or even unusable.
Removal of adware component might affect the related ad-supported software.

Sorry to say, but this is total bullshit. Ad.com’s cookie doesn’t infect a thing and can be removed easily and safely by yourself. So what are adserver cookies and what’s stored in them? The most basic information that most adserver will want to know is:

  • which ads you’ve seen, and how many times you’ve seen them
  • which ads you’ve clicked on
  • which ads you’ve converted on

Yes — that’s right, adservers know when you buy things after clicking on an ad — but that’s another post. Now more advanced behavioral companies will also want to track which sites you’ve been to and what behaviors (or segments) you belong to. Allowing adservers to track you using cookies means you won’t repeatedly see the same ad, and you should see more relevant ads to your interests as you continue your path along the internet.

So how do companies store this information? There are two ways — client-side or server-side. Client-side involves storing all the information about you in your cookie whereas server-side the cookie simply stores an ID and all information about you is stored on some database somewhere. Doubleclick as seen in the below screenshot seems to have gone the server-side route and simply stores my “id” which maps to some data they have on me in their databases. The server-side route is great if the advertiser wants to store a lot of data since there are size limiations on cookies and you don’t necessarily want to transfer 10kb back and forth on every ad call. It’s not so great as it requires some serious database infrastructure to handle 10-100k read/writes per second to a single database.

Doubleclick Cookie

Burst seems to have gone the client-side route and stores all ads I’ve seen from them directly in my cookie files. It’s actually interesting to note that the cookie data from Burst is not-encrypted (big nono!).

Burst Cookie Files

Ok, so what does this all mean for the end-user? Well, indeed, there are companies out there that are tracking most of everything that you do online. Ad companies like Tribal Fusion know what you’re interested in and what you like to browse for. Hell, if Google succeeds in pushing Checkout to the world they’ll know everything you’ve bought, how much you paid for it and much more! All in all this sounds scary but it really shouldn’t be. Why?

First off, none of the information _should_ be personally identifiable. Cookies are tied to your computer, not your name. So if you switch machines often, or clear your cookies, all history is immediately erased. Now, some argue that if your browsing history is stored it should be possible to figure out exactly who you are. Lets be honest, why go through all the trouble if there are much easier ways of figuring out what you’re doing online? If you’re worried, go download an anonymous browsing tool (there are plenty), or visit the little known-of ‘opt-out’ pages that almost all online advertising companies have. Some examples: Advertising.com Opt-out, Yieldmanager Opt-out, Doubleclick Opt-out. If you do go the opt-out route, don’t clear your cookies because that’s the only way they’ll know you don’t want them to track you!

So whichever route you take, don’t forget that the websites you visit can provide you with free content because of the money they receive from advertisers. Cookies are one tool that advertisers use to help track revenue and regularly clearing them can cost your favorite sites money. My personal choice? I clear my cookies every month or so but am perfectly happy to let companies track my behavior to show me more relevant ads. Hell, I’d much rather look at an ad for some new tech gadget than ‘punch the monkey’!