October 18th, 2009
One of my readers posted the following comment on my first post on RTB:
In your second diagram you show the interaction between the publisher adserver and multiple networks. Does this potentially multiple source back and forth not slow down the adserving in the same way a series of dumb redirects would? Especially when you consider that presumably if Network 1 came back with the best price out of 3 or four networks, once the publisher ad server knew that it would need to go back to it and request the actual ad again. It would be interesting to see some realistic HTTP traces for this stuff.
This is indeed a great question. Technically it looks like there are the same # of requests going back and forth in RTB versus a traditional ad-call. Although this is the case, RTB is going to be significantly faster… and here’s why.
Technically a browser downloading content from an adserver is a five step process:
* DNS lookup of the adserver domain name
* Establishment of a TCP connection
* Requesting content
* Acknowledge of request & sending back content
* Terminating the TCP connection
Assume for this case that a DNS lookup takes about 100ms. Each of these steps requires a number of packets to go from the local computer up to the adserver and a series of response packets. Here’s the # required for each step:
* TCP Connection: Two packets up, and one packet down (SYN, SYN-ACK, AKC)
* Requesting content: One packet up (minimum)
* Request acknowledgement and content: One packet down (minimum) & one packet up
* Terminating the connection: One packet
So the minimum number of packets sent back and forth is 7. If the latency from an end-user is 50ms to the adserver, this means it will take *at least* 450ms (100ms DNS + 350ms ad-request) to request the ad.
Now you’d think this would be the same for real-time, but it’s not! There are three reasons a request between two serving systems is much faster:
* Better connectivity — Adservers are hosted in datacenter that generally have much better internet connectivity than the average end-user. This means lower latency between the two adserving systems.
* No DNS lookup — The RTB system can cache DNS lookups for all RTB partners, effectively removing this 100ms.
* Persistent TCP connections — Any intelligent RTB integration would use persistent TCP sessions between the sell and buy side systems. This means a connection is established once and reused thousands of times after that.
With the above three, here’s how requesting a “bid” looks from sell to buy side:
* Requesting content: One packet
* Acknowledge of request & sending back content: One packet
So assume 25ms latency between systems (rather than 50) and the minimum time for an RTB request between systems is only 50ms compared to the 450ms it would take for an actual end-user or 9 times faster. The slower the end users connection and the faster RTB will be.
Conclusion — yes, adserving individual requests becomes a little bit slower but the removal of redirects makes the overall process signficantly faster.
For those technically curious, here’s are tcpdumps that prove this.
Browser to adserver:
15:45:04.380042 IP 10.0.1.31.59541 > 220.127.116.11.http: Flags [S], seq 50484529, win 65535, [...] 15:45:04.397395 IP 18.104.22.168.http > 10.0.1.31.59541: Flags [S.], seq 661028066, ack 50484530 [...] 15:45:04.397529 IP 10.0.1.31.59541 > 22.214.171.124.http: Flags [.], ack 1, win 65535, length 0 15:45:04.397831 IP 10.0.1.31.59541 > 126.96.36.199.http: Flags [P.], seq 1:1288, ack 1, win 65535, length 1287 15:45:04.424466 IP 188.8.131.52.http > 10.0.1.31.59541: Flags [.], seq 1:1461, ack 1288, win 62780, length 1460 15:45:04.424472 IP 184.108.40.206.http > 10.0.1.31.59541: Flags [P.], seq 1461:1543, ack 1288, win 62780, length 82 15:45:04.424546 IP 10.0.1.31.59541 > 220.127.116.11.http: Flags [.], ack 1543, win 65535, length 0
Adserver to adserver with persistent connections:
20:00:10.709152 IP 18.104.22.168.41096 > 22.214.171.124.80: . ack 1023 win 7154 20:00:10.754844 IP 126.96.36.199.80 > 188.8.131.52.41096: P 1023:2045(1022) ack 501 win 62780
- Redirects and Integration, Part I: Limitations
- Redirects and Integration, Part II: Hacking Around the Browser
- Architecting for immediate need or future flexibility
- Can’t we all just 302? Report on redirect timings
- Mashable/Google/Malvertising Follow-Up