Competition and Crowd-out for Brand Keywords in Sponsored Search

Competition and Crowd-out for Brand Keywords in
Sponsored Search∗
Andrey Simonov
University of Chicago (Booth)
Chris Nosko
University of Chicago (Booth)
Justin M. Rao
Microsoft Research
September 30, 2015
Abstract
On search keywords that include trademarked terms, the brand owner (focal brand)
and other relevant firms compete for consumers. For the focal brand, paid clicks have
a direct substitute in the organic link that appears directly below the advertisements.
This crowd-out can create a large wedge between the real causal effect and commonlyreported “nominal measures,” such as click-through rates. We quantify this wedge
using a large-scale, fully randomized experiment on Bing, which allows us to study
2,500 brands with a sufficient sample size. We find a positive, statistically significant
impact of brand ads, with a modest size, on the order of 1–4%. There is significant
heterogeneity, with more well-known brands showing a smaller causal effect. Competitors greatly impact the market. First, if the focal brand is not present in the top slot,
competitors “steal” 10-20% of clicks on average. Second, when the focal brand is in
the top slot, competitors steal only a modest fraction of clicks, 1-5%, but dramatically
shift the focal brand’s traffic from the organic link to the paid link, raising the focal
brand’s costs. Taken together, the results indicate the primary value of branded paid
links is defensive in nature.
∗
We thank Christian Perez, Matthew Goldman and Giorgos Zervas for helpful comments. Most of this
work was completed when Simonov was an intern at Microsoft Research. All opinions represent our own and
not those of our employers.
1
Introduction
In sponsored search advertising, brand keywords, queries that include a company’s trademarked name, are a substantial portion of advertising expenditure. At first glance this
practice seems sensible: Consumers learn about products through a variety of channels
and seek them out online using a search engine. Entering a specific brand signals product
awareness. Click-through rates (CTR) tend to be higher than similar non-brand keywords.
Further, since bidding on another firm’s trademarked term is legal, competing firms can step
in and siphon off traffic from the focal brand (as in theoretical model of Sayedi et al. (2014)).
The focal brand might want to bid aggressively to minimize “traffic stealing.” On the other
hand, the focal brand also occupies the first organic result, which is shown just below the
paid link(s), creating the possibility that the paid link simply crowds-out free clicks. Indeed
a recent paper, Blake, Nosko and Tadelis (2015), find almost complete crowd-out for a single, well-known brand, eBay. Using a controlled experiment, they document that when eBay
stopped bidding on its own keywords 99.5% of traffic was retained via the organic link.
We are thus left with a bit of a mystery. Advertising on brand keywords is substantial, yet
the presence of an almost complete substitute (the organic link) suggests that it is practically
worthless for the focal brand. Moreover, there may be no need to continue to advertise to
users who intentionally seek out a particular brand – these are folks that will find their way to
your product regardless of search advertising. Competing explanations of why firms continue
to bid on brand keywords have fundamentally different implications for the marketplace and
our understanding of firm behavior. The first class of explanations posits that advertisers
focus on “nominal measures” of ad performance, such as CTR, rather than the causal impact
on customer visits, which leads to an irrational interest in keywords that appear attractive
through this lens. The second class of explanations points to eBay as being unrepresentative
in some way. It is the second largest e-retailer with a very established brand and thus may
be immune to competitive effects among customers using “eBay” in a search query. Under
this explanation, other firms very well could be making rational choices.
In this paper we arbitrate between these explanations with a large-scale, fully randomized
experiment run on Bing. On both Bing and Google, the maximum allowable number of ads
placed above the organic search results is 4. In the experiment this cap was exogenously
reduced to 0, 1, 2 or 3. For example, if a query had a single ad that cleared the auction
1
reserve price, comparing conditions “Cap 0” (0 ads allowed) and “Cap 1” (only 1 ad allowed)
effectively replicates the eBay experiment. We have such experiments for tens of thousands
of brands, but focus our analysis on the top 2,500 brands, which account for 98.7% of brand
searches. The included firms have widely varying brand quality metrics.
We start by measuring the extent to which organic search results serve as substitutes
for paid search links when no competing ads are present (matching the setting of Blake et
al.). We find that the results in Blake et al. are a bit of an outlier. In our sample of firms,
sponsored links on branded search queries drives significant incremental traffic—total clicks
to a firm’s website increases by 2-3 percentage points on average. This effect is significantly
larger for lesser known brands, while the strongest brands in our sample show effects closest
to that of eBay. But while incremental traffic is relatively modest, “crowd-out” of free clicks
on the brand’s link in the top organic slot tends to be quite large.1 When we experimentally
introduce the brand ad about half the traffic, or 40 percentage points of CTR, is shifted
from the organic link to the paid ad. Since crowd-out exceeds the causal effect by more than
a factor of ten, the nominal “cost per click” (CPC), the standard pricing metric reported
by online advertising platforms, strongly diverges from the real “cost per incremental click”
(CPIC).
For the case where brands do not face competing ads we measure the wedge between
CPIC and CPC for a broad cross-section of firms. For firms that show a significant ad effect,
CPIC is on average 11 times larger than the CPC. For all firms, we establish a lower bound
ratio of 22.2 While these multipliers might seem too high to rationalize, we must keep in mind
that the focal brand always pays a much lower CPC than competitors because the pricing
rule of the generalized second price (GSP) auction rewards link relevancy and clickability. To
put our CPIC estimates in context we compare them to two quantities that should capture
the real costs of customer acquisition on the platform: 1) the CPC focal brands pay on
keywords when they do not occupy a high organic position 2) the CPC competitors pay on
brand keywords. CPIC estimates exceed these comparison measures in the vast majority of
cases.3
1
All the brands in our sample always occupied the top slot.
We cannot reliably compute CPIC at the firm level when the estimated causal effect is close to zero.
As such, we rely on the convexity of the underlying function and use Jensen’s equality to produce a lower
bound by estimating the function at the average value of the arguments.
3
We cannot observe the potential differences in the value of an ad click on different search queries or click
type, nor the overall value of a click (given bid shading in the GSP Gomes and Sweeney (2014)), so these
comparisons are by no means perfect. We note that more than 90% of firms have the same “landing page”
2
2
We have thus far focused on the case of focal brands not facing aggressively bidding rival
firms. If competitors are present—a situation commonly observed in our data—the focal
brand can choose to 1) either “defend” and bid aggressively to occupy the top slot or 2)
not bid and allow competitors to fill the top of the page, pushing the brand’s organic link
further down.4 In the first case, the focal brand’s ad in the top slot provides a largely effective
defense—competitors are able to steal only 1-5% of clicks, which depends on the number of
competitors present and brand attributes. It turns out that in this case competing ads have
a much larger impact on crowd-out rates. When facing no competitors (Cap 1 condition),
about 50-60% of clicks to the focal brand’s website go through the paid link and the remaining
half or so are free. When we randomly add in competitors, the fraction of paid clicks increases
by 10% for the first competitor added, 9% for the second and 5% for the third, reaching 84%
for a full slate. Thus increased competitors both lower the effectiveness (relatively small
effect) and raise the costs (relatively large effect) of the focal brand’s defensive position.
Now we look at the case where the focal brand does not advertise but competitors do.
These firms come in two varieties. The first, which we’ll call “reseller” types, are firms
that produce a product and sell it themselves, but allow other licensed retailers to sell it
too (e.g. laptop manufacturers). These firms typically get only about 30% of total clicks
on brand searches. The second variety is the exclusive seller, such as a travel website or
car insurance company. These firms typically get more than 80% of total clicks. Using
the experimental variation to forcibly remove competitors’ ads reveals that, in both cases,
competing advertisers get 20% of total clicks—far higher than the 1-5% we observed earlier
when the focal brand was present in the top slot. For reseller types, only about one-third
of this 20% comes from substitution from the focal brand’s links, with the rest coming from
clicks that would have gone to other firms present in the organic links. In contrast, for
exclusive types, nearly all the competing ad clicks come from the focal brand. Based on
observed click metrics, exclusive types are a much better match to the brands that choose
to advertise, indicating that competitors pose a real threat to traffic acquisition if the brand
is not present.
Our results indicate that the returns to advertising for the focal brand hinge on presence of
competitors. In the absence of competitors, ads have a positive and statistically significant
(where a user goes after clicking on both.)
4
It is exceedingly rare that a brand bids and does not occupy the top slot. Conditional on bidding, they
bid to win.
3
effect, but associated crowd-out produces CPIC/CPC ratios (which give a multiplier on
nominal costs to get real costs) that are suspiciously high: 11 for firms with a significant ad
effect and 22 overall. Computing CPIC when both the brand and competitors are present is
more difficult because it requires us to estimate a counterfactual we cannot directly observe at
the firm level. Nonetheless, using average estimates from other firms indicates that exclusive
type firms would lose a substantial fraction of traffic—roughly 25% of total clicks—if they
discontinued their ad, while the impact on reseller type firms is more muted.5 This translates
to CPIC/CPC ratios of 3–5 for exclusive types, with the exact figure depending on the
number of competitors present and brand attributes. Given the relatively low CPC paid by
focal brands, these multipliers generally place real costs below our comparison measures and
thus are rationalizable.
How does firm behavior align with our findings? We saw that without the defensive value
induced by competition, the real costs of brand ad clicks are quite high and the overall returns
are, at the very least, questionable. In this case, a slight majority, 54%, of firms choose not
to advertise and only a small fraction of the firms that do advertise show ad effects and
crowd-out rates that rationalize their choices. The remaining firms appear to excessively
focus on average rather than marginal costs, as has been previously been documented in
other contexts (de Bartolome, 1995; Ito, 2014), or suffer from an incentives failure of some
sort. When competitors are present, the real-to-nominal cost ratios are generally easy to
rationalize by the data, and here we observe that the vast majority exclusive-type brands
choose to advertise. Finally, advertising on a rival’s keywords produces a double-dividend: it
necessitates a relatively costly defense, which increases with the number of competitors, and
generates a modest amount of stolen clicks. Consistent with these observations, we observe
the practice is widespread. Taken together, the majority of firms exhibit behavior that is
broadly consistent with our findings, but a non-negligible fraction make choices that do not
appear to be supported by the data.
5
The returns to these firms is more complex, since “competing firms” are advertising the product the
focal brand produce, although they almost always sell other brands on their website.
4
2
Background and Context
We define brand keywords as queries that consist of a trademarked term and where the
trademark holder occupies the top organic slot. Competitors using a trademarked term to
guide their bidding is a contentious practice as the focal brand dislikes the fact that their
competitors can target a user that has expressed a particular interest in them. Indeed the
brand firm may use other forms of advertising to generate searches for their products, as
shown in Lewis and Nguyen (2014), and not want this traffic prone to competition. Although
there have been trademark infringement lawsuits on the matter courts have consistently
upheld the practice. The use of trademarked terms within a competitors ad is not allowed6 ,
an exception was granted in 2009 to licensed resellers, since a reasonable consumer would
not be confused by the practice (the good being sold be the reseller is genuine). Chiou and
Tucker (2012) study this change and find that this change did not damage the focal brand
because it made the competing resellers less distinct.
In empirical studies of brand search, early experimental evidence comes from Reiley
et al. (2010), which showed that organic links and ads are substitutes for each other. This
substitution pattern is overwhelmingly present in our experimental data as well, and has also
been found in structural work (Jeziorski and Segal, 2014).7 Reiley et al. (2010) further show
that more ads can increase total CTR for the ad placed in the top slot because organic links
act as slightly better substitutes for ads in their study. As discussed in the introduction,
Blake et al. (2015) ran a large experiment with eBay and the results of the experiments
led the firm to discontinue brand search ads. In the experimental period eBay did not face
competing ads.
The final relevant strand of literature involves user behavior in selecting options on the
search results page and how this relates to bidding incentives. The first set of papers in this
literature help explain why the position of a competitor’s ad on the page causally influences
a user. Granka et al. (2004) presents eye-tracking evidence that most users use a “top-down
cascade” approach to searching the page. Craswell et al. (2008) takes this result to the
field and shows that “position effects” can be large near the top of the page—the same link
6
For example, while travel website Expedia is free to bid on “priceline,” it cannot include “priceline” in
their ad text.
7
One paper, Yang and Ghose (2010), sharply diverges from this result. Using a structural model, they
assert that there is a positive interdependence between organic ranking and search click-through rate.
5
gets substantially more clicks based on position alone. Later work has shown that search
is more complex than simple top-down traversals (Dupret and Piwowarski, 2008), but still
supports the finding that being pushed further down the page tends to lower CTR. In a
(mostly) theory paper, Desai et al. (2014) model interactions between competitors with
brand keywords. They discuss the prisoner’s dilemma nature of branded keyword bidding.
Jerath et al. (2011) present a related theoretical model where there are low and high quality
firms and discuss why inferior firms have more incentive to locate at the top of the page
because otherwise no one will find them. In contrast, a superior firm may get more clicks
despite being lower down the page. In our setting this could correspond to the brand’s
organic link still getting the majority of clicks even if there are rival ads above it. One could
broadly read our empirical evidence as consistent with many of these theoretical predictions.8
2.1
Experiment and Data Description
The data in our study come from a fully randomized experiment on the Bing search engine.
On Bing, the sponsored listings that appear at the top of the page, above the organic listings,
are known as the “mainline.” A maximum of four mainline ads could be shown on a given
query—the same practice employed by Google. Absent an experiment, the number of ads
and their composition is endogenously determined by firms’ bids. A cross sectional regression
that looked at differences in the number of advertisements by keyword would conflate true
effectiveness of advertisements with differing environments across keywords. We use our
experimental variation to control for these confounding factors.
The experiment was conducted on a small fraction of U.S.-located users over nine days
in January of 2014. Users’ searches were randomized into one of five conditions. Four of
these conditions had some treatment, and corresponded to limiting the maximum number
of mainline ads at 0, 1, 2, and 3. The fifth condition was a control group, and corresponded
to the maximum of 4 mainline ads, the typical production setting.9 The treatment limited
the number of ads that could be shown, but often this cap did not bind. For instance, in
the treatment group that limited mainline ads to a maximum of 3 (“cap 3” to employ the
8
We do not, however, observe brands occupying the lower adverting slots. The context of brand search is
somewhat different than generic product search, so we would not expect all the predictions to be borne out.
9
The reason “all the rest” of the traffic is not the control group is that other experiments are running
concurrently, so a “clean control” is preferred.
6
terminology we will use throughout), if there were not enough bidders that met the reserve
price to fill the 3 slots, then fewer than 3 ads were shown. We carefully control for this
issue by selecting only queries that matched into bidding data where an ad would have been
shown in the absence of the experiment. See the Appendix A for more detail on this process.
To identify brands, we extracted 87,000 retailer and brand names from the Open Directory Project.10 A search is characterized as a brand query if and only if (1) the query
is in this list, meaning it is a verified firm brand, and (2) the query matches the domain
name in the first organic position. We focus only on brands that are in the first organic
link because this selects true brand keywords. Queries that generate brands that are not in
the first organic position might be searches of a different nature, perhaps not meant to get
directly to the brand page, but to a broader set of sites.
Figure 1: Brand search example
This example has two mainline ads: own brand ad in mainline 1 and competitor’s ad in mainline
2.
10
dmoz.org, the project uses volunteer annotators to “classify the web.”
7
Figure 1 provides an example of a brand query. Queries are simplified using standard
techniques, e.g. we treat “Macy’s,” “macys.com,” “macys,” and “macy’s” as the same query.
We focus on searches with 0 or 1 clicks on the page, ignoring rare instances of 2 or more
clicks.11 The majority of brands have very few exposures in the dataset. Table 1 presents
the number of brands binned by the number of observations for those brands. 64.7% of all
brands in the control group have less than 10 exposures but represent only 0.19% of all traffic,
whereas 96% of traffic comes from the 1045 brands that have 1000 or more exposures. The
starting point for our sample selection will be the 2517 companies with over 350 exposures.12
These firms offer us relatively precise estimates and cover 98.7% of market activity.
Number of
exposures
in Control
1
2
3
4 - 10
11 - 100
101 - 1000
> 1000
Total
Table 1: Ad coverage in the control condition
Number of Percentage Percentage Percentage
Percentage
brands
of brands
of traffic
of own ads of competitor’s ads
in Control
(%)
(%)
in ML1 (%)
in ML1 (%)
4869
23.1
0.02
3
30.6
2773
13.1
0.02
4.1
32.4
1686
8
0.02
6.3
30.8
4315
20.5
0.12
10.2
34.5
4200
19.9
0.64
19.8
34.6
2202
10.4
3.6
42.64
28.5
1045
5
95.6
43.8
13.6
21090
100
100
14.4
31.4
Percentage of ads is computed across companies. For example, companies with 4 exposures and
companies with 10 exposures are given the same weight in group 4-10. Total frequency is also
computed across companies, unweighted.
In our working sample of firms, 32.7% of companies advertise on their own brand keywords
more than 90% of the time. 39% of brands do not advertise on their own keyword at all.
The remaining brands advertise selectively, turning brand advertising on and off, or (more
likely) advertising in some geographic regions but not others. More detailed information is
given in the Appendix A, Figure 12.
11
In these occurrences, the searcher often visits all advertisers, making it less interesting to study. Further,
search engines often refund clicks from such patterns.
12
With this selection rule we are balancing the number of firms against the inclusion of brands that don’t
provide meaningful information because they are so small. We have done substantial robustness around this
threshold and the results are not materially impacted.
8
3
Results: Causal Effect of Brand Advertisements
In this section we measure how effective advertising is in driving traffic to the focal brand’s
website.
3.1
Ad Effect without Competitors Present
We begin by looking at firms that chose to advertise on their own keyword over 90% of the
time during our sample period. 32.7% of the 2,517 high-exposure firms are in this set.13
For 42.7% of this set (352) a competitor occupied the second paid slot less than 20% of the
time. This represents the most extreme form of potential substitution between paid and
organic search links because the link directly below the mainline advertisement goes to the
brand’s website with limited-to-no competition to stand in the way of the firm getting the
click. To estimate ad effectiveness for this case, we compare the cap 1 treatment group for
cases when the top ad went to the brand website to the cap 0 treatment condition where no
advertisements were shown for the same set of queries because the experiment forcibly (and
randomly) removed all paid search links.
Our primary outcome of interest is the probability that an individual arrives at the
website of the searched brand. We estimate this probability using the server logs that
separately record clicks on a brand’s organic link and those on the paid link. If a user does
not click one of these links it is either because she chose to visit a competitor or did not
click any link. Figure 2 plots the probability that a user arrived at a brand’s website by
the cap 0 vs. cap 1 groups. The figure shows that advertising on one’s one keyword drives
an incremental 2.27% of traffic to a brand’s website. This average effect is overwhelmingly
statistically significant, but we note that the y-axis is “zoomed in” and the overall magnitude
of the effect is perhaps economically small.
Next, we look at whether the ad effect varies by observable features in the data, focusing on a sub-sample of 493 companies with a sufficient amount of traffic to get reliable
company-specific estimates of the probability to get a click (the results are not sensitive to
13
If we focus on companies that advertise on their own keyword at least once, these companies correspond
to 53.6% of companies and 19.6% of traffic
9
Figure 2: The Effect of Advertising on One’s Own Keyword
this choice).14 The first step is to check if there is more variance in our estimates than would
be expected from sampling variation alone. Figure 3 presents an estimate of the density
across firms. In this sub-sample, the average effect is 0.021 (this is consistent with a smaller
effect for larger firms), and the standard deviation is 0.048. We plot a fitted normal density
over the distribution of effects, which shows the empirical distribution has heavier tails. We
formally test and readily reject the hypothesis that there is no heterogeneity.15
Having verified there is real heterogeneity, what is the source? In our comparison of
experimental conditions cap 0 vs. cap 1, we use all firms with a sufficient sample size in the
top position—this includes both those firms that would have faced competing ads in slots 2–4
and those that are typically alone on the page. The presence of competitors is presumably
endogenous to many unobservable factors such as the brand capital (as in Simon and Sullivan
(1993)) of the focal firm. A natural hypothesis is that “stronger” brands are less susceptible
to competition and thus competing bidders is a signal of a weaker brand. If weaker brands
benefit more from advertising, then we expect a larger effect for those brands that face
competing bidders in the normal state of affairs.
14
We keep companies with more than 80 exposures in each condition.
We perform a series of standard tests for normality, including Shapiro-Wilk, Jarque-Bera, DÁgostino
and other tests. All of them reject normality of the distribution.
15
10
Figure 3: There is heterogeneity in the effect of own ad in mainline 1
Table 2 divides the sample into groups depending on the probability (across queries and
time) that a competitor would have occupied the 2nd slot in the absence of the experiment.
For firms whose competitors never clear the reserve price on their keyword, the incremental
effect is 1.4% on average, whereas for firms who almost always face competition, the effect
is 4.9% on average. These differences are significant well beyond conventional levels and the
results are also confirmed in a regression framework given in Appendix B, Table 10. We can
also cut the data by the number of other competitive bidders for a given query, as given in
Table 9 in Appendix B. The table shows shows that greater competition, in terms of bidders
that clear the reserve price, is associated with a large experimental estimate.
We now examine if the effect size varies with a firm’s “brand capital.” Blake et al. showed
that eBay’s brand ads had almost no impact on website traffic. As the authors note, eBay
is a very strong brand—it is the second largest e-retailer. Our broader sample allows us to
look at a wide range of firms, from those that are quite similar to eBay to those that the vast
majority of consumers would not recognize by name. We proxy for brand capital with data
collected from Alexa.com, a widely used website ranking service. In particular, we collect
US and global rankings, “bounce rate” (the probability of a visit less than 30 seconds), the
fraction of traffic from search (well-known firms get more direct navigation via the URL
window), pages viewed per day and time spent per day. Table 8 (Appendix B) provides a
11
Table 2: Effect is higher for brands with more competitors on their keywords
Total
Competitors’ ads frequency in mainline 2
0-0.1 0.1 - 0.25 0.25 - 0.75 0.75 - 0.9 0.9 - 1
Ncomp
493
192
99
125
34
43
pnoad
pad
pad − pnoad
0.779
(0.001)
0.800
(0.001)
0.811
(0.002)
0.825
(0.001)
0.790
(0.003)
0.808
(0.002)
0.756
(0.003)
0.776
(0.002)
0.739
(0.005)
0.783
(0.004)
0.708
(0.005)
0.757
(0.003)
0.021
(0.002)
0.014
(0.002)
0.018
(0.003)
0.020
(0.004)
0.044
(0.007)
0.049
(0.006)
where
Ncomp - number of companies
pnoad - probability to get to brand’s website when there are no ads
pad - probability to get to brand’s website when there is own ad in mainline 1 and no competitor’s
ads
summary of brand capital measures.
Table 3 presents the results of four regressions of own brand ad effect on brand capital
measures. Specification (1) shows that companies with higher rankings tend to have a lower
incremental effect of one’s own ad in mainline 1. On average, the effect of an own brand ad
for a very well-known company (log(US) ranking around 2.5) is 3 percentage points lower
than for an unknown brand (log(US) ranking around 11). As we add more controls for our
brand capital measure in specification (2), the coefficient on log ranking is still significant. In
specification (3) we add controls for how much space the brand links have on the webpage:
the number of “deep links” (sub links below the main link), the number of “detail cards”
(detailed informational panels including things like maps and ratings), and the number of
organic links of one’s own brand on the first page.16 The coefficient falls by roughly 30% and
is no longer statistically significant. This drop combined with the negative coefficients on
the features that indicate richer and larger content for the focal brand indicate that some of
the effect was coming through stronger brands getting better visual representations. Finally,
in specification (4) we add in the frequency of times competing bidders occupy slot 2. The
coefficients do not change much and we can use this specification to confirm the larger effect
when the ad typically faces competition.
16
For a description and examples of deeplinks and dcard please see Appendix C
12
Table 3: Companies with higher brand capital have lower own ad effect
Dependent variable:
Effect of own ad in Position 1
(1)
(2)
0.004∗∗
(0.001)
0.009∗
(0.005)
0.006
(0.005)
−0.002∗
(0.001)
−0.002∗∗∗
(0.001)
−0.001
(0.002)
Other brand capital controls
−0.008
(0.012)
No
0.024
(0.020)
Yes
0.062∗∗∗
(0.024)
Yes
0.006
(0.005)
−0.002
(0.001)
−0.001∗∗
(0.001)
−0.0002
(0.002)
0.017∗∗
(0.008)
0.052∗∗
(0.024)
Yes
Observations
R2
Adjusted R2
492
0.013
0.011
492
0.038
0.026
492
0.079
0.062
492
0.089
0.070
Log US Website Traffic
(3)
Deep links
Details card
# brand organic links
(4)
Freq. competitor in Slot 2
Constant
∗
Note:
13
p<0.1;
∗∗
p<0.05;
∗∗∗
p<0.01
We conclude that brands with higher brand capital gain less from occupying the top
adverting slot. This appears to be both a combination of consumers being less likely to
substitute away from going to a firm’s website when that firm has higher brand capital and
because stronger brands get more visually appealing organic search results. Regardless of
the precise causal channel the implication for managers remains the same—firms with higher
brand capital benefit less from bidding on their own brand keywords.
3.2
Ad Effect with Competitors Present
First, we look at situations where competitors are the only bidder that clears the reserve
price. In this case, competing links are above the focal brand’s first link (the organic result)
on the search results page. This is clearly a selected sample because the brand’s decision
not to advertise is a choice. In particular firms that choose to not “protect” their own brand
keywords are presumably different than those that do.17 However, for firms that chose not
to advertise on their own keyword the experiment gives us clean variation to estimate the
causal competitive effect. We start by comparing cap 1 to cap 0, where the competing ad
has been removed at random.
In order to mitigate selection effects we break up the set of firms that do not advertise on
their own keywords but face competition into two sets. To do so we use the probability, in
the absence of any advertisements (a case created by the cap 0 condition in our experiment),
that consumers click on the firm’s organic unpaid link. The case where consumers click
relatively infrequently on the brand’s organic link indicates that many firms sell a genuine
version of the intended good. For example, one can buy a Dell laptop directly from Dell or
through many e-retailers. We will call this the “reseller” case. In contrast, consider a search
for a travel booking tool like Expedia.com. For these types of searches there is only one
“genuine version” of the good and the branded firm garners a high fraction of clicks. We
thus refer to such cases as the “exclusive” type.
We assign types using the probability of arrival distribution, which is given in Figure 4
for the 49 firms that do not bid on their own keywords and have a competitor in mainline
1 above 90% of the time in the control group for our experiment. The rates here are much
17
Recall that one of the downsides of the experiment is that we were not able to place ads in situations
where they wouldn’t have existed otherwise – we were only able to subtract them.
14
Figure 4: Heterogeneity in firms that do not bid on their own brand keywords
lower as compared to the average arrival rate of about 80% observed in the last section.
There is also substantially more more heterogeneity, with a large mass of firms below 0.30
(Figure 14 in Appendix E shows a similar histogram for 493 companies which advertise on
their keyword more than 90% of the time). We divide the firms with the two vertical lines
shown at arrival rates of 50% and 70%. Based on click behavior, the exclusive types form a
more suitable comparison group for the firms that typically advertise on their keywords.
Figure 5 graphs the difference in the number of clicks on the brand’s own organic link and
the competitor’s ads across the cap 0 and cap 1 conditions. The left-panel gives the reseller
type and the right panel the exclusive type. When we experimentally add the competitor’s
advertisement two effects occur. First, the percentage of traffic to the brand’s website drops.
The drop is 8 percentage points for the reseller case and 17 percentage points for the exclusive
case. These drops represent “traffic stealing.” Second, the competitor gains a large number
of clicks—around 20% in both cases. This means that for the exclusive case, almost all the
clicks garnered by the competing ad come at the expense of the focal brand, whereas for the
reseller case less than half of the clicks came from the focal brand, with the rest coming from
other resellers with links further down the page.
We now move to the case where the focal firm wins the auction and competitors occupy
15
Figure 5: The incremental effect of bidding on one’s competitors’ keywords for “reseller
type” (left panel) and “exclusive type” (right panel) firms.
slots 2, 3, and 4. We narrow our sample down to firms where, in the absence of the experiment, 4 mainline ads would be shown.18 This holds the set of firms constant and the
experimental conditions systematically remove advertisements. Figure 6 displays the causal
effect of occupying each of the mainline slots. Starting with the left panel, the first point
displays the total traffic that goes through both the organic and paid link to the brand’s
website when only the brand ad is present. The second through fourth points display the
traffic to the brand’s website adding in competitors into the mainline slots 2 through 4,
respectively. The relationship is downward sloping, as expected and the full slate of competitive ads reduces the traffic to a brand’s own website by 4.3 percentage points on average.
In the right panel we separate advertisers by a median split of website traffic. The lower line
represents below-median firms. We can see the smaller firms have a lower baseline CTR and
a slightly greater loss of clicks when facing 3 or 4 competitors.
Figure 7 breaks down the stolen traffic by competitor position. Two points are worth
noting. First, the amount of traffic garnered by the second and third place bidders is not
18
Again, we deal with a bit of a selected sample – keywords where both the brand and competitors are
already bidding. We have already shown these firms tend to be weaker brands, for instance. However these
are presumably the firms for which these estimates matter the most.
16
Figure 6: The effect of competitive ads in mainline slots 2-4
(a) Average effect
(b) Upper and Lower 50% by Brand Capital (US
Ranking)
significantly affected by the number of additional ads below them, though there is slight
evidence that the second ad prefers to face more ads below it, consistent with Reiley et al.
(2010). Second, the higher placed bidders secure more clicks. This is a natural consequence
of the auction logic and should not be interpreted as a causal effect of position, though some
of the effect is likely due “position effects,” Goldman and Rao (2015).
We started by examining the impact of the focal brand ad in isolation and observed a
statistically significant effect of about 2.5 percentage points. This effect tended to be larger
for weaker or more susceptible brands. Within the sample of firms that we observe that do
not bid on their own keyword and have a competitor in mainline 1, this competitor receives
around 20% of clicks for those keywords.
Taken together, our estimates provide some justification for the notion that firms should
bid on their own keyword in order to protect their brand keywords from competitors. In the
absence of a focal brand, ad clicks to the brand’s website decrease by an average of between
8 and 17 percentage points depending on the type of firm. We showed that such “defensive
positions” are relatively effective. When the focal brand is present, competitors in slots 2–4
17
Figure 7: The effect of competitive ads in mainline slots 2-4
steal far fewer clicks, around 4 percentage points, or one-fourth of the effect for the exclusive
types (the appropriate comparison group) when the brand ad is not present. This evidence
indicates that the marginal effect of the brand ad is quite high in this case—on the order
of 15 percentage points. In contrast when competitors are not present the effect is around
2 percentage points. This comparison suggests the primary value of a focal brand’s ad is
defensive.
4
Results: Crowd-out and Cost Per Incremental Click
Up until now we have combined paid and organic clicks together in order to measure the
causal impact of the ads. Of course the firm is not agnostic to the source of traffic: If the
traffic goes through the paid link, they must pay the search engine for each click; if it goes
through the organic link, it is free. When crowd out occurs, for every marginal click there
are inframarginal clicks that the firm must pay for as well. In this section we define “cost
per incremental click,” which accounts for the real cost of acquiring traffic.
18
4.1
Crowd-out and CPIC without Competitors Present
Figure 8 shows the percentage of traffic going to one’s own brand website through the paid
link in mainline 1 for the sample of focal firms consistently advertising on their keywords.
On average, 46.3% of clicks go through the paid link and most of the firms fall between 30%
and 70%.
Figure 8: Histogram of the fractions of total clicks going through paid links
Figure 9 visually shows how many of these clicks are “cannabilizing” organic clicks by
using the experimental variation induced by the cap 0 vs. cap 1 conditions. The difference
in the overall height of the bars represents incremental clicks to a brand’s website, i.e. the
average effect of 2.27% we previously documented. The drop in the organic segment (pink)
shows the degree to which the paid link cannibalizes organic clicks—roughly half of the
traffic now goes through the paid link. Putting both figures together, on average for every
incremental click the firm pays for 19 inframarginal ones. It is important to note, however,
that this calculation implicitly sets the counterfactual of no competetive ads. Since our
interest is to compute an accurate costs per incremental click at the firm level, we need to
choose the appropriate counter-factual: what would happen if the firm did not advertise.
19
For firms that typically do not face competing ads, the simple Cap 0 to Cap 1 comparison
is valid and we use this as a starting point.
Figure 9: Organic click cannibalization compared to incremental clicks
We compute CPIC as:
CP IC =
xp1
CP C
(p1 − p0 )
where x is the fraction of clicks going through an ad, p1 is the probability of getting to
one’s own website with the own brand ad in mainline 1, and p0 is the probability of getting
to one’s own website without the own brand ad in mainline 1. The CPIC/CPC ratio is
a natural measure of crowdout—it gives how many clicks a firm has to pay for to garner
one incremental click. However, a challenge in computing this ratio is that CPIC is only
a coherent object if there is a positive effect of own brand ad (otherwise it is infinite or
negative). In our sample, the estimate is significantly positive for 16% of companies. We
start by computing CPIC for these 43 firms. The median is 9.1, the mean is 11.07 and the
distribution is shown in Figure 10.
To estimate this ratio for all firms we rely on the assumption of a non-negative ad effect
and the convexity of the relationship between CPIC and effect size. This convexity is due
20
to effect size showing up in the denominator of CPIC. Using Jensen’s inequality, evaluating
the CPIC function at the expected value of it’s arguments will be a lower bound for the
expectation of the function.19 For these firms, the average average effect of an own brand
advertisement is 1.6,%20 and the average amount of traffic going to the own brand website
through the paid link is 36%. The resulting estimate of the CPIC to CPC ratio is 22.5.
Figure 10: Histogram of CPIC/CPC ratio for the estimates for the 43 firms that do not
typically face competing ads and show a significantly positive ad effect.
Table 4 presents a detailed summary of the costs paid per click by the focal brand and
competing firms. The focal brand’s nominal CPC is low, between $0.06–0.15 depending on
the sample and the weighting. But given the amount of traffic that goes through these links,
the amount of money involved is far from trivial. Competitors typically pay many multiples
of the brand’s price, between $0.47–0.86 on average, despite occupying lower positions. This
disparity is a function of the pricing rule of the GSP auction—more relevant and clickable ads
get a lower per click price. Real costs, given by CPIC, are much higher. We can estimate
CPIC at the firm level for those with a significant ad effect. It comes in at $1.12 in the
unweighted average and $1.42 when we weight the average by searches. To etimate the
average across all firms, we use the lower bound approach previously described and estimate
$3.50 and $2.52 for the unweighted and weighted figure respectively. All of these averages
19
We provide more details on this in the Appendix D.
Recall these are only firms that do not face competition. Effects for these companies are presented in
columns 1 and 2 of Table 2.
20
21
are higher than the CPC competitors are paying, both at the average and firm level, which is
especially true when weighting by impressions. A second natural comparison point for CPIC
is the CPC focal brands pay on keywords when they do not occupy high organic position and
thus nearly all clicks are marginal. Even for the firms with a significant effect, CPIC exceeds
this figure for 93% (51% unweighted) of the companies that show a significantly positive ad
effect. We are unable to make firm level comparisons when the effect size is not significant
and positive, but note that the lower bound given by evaluating the CPIC at average values
well exceeds the comparison measures of CPC.
Table 4: Cost of Clicks for Focal Brands and Competitors
Unweighted average
Measure
All brands Significant
¯
CP C brand ($)
0.15
0.15
CP¯ C compet ($)
0.86
0.78
‡
CP¯ C brand−other ($)
1.36
1.43
∗
¯
CP ICbrand ($)
3.50
1.12
CP ICbrand > CP Ccompet (%)
51.2
CP ICbrand > CP Cown−other (%)
55.9
N
268
43
Weighted average†
All brands Significant
0.09
0.06
0.61
0.47
1.14
0.92
∗
2.52
1.42
92.7
90.6
268
43
† Weights
‡ CPC
are number of exposures
paid by the brand on other keywords when they are not high in the organic listing. It is
computed for 85% of companies in the sample where a match is possible.
∗ Lower bound computed with average values as described in the text.
Taken together, the evidence indicates that these firms, which did not face competition in
our sample period, are spending a substantial amount of money on brand keyword advertising
when the CPIC calculations indicate that they could acquire clicks for a lower real cost via
other keywords. Similarly, firms are paying more to attract these customers than their
competitors are.
What explains this pattern of results? One way to rationalize behavior is to posit a higher
valuation for clicks for brand searches as compared to more generic “product searches.” For
any given effect size there is a valuation premium that rationalizes the implicit CPIC. While
we cannot rule such valuations out, it seems questionable that users already familiar enough
with a brand to enter a targeted query are substantially more valuable than users acquired
through more generic keywords or those specifically searching for competitors. Indeed, since
22
brand searchers are already aware of the focal brand, the “brand awareness” channel of
advertising effectiveness is shut down (Laurent et al., 1995), which would point to a prior that
these users are less valuable. The second class of explanations points to bounded rationality
on the part of the decision makers, such as the online media buyer at firms.21 If decision
makers focus on vanilla CPC, the familiar and widely reported metric, as opposed to CPIC,
which must be computed via experimentation, then the low CPC, often ten times less than
what the brand pays on generic keywords, makes brand searches look quite attractive. This
explanation is supported by past work showing that people struggle to “think marginally,”
sub-optimally focusing on average as opposed marginal measures (de Bartolome, 1995).22
Finally, it could be that the proximate decision makers know the CPIC but choose to report
CPC to their less-savvy superiors in order to frame the purchase as a “good deal” (potentially
by adding them to a larger portfolio to reduce the average (nominal) “customer acquisition
cost” of online advertising).
While we cannot conclusively reject the first explanation, the scales seem to tip against
it. We have no way to differentiate between the bounded rationality and principle-agent
explanations and thus leave this unpacking too future work. Finally we note that among
the set of firms in our sample that do not face competition, 54.4% choose not to advertise
and some that do advertise have observable behavior that is easily rationalizable. This
indicates that if there are indeed biases (or incentive problems) at play they are by no means
universally present.
4.2
Crowd-out with Competitors Present
We now shift our focus to the case when one or more competitors clears the auction’s reserve
price and the brand’s ad still occupies the top slot.23 Competitors impact crowd-out because
their ads appear at the top of the page, shifting the brand’s organic link further down. Given
that top-down visual search is common, this pushes more traffic through the paid search link
for the brand.
21
At smaller firms, this individual would buy the advertising through the platforms online portal. For
larger firms, they likely handle a relationship with a “search engine management” firm.
22
We wouldn’t argue that firms don’t intuitively understand that some of this cannibalization is happening.
The question is to what extent firms measure and adjust their bids based on it.
23
It is exceedingly rare for the brand to occupy slots 2–4. Bids reveal that if they choose to bid, they win
the auction easily.
23
To examine the magnitude of this effect, we extract the set of keywords that, in the
absence of the experiment, have 4 ads present. We then use experimental variation to study
the effect of ads in slot 1 (focal brand) and slots 2-4 (competitors). In each case we compare
the total traffic that goes to the brand’s own website and the fraction of that traffic that
goes through the organic versus the paid link.
Figure 11: The effect of competition in increasing cannibalization
Figure 11 plots these effects. The relative uniformity in the bar height confirms what we
previously documented in terms of a modest brand ad effect and minimial traffic stealing
when the brand occupies the top slot. Far more dramatic differences emerge when we focus
on the paid/organic traffic split. When only the brand ad is present, 60% of clicks go through
the paid link. (This is higher than our previous estimates because the sample of firms here
is restricted to those that face 3 competing ads in the control condition.) This increases to
70%, 78% and 84% with one, two and three competitors present respectively.
Table 5 extends the sample of firms to include those that typically face one (first column)
two (second column) and three (corresponding to Figure 11) competing ads. The effect is
consistent across these samples of firms. A competitor’s ad in mainline 2 causally increases
24
the fraction of ads going through the focal brand’s paid link by 10-12 percentage points, an
additional 8–9 percentage points for the third slot and an additional 6 percentage points for
slot 4. When facing a full slate of competitors, the ad gets 84.3% of brand clicks.
Table 5: Crowd-out by the number of ads a brand faced in the control and number of
competitors present
Number of competitors in Control
Condition
1
2
3
Cap 1
53.9% 56.6%
59.5%
(0.29) (0.42)
(1.33)
Cap 2
65.5% 68.0%
69.9%
(0.32) (0.39)
(1.30)
Cap 3
77.3%
78.2%
(0.38)
(1.18)
Cap 4
84.3%
(0.91)
N
784
682
564
Standard errors in parentheses. In Cap 0, not shown, is 0% in all cases.
It is clear why brand’s have fought to ban competing firms from bidding on their trademarked terms. If competitors do not advertise, a slim majority of brands choose not to
advertise, meaning all clicks are free. Those that do advertisers have crowd-out levels of
roughly 54–60%. If competitors bid aggressively, the vast majority of companies choose to
advertise in order to “defend their turf.” This practice can be explained by the large click
stealing we observed if no defense is present. Such defense is largely effective, but, as competition increases, would-be free clicks are causally pushed to the paid link, raising costs for
the focal brand. A competitor can “tax” a rival through this channel, while stealing a few
clicks along the way. It is not surprising, then, that this is a commonly observed practice.24
4.3
CPIC with Competitors Present
To compute CPIC, we need the relevant counterfactual of what would happen if the focal
brand did not advertise. When competitors are absent, if the firm stopped bidding, then no
ads would be present and thus we can measure CPIC at the firm level by comparing Cap 0 to
24
One could imagine that either collusion or repeated game dynamics lead to a “don’t bid on mine and I
won’t bid on yours” situation. To the extent that this occurs, it does not appear to be common.
25
Cap 1. In contrast, when competitors are present, if the focal brand stopped bidding, then a
competitor would occupy the top slot. There is not a comparison of experimental conditions
that provides us with this counterfactual for the set of firms that choose to advertise. Instead,
for a different set for firms, namely those that do not advertise, we can estimate the impact
of competitors by experimentally varying the level of competition, which was presented in
the second part of the last section. Although we are unable to estimate CPIC at the firm
level, we can use the analysis to give back-of-the-envelope estimates.
In Section 3.2 we noted that there were two types of firms that chose not to advertise,
resellers and exclusive types, and that only the exclusive types seemed to match firms that
typically do advertise. For exclusive types, competitors were able to steal about 20 percentage
points of clicks, though there was heterogeneity in the effect. By advertising, the causal effect
of the ad roughly cancels out click stealing, meaning the relevant “effect size” is about 20
percentage points. The faction of traffic going through the paid link is 65–70%, 77–78%
and 84% with 1–3 competitors present respectively. This gives average CPIC/CPC ratios
ranging from 3.2 to 4.2—far lower than the ratios we observed in the absence of competition.
Returning to Table 4, brands are paying about $0.06–$0.15 cents per click on their own
keywords, while competitors are paying $0.47–$0.78. Further, focal brands tend to pay
a bit more than competitors on generic keywords. Multipliers in the range of 4 are thus
easy to justify. If we halve the ad effect, then the ratio enters close call territory, meaning
conditional on competitors advertising, many brands “enjoy” a surplus, even though they
of course prefer competitors to be banned outright. In contrast, for very strong brands that
have a low causal effect, defense may not be warranted.
5
Conclusion
In this paper, we’ve described and documented the competitive environment for brand keyword advertising. This market has the interesting property that a free substitute exists in
close proximity to the paid advertisement. While this is a particularly strong substitute in
the case of brand keywords, the dynamic is generally present in sponsored search. Consider,
for example, the decision of a large firm such as Geico of whether or not to advertise on
keywords such as “car insurance.” Based on their size and relevance, Geico enjoys a high
organic ranking. While not as dramatic as the case of brand search, the organic channel still
26
creates a wedge between the commonly reported CPC and the real cost of a marginal click,
given by CPIC. Since the largest firms tend to advertise on these types of generic keywords,
this wedge will commonly exist, albeit with varying size.
The importance of this wedge depends critically on the level of competition. In the
absence of competition, advertising on one’s own keyword drives a statistically significant but
economically modest amount of traffic to the brand’s website, with a larger effect observed
for weaker brands. The modest increase, however, comes at a relatively large cost: roughly
half of would-be free clicks are shifted to the paid link, which creates a large wedge between
nominal and real costs. We argued that the observed CPIC/CPC ratios for many firms that
choose to advertise in the absence of competition are hard to explain with fully rational agents
and complete incentives. Although we note that 54% of firms elect not to advertise in this
case. When competitors bid aggressively on a brand’s keyword the situation fundamentally
changes. “Defensive advertising” by the brand is an effective way to prevent click stealing
and the implicit causal effect of these ads indicates it is an easily rationalizable expenditure.
Consistent with this observation, the vast majority of brands that face competition choose to
advertise. Conditional on the brand occupying the top slot, the level of competition matters:
adding competitors gives rise to a modest increase in click stealing and a substantial increase
in crowd-out. This makes the defense simultaneously less effective and more expensive, and
it is thus unsurprising that we commonly observe firms advertising on their rivals’ keywords.
Taken together, these factors substantially complicate estimating the returns to search
advertising. Although CPC is widely used and easy to understand, it often does not capture
true marginal costs of traffic. In contrast, CPIC is an economically sound metric but it
is complicated because it depends on the relevant counterfactual, which may not be easily
observable, can change over time or is an unfamiliar concept to decision makers. We do,
however, observe evidence that many firms broadly understand these complexities. For
example, advertising on brand keywords is far more common when competitors are present.
Nonetheless, when we have the resolution to study behavior at the firm level, there is evidence
that the depth of these complexities is not fully understood or captured by within-firm
incentives.
27
References
Chiou, L. and Tucker, C. (2012). How does the use of trademarks by third-party sellers
affect online search? Marketing Science, 31(5):819–837.
Craswell, N., Zoeter, O., Taylor, M., and Ramsey, B. (2008). An experimental comparison
of click position-bias models. In Proceedings of the 2008 International Conference on Web
Search and Data Mining, pages 87–94. ACM.
de Bartolome, C. A. (1995). Which tax rate do people use: Average or marginal? Journal
of Public Economics, 56(1):79–96.
Desai, P. S., Shin, W., and Staelin, R. (2014). The company that you keep: When to buy a
competitor’s keyword. Marketing Science, 33(4):485–508.
Dupret, G. E. and Piwowarski, B. (2008). A user browsing model to predict search engine
click data from past observations. In Proceedings of the 31st annual international ACM
SIGIR conference on Research and development in information retrieval, pages 331–338.
ACM.
Goldman, M. and Rao, J. M. (2015). Experiments as instruments: Heterogenous position
effects in sponsored search. In Available on SSRN.
Gomes, R. and Sweeney, K. (2014). Bayes–nash equilibria of the generalized second-price
auction. Games and Economic Behavior, 86:421–437.
Granka, L. A., Joachims, T., and Gay, G. (2004). Eye-tracking analysis of user behavior in
www search. In Proceedings of the 27th annual international ACM SIGIR conference on
Research and development in information retrieval, pages 478–479. ACM.
Ito, K. (2014). Do consumers respond to marginal or average price? evidence from nonlinear
electricity pricing. THE AMERICAN ECONOMIC REVIEW, 104(2):537–563.
Jerath, K., Ma, L., Park, Y.-H., and Srinivasan, K. (2011). A “position paradox” in sponsored
search auctions. Marketing Science, 30(4):612–627.
Jeziorski, P. and Segal, I. (2014). What makes them click: Empirical analysis of consumer
demand for search advertising. Technical report, Working papers, Haas School of Business.
28
Laurent, G., Kapferer, J.-N., and Roussel, F. (1995). The underlying structure of brand
awareness scores. Marketing Science, 14(3 supplement):G170–G179.
Lewis, R. A. and Nguyen, D. T. (2014).
play advertising’s competitive spillovers
http://dx.doi.org/10.2139/ssrn.2374414.
A samsung ad for the ipad?
disto search.
Available at SSRN
Reiley, D. H., Li, S.-M., and Lewis, R. A. (2010). Northern exposure: A field experiment
measuring externalities between search advertisements. In Proceedings of the 11th ACM
conference on Electronic commerce, pages 297–304. ACM.
Sayedi, A., Jerath, K., and Srinivasan, K. (2014). Competitive poaching in sponsored
search advertising and its strategic impact on traditional advertising. Marketing Science,
33(4):586–608.
Simon, C. J. and Sullivan, M. W. (1993). The measurement and determinants of brand
equity: a financial approach. Marketing science, 12(1):28–52.
Yang, S. and Ghose, A. (2010). Analyzing the relationship between organic and sponsored search advertising: Positive, negative, or zero interdependence? Marketing Science,
29(4):602–623.
29
6
Appendices
6.1
Appendix A: Matching with Auction Data
Experiment we use randomly restricts the number of possible paid links on top of the search
page. The control group corresponds do the default, which is a maximum of 4 advertisements
in the mainline (Cap 4). There are 4 experimental conditions: Cap 0, 1, 2 and 3. The idea
is similar to the control: e.g. Cap 0 does not allow any advertisements in the mainline, and
Cap 3 allows at most 3 advertisement in the mainline.
This design of the experiment restricts us to studying only the cases where advertisement
is eligible to be shown in the mainline, which means that in the absence of experiment
advertisement will be shown in the mainline. E.g. we cannot study the effect of advertisement
for a company that does not advertise on its own keyword: there will be no own brand
advertisements in both Cap 0 and Cap 1 condition.
Thus, we restrict our attention to cases where companies advertise. We are still facing a
challenge: if a company advertises only 50% of the time on a given query and selects search
traffic where the effect will be higher25 , we cannot compare occasions with advertisement to
the treatment condition where the ad will be removed. E.g. if we would like to estimate
the effect of own brand advertisement in mainline 1 when in 50% of the cases company
advertises, and in 50% of the cases there is no advertisement, comparing occasions in Cap 1
conditions with a paid link shown to the entire Cap 0 conditions will bias the estimates.
To find the right treatment group, we need to allocate the occasions where ad was actually
removed from the mainline. In the example above, we would like to compare occasions with
own brand paid link in Cap 1 to occasions in Cap 0 when own brand paid link would have
been shown. To find such occasions, we collect the auction data for the search queries in the
experiment. The allocation of positions in mainline follows the standard GSP auction rules:
players submit the bids for price of a click, platform computes the ”rankscore” of a given
player, and players are allocated the positions in mainline based on their rankscores. Given
that the reservation level is cleared, company with the highest rankscore gets position 1,
company with the second highest rankscore gets position 2, etc. Rankscore is proportional
25
E.g. using geo-targeting
30
to the bid and a probability of click on the ad as computed by the platform
RSj ∝ bj pclickjα
(1)
where bj is a bid of company j, pclickj - probability of company j to get a click, and α is
the tuning parameter.
This implies that knowing the rankscores of bidders and reservation level for a search
query allows to say which advertisement would be shown in the mainline in the absence of
the experiment. To get this information, we exploit auction data collected by the advertising
team. Experiment that we use was designed by removing the potential advertising slots from
mainline, but bidding data was still collected.
Table 6: Summary of matching of the experimental and auction data
Condition
Cap 0
Cap 1
Cap 2
Cap 3
Control
Searches
Searches
%
Total
Matched Matched, % ML 1
3162615 1506827
47.6
30.6
6342073 3568054
56.3
41.8
6338914 3568918
56.3
41.8
6348311 3577819
56.4
41.9
22209220 12506083
56.3
41.9
of eligible ads in
ML 2 ML 3 ML 4
9.2
4.6
2.4
11.2
4.7
2.9
10.9
6.1
3.2
11
5.7
4.1
11
5.7
3.5
Table 6 presents the summary of matching experimental data and collected auction data
for Cap 0-4. For Cap 1, 2, 3 and 4, around 56.3% of search queries in experimental data
where matched with the auction data. A search query will not be recored in the auction
data if no advertiser submitted a non-trivial bid26 , so the unmatched data can correspond to
queries with no bidders.The majority of unmatched queries correspond to occasions where
no advertisements were shown, which supports this explanation.
Cap 0 condition has a higher percentage of unmatched search queries. This indicates
the problem with the matching, given that the experiment was constructed to be balanced
between the treatment and control groups. We further find that percent of advertisement
eligible for the mainline 1 position in Cap 0 is substantially different from the percent of
advertisements eligible for mainline 1 position in Cap 1, 2, 3 and 4.
26
As defined by the platform
31
This creates a potential problem for using the occasions with eligible brand advertisements for Cap 0 condition. Consider the case of estimating the effect of own brand advertisement in mainline 1. Using the matched data, we would like to compare occasions
with own brand advertisement from Cap 1 condition to occasions with eligible own brand
advertisement from Cap 0 conditions. We know that some occasions with eligible own brand
advertisement are missing from Cap 0. If this mismatch is correlated with the probability
of a click on own brand weblink, our estimate of advertisement effect will be biased.
To check if there is a selection problem in Cap 0 matching, we estimate the effect of own
brand advertisement in mainline 1 for companies which always advertise in mainline 1 on
their own keyword27 . For these companies, comparison of Cap 0 to Cap 1 provides the casual
effect of own brand advertisement: we know that, if not the experiment, search results in
Cap 0 will have own brand advertisement in mainline 1. We also can estimate the effect
using only eligible own advertisement occasions in Cap 0 and Cap 1. If the estimates of the
effect based on two methods are different, we can confirm that the occasion in Cap 0 which
have eligible own brand advertisement in mainline 1 are correlated with the probability of
click on own brand website.
Table 7: Effect of own brand ad in mainline 1 is significantly underestimated when using
eligible ads
Ncomp
p̂own0
p̂own1
p̂own1 − p̂own0
All queries
391
0.7867
(0.0022)
0.8035
(0.0015)
0.0168
(0.0027)
When own ad is eligible
391
0.8115
(0.0029)
0.8179
(0.0015)
0.0063
(0.0033)
p̂own0 is the probability of a click on own brand link in Cap 0
p̂own1 is a similar probability in Cap 1
Table 7 presents the estimation results. Advertisement effect estimate based on all traffic
is 1.68 percent points. Advertisement effect estimated based only on traffic with eligible
own advertisement is 0.63 percent points. The difference in two estimates is statistically
27
Companies that have own brand advertisement in mainline 1 more than 99% of the time
32
significant28 .
We thus confirm that the occasions of eligible own brand ads in mainline 1 in Cap 0 are
correlated with the probability to get a click on own brand weblink. This restricts us from
using the eligible advertisements occasions to compare Cap 0 and Cap 1. Instead, we focus
only on companies that have a paid link in mainline 1 more than 90% of the time. For these
companies, comparison of Cap 0 and Cap 1 gives casual effect of advertisement.
Figure 12 shows that around 50% of companies advertise at least 10% of the time, with
around 33% advertising more than 90% of the times. Restricting the analysis to the later
group gives us 824 companies which always advertise on their keyword.
28
Difference in estimates is 0.0105, with standard error of the difference being 0.0043, which corresponds
to a t-stat of 2.46
33
Figure 12: Frequency of ads in mainline 1 for 2517 most popular brand queries
(a) Frequency of own ads in ML1
(b) Frequency of competitor’s ads in ML1
(c) Frequency of ads in ML1
34
6.2
Appendix B: Additional Table
Table 8: Brand capital measures
Mean
log(expos)
7.59
log(Rankglobal )
10.52
log(RankU S )
9.05
bounce rate, (%)
35.1
time spent per day (minutes)
4.6
pages viewed per day
4.5
search traffic (%)
19.85
Standard deviation
1.41
2.14
1.96
14.3
2.98
2.76
9.19
Table 9: Effect is higher for brands with more bidders on their keywords
Total
Quantiles of number of bidders
0-0.1 0.1 - 0.25 0.25 - 0.75 0.75 - 0.9 0.9 - 1
Ncomp
493
48
74
246
74
51
# of bidders 11.55
3.7
6.22
10.88
16.72
22.42
pnoad
pad
pad − pnoad
0.779
(0.001)
0.800
(0.001)
0.819
(0.003)
0.833
(0.002)
0.807
(0.003)
0.825
(0.002)
0.780
(0.002)
0.802
(0.001)
0.743
(0.004)
0.766
(0.003)
0.749
(0.005)
0.776
(0.003)
0.021
(0.001)
0.014
(0.004)
0.018
(0.004)
0.022
(0.002)
0.023
(0.005)
0.027
(0.006)
35
Table 10: Companies with higher competition have higher own ad effect
Dependent variable:
Effect of own ad in ML1
(1)
Frequency of competitor in ML2
(2)
(3)
∗∗∗
0.032
(0.007)
Number of bidders
Constant
0.012∗∗∗
(0.003)
0.001
(0.0004)
0.014∗∗∗
(0.005)
Observations
R2
Adjusted R2
493
0.043
0.041
493
0.005
0.003
∗
Note:
6.3
p<0.1;
∗∗
Appendix C: Deeplinks and Dcard Example
36
p<0.05;
0.034∗∗∗
(0.007)
−0.0002
(0.0004)
0.013∗∗∗
(0.005)
493
0.044
0.040
∗∗∗
p<0.01
Figure 13: Deeplinks example
37
6.4
Appendix D: Lower bound on CPIC/CPC
We compute the CPIC/CPC ratio for a company with average effect size. This estimate
gives a lower bound on the average CPIC/CPC ratio as CPIC/CPC is a convex function in
the effect size:
xp1
CP IC/CP C =
p 1 − p0
Fix the xp1 and assume the effect size p1 − p0 > 0. Denote the LHS of the CPIC/CPC
equation as f (∆(p)), where ∆(p) = p1 − p0 Then, by Jensen’s inequality, E(f (∆(p))) >
f (E(∆(p))). Hence, computing CPIC/CPC in the average effect point E(∆(p)) gives a
lower bound on the expected value of CPIC/CPC.
38
6.5
Appendix E
Figure 14: Probability to get a click for firms that bid on their own brand keywords
(a) Cap 0
(b) Cap 0 versus Cap 1
39
6.6
Appendix F: Competition and Brand Capital Relationship
Table 11: Companies with higher brand capital have less competition
Dependent variable:
Frequency of competitor in ML2
(1)
(2)
0.052∗∗∗
(0.009)
Constant
−0.126∗
(0.075)
0.028∗∗∗
(0.009)
−0.023∗∗∗
(0.008)
−0.022∗∗∗
(0.003)
−0.039∗∗∗
(0.012)
0.483∗∗∗
(0.109)
Observations
R2
Adjusted R2
492
0.065
0.063
492
0.216
0.209
Note:
∗
log(us)
Deeplinks
Dcard
Number of own organic links
40
p<0.1;
∗∗
p<0.05;
∗∗∗
p<0.01