The Adoption of Energy Efficient Light Bulbs and Information

The Adoption of Energy Efficient Light Bulbs and Information Diffusion of Multidimensional
Product Characteristics: A Text-Mining Analysis Using Amazon Product Webpages
Yu-li Ko1 & Kenneth L. Simons2
Date of This Version:
5/23/2016
WORKING PAPER: Please do not cite or circulate without permission.
* This work was supported in part by the Engineering Research Centers Program of the National Science
Foundation under NSF Cooperative Agreement No. EEC-0812056 and partially by New York State under NYSTAR
contract C090145. Any Opinions, findings and conclusions or recommendations expressed in this material are those
of the author(s) and do not necessarily reflect those of the National Science Foundation or New York State.
1
2
Rensselaer Polytechnic Institute, [email protected]
Rensselaer Polytechnic Institute, [email protected]
The Adoption of Energy Efficient Light Bulbs and Information Diffusion of
Multidimensional Product Characteristics: A Text-Mining Analysis Using Amazon Product
Webpages
1. Introduction
Currently, electric lighting consumes almost 20% of global electricity production (Waide
& Tanishima, 2011). Fast adoption of technologies and further innovations can effectively
reduce energy consumption in the lighting sector. In this context, policies have supported two
historically important changes in the electric lighting industry: phase-out of inefficient
incandescent lamps and the adoption of efficient light-emitting diode (LED) lamps. However,
energy efficient products tend to diffuse slowly despite their cost-saving benefits (Jaffe &
Stavins, 1994a; Jaffe & Stavins, 1994b; Sorrell, 2004), and the previous experience with
compact fluorescent lamps is a well-known example of slow adoption of energy efficient
products. Multiple explanations for the “energy paradox” have been suggested, such as risk,
hidden costs, difficulties in financing, split incentives between renters and landlords, bounded
rationality, and imperfect information (Sorrell, 2004). While all these reasons would play a role,
this study focuses on imperfect information, specifically, consumer communication regarding
unobservable product characteristics. If consumers communicate less about unobservable
product characteristics, information on improved efficiency may not diffuse as fast as observable
characteristics, resulting in a suboptimal adoption rate. To understand the problem, 103,003
product reviews web-scraped from Amazon.com were classified into product characteristic
categories, including efficiency, color, brightness and the price of the bulb.
This study is evolved from a Lancasterian perspective of multidimensional product
characteristics (Lancaster, 1966), combining insights from studies on search goods, experience
goods, and credence goods (Nelson, 1970, 1981; Darby & Karni, 1973). The classification of
search goods, experience goods, and credence goods depends on how easy it is to gather
information for consumers (Nelson, 1970; Darby & Karni, 1973). A search good is a product for
which consumers can assess its value before purchase. For an experience good, consumers can
assess its value after purchase. For a credence good, it is hard for consumers to assess its value
because assessment requires specialized knowledge and/or high cost. Later literature tends to
emphasize “search attributes”, “experience attributes,” and “credence attributes” (Nelson, 1981),
rather
than
product-level
classifications
embracing
a
Lancasterian
perspective
of
multidimensional product characteristics. This study applies such attribute categories to
understand technology adoption better, especially energy efficiency improvements. Unbalanced
information diffusion, originated from the cost of evaluation, may adversely impact the adoption
of technological change in credence-good attributes, like energy efficiency.
To empirically assess this claim, a text-mining method is applied to 103,003 consumer
reviews collected from Amazon product webpages in the light bulb category between September
2010 and June 2014. The comments were first parsed into words. The 2,000 most frequent words
are detected by text-mining, and manually classified into categories of most discussed product
characteristics including efficiency, longevity, price, brightness, color, and heat emission. By
using frequent words, the analysis focuses on major attributes. These product characteristics are
further classified into search attributes, experience attributes, and credence attributes. Efficiency
and longevity are classified as credence attributes, price is classified as a search attribute, and
brightness, color and heat emission are classified as experience attributes. Then, total counts of
comments for each characteristic/attribute are calculated by month. Using these data, consumers’
communication for each characteristic/attribute is quantified.
Prior studies analyzed product attribute categories using Amazon reviews. Mudambi and
Schuff (2010) studied whether product attribute categories affect the subjective helpfulness of
the review. They picked a music CD, an MP3 player, and a video game to represent experience
goods, and a digital camera, a cell phone, and a laser printer to represent search goods. This
classification fits the dominant attributes of products studied in prior literature. Their statistical
results suggest that extreme reviews are less helpful for experience goods compared to search
goods. Hong, Chen, and Witt (2012) developed theoretical models to classify products into
experience goods and search goods using consumer reviews. They applied the theory using
Amazon reviews based on the theoretical models. Siering and Muntermann (2013) analyzed
review contents to understand how reviews discussing credence goods differ from reviews
discussing search or experience goods, and found differences in product evaluations, quality
assessment, and expressed feelings of need. While the prior studies reveal how product attribute
categories affect consumer reviews, this study focuses on consumer communication about
product attributes within a product category.
Among 103,003 comments, 55% are about LED bulbs, 20% are about CFL bulbs, 12%
are about halogen bulbs, and 13% are about incandescent bulbs. According to the U.S. Energy
Information Administration (2014), the market share of LED bulbs in 2013 was 2.3%, and that of
CFL bulbs was 15%-20%. A clear pattern is that consumers’ tendency to talk was greater for
LED light bulbs, which are new products. In all, 26.6% of comments discussed price. Since price
is shared information across all consumers, the marginal utility of this information to other
consumers is nearly zero, and the marginal cost of observation is also nearly zero. No risk of
fraud or ambiguity is associated with this information. Therefore, this can be a benchmark value
to understand the fraction of other product characteristics. For the three experience attributes,
32.4% of comments discussed brightness, 26.8% discussed color, and 6.0% discussed heat. For
the two credence attributes, 9.8% discussed efficiency and 2.6% discussed longevity. The 9.8%
frequency of efficiency comments is not much higher than the 6.0% frequency of heat-related
comments. Considering that heat is not a main characteristic of a light bulb, 9.8% is surprisingly
low. In sum, 26.6% of comments were classified as search-attribute related, 47.3% of comments
were classified as experience-attribute related, and only 12.0% of comments were classified as
credence-attribute related. The results suggest that consumer communication is biased towards
experience attributes such as brightness and color, rather than credence attributes such as
efficiency. Results are mostly similar across technologies and over time.
The results support the need for command-and-control policies like phase-out of
incandescent bulbs, active public advertisement, and education. Policies must aid the diffusion of
information on credence attributes such as energy efficiency. Information on technological
change in efficiency may not diffuse as fast as observable characteristics, and this can create two
disadvantages for incentivizing further improvement in energy efficiency. First, products with
superior efficiency may experience a smaller market than is socially desirable, disincentivizing
technological change in efficiency improvement. Second, social reputations for technological
change in efficiency improvement may be insufficient, psychologically discouraging innovators.
This bias found in consumer communication on light bulbs has broader and general implications
on technology policies, suggesting the need to promote technological change in unobservable
characteristics more intensively.
In section 2, light bulb technologies and some history are briefly introduced. In section 3,
research topics are explained. In section 4, the data and method are explained. In section 5,
results are summarized. The last section provides a summary and discussion.
2. Light Bulb Technologies
Currently, the global light bulb market is served mainly by four technologies:
incandescent lamps, halogen lamps, compact fluorescent lamps (CFLs), and LED lamps. These
lamp types differ on a key dimension of technological change, energy efficiency. For electric
lighting, energy efficiency means the degree to which electrical energy input is converted to light
output. Light output per unit of electrical energy is commonly measured in units of lumens per
watt. Energy efficiency of the four major lighting technologies is reported in Table 1. For climate
change abatement, improvement in energy efficiency has become important.
Most incandescent bulbs convert less than 5% of electrical energy to light, and the other
95% is mostly wasted as heat. As seen in Table 1, energy efficiency of 60-watt incandescent
light bulbs is only 16 lumens per watt. In addition, bulb life is very short. Because of their low
efficiency, governments have been mandating a phase-out of incandescent lamps. Table 2
summarizes phase-out policies of major industrialized regions.
The efficiency of halogen bulbs is not much better than that of incandescent bulbs, but
life is significantly longer. Halogen lamps are incandescent lamps with small amounts of halogen
elements, such as iodine or bromine, added. The halogen elements help the incandescent filament
last longer.
CFLs were introduced in the 1970s. The efficiency of 60-watt CFLs is 67 lumens per
watt, which is significantly better than for incandescent or halogen lamps. Bulb life is also longer.
The adoption rate of CFLs has varied greatly by country. In the U.S., adoption has been
especially low despite supportive government policy.
LED lamp technology is the most recent technology. The efficiency and longevity of
LED lamps exceeds previous technologies such as incandescent lighting, CFLs, and halogen
lamps, and there is still good potential for efficiency improvement. However, up-front cost has
been relatively high. It is expected that technological change can be fast, further reducing
production cost and improving efficiency. Despite the policy objective, adoption has been slow.
As of 2013, the market share of LED bulbs in 2013 was only 2.3%.
3. Research Topics
The cost of electric lighting using existing fixtures consists of two components. The first
component is the price of the light bulb distributed and discounted over the life of a light bulb.
The second component is electricity cost, which is reduced by improvements in energy
efficiency. With improvements in efficiency and longevity, energy efficient light bulbs reduced
the cost of electric lighting, even taking account of the price of the light bulb (Sanderson &
Simons, 2014). However, energy efficient light bulbs have experienced slow diffusion. In
general, energy efficient products are known to diffuse slowly despite their cost saving features.
An old debate in energy economics has addressed this “energy efficiency gap” or “energy
paradox.” It has been argued that consumers are unlikely to adopt energy efficient technology
quickly even if it is beneficial to them. For example, an empirical study concludes that
consumers appear to be “indifferent between one dollar in discounted future gas costs and only
76 cents in vehicle purchase price” (Allcott & Wozny, 2012, p. 31). Multiple explanations have
been suggested for the energy paradox, such as risk, hidden costs, difficulties in financing, split
incentives between renters and landlords, bounded rationality, and imperfect information (Sorrell,
2004).
This paper researches biased diffusion of information on product characteristics, which
can be unfavorable to efficiency. When a product has multidimensional characteristics, and cost
of evaluation differs by each attribute, information diffusion may not be balanced across all
product characteristics. Such unbalanced information diffusion can adversely impact adoption of
technological changes in credence-good attributes, in this case energy efficiency. This implies
that information on technological change in efficiency may not diffuse as fast as observable
characteristics, resulting in a suboptimal adoption rate of energy-efficient products. This can
hinder a critical climate change abatement method.
A product could have multiple characteristics (Lancaster, 1966). For example, in the case
of electric lighting, brightness and color are dominant product characteristics. If consumers talk
less about some product characteristics, two main reasons may exist. On one hand, each
characteristic’s contribution to utility can differ across products, and the difference may provoke
consumer communication. A 60-watt-equivalent replacement LED light bulb may be brighter or
more bluish compared to a particular previously-experienced 60-watt incandescent light bulb,
and a consumer may or may not like the brightness or color difference. Actual variance in
characteristics and consumers’ utility from each characteristic can affect what characteristics
consumers talk about.
On the other hand, consumers may talk less about some characteristics due to the cost of
observation, and this part is the main research topic of this paper. Not all product characteristics
are equally easy to evaluate. Products can be classified as search goods, experience goods, and
credence goods depending on how easy it is to gather information about them for consumers
(Nelson, 1970; Darby & Karni, 1973). A search good is a product for which consumers can
figure out its value before purchase. An experience good is a product for which consumers can
figure out its value after purchase. A credence good is a product for which it is hard for
consumers to figure out its value, because evaluation requires specialized knowledge and/or high
cost. A later literature tends to emphasize “search attributes,” “experience attributes,” and
“credence attributes” (Nelson, 1981). Brightness and color are experience attributes while
efficiency is a credence attribute. The study investigates whether consumers communicate less
about credence good characteristics like efficiency.
4. Data and Method
4.1 Data Collection
To understand biases in consumer communication, online consumer reviews can be a
good source of information. Studies using online reviews have become popular in academia. For
e-commerce, consumer comments are an essential part of consumer communication.
Product webpages were collected from the U.S. Amazon internet retailer, well known by
its internet address of amazon.com. Amazon consumer reviews are representative and relatively
reliable. Amazon is the world’s largest online retailer, with net sales of $61.09 billion in 2012.
Alexa Internet, Inc. ranks Amazon as having the world’s sixth-highest internet traffic rank as of
April 3, 2016. Amazon consumer ratings show good correlation with expert ratings (Mudambi &
Schuff, 2010).
The collected products belong to the light bulb category. For this study, subcategories are
restricted to incandescent bulbs, halogen bulbs, CFL bulbs (excluding fluorescent tubes), and
LED bulbs, because these are the most common residential-use bulbs. The text was web-scraped
during July 8, 2013 to early July 2014, and is an almost complete collection of consumer reviews.
This dataset has many strong points. It is an almost complete collection of reviews in the
period, avoiding sample selection issues. Use of textual instead of spoken words facilitates
processing by computer. With the growth of e-commerce, understanding online communication
is important by itself. One advantage over interviews is that there is no personal interaction
between interviewees and interviewers.
Use of Amazon reviews may yield differences compared to the general US or world
populations. Review writers are all web users, and mainly US consumers. The writers’ age
distribution may be biased somewhat toward a younger generation. In the U.S., CFL penetration
was largely unsuccessful, signaling that U.S. consumers may be less sensitive to energy
efficiency than consumers worldwide. Consumers who experience a large gap between
expectation and reality may be disproportionately likely to leave comments, and ratings are
likely to follow a J-curve distribution (Hu, Pavlou, & Zhang, 2009). One- or two-star customer
ratings would generally signal a low level of consumer surplus while four- or five-star ratings
would signal a high level of consumer surplus. Consumers are likely to say nothing about
product characteristics if they experience moderate satisfaction. In this period, incandescent
lightbulbs were phased-out in the U.S., with production of 100-watt bulbs ceasing in 2012, then
75-watt bulbs ceasing production, then 60- and 40-watt bulbs ceasing production in 2014.
Producers could still sell incandescent bulbs. While the phase-out was anticipated by producers
and many consumers, the majority of incandescent light bulb consumers may not have known its
details. This event, and other U.S. government policies, may have provoked awareness of
efficiency, so it is possible that this increased comments using efficiency-related words. Since
Amazon’s page views greatly increased in this period, newer time periods and products are also
represented more than older periods and products.
4.2 Data Preparation Method
4.2.1 Data Clean-Up
Originally 269,462 comments in the light bulb category were collected, including revised
versions of the same review on the same product by the same reviewer. The original dataset went
through clean-up processes. As a result of clean-up, 103,003 comments were left.
First, unusual lighting products were excluded from the dataset, to focus on normal white
light bulbs. Color-changing bulbs, LED strip lights, and lanterns were excluded. Amazon’s
categorization of products is not perfect, so exclusions to focus on normal white light bulbs may
be imperfect.
Second, only comments written between September 2010 and June 2014 are used. In the
early part of the original dataset, few reviews were available. Amazon’s growth is one reason for
an increased number of reviews over time, as is accumulated consumer experience writing
reviews so that consumers begin to leave reviews with increased frequency. If an Amazon seller
inactivates a product, the associated product web page is no longer searchable. Without
truncation, old reviews would over-represent products that survived. Amazon employees were
consulted regarding this issue, but no information was available from them. From September
2010, the number of comments per month for each product category exceeded 30, which was
chosen as a cutoff for sample inclusion. Data for early July 2014, at the end of the sample, were
excluded so that when data are reported on a monthly basis, analyses include (nearly) all reviews
during each month.
4.2.2 Keyword Classification
Text mining is a computerized method that processes natural language text to find
patterns, and recent research in economics makes use of the technique. In this paper, a comment
is classified into a subject category if the comment contains keywords associated with the
category. To classify comments, a list of frequent words was constructed by text-mining, and the
2000 most frequent words were manually classified, where relevant, into product characteristics
categories. Words not associated with a product characteristic were ignored. The classification is
not mutually exclusive. For example, if a comment contains “white” and “efficiency,” the
comment is classified into the category of color and also into the category of efficiency. Since
the classification depends on the pool of keywords, selecting keywords is a central problem of
data preparation for this study.
The critical point of this classification is to minimize the possibility of incorrect
classification of a comment. A sufficient number of keywords should be used, and ambiguity
should be investigated and reduced.
To avoid biases in selecting keywords, a list of most frequent words in the reviews is
generated first. Among 42,215 words, the most frequent 2,000 words were examined manually.
Keywords from the list were identified that consumers use to describe product characteristics.
These keywords were categorized into product characteristics including color, brightness, heat,
efficiency, longevity, and price. The classification did not start with preset categories to avoid
bias. Since the classification is based on the most frequent keywords, these are product
characteristics that consumers talk about often. It would be impractical to go through all 42,215
words, and largely meaningless to do so. Examination halted at the 2,000th most frequent word,
because meaningful keywords were rarely found near the 2,000th word.
Note that the
distribution of words usually follows a power-law distribution, and the frequency of the most
frequent words is overwhelmingly high compared with the frequency of less frequent words. For
example, the 2,000 most frequent words are 4.7% of the single words that ever appear, but
represent 87.4% of the words in the text. Therefore, neglecting low frequency words is less
problematic.
Naturally, rare terms are not included, and thus truncation of the words used may differ
across product characteristics. To illustrate the problem, consider a total set of words that a
person can mention to talk about a certain topic. For example, for color, a total set of color-
related terms can cover the spectrum of visible light, and these terms can capture comments that
talk about color very well, as the set of words is complete and dense. At the same time, some
color-related terms are rarer than other terms. Violet would not be as frequent as red. Since 2,000
frequent words are used, rarer color-related terms are not captured. If few words are associated
with a product characteristic, the associated words may be more likely to appear within the list of
2,000 most frequent words. This would be a problem if the words that people use to describe a
product characteristic are overall very rare and differ across people, causing the words not to
make it to the top 2,000 list. However, since the 2,000 most frequent words cover 87.4% of the
words in the text, the problem is likely to be negligible.
In case there is no ambiguity, a word is easy to classify. For example, “white” can be
classified in the category of color without ambiguity. Table 3 is the list of non-ambiguous words
and their classifications. This non-ambiguity takes account of the context of light bulb reviews.
When “blue” is detected, it will be interpreted as blue color. It is possible that it means blue
emotion, but within the context of light bulb reviews, it is judged that ambiguity is insignificant.
Ambiguity of a word can be a great source of biases, so is treated carefully. An
assumption in the frequency analysis is that the distribution of signs, namely words and
collocations, reflects the distribution of meanings. Text analysis favors rare and technical terms
that assign one sign to one meaning. In case of frequent and non-technical terms that assign one
sign to multiple meanings, text analysis may be biased. Since this analysis is about product
characteristics, it is likely that a term is assigned to one meaning. However, there are cases that
require caution. For example, when “cool” is detected, it will be interpreted as cool color, but it
is possible that it means cool temperature or something nice.
If at least one keyword appears in the text, the text is classified under the associated
characteristic. Therefore, if a consumer uses multiple keywords in her review, classification is
likely to become more accurate. For example, if a consumer writes that “the bulb’s color is
white,” the review is categorized as related to color for two reasons, both “color” and “white.”
The availability of multiple keywords for a characteristic thus reduces ambiguity and chance
fluctuations in classification.
There are some resolutions for the remaining ambiguity, and each has its own problems.
First, ambiguous terms can be neglected. Second, one ambiguous term can be assigned to one
characteristic category according to a manually analyzed distribution. Third, ambiguous terms
can be assigned to multiple characteristics categories according to a manually analyzed
distribution. For example, suppose that 4 out of 10 appearances of ‘cool’ are temperature cool,
and 6 out of 10 appearances of ‘cool’ are color cool (meaning a bluish white light). Then, the
first way neglects cool due to ambiguity, the second way assigns all the comments that include
‘cool’ to color cool, and the third way assigns 60% of all the comments that include cool to color
cool and 40% to temperature cool. In this paper, the second way is used, and to complement the
method, a word strongly associated with the word is used if possible. For each ambiguous word,
twenty comments that include the word were randomly retrieved and examined to determine
classification rules. A list of ambiguous words and their treatment methods is found in Table 4.
The first way is sometimes used when meanings did not generally pertain to any of the product
characteristics analyzed. Two-word phrases that more specifically capture a meaning were also
sometimes used; for example, when ‘long’ is neglected for ambiguity, “last long” is included to
capture longevity-related comments.
Technical words that describe product characteristics are included in the list of keywords,
because it appears that customers do use technical words to talk about product characteristics.
For example, lumen and output are used to describe brightness. The words 2700k, 3000k, 6000k,
and kelvin are used to describe color temperature, so these are classified as color-related
keywords.
As seen in Table 5, it may look like the number of keywords for each category unfairly
differs, but this is not a problem. For example, there are 17 color-related keywords while there
are only 5 brightness-related keywords. This reflects two underlying patterns in the dataset.
Color-related keywords are indeed frequent, and color related terms have a finer spectrum. In
comparison, brightness-related terms are simpler using only variants of bright and dark. Thus,
there is an inherent difference in the number of terms in English language. However, the
numbers of comments classified with these keywords are aggregated by product characteristics,
so the number of keywords for each category is unlikely to cause any bias. In fact, the analysis
results indicate that the number of comments for brightness is generally higher than the number
of comments for color despite the smaller number of color-related keywords.
The product characteristics are further classified according to whether they are search,
experience, or credence characteristics. Color and brightness are experience attributes, because
they are observed after purchase. Heat is also an experience attribute. The price of the bulb is a
search attribute. Efficiency and longevity are credence attributes. The borderlines between these
categories are not necessarily sharp. Longevity is technically observable. However, measurement
of longevity is costly, in terms of time and record-keeping, and thus it is classified as a credence
attribute.
4.3 Method
Simple statistical methods are used. The fraction of reviews discussing each product
characteristic and each attribute (search/experience/credence) is calculated and compared. From
the dataset, the frequency with which reviews mention each characteristic is calculated, as the
number of reviews that match a certain product characteristic category, divided by the total
number of reviews. If these rates are overall higher for experience attributes, this would support
the claim that information on experience attributes is easier to diffuse. Since the dataset is an
almost complete collection of the consumer reviews in the light bulb category on Amazon,
simple statistics directly represent the population. Additionally, to check the robustness of the
results, comments on ‘Energy Star’ products, differences across technologies, and temporal
trends are studied.
First, differences across product characteristic and search/experience/credence attributes
by bulb technologies are examined. LED bulb reviewers may be more sensitive to credenceattributes such as efficiency and longevity.
Second, the monthly dynamics of the fraction of reviews discussing product
characteristics are studied to see if there are temporal trends. From the dataset, the frequency
with which reviews mention each characteristic is calculated on a monthly basis, as the number
of reviews that match a certain product characteristic category, divided by the total number of
reviews that month. Since this period went through technological change, including phase-out of
incandescent bulbs and commercial development of LED bulbs, an examination of temporal
trends allows consideration of potential impacts of these shifts. New products, especially LED
products, were introduced over time, and might have more comments. Over the time period
analyzed, LED bulbs’ market share increased. The industry experienced severe price reductions,
and LED bulbs benefited from technological change, yielding cheaper, better looking, brighter,
more-reliable bulbs with better dimming and other features.
Third, the fraction of reviews discussing each product characteristic and each attribute
(search/experience/credence) is calculated and compared with the dataset restricted to products
that explicitly include ‘Energy Star’ in the product description. Energy Star is a U.S.
Environmental Protection Agency program that certifies energy efficient products. For the
Energy Star analyses, 31,958 comments (31% of the total comments) are analyzed. This analysis
is done to check whether consumers’ tendency to talk about search or experience attributes is
affected by information on efficiency from the product descriptions.
5. Results
First note that the majority of comments was about LED bulbs even though the market
share of LED bulbs was low, as shown in Figure 1. Note that 55% of the total comments are
about LED bulbs, 20% are about CFL bulbs, 12% are about halogen bulbs, and 13% are about
incandescent bulbs. It appears that consumers talked more about the new technology than the
incumbent technology. The number of LED bulb products for sale became substantially higher
than for the incumbent bulb types, so the mean number of comments per product was not
necessarily higher.
Table 5 Column (1) summarizes the aggregated results. Among 103,003 comments, 26.6%
of comments discussed price, which is a search-attribute. Since price is shared information
across all consumers, the marginal utility of this information to other consumers is nearly zero,
and the marginal cost of observation is also nearly zero. No risk of fraud or ambiguity is
associated with this information. Therefore, this can be a benchmark value to understand the
normalized frequency of other attributes. For the three experience attributes, 32.4% of comments
discussed brightness, 26.8% discussed color, and 6.0% discussed heat. For the two credence
attributes, 9.8% discussed efficiency, and 2.6% discussed longevity. In sum, 26.6% of comments
were classified as search-attribute related, 47.3% of comments were classified as experienceattribute related, and only 12.0% of comments were classified as credence-attribute related. For
bulbs, consumers were highly likely to talk about experience attributes, and fairly likely to talk
about search attributes. They were least likely to talk about credence attributes.
Table 5 Columns (2)-(5) summarize the results of the analysis by bulb technology. While
differences are observed, the fraction of experience attributes is clearly higher than the fraction
of credence attributes across all bulb technologies.
Figure 2 depicts the monthly trend of reviews that are related to bulb price, which is a
search-attribute. The fraction for price mostly stays between 25% to 35%, and a falling trend is
observed. Thus, 25-35% of customer reviews discuss price. Figure 3 gives the same information
for different bulb types. The decrease in discussion of price occurred across all bulb types, most
notably for incandescent and LED bulbs. Discussion of price was greatest for halogen bulbs,
which are relatively high-priced among incumbent bulb types, and for LED bulbs, whose price
started out high but has declined rapidly.
Figure 4 summarizes monthly trends of experience attributes, including color, brightness,
and heat. The trend shows little variance over time. Among the customer reviews, 25-35%
discuss color, 30-40% discuss brightness, and 5-10% discuss heat. The fraction for brightnessrelated keywords is similar. Keywords for heat, which is not a dominant attribute of a light bulb,
appear in around 5-10% of comments. It appears that the marginal utility change from heat is
relatively minor to ordinary consumers, since heat is not the main function of a bulb and does not
ordinarily pose problems. Color and brightness, however, are central aspects of consumer utility
from light bulbs, and the importance is likely to be reflected in the fractions along with the low
cost of observation.
Figure 5 summarizes monthly trends of credence attributes, including efficiency and
longevity. The percentages are fairly stable over time. Roughly 10-15% of reviews discussed
efficiency, and about 3% discussed longevity. Thus, it appears that consumers are less likely to
write about efficiency or longevity. This confirms the hypothesis that credence attributes are less
like to be discussed or communicated among consumers.
Figure 6 shows the monthly trend for efficiency, by bulb technology. LED and CFL
reviews were most likely to include efficiency-related keywords, with about 7-15% of CFL
reviews discussing efficiency, and roughly 7-17% of LED reviews discussing efficiency. The
discussion of efficiency for CFL bulbs trended downward over time. For incandescent and
halogen bulbs, efficiency was discussed in about 7-10% of reviews, albeit with considerable
fluctuation over time partly due to relatively small sample sizes. It appears that reviewers of LED
and CFL products, which might be chosen for efficiency reasons, were most likely to consider
efficiency in assessments of light bulb products. Nonetheless, discussion of this key attribute of
light bulbs is very infrequent.
LED light bulbs experienced substantial efficiency gains over this period, whereas
incandescent, halogen and CFL bulbs were mature technologies with little if any efficiency
improvement over the sample period. One might expect that the changes in efficiency over time
would stimulate more discussion, but any such effect regarding efficiency apparently was very
modest. Improvements in other aspects of LED bulbs were also important, including for price,
color, brightness, heat, and longevity.
6. Discussion
Only 9.8% of comments discussed efficiency. For comparison, 26.6% of comments
discussed price, and 32.4% of comments discussed brightness. In general, while 47.3% of
costumer reviews consider experience attributes, only 12.0% consider credence attributes. The
9.8% considering efficiency is only slightly higher than the fraction of heat-related comments,
which is 6.0%. Since heat is not a main characteristic of a bulb, 9.8% is surprisingly low. This
substantial difference in frequency is consistent with information on efficiency being less likely
to diffuse relative to information on color or brightness. It appears that experience attributes are
much more likely to be mentioned in review comments than credence attributes.
As efficiency-related terms are less often communicated compared to experience-related
terms, the rate of information diffusion is affected. This implication for the rate of diffusion has
consequences for adoption and innovation. If efficiency improvement is not associated with
improvements in experience attributes – and this is generally true for LED lighting – then the
products that embed efficiency innovations would not be adopted rapidly. This would disincentivize innovations in efficiency. This also suggests an incentive for biased technological
change, favoring improvements in experience attributes. Technological change in credence good
qualities may be slower.
Facing this, on the firm’s side, first, it is competitively essential to improve experience
and search good qualities along with any improvements that might be made in credence good
qualities. In the case of LEDs, the changing technology is changing in multiple characteristics,
and improvement in energy efficiency alone is not sufficient for rapid diffusion of the technology
when consumers weigh other characteristics more than unobservable energy efficiency. Superior
performance in light quality should be coupled with better efficiency. Improvement in efficiency
alone is not sufficient for fast diffusion when a product has multiple characteristics. This would
be one reason for the energy efficiency gap. When a technology is bundled, it is desirable to
invest more in improving other characteristics to some extent. For new technologies, the
potential to improve other characteristics, that are more salient to consumers, should be assessed.
Cost is not the only concern of consumers, and new technologies with improvement in one
characteristic may not satisfy the needs of consumers for other characteristics. Second, this
credence good quality can be diffused by expert consumers. Expert consumers might be trusted,
if their expertise becomes apparent in reviews, and their ratings and uses may affect future
consumers. Targeting this group may help. Consumers with technical knowledge often
voluntarily provide information including information about credence good quality. Technical
comments often are written in the early stage of product introduction. This would imply a role of
consumer experts in diffusion, especially for products with important credence good qualities.
The results support two government policies. First, to spur adoption, it may be helpful to
support technological changes in other characteristics along with efficiency, even though
efficiency improvement is the main policy target. This implies that technology policy that only
promotes energy efficiency may be likely to fail. If government subsidy is given only to improve
energy efficiency, the resulting products may be of ordinary, or even low, quality in terms of
other characteristics that are salient to consumers, causing low adoption. Thus government
agencies supporting efficiency improvement should also support improvement of other
characteristics. A gap in perceived product quality may explain why some consumers still use
incandescent lamps, in preference to CFLs and to early generations of LED bulbs. LED products
with better color, brightness, and other observable features can penetrate the market faster, by
building on improvements in experience attributes. Utility gains from characteristics other than
energy efficiency are high, and energy-efficient products do not yield enough utility to overcome
uncertainties or a lack of information about efficiency. This yields complex implications for the
phase-out of incandescent bulbs. Efforts to improve light quality may be reduced without
competitive pressure from incandescent bulbs, although competition among CFL and LED bulbs
still provide competitive pressure to improve these features. On the other hand, if consumers are
not aware of some information, phase-out may be justified.
Second, the lack of consumer attention to important non-salient features justifies the need
for command-and-control policies like phase-out of incandescent bulbs, active public
advertisement, and education. Globally, governments have pursued the policies, but policies can
be more intensive. More active top-down information diffusion can be effective. It would also be
helpful to make the credence good quality visible. Technological change to improve visibility of
credence good qualities, such as a more visible electricity meter, or internet-of-things data
collection leading to display of the electricity cost of specific devices, may be helpful. In this
way, credence good qualities like efficiency might be converted into experience or search
qualities. Energy auditing also may help to make credence information visible. Education or
advertisements may help. Incentivizing expert citizens may supply more information.
Technological literacy differs across people, and information cost differs, so methods to simplify
the information and its flow may help; for example if the term “efficiency” is too technical and
ambiguous, “dollar savings in your area” may better communicate information.
The U.S. government and other governments have actively tried to address some of these
problems of consumer information diffusion for energy-saving products including light bulbs.
The U.S. Energy Star program is a government labeling campaign that has provided trustworthy
information, for products including light bulbs, about efficiency. In addition, the U.S.
Department of Energy’s Lighting Facts program requires LED light bulb manufacturers to
provide detailed information on product characteristics including brightness, color, and
efficiency on their product labels, based on test results that follow industry standard procedures
(U.S. Department of Energy, 2012).
The program provides credence to consumers and
incentivizes firms to improve multiple characteristics. The U.S. Department of Energy has
published reports on product characteristics of energy efficient light bulbs. However, this may
not sufficiently help fast diffusion of information on credence attributes. Many academic and
government reports already suggested that more active information diffusion policy may be
necessary such as consumer education and public advertisement, and the results of this paper
supports the direction.
7. Conclusion
To understand the role of imperfect information in the energy paradox, 103,003 Amazon
review comments on light bulbs were analyzed with text-mining methods. A new method is
suggested based on search attributes, experience attributes, and credence attributes classification.
While 30% of comments include color-related keywords and a similar fraction of comments
include brightness-related keywords, only 9.8% of comments include efficiency-related
keywords. This fraction is only slightly higher than the fraction of heat-related comments, which
is 6.0%. The fraction of comments including longevity is much lower. It appears that the pattern
is consistent across different bulb types.
The evidence confirms that efficiency is not a characteristic for which information is easy
to diffuse relative to observable characteristics, especially color and brightness. If information on
efficiency is not easy to diffuse despite cost savings, the rate of adoption for energy efficient
products will be slow.
The study has general implications for technological change. It is likely that
technological change that cannot be directly observed by consumers would be slow due to slow
diffusion. This type of technological change is less incentivized. Public communication of
characteristics including credence characteristics can be important, as is further confirmed by
U.S. government efforts to diffuse this sort of information. Especially when unobservable
characteristics involve an externality, this is potentially problematic.
There are some weaknesses associated with text-mining. Text-mining can yield noisy
results despite efforts to minimize noise. In future studies, to check sensitivity, two methods can
be used. First, only helpful reviews might be used to see if there is difference for good quality
reviews. Reviewers who write good quality reviews may write more about credence-good
attributes. Second, only highly rated products might be used to see if there is a difference for
good quality products. Reviews who buy good quality products may write more about credencegood attributes. Noise may be further reduced. Analysis might be restricted to certain bulb types
such as 60W and 40W equivalents. Since the impact of phase-out differs by wattage of bulbs,
which decides variable cost per bulb, the difference in comments can be analyzed in future
studies. This might provide unique evidence on the diffusion side of the energy paradox. Cleanup might be done better to address collocated words, such as the word pair “warm up.”
More extended research could be done. Similar analysis might be done with other
products, including other energy-efficient products. Information on credence good qualities may
be more likely to be spread by expert consumers, who have better access to and knowledge about
credence information, and this issue might be studied. Combining the work with sentiment
analysis, it can be studied whether incandescent light bulb consumers leave less technical and
more emotional reviews. Cross validation might be made using social networking services like
Twitter and Facebook. The same framework might be applied to other products, including other
energy-efficient products. Firm dummies might be included to see how company dominance is
established in the new LED sector. Further studies are needed to reveal more details.
References
Allcott, H., & Wozny, N. (2012). Gasoline prices, fuel economy, and the energy paradox. (No.
w18583). NBER Working Paper. Cambridge, MA: National Bureau of Economic Research.
Darby, M. R., & Karni, E. (1973). Free competition and the optimal amount of fraud. The
Journal of Law & Economics, 16(1), 67-88.
Hong, Y., Chen, P. Y., & Hitt, L. M. (2014). Measuring product type with dynamics of online
product review variances: A theoretical model and the empirical applications (No. 14-03).
NET
Institute
Working
Paper.
Retrieved
from
http://papers.ssrn.com/sol3/papers.cfm?abstract_id=2506328 (Date Last Accessed, April, 16,
2016)
Hu, N., Pavlou, P. A., & Zhang, J., (2009) Why do online product reviews have a J-Shaped
distribution? Overcoming biases in online word-of-mouth communication. Communications
of the ACM, 52 (10), 144-147. Retrieved from http://dx.doi.org/10.2139/ssrn.2380298 (Date
Last Accessed, April, 16, 2016)
Jaffe, A. B., & Stavins, R.N. (1994a). The energy-efficiency gap what does it mean? Energy
Policy, 22(10), 804-810.
Jaffe, A. B., & Stavins, R.N. (1994b). The energy paradox and the diffusion of conservation
technology. Resource and Energy Economics, 16(2), 91-122.
Lancaster, K. J. (1966). A new approach to consumer theory. The Journal of Political Economy,
74(2). 132-157.
Mudambi, S. M., & Schuff, D. (2010). What makes a helpful review? A study of customer
reviews on Amazon.com. MIS Quarterly, 34(1), 185-200.
Nelson, P. J. (1970). Information and consumer behavior. Journal of Political Economy, 78(2),
311-329.
Nelson, P. J. (1981). Consumer information and advertising. In M. Galatin & R.D. Leiter (Eds.),
Economics of Information (pp. 42-77). Hingham, MA: Martinus Nijhoff Publishing Kluwer
Boston, Inc.
Sanderson, S. W., & Simons, K. L. (2014). Light emitting diodes and the lighting revolution:
The emergence of a solid-state lighting industry. Research Policy, 43(10), 1730-1746.
Siering, M. & Muntermann, J. (2013, August 16). Credence goods and online product reviews:
An exploration of the product type concept in the social commerce era. Paper presented at
The 19th Americas Conference on Information Systems: Hyperconnected World: Anything,
Anywhere,
Anytime,
Chicago,
IL.
Retrieved
from
http://aisel.aisnet.org/amcis2013/eBusinessIntelligence/GeneralPresentations/9/
Accessed, April, 16, 2016)
(Date
Last
Sorrell, S. (2004). The economics of energy efficiency. Cheltenham, UK: Edward Elgar.
U.S. Department of Energy (2012). LED lighting facts partner participation manual version 4.0.
(n.p.): Author. Retrieved from
http://www.lightingfacts.com/Downloads/LF_PartnerParticipationManual_V4.pdf (Date Last
Accessed, April, 16, 2016)
U.S. Energy Information Administration. (2014, November 4). LED light bulbs keep improving
in efficiency and quality. (n.p.): Author. Retrieved from
https://www.eia.gov/todayinenergy/detail.cfm?id=18671 (Date Last Accessed, April, 16, 2016)
Waide, P., & Tanishima, S. (2011). Light’s labor’s lost Policies for energy-efficient lighting.
Paris, France: International Energy Agency.
https://www.iea.org/publications/freepublications/publication/light2006.pdf
(Date
Last
Accessed, April, 16, 2016)
Tables and Figures
Table 1. Efficiencies of 60-Watt Replacement Lamps by Technology
Incandescent
Halogen
CFL
LED
16
20
67
83
Efficiency
(lumens/watt)
Table 2. Phase-Out Schedule of Incandescent Bulbs by Country (In U.S. and Other Countries)
Country
Date
U.S.
Jan/1/2012
Ban on 100-watt bulbs
Jan/1/2013
Ban on 75-watt bulbs
Jan/1/2014
Ban on 40-watt and 60-watt bulbs
Europe
Contents
Sept/1/2011 Ban on inefficient 60-watt light bulbs
Sept/1/2012 Ban on manufacturing and importing incandescent bulbs of any
type
Japan
Terminating production and sales of incandescent bulbs was
recommended by the government.
China
Oct/1/2012
Ban on imports and sales of 100-watt and higher incandescent
bulbs
Oct/1/2014
Ban on 60-watt and higher incandescent bulbs
Oct/1/2016
Ban on 15-watt and higher incandescent bulbs
Table 3. Non-Ambiguous Keywords Assigned to Product Characteristics
Number
Product
Attribute
Keywords
of
Characteristics
Keywords
Search
Bulb price
discount, overpriced, price, pricey
4
Experience
Brightness
brightly, brightness, brighten, bright, output,
5
Color
amber, black, blue, blueish, bluish, color,
22
colored, colors, green, greenish, hue, kelvin,
orange, pink, red, RGB, white, yellow,
yellowish, 2700k, 3000k, 6000k
Credence
Heat emission
heat, hot
2
Efficiency
bill, efficiency, efficient, electricity, energy
5
Longevity
longevity
1
Keyword
warm,
Table 4. Ambiguous Keywords and Applied Rules
Classification
Usage in Sampled Comments and Applied Rules
color
13 comments used warm as color-related, 2 comments
used “warm” as heat-related, 6 comments used “warm”
warmth
as in “warming up”. Thus, the expression “warming up”
is cleaned from the text and “warm” was used as a
color-related keyword. Warmth shows a similar pattern,
so is classified as color-related.
cheap,
price
19 comments used cheap as price-related, 1 comment
used “cheap” to describe production method, 0
cheaply
comments used “cheap” as efficiency-related. Thus,
“cheap” is classified as a price-related keyword.
“Cheaply” is also classified as a price-related keyword
accordingly.
expensive,
Price
17 comments used “expensive” as price-related, 1
comment used “expensive” ambiguously, 2 comments
inexpensive
used “expensive” for other products (not bulbs). Thus,
“expensive” is classified as a price-related keyword.
“Inexpensive” is classified as a price-related keyword
accordingly.
long,
longevity
6 comments used “longer” as physical-length-related, 1
longer
(used as “last
comment used “longer” to describe warming up time, 8
long”)
used “longer” as longevity-related, 5 used “longer” as in
Keyword
Classification
Usage in Sampled Comments and Applied Rules
no longer/any longer. When longer is used to describe
longevity, it is usually combined with last as “last
longer”. Thus, “last long” and variants are used to
capture the usage.
cost, costs
price
13 comments used “cost” as price-related, 3 comments
used “cost” as efficiency-related, 1 comment used
“cost” as longevity-related, 2 comments are ambiguous,
and 1 comment is about return-to-sender cost. When
cost is used to describe efficiency, it is “cost of lighting”
“energy cost” and “cost of operation”. For “energy
cost”, the keyword energy will capture the comment
without using “cost” as an efficiency-related keyword.
“Costs” is classified as a price-related keyword
accordingly.
cool, cooler
not used
7 comments used “cool” as color-related, 3 comments
used “cool” as heat-related, 2 comments used “cool” as
in “cool down”, 9 comments used “cool” as nice. To
avoid noise, “cool” is not used. In case cool is used to
describe color, the sentence usually contains other color
related terms such as “cool white”, so the majority of
such comments would be captured by other colorrelated keywords. Reviews that include cooler show a
Keyword
Classification
Usage in Sampled Comments and Applied Rules
similar pattern, and cooler is not used.
save,
efficiency
14 comments used “save” as efficiency-related, 2
saving,
comments used “save” as price-related, 3 comments
saver,
used “save” for other costs such as travel cost, 1
saved
comment used “save” ambiguously.
temperature
not used
17 comments used “temperature” as color-related, 2
comments used “temperature” as heat-related, 1
comment used “temperature” ambiguously. Thus,
“temperature” can be used to classify color, but in most
cases “temperature” is used as “color temperature”, so it
is unnecessary to use temperature.
forever
reduce
longevity
14 comments used “forever” as longevity-related, 5
(used as “last
comments used warming up time related, and 1 is other.
forever” and
The usage is usually “last forever”. “Last forever” and
variants)
its variants are used.
not used
16 comments used “reduce” for irrelevant other
meanings, and 4 comments used “reduce” as efficiencyrelated. When it is efficiency-related, other terms such
energy, electricity, save are used as well, so other
keywords can capture efficiency-related contents.
durable,
durability
not used
8 comments used “durable” for sturdiness, 12 comments
are ambiguous, and 1 comment used “durable” for
Keyword
Classification
Usage in Sampled Comments and Applied Rules
longevity. When “durable” was used ambiguously, it
was likely to be about sturdiness, but this was unclear.
cold
not used
9 comments used “cold” for color, and 12 comments
used “cold” for other meanings. Since other colorrelated keywords can capture comments, “cold” is not
used.
dollar, buck
price
19 comments used “dollar” for price, and 1 comment
used “dollar” for another meaning. “Buck” is similar.
economical
not used
16 comments used “economical” ambiguously, 2
comments used “economical” for efficiency, and 2
comments used “economical”’ for price. It seems that
customers use economical to mean everything related to
cost in general including longevity, bulk purchase, price,
and efficiency. Due to the ambiguity, “economical” is
not used.
Table 5. Frequency of Comments about Each Product Characteristic, by Bulb Type
Total
LED
CFL
Halogen Incandescent
Price
26.6%
26.2%
24.6%
34.9%
24.0%
Color
26.8%
33.4 %
25.5%
13.0%
19.2%
Brightness
32.4%
39.8 %
30.9%
19.2%
19.3%
Heat emission
6.0%
7.8%
4.3%
4.3%
4.4%
Efficiency
9.8%
11.9%
10.2%
5.4%
6.7%
Longevity
2.6%
2.4 %
3.0 %
3.2%
2.5%
Search Total
26.6%
26.2%
24.6%
34.9%
24.0%
Experience Total
47.3%
57.1%
44.5%
28.7%
33.4%
Credence Total
12.0%
13.7%
12.7%
8.5%
8.8%
N
103,003
58,815
23,551
13,633
15,976
Figure 1. Number of Reviews by Month and by Technology
Figure 2. Fraction of Reviews Discussing Bulb Price
Figure 3. Fraction of Reviews Discussing Bulb Price by Technology
Figure 4. Fraction of Reviews Discussing Experience Attributes
Figure 5. Fraction of Reviews Discussing Credence Attributes
Figure 6. Fraction of Reviews Discussing Efficiency by Technology