Measuring price change for consumer electronics using scanner data Crown copyright © This work is licensed under the Creative Commons Attribution 3.0 New Zealand licence. You are free to copy, distribute, and adapt the work, as long as you attribute the work to Statistics NZ and abide by the other licence terms. Please note you may not use any departmental or governmental emblem, logo, or coat of arms in any way that infringes any provision of the Flags, Emblems, and Names Protection Act 1981. Use the wording ‘Statistics New Zealand’ in your attribution, not the Statistics NZ logo. Liability While all care and diligence has been used in processing, analysing, and extracting data and information in this publication, Statistics New Zealand gives no warranty it is error free and will not be liable for any loss or damage suffered by the use directly, or indirectly, of the information in this publication. Citation Statistics New Zealand (2014). Measuring price change for consumer electronics using scanner data. Available from www.stats.govt.nz. ISBN 978-0-478-42940-4 (online) Published in November 2014 by Statistics New Zealand Tatauranga Aotearoa Wellington, New Zealand Contact Statistics New Zealand Information Centre: [email protected] Phone toll-free 0508 525 525 Phone international +64 4 931 4600 www.stats.govt.nz Contents List of tables and figures ................................................................................................... 4 1 Using scanner data in the CPI ...................................................................................... 5 Consumer electronics categories that use scanner data ................................................ 5 Contribution to the CPI .................................................................................................... 5 2 Scanner data from market research company GfK ................................................... 6 3 Benefits of using scanner data for consumer electronics........................................ 8 Scanner data allows more accurate price measurement ................................................ 8 Scanner data allows us to re-use existing data............................................................... 8 Scanner data accurately reflects seasonality in quantities ............................................. 8 Scanner data reflects product substitution .................................................................... 10 4 New methods to create price indexes ....................................................................... 11 The need for new methods ............................................................................................ 11 Rolling year GEKS (RYGEKS) index ............................................................................ 11 Imputation Törnqvist rolling year GEKS (ITRYGEKS) index......................................... 12 5 Implementing the ITRYGEKS index in production................................................... 14 Deriving the quarterly index from quarterly average prices and expenditure shares ... 14 Using two months of the quarter for production ............................................................ 14 Integration with other changes arising from the 2014 CPI review ................................ 15 References ........................................................................................................................ 16 3 List of tables and figures List of tables 1. Number of characteristics in each category (August 2014) ........................................ 6 2. Number of distinct products in each category (August 2014) ..................................... 7 List of figures 1. Seasonality of quantity, expenditure, and average price – digital cameras................ 9 2. Quality-adjustment of the price index – digital cameras ........................................... 10 3. Data incorporated into each quarterly index at time of production ........................... 15 4 1 Using scanner data in the CPI Measuring price change for consumer electronics using scanner data describes the methodology we introduced for creating price indexes from consumer electronics scanner data. From the September 2014 quarter, we incorporated retail transaction data, or ‘scanner data’, into the consumers price index (CPI), to measure price change for consumer electronics categories. Consumer electronics categories that use scanner data We use scanner data for 12 consumer electronics categories in the CPI: heat pumps desktop computers laptop computers tablet computers multi-function devices cellphone handsets digital cameras digital camera memory cards television sets set-top boxes for television sets DVD, Blu-ray players and player/recorders home theatre and stereo systems. Contribution to the CPI In terms of expenditure weighting: heat pumps contribute a fifth of the ‘major household appliances’ class, which contributes 0.71 percent to the all groups CPI expenditure weight for the June 2014 quarter cellphone handsets contribute over 90 percent of the ‘telecommunication equipment’ class, which contributes 0.29 percent to the all groups CPI the remaining 10 categories contribute four-fifths of the ‘audio-visual and computing equipment’ subgroup, which contributes 1.16 percent to the all groups CPI. 5 2 Scanner data from market research company GfK The market research company GfK supplies us with the scanner data. GfK adds information about product features, or characteristics, to the data collected from retailers. Total monthly sales values and quantities sold are available, across retailers, for each product, along with extensive information about the characteristics of each product. The scanner data represents hundreds of thousands of transactions each month. The number of characteristics in the data range from 10 for digital camera memory cards to 77 for digital cameras, as shown in Table 1 for August 2014, the latest month data is available. The coverage of the data is between 80 and 95 percent of transactions for most of the categories, with lower coverage for some categories, such as heat pumps. Table 1 Number of characteristics in each category (August 2014) 1. Number of characteristics in each category (August 2014) Category Heat pumps Characteristics 27 Desktop computers 56 Laptop computers 71 Tablet computers 73 Multi-function devices 53 Cellphone handsets 59 Digital cameras 77 Digital camera memory cards 10 Television sets 62 Set-top boxes for television sets 49 DVD, Blu-ray players, and player/recorders 50 Home theatre and stereo systems 62 6 Measuring price change for consumer electronics using scanner data The number of distinct products represented in the data for each category, in August 2014, is shown in Table 2. Table 2 Number of distinct products in each category (August 2014) 2. Number of distinct products in each category (August 2014) Category Heat pumps Products 72 Desktop computers 107 Laptop computers 445 Tablet computers 148 Multi-function devices 102 Cellphone handsets 392 Digital cameras 228 Digital camera memory cards 254 Television sets 325 Set-top boxes for television sets 24 DVD, Blu-ray players, and player/recorders 129 Home theatre and stereo systems 224 7 3 Benefits of using scanner data for consumer electronics We have moved to using scanner data to measure price change for consumer electronics because it: allows more accurate price measurement allows us to re-use existing data accurately reflects seasonality in quantities reflects product substitution. Scanner data allows more accurate price measurement Until now, we have relied on sampling consumer electronics prices across several dimensions – categories, products, outlets, and time. For each consumer electronics category, a product was priced at each of about 60 appliance retailers and department stores. When a product was no longer available, we replaced it with a similar product, based on discussion with retailers about market share and features. We based quantities, or expenditure shares, on information we acquired during the Household Economic Survey reference period, and updated every three years. In contrast, scanner data has the potential to give a more complete picture of both prices and quantities sold at any point in time. With information on the characteristics of each product, we are also able to use statistical methods to explicitly quality-adjust the price indexes. Information on region of sale is not available in the data, so we will use national movements in each of the regions. Scanner data allows us to re-use existing data Market research companies already collate consumer electronics scanner data for businesses, so it is good practice to re-use the data to generate official statistics. This reduces fieldwork, and the respondent load associated with collection, which involves observing products and prices in stores, and discussing product changes with store staff. Scanner data accurately reflects seasonality in quantities Scanner data for consumer electronics has monthly information on both prices and quantities. Quantities can be seasonal, as shown in Figure 1 for digital cameras. Correspondingly, total expenditure on digital cameras is also highly seasonal. Note that average price, unadjusted for quality change, also shows seasonality, corresponding to cheaper cameras being bought around the Christmas period. 8 Measuring price change for consumer electronics using scanner data Figure 1 1. Seasonality of quantity, expenditure, and average price – digital cameras Seasonality of quantity, expenditure and average price - digital cameras July 2008 to June 2011 Base: July 2008 month (=1) Index 3.5 3.0 2.5 2.0 1.5 1.0 0.5 0.0 J A S O N D J F M A M J J A S O N D J F M A M J J A S O N D J F M A M J 08 09 10 11 Total expenditure Average price Quantity sold Note: This graph is based on earlier research data (Statistics New Zealand, 2012). Note that each of the three series is expressed as an index and based to 1 in July 2008, for confidentiality reasons. Source: Statistics New Zealand The current fixed-basket approach to price measurement, when applied to seasonal prices and quantities, has the potential to over- or under-state actual price movements when we combine seasonal prices with quantities that were fixed at an average annual level. Figure 2 shows that when we appropriately incorporate the seasonal prices and quantities of digital cameras using the imputation Törnqvist rolling year GEKS (ITRYGEKS) index (discussed in Chapter 4), the resulting quality-adjusted price movement is no longer seasonal. 9 Measuring price change for consumer electronics using scanner data Figure 2 2. Quality-adjustment of the price index – digital cameras Quality adjustment of the price index - digital cameras July 2008 to June 2011 Base: July 2008 month (=1) 1.2 Index 1.0 0.8 0.6 0.4 0.2 0.0 J A S O N D J F M A M J J A S O N D J F M A M J J A S O N D J F M A M J 08 09 10 11 ITRYGEKS index Average price Note: It is possible, and valid, for quality-adjusted price indexes for some goods (eg fresh fruit and vegetables) to still display seasonality even after appropriate incorporation of seasonal quantities. Source: Statistics New Zealand Scanner data reflects product substitution Because scanner data provides prices and quantities for the most detailed level of product specification (ie the barcode level), we can use it to incorporate new products in the index at the time they are introduced, and to reflect their relative importance based on actual quantities sold. We can also use scanner data to empirically test substitution effects across different product categories, to infer the appropriate level to fix expenditure shares between CPI basket and weight reviews. 10 4 New methods to create price indexes To produce price indexes from scanner data, we are using the imputation Törnqvist rolling year GEKS (ITRYGEKS) index. The ITRYGEKS is an extension of the rolling year GEKS (RYGEKS) index. This chapter provides more detail about the: need for new methods rolling year GEKS (RYGEKS) index imputation Törnqvist rolling year GEKS (ITRYGEKS) index. The need for new methods Consumer electronics is a rapidly changing product class, so it is particularly important that we use methods that will appropriately adjust for the change in quality of the products purchased. In addition, because consumer electronics products can have short life cycles, it is important to introduce new products into the index in such a way that the implicit price movement associated with their introduction is appropriately reflected in the index. That is, if a new product has a low introductory price, relative to its set of features, then the price index should reflect this price decrease. In addition to this requirement for appropriate quality adjustment, the CPI is nonrevisable. This places another constraint on methods for creating price indexes from scanner data using statistical models, where the models are generally based on a past window of data – usually a year – ending with the most recent quarter. The two key reasons why traditional index number methods do not work well in the case of scanner data are: the high level of ‘churn’ – products appearing and disappearing from the market volatile prices and quantities due to discounting, which leads to a bias called ‘chain drift’ when chained superlative indexes such as a chained Törnqvist are used to continually update the basket in the presence of high churn. Over the past five years, we have collaborated on research to determine an index method that is appropriate for producing non-revisable, quality-adjusted price indexes from scanner data. Rolling year GEKS (RYGEKS) index Ivancic, Diewert, and Fox (2011) proposed a method for producing price indexes from scanner data that uses all the prices and quantities in the data, and is free of chain drift. Called the rolling year GEKS (RYGEKS) index, it is based on the Gini, Eltetö and Köves, and Szulc (GEKS) index used for multilateral spatial price indexes such as purchasing price parities – which compare prices in different countries at a point in time. Within a window of time (usually just over one year – ie five quarters for a quarterly index, or 13 months for a monthly index), the RYGEKS index between periods t1 and t2 is the geometric mean of all the superlative bilateral indexes (such as the Törnqvist index or, as used in Ivancic et al (2011), the Fisher index) between: t1 and all the other periods in the window, and t2 and all the other periods in the window. The monthly RYGEKS, based on a 13-month rolling estimation window, is as follows: 11 Measuring price change for consumer electronics using scanner data For the first window (ie T=0 to 12), the RYGEKS index is equal to the GEKS index: 0T RYGEKS P P 0T GEKS 12 P P 0t tT 1 13 (equation 1) t 0 Where P ij is any superlative index (eg a Törnqvist index) between periods i and j. From t=13 onwards, RYGEKS links on the most recent movement from the GEKS calculated on the next window (ie from t=1 to 13, then from t= 2 to 14, and so on) as follows: 13 0,13 0,12 PRYGEKS PGEKS P12,t Pt ,13 1 13 t 1 14 0,14 0,13 PRYGEKS PRYGEKS P13,t P t ,14 12 P 0t Pt ,12 t 0 1 13 P 13 1 13 12, t Pt ,13 1 13 t 1 (equation 2) t 2 and so on. However, a limitation of the RYGEKS method is that it does not reflect the implicit price movements of new or disappearing products entering or leaving the market. For example, if the initial price of the latest model of a cellphone is high relative to its set of features, then this implicit price increase is not reflected in the RYGEKS index. Imputation Törnqvist rolling year GEKS (ITRYGEKS) index Jan de Haan, of Statistics Netherlands, proposed an extension of the RYGEKS index, called the imputation Törnqvist rolling year GEKS (ITRYGEKS). A paper on the method by de Haan and Statistics NZ senior researcher Frances Krsinich was recently published in the Journal of Business and Economic Statistics (de Haan and Krsinich, 2014). The ITRYGEKS method is an extension of the GEKS method. Like the GEKS, the ITRYGEKS can utilise all the information in the data, while remaining free of chain-drift. In addition, it reflects the implicit price movements of new and disappearing products by imputing price movements based on statistical modelling of the relationship between price and product features. Unlike the RYGEKS method described above, which is based on superlative indexes ij such as the Törnqvist or Fisher (ie the P in equations (1) and (2), the ITRYGEKS index is based on ‘bilateral time-dummy hedonic’ indexes. The ‘Törnqvist’ in its name refers to the fact that it is algebraically equivalent to a Törnqvist index based on real and predicted prices, as shown in equation (4). A bilateral time-dummy hedonic index between any periods 0 and t is derived from a statistical regression model based on the data from periods 0 and t.1 The estimating equation for the bilateral time-dummy hedonic regression model is: K ln pit t Dit k z ik it (equation 3) k 1 1 This can be generalised to any two periods i and j. 12 Measuring price change for consumer electronics using scanner data Where: ln pit is the log of the average monthly price for product i in period t. is the intercept term. Dit has the value 1 if the observation relates to period t (t ≠ 0) and the value 0 if the observation relates to period 0. z it is the quantity of the kth characteristic, or feature, for product i.2 By definition, this will be the same in both periods.3 it is an error term with an expected value of 0. Since the model described by equation (3) controls for changes in the product characteristics, exp is a measure of quality-adjusted price change between periods 0 and t (de Haan and Krsinich, 2014). t In addition, when the average4 expenditure shares are used as weights for the matched products, and half of the expenditure shares for the unmatched products (in the periods they are available), the ITRYGEKS index from period 0 to t can be expressed as follows: 0t PITRYGEKS p iU 0 t p t i 0 i si0 sit 2 pˆ iU D0 t p t i 0 i si0 2 p iU N0 t pˆ t i 0 i sit 2 (equation 4) Where: U 0t is the set of matched products i with respect to periods 0 and t. U D0t is the set of 'disappearing' products i with respect to periods 0 and t – that is, products that exist in period 0 but not in period t. U N0t is the set of 'new' products i with respect to periods 0 and t. s it is the expenditure share of product i in period t. Equation (4) demonstrates that the ITRYGEKS index is equivalent to a Törnqvist index for the matched products, and for new and disappearing products, the ITRYGEKS index applies a Törnqvist formula to prices predicted from time-dummy hedonic models for the period in which there is no price available. The derivation of (4) is given in de Haan and Krsinich (2014). 2 For categorical characteristics, such as those on the consumer electronics scanner data, each characteristic k will be represented by a set of dummy variables corresponding to all possible values of characteristic k – that is, variables which are set to 1 (or 0) in the presence (or absence) of that value of the characteristic. 3 A product corresponds to a distinct set of characteristics, or features. Therefore, any change in characteristic would result in a different product. 4 Across both time periods. 13 5 Implementing the ITRYGEKS index in production This chapter summarises key decisions we made before using the ITRYGEKS index to produce price indexes for consumer electronics categories in the CPI: deriving the quarterly index from quarterly average prices and expenditure shares using two months of the quarter for production integration with other changes arising from the 2014 CPI review Deriving the quarterly index from quarterly average prices and expenditure shares We receive monthly scanner data from GfK, but the New Zealand CPI is a quarterly index. We can either derive the quarterly index from a monthly index, or pre-aggregate the data to a quarterly level before deriving a quarterly index. While it is useful to produce a monthly index as part of the monitoring and analysis process, it is conceptually more appropriate to derive the quarterly index from quarterly average prices and expenditure shares. This ensures the prices for products sold in each month of the quarter are appropriately weighted for price deflation, to produce quarterly volume indexes in the National Accounts. Using two months of the quarter for production Consumer electronics data for just the first two months of the quarter are available in time to incorporate into the CPI. There are four options for how to deal with this limitation in production. Using back-data, we assessed each of these options against the benchmark of the index we could calculate if all three months of the quarter were available. The four options were to: 1. base the published index for the most recent quarter on the first two months of the quarter, with complete back-data feeding into the estimation (ie the third month of the quarter will be incorporated into the five-quarter estimation window for the following quarter’s index calculation) 2. base the published index on three months of data, lagged by one month 3. base the published index on only the middle month of each quarter 4. base the published index on only the first two months of each quarter. Option 1 performed best of all four options, and sits very close to the benchmark index. Therefore, we derived the ITRYGEKS index for the latest quarter from the first two months of that latest quarter, with the third month of the quarter then being incorporated into the five-quarter estimation window used to calculate the following quarter's CPI, as shown in Figure 3. 14 Measuring price change for consumer electronics using scanner data Figure 3 Data incorporated into each quarterly index at time of production 3. Data incorporated into each quarterly index at time of production 2013Q3 2013Q4 2014Q1 2014Q2 2014Q3 2014Q4 2015Q1 2014Q3 2014Q4 2015Q1 Figure 3 shows that, for example, to calculate the quarterly price movement for the third quarter of 2014 (ie the top row), we use full quarterly data for each of the quarters from quarter three of 2013 through to quarter two of 2014, and the first two months of data for the third quarter of 2014. Integration with other changes arising from the 2014 CPI review We implemented the ITRYGEKS index in production along with the CPI basket and weight changes arising from the 2014 CPI Review. The basket has been realigned to reflect the consumer electronics categories that we are measuring using scanner data. The price movement used in the CPI for the June to September 2014 quarters is a likefor-like movement based on the scanner data, linked in on the new price reference period – the June 2014 quarter. Initially, we fixed the expenditure shares at the item level of the CPI basket for consumer electronics categories. However, we will review this decision in the future to determine whether we will use information in the scanner data to reflect substitution across consumer electronics categories, between the three-yearly CPI basket and weight reviews. 15 References de Haan, J, & Krsinich, F (2014). Scanner data and the treatment of quality change in nonrevisable price indexes. Journal of Business and Economic Statistics, 32(3). Available from www.tandfonline.com Ivancic, L, Diewert, WE, & Fox, KJ (2011). Scanner data, time aggregation and the construction of price indexes. Journal of Econometrics, 161(1). Available from www.sciencedirect.com Statistics New Zealand (2012). A fresh look at patterns in gadget sales. Economic News, April 2012. Available from www.stats.govt.nz 16
© Copyright 2026 Paperzz