Marketing Strategies for Retail Customers Based

Marketing Strategies for Retail Customers
Based on
Predictive Behavior Models
Glenn Hofmann
HSBC
Salford Systems Data Mining 2005
New York, March 28 – 30
0
Objectives
• Inform about effective approach to direct marketing in retail:
– Creation of single activity score from comprehensive data.
– Effect on top line sales and ROI.
– Ease of customer targeting and strategy development.
• Get feedback and ideas for improvements.
1
Agenda
1. Background
2. Goals and vision
3. Data preparation
4. CART Modeling
5. Marketing strategies
6. Marketing results
2
Background: HSBC
• Second largest financial services organization in the world.
• 232,000 employees in 76 countries.
• Over 110 million customers worldwide.
• US$1,154 billion in assets (June 2004)
• Top 10 credit card issuer in U.S.
www.hsbc.com
3
Background: Retail and store credit cards
• Retailers employ credit cards to
– Increase sales (buy now – pay later).
– Collect customer information.
– Provide additional revenue stream.
• 20 – 70% penetration (sales on store-branded cards).
• About 50% convenience users, 50% revolvers.
4
Background: Marketing challenges
• Increase sales at major events
(holidays, back-to-school, end-of season-sales).
• Decrease attrition
– Rich initial offer causes many customers to use card only once.
– Large turn-over due to competition.
• Reactivate inactive customers.
• Keep high spenders loyal.
5
Goals
Desired characteristics for analytic tools:
• Enable accurate targeting of relevant segments (high lift).
• Ease of use for marketers at all levels.
• High ROI from direct mail campaigns.
å Effective marketing strategy and execution.
6
Vision: Score idea
Credit risk:
• Financial + credit card
related account and
transaction data
• Demographic data
Single number
per customer
FICO
score
Direct
Marketing:
• Account information
• Transactions
• Demographic data
Single number
per customer
AIA
score
7
Data preparation: Customer-level information
Account
- Opening date
- Payment history
- Payment problems
- Credit limits
- Blocks
- Name, Address
Transactions
Item-level purchases
and returns with
- Date, Store
- SKU/UPC, Amount
- Department
Demographics
(3rd party append)
- Age
- Gender
- Income
- Family status
- Children
Cluster code
(3rd party
append) E.g.,
- Personicx
- Cohorts
- Prism
- MOSAIC
Summarize on
customer level
Transform, prepare, combine, create new variables
Hundreds of customer-level variables
8
Data preparation: Criteria for variables
• Predictive value
– Make key variables more predictive through adjustments
(e.g. gender, activity criterion).
– Continuous better than discrete.
• Robustness w.r.t. monthly and seasonal changes
– Data quality: Classification not based on missing/nonmissing
(e.g. demographic story) or other volatile information (e.g. account balance).
– Standardize seasonal data, examples:
Purchases in last 3 months (Dec. vs. March)
Purchases since 1/1/2002
å Standardize by values for all customers
(but not other variables: e.g. number of visits in lifetime of account).
9
Data preparation: Additional predictors
Individual customer-level variables
Summarize by store
Store-based :
- Square footage
- FTE
- Sales
Summarize by zip
Zip-level census:
- Mean income
- # households
- Pop. density
Summarize by cluster
Cluster-level:
- Mean income
- Net worth
- # children distr.
Reattach categorized information to each customer
Deviations of individuals
from category summaries
Final set of predictor variables (several thousand)
10
Data preparation: Summaries by category
Categories: store, zip code, household cluster
Continuous variables
(spend, #visits, age)
å
Mean, Median, Standard Deviation
Discrete variables
(gender, activity level,
rewards level)
å
Absolute and relative frequencies
for each possible value
(infrequent values grouped in “other”)
11
CART modeling: General tips
• Work on random samples (100,000 customers).
• Set categorical variable penalties.
• Reduce default depth level and number of nodes.
• Use random subset for validation.
12
CART modeling: Implementation
1. Model subsets of predictors (customer, store, zip, cluster).
å Find relevant variables (importance).
Put all relevant variables in single set.
2. Create final model (final list of variables).
å Make set with needed variables only (smaller file).
3. Use bagging to create grove file.
4. Score entire population
(average predicted probabilities from multiple trees).
13
CART modeling: Issues and paradoxes
• Categorical variables can cause trouble if in top nodes.
å Play with removal and penalties.
• Joining subset of predictors å unexpected behavior (limitation of CART)
Store summaries
Individual demographics
30% error
30% error
Individual demographics and store summaries
35% error
“Solution”: Play with removal of variables
Other ideas?
14
CART modeling: Results
Score expresses likelihood of customer to use the card within the next 12 months.
Scores distribution
Classification matrix
(using .35 score cut-off point)
Predicted
True
Active
Inactive
Active
90%
10%
Inactive
14%
86%
Gains chart
(for identifying future active customers)
Important predictor variables?
15
Vision: Score idea
Credit risk:
• Financial + credit card
related account and
transaction data
• Demographic data
Single number
per customer
FICO
score
Direct
Marketing:
• Account information
• Transactions
• Demographic data
Single number
per customer
AIA
score
Proxy for many
response models
• More accurate than RFM
• Easier to use than RFM
Effective driver of
marketing strategy
and selection
16
Vision: characteristics
Advantages: One score instead of multitude of scores for different
products (campaigns)
- People with higher credit score are less risky.
- People with higher AIA score are more active.
Challenges:
- People with high credit score are not always most profitable.
- People with high active score do not always give largest
incremental in marketing campaigns.
Scope:
- FICO for all credit products
- AIA limited only by availability of transaction data
• Currently: merchant and tender (PLCC) specific
• Potential: product segment specific
(apparel, power sport, etc.)
17
Marketing: Single score implementation
Recuperating inactive customers (x+ months without purchase):
å High all-information activity scores.
Special events: Detection of customers with high incremental
spend potential
å Highest AIA scores (strong correlation with campaign response).
Detection of low-incremental customers (marketing exclusion):
å Low scores.
18
Marketing: Special strategies (with further information)
Detection of high-potential customers among inactives:
å High demographic-potential activity scores, lower
all-information activity scores.
Attrition prevention (early detection):
å Active customers with low or declining AIA score, but
reasonable past purchase behavior or demographic potential.
Early loyalty enrolment (Gold card + soft benefits):
å High loyalty score (propensity to meet rewards threshold in near
future).
Conversion of high spenders from competition or other tender:
å Profitable customers with less frequent store visits / card use than
indicated by demographic potential.
19
Results: Seasonal sales event
Incremental contribution of direct mail
Only implementation difference:
Which set of 300,000 customers is
targeted by mail piece.
Confidentiality note: Dollar values are fixed multiples of actual values.
20
Results: Reactivation mailing
Incremental contribution of direct mail
* Model selection from customers 8+ months inactive, using highest all-information active scores.
** Selection by straightforward inactivity criterion, selecting all customers 8, 10 and 11 months inactive.
Confidentiality note: Dollar values are fixed multiples of actual values.
21
Results: Early rewards enrolment
Confidentiality note: Dollar values are fixed multiples of actual values.
22
Results: Early rewards enrolment
Confidentiality note: Dollar values are fixed multiples of actual values.
23
Summary
1. Predictive scores for customer behavior can
•
•
•
•
Simplify effective selection.
Enable powerful direct marketing strategies.
Increase top line sales and ROI substantially.
Potentially provide entire industry with powerful std. tool.
2. Comprehensive customer data ensures modeling success
• Reattached category summaries substantially improve model.
3. (Once developed,) scores are easy to use for marketers.
24
Audience feedback
I welcome
• Ideas for improvements.
• Information on other approaches.
• Your experiences.
[email protected]
25