UCSD MGT 100 Week 09
Importance of Customer Acquisition
Market size: How many customers experience the core need?
Diffusion: How does the served market change over time?
CLV: How profitable are customer relationships?
Einav et al. (2021) analyzed all US Visa CC transaction data
- >$1T spent in 32B transactions by 428MM cards at 1MM stores from 2016-19
- Assume card~=customer
\(\text{BrandRev.} = \sum spend \equiv \sum \frac{stores}{1} \frac{cards}{stores} \frac{transactions}{cards} \frac{spend}{transactions}\)
Research question: How well does each factor explain brand revenue?
Regressed log revenue on log RHS with merchant and year fixed effects
Weather forecasts
- What to do, what to wear
Stock prices
- Buy or sell, how much, when
Safety
- Where to live, how to transport
Lifespan
- Schooling, savings/spending, work/retire
Product quality
- Buy now/later, return, warranty, insurance
Correlations alone can enable powerful predictions
Causal drivers will improve predictions
- But we do not need to know all the causal drivers to make predictions
- E.g., future behavior often predicted using past behavior
Predictive analytics are not prescriptive or diagnostic
- Predictive analytics are oft misused & overinterpreted
Predictive Analytics typically have wide error bars
- Often ignored by those who don't understand ... and those who do
- Accounting for data variability, estimation error and model uncertainty .. CIs get wide
Market size (\(N\)): # of people who might pay to address the core need in a given time period
- Alternatively measured in $, units or volume
- Noisy but helps inform potential returns to investments
- Typical investor's first question: How big is the market?
$100B market is viewed differently than a $100MM market
- How will you know if you got the right answer?
- What happens if you overestimate market size?
“Marketing myopia:” Neglecting nontraditional competitors, e.g. Zoom v. Uber or Carnival v. Whistler
USA population : ~340 million
Assumption : \(TAM\approx SAM\) (why? pros, cons?)
Assumption : Avg mattress lasts 7 years (pros, cons?)
Market size \(\approx\) 47.1 million people annually
Average mattress price : $283, across all bed sizes
Market size \(\approx\) $13.3B/year
Let’s check Grand View Research & ISPA
\(M\) : Market size (we’ll estimate this)
\(t\) : Time periods
\(A(t)\) : Accumulated sales before time \(t\)
- AKA "installed base"
- A(0)=0 by assumption
\(\frac{dA(t)}{dt}\) : number of new adopters in time \(t\)
\(R(t)\) : Remaining customers who have yet to adopt, \(R(t)\equiv M-A(t)\)
\[\frac{dA(t)}{dt}=pR(t)+q\frac{A(t)}{M}R(t)\]
\(p\) : “coefficient of innovation”
\(q\) : “coefficient of imitation”
- p and q assumed constant
\[\frac{dA(t)}{dt}=pR(t)+q\frac{A(t)}{M}R(t)\]
This is a first-order diffEQ with analytic solution
\[A(t)=M\frac{1-e^{-(p+q)t}}{1+\frac{q}{p}e^{-(p+q)t}}\]
If you have sales data by time, you can use Nonlinear Least Squares to estimate \(p\), \(q\) and \(M\), i.e. choosing parameters to minimize square errors \((LHS-RHS)^2\)
Or, notice that \(\frac{dA(t)}{dt}=pR(t)+q\frac{A(t)}{M}R(t)\) \(=p(M-A(t)) + q* \frac{A(t)}{M}(M-A(t))\) \(=pM + (q-p)A(t)+\frac{q}{M} A(t)^2\)
We can regress \(\frac{dA(t)}{dt}\) on a quadratic in installed base
- If you want, recover p, q & M from the parameter estimates
- (3 equations with 3 unknowns)
Extensions: Multiple markets, hazard models,
types of “influence”
\[\hat{\beta}=(X'X)^{-1}X'Y\]
\[min_\beta (Y-X\beta)'(Y-X\beta)\]
\[min_\beta [X'(Y-X\beta)]^2\]
"In theory, there's no difference between theory & practice. In practice, there is."
Some theorical models offer multiple estimators. Some have no estimators
Estimates often differ due to assumptions and numerical properties
Subfield that invents estimators and studies their properties: "econometrics"
Diffusion depends on Relative Advantage, Perceived Complexity, Compatibility, Observability, Risk, Divisibility (aka “Trialability”)
- Summarized 40 years of research, incredibly influential on practice
- Provided diagnostics to interpret Bass (1969)'s predictive analytics
- E.g., a prototype could be evaluated on these 6 dimensions then modified
- Early example of HARKing but likely useful
Heterogeneity might drive adoption timing
- Adoption driven by consumer preferences, needs, income, risk attitudes?
Markets typically evolve after introduction
- Production becomes more efficient & reliable; costs fall; price may fall
- New features, technology generations, safety improves
- Competitors introduce variants targeting unserved customers
- Network effects, e.g. smartphone compatibility with chargers or accessories
- Complementors, e.g. Verizon stores, iFixIt, Genius Bar
- Consumer preferences, e.g. expected reliability rises over time
Your first job includes a bet on a market
- Mature market: Big, reputable employers, established career tracks
- But markets typically decline at some point
- Growing market: Exciting, high risk, high reward, more opportunity
- But market may not take off as you expect
- Will your skillset enable a transition if needed?
Today’s safe option might not be safe!
- Mature markets will decline
- New markets will grow
- Consider diffusion trends, not just current market size
Best available indicator of unserved market
Google Trends reports search volume indices by keyword, place, time, service
- Also identifies keyword topics, trending terms & related queries
- Samples the query database, reports estimates not actual counts
- Requires a minimum query volume for reporting
- Free, so it could get sunsetted
CLV is the most powerful customer analytics metric
- Expresses the firm's value of an individual customer relationship as the net present value of expected future customer profits
- Pioneered by catalogue retailers in the 1980s
- Has spread widely, but not yet everywhere
- CLV metrics enable quantification & serious discussion of new policies
Housing First has certainly not solved homelessness
- "Chronic" means disabled and unhoused for 1+ yrs, or 4x in 3 yrs
- ~28% of CA homelessness is chronic (2019)
- UT originally claimed 90% reduction, then revised their metric definitions
- Reliable efficacy metrics are rare
- Housing First has been implemented haphazardly
- UT built new apartments. CA cities mostly use shelters, SROs, vouchers
- Key Q: Require wraparound services? E.g. Addiction treatment, etc
- Key Q: Does Housing First somehow encourage homelessness?
I claim: Quantification enables bold policy shifts
- U.S. HUD adopted Housing First as preferred approach to homelessness in 2014
- CLV quantifies policy costs and benefits & enables ex-post evaluations
- We then can use data to refine CLV estimates and policies
Marketing campaigns should be profitable if Avg. Customer Acquisition Cost (CAC) \(<\) CLV
- Caveat: So long as acquired customers have CLV>=avg CLV of existing customers
- Often, managers impose a "fudge factor" as a speedbump
Suppose the tennis club has a chance to pay $20k for a billboard. It will be seen by 100K people with an expected conversion rate of .1%. Should we do it?
Similar “break-even” calculations possible for
- partnerships, opening new stores, price promotions, etc. Anything that requires an upfront outlay to potentially acquire new customers, increase current customer retention, or develop current customer spending
CLV popularity rose alongside CRM data systems
- e.g. Oracle, SAP, Salesforce
- Services and retailers used CLV to set customer experiences: high-CLV flyers got upgraded, high-CLV lodgers got better rooms, high-CLV shoppers got sterling return service and attention, high-CLV callers got shorter wait times and more consideration
In the 00s, consulting firms published claims that 20-30% of customers were unprofitable. Many firms tried to “fire” unprofitable customers
- American Express offered some cardholders $300 to cancel their cards. Best Buy stopped notifying some shoppers about upcoming promotions. Banks used minimum balances and teller fees to drive away some accountholders
Customers talk to each other; firing customers is a brand risk
CRM data may be incomplete, disconnected, error prone
- E.g., can you connect customer identity across different credit card #s?
- Think about measurement error in retention rate
CLV models are predictive analytics, not prescriptive or diagnostic
- CLV contains no customer purchase model
- Heterogeneous CLV-based policies may be self-fulfilling:
Treat someone as unprofitable, they may act that way
Guiding Principles
- Firm & customer both can benefit from higher customer value. CLV metrics for value creation benefits everyone
- Firm & customer have opposing interests in price. CLV for margin extraction may generate perverse customer incentives
- Customer dissatisfaction may reveal CLV flaws; requires careful attention
- Hence many firms now measure point-of-sale satisfaction
Assume price is $100, margin $50, retention 50%,
discount rate 10%, horizon T=1
CLV = 50 + .5 * 50 / 1.1 = $72.72
Suppose you consider increasing price to $120, holding all else constant
Then, CLV = 70 + .5 * 70 / 1.1 = $101.82
Should you raise the price?
What is this analysis missing?
How could we resolve that problem?
Hint: CLV is predictive analytics, not diagnostic or prescriptive
- Suppose you tutor a high school student for $250/month. There is a 10% chance the student will move and find a new tutor in any given month. There are six months remaining in the school year, at which point the tutoring arrangement will definitely end.
- What missing information do you need to calculate the current value of the tutoring arrangement?
Innovation Diffusion and New Product Growth Models (Peres et al. 2009)
Customer-Base Valuation in a Contractual Setting with Heterogeneity (Fader & Hardie 2010)
Exploring the Distribution of Customer Lifetime Value (Fader & Hardie 2017)