MGT 100 Week 1
“The greatest value of a picture is when it forces us to notice what we never expected to see.”
- Boxplot Inventor John Tukey
- Great visualizations raise new questions
describes people, entities, and activities involved in producing, compiling, transforming & sharing data
- Verifiability requires source data & cleaning scripts
Crucial in determining quality, reliability, trustworthiness
- Investigating provenance requires and deepens domain expertise
- Typically turns up unexpected information, and sometimes errors
When can you trust provenance information?
- Best to treat provenance as hypotheses to be verified in the data
- Usually, provenance descriptions are missing or imperfect
- Financial info is most likely to be accurate due (accounting, auditing)
- Considerations: Consent; Privacy; Missingness; Permissible uses
- Sometimes, provenance descriptions are marketing documents
Data Origin, Measures, Quality, Verifiability
- How were the data produced?
- Who collected it, when and why?
- How is each variable measured? (surveys, APIs, sensors, etc; units)
- Is there a data dictionary or changelog?
- How can I verify individual datapoints?
- Certified by any third party? Who relies on them, for what?
Processing & Transformation
- What cleaning or manipulation was done?
- Who performed it and why? Can I see their scripts?
- Were missing values imputed? How?
- How were outliers handled?
Eyeballs can interpret pictures quickly
- Human brains are great at interpreting visual patterns, predictions
- Can detect unknown errors
Understandable to managers
- Viz choices enable narrative understanding, another common brain function
Viz are lingua franca across disciplines
- Easily replicated -> more easily trusted
- Trust can is a major issue in some corporate cultures
Viz succeed when they raise deeper questions
- Asking the next question indicates acceptance
- "I wonder why that is...." or "Maybe that's because..."
- Viz usualy won't settle the matter; hence, the first step
Usually start axes from 0, choose scales judiciously
Show all relevant data, label accurately. Don’t edit, trim
Discretize and smoothe judiciously
No Customers, No Business
No Customers ->
No Revenue ->
No Profit ->
No Business
QED
More Customers, More Profits
More Customers ->
More Revenue ->
More Profit
QED
- These are empirical tendencies, not logical necessities
- Individual customers can be unprofitable if (price-cost)<0
Marketing Objective: Maximize Long-term Profits
- Long-term focus mostly aligns our interest with customers
- Goal is not only to acquire customers; also, keep and develop them
- Sidesteps or reduces most ethical dilemmas
- Short-run objective may profit before liability; I won't teach
Consumer surplus
- Suppose customer pays $20/month, watches 60 hours: $0.33 per entertainment hour, or $0.13/hour with ads, or less with shared accounts
- A la carte rentals are more like $1-2+/hour
- Non-video entertainment tends to be much more expensive
- Social benefits may accrue from shared viewing
- Other streaming services may be competitive
Producer surplus
- NFLX spent $13B on content in 2023, 230 million subscribers, about $4.70/user/month
- Cost structure: High fixed, low marginal
- Ads likely earning around $4-5 per recipient/month
- Net profit margin about 20-22% in a competitive category
Businesses have 4, and only 4, ways to make money:
Acquire, develop, retain and “fire” customers
This is called Customer relationship management (CRM): week 9
Marketing mix (“4 P’s”): Improve product offerings, prices, promotion, distribution
Incorporate customer heterogeneity for targeting, personalization, recommendations, product development…
Privacy and security, e.g. misuse, theft, regulatory compliance
CONSUMER PANEL DATA
- The Consumer Panel Data include longitudinal data beginning in 2004 with annual updates. These data track a panel of 40,000–60,000 US households and their purchases of fast-moving consumer goods from a wide range of retail outlets across all US markets.
RETAIL SCANNER DATA
- Retail Scanner Data consist of weekly pricing, volume, and store environment information generated by point-of-sale systems from more than 90 participating retail chains across all US markets. Data begin in 2006 and include annual updates.
Do analytics make or justify decisions?
High- or low-trust environment? Tolerance for uncertainty?
- You have limited credibility. You may only get a few strikes
Do messengers get rewarded or shot?
Are data available and integrated?
Do teams work together or compete?
Analytics matters more in B2C than B2B (why?)
Selection effects are usually large
treatment effects are usually small
Key exceptions: Price or "free" giveaways
Demographics don’t predict behavior very well
Agencies lie about data sometimes
“If it’s written in LaTeX, it’s probably correct”
- I am assuming you read the syllabus carefully
Survey the field broadly, pointers for deeper learning
Focus on conceptual understanding:
“When it comes to LLMs, skillful prompting leaves amateurs in the dust.”
Strict communication policies
1. Website for class content
2. Canvas for study groups & grades
3. Piazza for all asynchronous interaction. No email or canvas messages
4. After class, break or office hours for live discussions
Read the syllabus carefully
Attend and contribute as suggested in the syllabus
Budget 5-10 hours/week
Between classes:
1. Step through script carefully, understand everything.
2. Do homework questions on your own
3. Check homework with group, resolve differences
4. Monitor Piazza and read for the next class
5. Compile notes and homework answers to facilitate exam prep
We assign attending students to study groups in week 2
It is OK to share homework scripts and answers
- We do not collect or assess homeworks.
- We do not provide homework answers.
- Exam questions will test your familiarity and understanding of homework answers. More details forthcoming
We encourage you to use Gen AI thoughtfully; we use it too
- LLMs are poor substitutes for human understanding
- Be advised, you may get what you pay for
- Free models are sometimes worse than useless
Common language helps communication
Core need: identifiable problem a customer wants to solve. Could be functional, emotional, social, profit-motivated, etc. Related: desire, want, pain point
Core benefit: Customer’s desired outcome of a purchase. E.g., commuters need to get to school, not necessarily cars
Consumer: Entity that experiences the core benefit
Customer: Entity that purchases and pays
Product/service/experience: Distinct offering that provides the core benefit
Features: Aspects of a product that provide additional tangible or intangible benefits
Value proposition: utility( Core benefit + features - price )
Contribution margin: Price — marginal cost
Competitor: Any paid or free alternative that addresses the core need. E.g., commute by bike, walk, bus, trolley, Uber, scooter, skateboard; work from home
Market: Potential customer group with common core need
Segment: Distinct subgroup of similar customers
Targeting: Which segment(s) a firm tries to serve
Positioning: Specification of product features to suit targeted segments
Marketing: Practice of meeting customer needs profitably
- Marketing: Business discipline that focuses most on customers
- Ads & sales: Worthless without good value prop and positive margin
- Poor implementation commonly leads to confusion with bullshit ("persuasive speech without regard for the truth" --Frankfurt 2005)
3/4/5 C’s: Customer, Competitor, Company;
Context; Complementors
STP: Segmentation, Targeting, Positioning AKA Marketing Strategy
4/…/10 P’s:
Price, Product, Promotion, Place AKA distribution AKA Marketing Tactics
- You need to know these well if you interview for marketing roles
- Generations of marketing professionals were educated to think this way, e.g. MGT 103 and Harvard MBAs
- Still relevant, but less, thanks to customer data abundance & analytics
“The good news about computers is that they do what you tell them to do. The bad news is that they do what you tell them to do.”
Conjecture:
(debugging difficulty) is exponential in (lines of code)
We can code fast or slow
Good habit: Test chunks as you code
Test = Manipulate text, verify output matches expectation
“Go slow to go fast”
y <- f(g(x)) is the same as
y <- x |>
g |>
f
Why?
Old pipe was %>% ; remains widely used
Customer analytics :
Using customer data to improve decisions
Data Viz should be the first step in customer analytics
Marketing : Meeting customer needs profitably
Analytics types:
Descriptive, Diagnostic, Predictive, Prescriptive
Summarize, select, filter, arrange, mutate, group_by