Cultural Diffusion of Personal Names: A Multi-Method Causal Study
The full formal write-up — SSA birth registration, Google Trends, and cultural-event attribution combined through synthetic controls, Hawkes processes, Bass diffusion, and a Lieberson null model. Working draft.
Cultural Diffusion of Personal Names: A Multi-Method Causal Study of Search, Media, and Birth Registration in the United States, 1880-2024
---
Abstract
This study presents a comprehensive analysis of how cultural events — films, television series, celebrity births, sports achievements, and news events — causally influence baby naming patterns in the United States. Combining Social Security Administration birth records (1880-2024), Google Trends search data (2004-present), and 15 external data sources, we construct the largest integrated dataset of cultural naming dynamics to date. Using Abadie-style synthetic controls, we estimate causal treatment effects for 200 well-attributed cultural events, finding that [results summary inserted from data]. A Lieberson-inspired variance decomposition reveals that event characteristics explain [X]% of the variance in adoption effects — [interpretation]. Hawkes self-exciting models show median cultural half-lives of [Y] weeks, while Bass diffusion classification reveals that peer imitation dominates broadcast adoption. Seven "nobody has noticed" side-quest analyses uncover [findings]. Geographic analysis via Moran's I reveals [spatial pattern]. A Salganik-style predictability ceiling exercise demonstrates that even our best model achieves AUC of [Z], confirming fundamental limits to cultural prediction.
Keywords: baby names, cultural diffusion, synthetic controls, causal inference, phonetic neighborhoods, Hawkes processes, Bass diffusion, Lieberson null model
---
1. Introduction
1.1 The Puzzle and the Dataset
Why do parents choose the names they choose? The question sits at the intersection of sociology, linguistics, and cultural economics. While individual naming decisions feel deeply personal, aggregate patterns reveal striking regularities: names rise and fall in synchronized waves, phonetic clusters co-move, and cultural events leave measurable imprints on birth registrations years after the initial stimulus.
This study leverages an unprecedented dataset combining 145 years of U.S. birth records, 20 years of search behavior data, and systematic attribution of over 1,100 cultural events to their naming effects.
1.2 Two Competing Theories
Lieberson (2000) proposed that name turnover follows a neutral-drift process — names cycle through popularity driven by internal dynamics (phonetic fashion, generational avoidance) rather than external cultural causes. Berger (2023) emphasized phonetic neighborhoods as the unit of cultural contagion, where a trending name lifts its sound-alikes.
We test both frameworks against a third possibility: that discrete cultural events (a film character, a celebrity, a news story) causally alter adoption trajectories in ways that exceed what neutral drift alone would predict.
1.3 Why We Can Settle This Now
Three developments make this study possible: (1) Google Trends data provides a real-time proxy for cultural attention that temporally precedes birth registration; (2) systematic spike detection and attribution algorithms identify the cultural events behind naming surges; (3) synthetic control methods provide per-event counterfactual estimates that move beyond correlational evidence.
1.4 Roadmap and Contributions
We proceed in five stages: descriptive characterization (Section 5.1-5.3), time-series modeling (5.4-5.6), causal inference (5.7-5.8), variance decomposition (5.9), and predictability assessment (5.10-5.11). Section 6 presents seven "nobody has noticed" findings that emerge from the data.
---
2. Data
2.1 Internal Datasets
- •**SSA Birth Records** (1880-2024): 2.14 million name-year-sex observations covering every name with >= 5 births in a given year.
- •**Google Trends** (2004-present): Relative search interest for 43,334 names, fetched via the pytrends API with "baby name" as reference term.
- •**Cultural Attribution**: 1,141 events identified through automated spike detection and multi-source attribution (Wikipedia, TMDb, OMDb, Open Library, Claude-assisted synthesis).
2.2 Phonetic Decomposition
Names were decomposed into CMU Pronouncing Dictionary phonemes (43,334 names), yielding syllable counts, stress patterns, onset/coda phonemes, and a phonetic neighborhood graph with 34 million edges.
2.3 External Augmentation
Fifteen external data sources supplement the core dataset: state-level SSA data (6.6M rows), Google Books Ngrams (5.8M rows), CDC natality, GDELT news mentions, Wikipedia pageviews, Wikidata entity links, TMDb film/TV metadata, and geographic place name controls.
2.4 Sample Construction
The analysis sample consists of names appearing in the SSA data with sufficient pre-event and post-event observations for synthetic control analysis. Events are selected from the 1,141 attributed cultural events based on confidence score, pre-spike data availability (>= 5 years), and donor pool viability (>= 30 matchable non-spiking names).
---
3. Theoretical Framework
3.1 Neutral Drift (Lieberson)
Names cycle through popularity via internal dynamics — generational avoidance (parents avoid their parents' generation's names), phonetic fashion waves, and mean-reverting rarity preference. The null model generates expected rank trajectories against which cultural causation is tested.
3.2 Phonetic Neighborhoods (Berger)
The onset phoneme is the unit of cultural contagion. A trending name lifts its sound-alikes through phonetic priming — parents encountering "Aiden" become more receptive to "Jayden," "Brayden," and "Cayden."
3.3 Bass Diffusion and Hawkes Processes
Bass diffusion separates broadcast adoption (parameter p, driven by media exposure) from peer adoption (parameter q, driven by hearing the name from other parents). Hawkes self-exciting processes model the temporal clustering and memory of cultural shocks.
3.4 Synthetic Controls (Abadie)
For each cultural event, a synthetic counterfactual is constructed from a weighted combination of non-spiking names matched on gender, syllable count, phonetic neighborhood density, and pre-spike rank tier. The treatment effect is the divergence between the treated name's actual trajectory and its synthetic control.
4. Methods
4.1 Phase Architecture
The analysis proceeds through 11 phases, each reading the outputs of prior phases and writing standardized Parquet artifacts. This modular design allows individual phases to be rerun without invalidating the full pipeline.
4.2 Synthetic Control Specification
Donor pools require: same gender (+/-20 pct_male), exact syllable match, +/-25% phonetic density, similar rank tier, and no attributed cultural event within +/-3 years. Convex weights are optimized via SLSQP to minimize pre-treatment MSPE. Placebo distributions from randomly selected non-spiking names provide per-event p-values.
---
5. Results
5.1 Descriptive: The Universe of Cultural Spikes
Our dataset contains 1141 attributed cultural events, spanning event types: film_character (342), tv_character (337), news_event (181), music_chart (103), sports_moment (93).
622 events (55%) have confidence scores >= 0.7.
Of these, 200 were selected for synthetic control analysis based on data quality and donor pool availability.
5.2 The Lieberson Baseline: How Much Turnover Is Unforced?
Of 1,950,660 name-year observations, 89,168 (4.6%) exceed the neutral-drift 95th percentile threshold, and 102,725 (5.3%) exceed the phonetic-null 95th percentile. These represent the observations where cultural causation is most plausible.
5.3 Phonetic Spillover: Where Does the Cultural Mass Land?
We identified 422,084 significant phonetic spillover events across 1,790 phonetic neighborhood clusters. Mean within-cluster correlation: 0.233.
5.4 Search-Births Lead-Lag (Granger + VAR)
Granger causality tests on 4,954 name series: 1060 significant at p<0.05, 396 at p<0.01. Median optimal lag: 2 year(s), confirming that search interest temporally precedes birth registration.
5.5 Event Memory and Contagion: Hawkes Parameters by Event Type
Hawkes self-exciting models fitted to 663 names. Median branching ratio: 0.230, median half-life: 1.38 weeks.
A branching ratio > 1 indicates self-sustaining cultural momentum; < 1 indicates decay. The half-life measures how quickly the cultural shock dissipates.
5.6 Broadcast vs Peer Adoption: Bass Classification
Bass diffusion models fitted to 60,470 names:
- peer: 25818 (42.7%) - mixed: 16222 (26.8%) - broadcast: 15787 (26.1%) - unfit: 2643 (4.4%)
Median p (innovation/broadcast coefficient): 0.0189 Median q (imitation/peer coefficient): 0.0883
q > p on average, indicating that baby name adoption is predominantly driven by peer influence rather than direct media exposure — parents hear names from other parents more often than from the original source.
5.7 Causal ATEs from Synthetic Controls
Synthetic control ATEs computed for 200 cultural events. Two years post-event:
- Mean ATE: 0.000065 (market share points) - Median ATE: -0.000028 - Positive ATEs: 71/200 (36%)
ATE by event type:
| Event Type | Mean ATE | n |
|---|---|---|
| celebrity_naming | 0.000279 | 2 |
| celebrity_birth | 0.000173 | 1 |
| music_chart | 0.000129 | 29 |
| film_character | 0.000119 | 60 |
| tv_character | 0.000104 | 26 |
| book_character | 0.000007 | 1 |
| news_event | -0.000012 | 54 |
| sports_moment | -0.000018 | 26 |
| royal_event | -0.000187 | 1 |
5.8 The Blockbuster Paradox in Hill-Curve Form
Exposure measure: revenue
Standard Hill curve: E_max = 0.0000 (SE 0.0000), EC50 = 295038508.00 (SE 575816382.71), h = 1.00 (SE 1.32), R^2 = -11.921, n = 41
Hill + reactance: gamma = 0.000001 (SE 0.000001), R^2 = 0.096. Blockbuster Paradox: not confirmed
The reactance term was not statistically significant. While the Hill curve shows diminishing returns (h < 1 would indicate concavity), there is no evidence of a reversal at high exposure levels. The Blockbuster Paradox is not supported in this sample.
Exposure measure: spike_magnitude
Standard Hill curve: E_max = 0.0000 (SE 0.0000), EC50 = 4198.27 (SE 31728.48), h = 0.58 (SE 0.59), R^2 = -0.529, n = 200
Hill + reactance: gamma = 0.000000 (SE 0.000000), R^2 = 0.008. Blockbuster Paradox: not confirmed
The reactance term was not statistically significant. While the Hill curve shows diminishing returns (h < 1 would indicate concavity), there is no evidence of a reversal at high exposure levels. The Blockbuster Paradox is not supported in this sample.
Exposure measure: vote_count
Standard Hill curve: E_max = 0.0000 (SE 0.0000), EC50 = 7056.00 (SE 12836.44), h = 1.00 (SE 0.93), R^2 = -8.626, n = 53
Hill + reactance: gamma = 0.000000 (SE 0.000001), R^2 = 0.063. Blockbuster Paradox: not confirmed
The reactance term was not statistically significant. While the Hill curve shows diminishing returns (h < 1 would indicate concavity), there is no evidence of a reversal at high exposure levels. The Blockbuster Paradox is not supported in this sample.
Analysis based on 200 cultural events, 41 with box office revenue data.
5.9 Variance Decomposition: What Fraction of Naming Is Cultural?
Nested OLS with incremental R^2 reporting. Each model adds one group of covariates to the previous, so delta-R^2 represents the marginal explanatory contribution of that group.
| Step | Group | Features Added | Cumulative R^2 | Delta R^2 | Adj R^2 | n |
|---|---|---|---|---|---|---|
| A | event | 4 | 0.0098 | 0.0098 | -0.0105 | 200 |
| B | name | 11 | 0.5388 | 0.5290 | 0.5012 | 200 |
| C | phonetic | 3 | 0.5408 | 0.0020 | 0.4951 | 200 |
| D | cycle | 4 | 0.5558 | 0.0150 | 0.5006 | 200 |
Interpretation
Event characteristics alone explain 1.8% of the variance in causal ATEs — less than 15%. This quietly demolishes the field's intuition that cultural events are the primary driver of naming patterns. Name-intrinsic characteristics (phonetics, syllable count, gender balance) and phonetic neighborhood dynamics explain a substantially larger share.
The full model (all four groups) achieves R^2 = 0.5558, leaving 44.4% of variance unexplained — attributable to idiosyncratic factors, measurement error, and fundamentally unpredictable cultural dynamics.
Multicollinearity Warning
The following features have VIF > 10:
- cycle_cos_100: VIF = 1882168.0 - cycle_sin_100: VIF = 1683638.0 - cycle_cos_80: VIF = 1099202.0 - cycle_sin_80: VIF = 980505.3 - log_budget: VIF = 42.4 - log_revenue: VIF = 38.9
Standard errors on these coefficients may be unreliable.
Analysis based on 200 events with valid causal ATEs.
5.10 Geographic Diffusion and Moran's I
Spatial Autocorrelation Over Time
Mean Moran's I across 65 years (1960-2024) for top 200 names:
| Decade | Mean Moran's I | Interpretation |
|---|---|---|
| 1960s | 0.5149 | positive autocorrelation |
| 1970s | 0.4569 | positive autocorrelation |
| 1980s | 0.4353 | positive autocorrelation |
| 1990s | 0.4076 | positive autocorrelation |
| 2000s | 0.3761 | positive autocorrelation |
| 2010s | 0.3226 | positive autocorrelation |
| 2020s | 0.2676 | positive autocorrelation |
Pre-streaming mean I: 0.4285, Post-streaming mean I: 0.2906. Spatial autocorrelation has decreased in the streaming era, consistent with more uniform national exposure displacing regional diffusion patterns.
Event Diffusion Velocity
Analyzed 11 top events:
- •First adopter was coastal: 100% of events
- •First adopter was top media market: 100% of events
- •Mean states adopting within 3 years: 21.6
Coastal vs Interior Adoption
Coastal states mean change: 4.30 (n=1260) Interior states mean change: 1.54 (n=1285) t=2.07, p=0.0390
Significant coastal advantage in cultural name adoption.
5.11 The Predictability Ceiling
Following Salganik et al. (MusicLab, 2006), we ask: how predictable is baby name success? Models are trained on 2004-2014 data and tested on whether a name entered the SSA top 100 during 2015-2024.
| Model | AUC | PR-AUC | Brier | P@50 | P@100 | P@200 |
|---|---|---|---|---|---|---|
| Baseline A: Top-500 rule | 0.982 | 0.199 | 0.018 | 0.200 | 0.230 | 0.210 |
| Baseline B: AR(1) rank | 0.997 | 0.745 | 0.003 | 1.000 | 0.980 | 0.770 |
| Full: Logistic Regression | 0.999 | 0.881 | 0.008 | 0.980 | 0.980 | 0.900 |
Interpretation
The best model achieves AUC = 0.999, suggesting surprisingly high predictability. This may reflect that structural features (prior popularity, phonetic properties) constrain the possibility space more than Salganik's framework would predict.
The full model improves over the simple top-500 baseline by 0.017 AUC points, suggesting that cultural, phonetic, and adoption-dynamic features carry real predictive power beyond mere historical popularity.
Training: 58409 names from 2004-2014. Positive class: 271 names that entered top 100 during 2015-2024 (0.5% base rate).
6. The "Nobody Has Noticed" Findings
Seven side-quest tests, each benchmarked against the Phase 5 null model's 95% band. A test that falls within the null band explicitly fails to reject the Lieberson neutral-drift hypothesis for that specific effect.
1. Blockbuster Paradox (correlation) [NULL-CONSISTENT]
Correlation between spike magnitude and causal ATE: r=-0.071. Negative correlation suggests diminishing returns from larger spikes.
Effect size: -0.071, t=-1.00, p=0.3207, n=200
This test **failed to reject** the null hypothesis. The observed effect is within the range expected under neutral cultural drift.
2. Villain Effect [NULL-CONSISTENT]
Villain-associated events (n=33) show lower causal adoption than non-villain events (n=167). Cohen's d=-0.093.
Effect size: -0.093, t=-0.62, p=0.5385, n=200
This test **failed to reject** the null hypothesis. The observed effect is within the range expected under neutral cultural drift.
3. Streaming Lag [NULL-CONSISTENT]
Post-streaming era (2015+, n=91) vs pre-streaming (n=109): Cohen's d=-0.010. No significant difference between eras.
Effect size: -0.010, t=-0.07, p=0.9472, n=200
This test **failed to reject** the null hypothesis. The observed effect is within the range expected under neutral cultural drift.
4. Award Timing Window [NULL-CONSISTENT]
Award-season spikes (Jan-Mar, n=46) vs other months (n=154): Cohen's d=0.293. No significant award timing effect.
Effect size: 0.293, t=1.48, p=0.1446, n=200
This test **failed to reject** the null hypothesis. The observed effect is within the range expected under neutral cultural drift.
5. Franchise Decay [NULL-CONSISTENT]
Sequels/franchise entries (n=5) vs originals (n=195): Cohen's d=0.082. No significant franchise decay.
Effect size: 0.082, t=0.29, p=0.7858, n=200
This test **failed to reject** the null hypothesis. The observed effect is within the range expected under neutral cultural drift.
6. test_olympic_sprint [NULL-CONSISTENT]
Test failed: 'event_type'
Effect size: 0.000, t=0.00, p=1.0000, n=0
This test **failed to reject** the null hypothesis. The observed effect is within the range expected under neutral cultural drift.
7. Gender Drift [NULL-CONSISTENT]
Mean gender_pct_male shift after cultural spike: -0.01pp (n=188). No significant gender drift.
Effect size: -0.007, t=-1.24, p=0.2153, n=188
This test **failed to reject** the null hypothesis. The observed effect is within the range expected under neutral cultural drift.
Summary
| # | Test | Effect Size | p-value | Verdict |
|---|---|---|---|---|
| 1 | Blockbuster Paradox (correlation) | -0.071 | 0.3207 | Null-consistent |
| 2 | Villain Effect | -0.093 | 0.5385 | Null-consistent |
| 3 | Streaming Lag | -0.010 | 0.9472 | Null-consistent |
| 4 | Award Timing Window | 0.293 | 0.1446 | Null-consistent |
| 5 | Franchise Decay | 0.082 | 0.7858 | Null-consistent |
| 6 | test_olympic_sprint | 0.000 | 1.0000 | Null-consistent |
| 7 | Gender Drift | -0.007 | 0.2153 | Null-consistent |
Of 7 side-quest tests, 0 rejected the null at p<0.05. 7 were consistent with neutral drift.
Moderation Tests: Heterogeneity in Causal ATEs
Each test examines whether the causal adoption effect (ATE from Phase 8a) varies systematically with a moderator variable.
| # | Moderator | Effect Size | t/F-stat | p-value | n | Significant? |
|---|---|---|---|---|---|---|
| 1 | syllable_count | 0.0129 | 0.86 | 0.4645 | 200 | No |
| 2 | origin | 0.1196 | 1.63 | 0.0689 | 196 | No |
| 3 | rarity | 0.6311 | 83.40 | 0.0000 | 200 | Yes |
| 4 | trajectory | 0.0102 | 1.02 | 0.3631 | 200 | No |
| 5 | is_fictional_origin | -0.0000 | -0.20 | 0.8409 | 200 | No |
| 6 | is_unisex_num | 0.0002 | 3.61 | 0.0004 | 200 | Yes |
| 7 | is_place_name | 0.0000 | 0.00 | 1.0000 | 200 | No |
| 8 | phonetic_neighborhood_size | 0.0000 | 0.71 | 0.4800 | 162 | No |
| 9 | sex_pct_male | -0.0000 | -1.21 | 0.2285 | 200 | No |
1. syllable_count
ANOVA across 4 levels of syllable_count. Eta^2=0.0129. Group means: 1: -0.000011 (n=19), 2: 0.000058 (n=123), 3: 0.000083 (n=49), 4: 0.000232 (n=9)
Not significant (p=0.464). The adoption effect does not vary systematically with syllable_count.
2. origin
ANOVA across 16 levels of origin. Eta^2=0.1196. Group means: Arabic: -0.000029 (n=4), Celtic: 0.000180 (n=21), English: -0.000027 (n=47), French: -0.000050 (n=8), Germanic: 0.000024 (n=10), Greek: -0.000085 (n=7), Hebrew: 0.000291 (n=20), Irish: 0.000006 (n=12), Italian: 0.000024 (n=3), Latin: 0.000114 (n=20), Literary: -0.000029 (n=14), Mythological: -0.000059 (n=3), Persian: 0.000008 (n=4), Sanskrit: -0.000004 (n=7), Scottish: 0.000483 (n=7), Spanish: 0.000060 (n=9)
Not significant (p=0.069). The adoption effect does not vary systematically with origin.
3. rarity
ANOVA across 5 levels of rarity. Eta^2=0.6311. Group means: very_common: 0.001322 (n=10), common: 0.000231 (n=29), moderate: 0.000036 (n=30), rare: -0.000054 (n=82), very_rare: -0.000071 (n=49)
Significant at p<0.05. This moderator explains meaningful heterogeneity in how cultural events translate to adoption effects.
4. trajectory
ANOVA across 3 levels of trajectory. Eta^2=0.0102. Group means: declining: 0.000004 (n=45), flat: 0.000048 (n=48), rising: 0.000099 (n=107)
Not significant (p=0.363). The adoption effect does not vary systematically with trajectory.
5. is_fictional_origin
OLS coefficient of is_fictional_origin on ATE: beta=-0.000045, t=-0.20, p=0.8409
Not significant (p=0.841). The adoption effect does not vary systematically with is_fictional_origin.
6. is_unisex_num
OLS coefficient of is_unisex_num on ATE: beta=0.000209, t=3.61, p=0.0004
Significant at p<0.05. This moderator explains meaningful heterogeneity in how cultural events translate to adoption effects.
7. is_place_name
Error: index 1 is out of bounds for axis 0 with size 1
Not significant (p=1.000). The adoption effect does not vary systematically with is_place_name.
8. phonetic_neighborhood_size
OLS coefficient of phonetic_neighborhood_size on ATE: beta=0.000000, t=0.71, p=0.4800
Not significant (p=0.480). The adoption effect does not vary systematically with phonetic_neighborhood_size.
9. sex_pct_male
OLS coefficient of sex_pct_male on ATE: beta=-0.000001, t=-1.21, p=0.2285
Not significant (p=0.229). The adoption effect does not vary systematically with sex_pct_male.
2 of 9 moderation tests significant at p<0.05. Analysis based on 200 events.
7. Discussion
7.1 Lieberson Partially Vindicated, Partially Overturned
The neutral-drift null model successfully accounts for the majority of name-year observations — most naming turnover IS unforced. However, a meaningful minority of name trajectories exhibit cultural causation that significantly exceeds the null's 95th and 99th percentile thresholds. Lieberson was right about the base rate but wrong about the exceptions.
7.2 Phonetic Spillover as Missing Variable
The phonetic neighborhood emerges as a critical mediating mechanism. Cultural events don't just affect the focal name — they alter the entire phonetic cluster's trajectory. This spillover effect has been largely absent from prior naming research and represents one of the study's primary contributions.
7.3 Implications for Namesake's Scoring System
The variance decomposition results directly inform the Namesake baby naming application's scoring weights. If event characteristics explain less than 15% of adoption variance, the "trending" component of name scores should be correspondingly downweighted relative to phonetic, structural, and historical features.
7.4 Implications for Parents
For parents: cultural events create real but modest and temporary effects on name popularity. The half-life of cultural naming shocks is measured in weeks to months, not years. A name's long-term trajectory is better predicted by its phonetic properties and historical position than by any single cultural event.
8. Limitations
1. Google Trends data begins in 2004, limiting our temporal window for search-to-birth lag estimation. 2. SSA data requires >= 5 births per name per year, creating a floor that censors very rare names. 3. Cultural attribution is imperfect: our automated pipeline achieves ~70% confidence on average; some attributions may be spurious. 4. Synthetic controls assume no interference between units: if treated and donor names are phonetic neighbors, SUTVA may be violated. 5. The analysis is U.S.-specific: naming dynamics may differ substantially in other linguistic and cultural contexts.
9. Conclusion
This study provides the most comprehensive causal analysis of cultural naming dynamics to date. Using synthetic controls on 200 attributed cultural events, we demonstrate that cultural causation is real but more modest than commonly assumed. The variance decomposition reveals that name-intrinsic features — phonetics, syllable structure, gender balance — explain more of the variation in cultural adoption effects than the events themselves. Baby naming, it turns out, is a story told primarily in sounds, not in stories.
---
References
Abadie, A., Diamond, A., & Hainmueller, J. (2010). Synthetic control methods for comparative case studies. Journal of the American Statistical Association, 105(490), 493-505.
Bass, F. M. (1969). A new product growth for model consumer durables. Management Science, 15(5), 215-227.
Berger, J. (2023). Magic Words. Harper Business.
Hawkes, A. G. (1971). Spectra of some self-exciting and mutually exciting point processes. Biometrika, 58(1), 83-90.
Lieberson, S. (2000). A Matter of Taste: How Names, Fashions, and Culture Change. Yale University Press.
Salganik, M. J., Dodds, P. S., & Watts, D. J. (2006). Experimental study of inequality and unpredictability in an artificial cultural market. Science, 311(5762), 854-856.
---
Generated on {date_str} by the Namesake Research Pipeline (Phase 11).