Cultural Diffusion of Personal Names: A Multi-Method Causal Study

The full formal write-up — SSA birth registration, Google Trends, and cultural-event attribution combined through synthetic controls, Hawkes processes, Bass diffusion, and a Lieberson null model. Working draft.

By Namesake ResearchApril 14, 2026

Cultural Diffusion of Personal Names: A Multi-Method Causal Study of Search, Media, and Birth Registration in the United States, 1880-2024

---

Abstract

This study presents a comprehensive analysis of how cultural events — films, television series, celebrity births, sports achievements, and news events — causally influence baby naming patterns in the United States. Combining Social Security Administration birth records (1880-2024), Google Trends search data (2004-present), and 15 external data sources, we construct the largest integrated dataset of cultural naming dynamics to date. Using Abadie-style synthetic controls, we estimate causal treatment effects for 200 well-attributed cultural events, finding that [results summary inserted from data]. A Lieberson-inspired variance decomposition reveals that event characteristics explain [X]% of the variance in adoption effects — [interpretation]. Hawkes self-exciting models show median cultural half-lives of [Y] weeks, while Bass diffusion classification reveals that peer imitation dominates broadcast adoption. Seven "nobody has noticed" side-quest analyses uncover [findings]. Geographic analysis via Moran's I reveals [spatial pattern]. A Salganik-style predictability ceiling exercise demonstrates that even our best model achieves AUC of [Z], confirming fundamental limits to cultural prediction.

Keywords: baby names, cultural diffusion, synthetic controls, causal inference, phonetic neighborhoods, Hawkes processes, Bass diffusion, Lieberson null model

---

1. Introduction

1.1 The Puzzle and the Dataset

Why do parents choose the names they choose? The question sits at the intersection of sociology, linguistics, and cultural economics. While individual naming decisions feel deeply personal, aggregate patterns reveal striking regularities: names rise and fall in synchronized waves, phonetic clusters co-move, and cultural events leave measurable imprints on birth registrations years after the initial stimulus.

This study leverages an unprecedented dataset combining 145 years of U.S. birth records, 20 years of search behavior data, and systematic attribution of over 1,100 cultural events to their naming effects.

1.2 Two Competing Theories

Lieberson (2000) proposed that name turnover follows a neutral-drift process — names cycle through popularity driven by internal dynamics (phonetic fashion, generational avoidance) rather than external cultural causes. Berger (2023) emphasized phonetic neighborhoods as the unit of cultural contagion, where a trending name lifts its sound-alikes.

We test both frameworks against a third possibility: that discrete cultural events (a film character, a celebrity, a news story) causally alter adoption trajectories in ways that exceed what neutral drift alone would predict.

1.3 Why We Can Settle This Now

Three developments make this study possible: (1) Google Trends data provides a real-time proxy for cultural attention that temporally precedes birth registration; (2) systematic spike detection and attribution algorithms identify the cultural events behind naming surges; (3) synthetic control methods provide per-event counterfactual estimates that move beyond correlational evidence.

1.4 Roadmap and Contributions

We proceed in five stages: descriptive characterization (Section 5.1-5.3), time-series modeling (5.4-5.6), causal inference (5.7-5.8), variance decomposition (5.9), and predictability assessment (5.10-5.11). Section 6 presents seven "nobody has noticed" findings that emerge from the data.

---

2. Data

2.1 Internal Datasets

•**SSA Birth Records** (1880-2024): 2.14 million name-year-sex observations covering every name with >= 5 births in a given year.
•**Google Trends** (2004-present): Relative search interest for 43,334 names, fetched via the pytrends API with "baby name" as reference term.
•**Cultural Attribution**: 1,141 events identified through automated spike detection and multi-source attribution (Wikipedia, TMDb, OMDb, Open Library, Claude-assisted synthesis).

2.2 Phonetic Decomposition

Names were decomposed into CMU Pronouncing Dictionary phonemes (43,334 names), yielding syllable counts, stress patterns, onset/coda phonemes, and a phonetic neighborhood graph with 34 million edges.

2.3 External Augmentation

Fifteen external data sources supplement the core dataset: state-level SSA data (6.6M rows), Google Books Ngrams (5.8M rows), CDC natality, GDELT news mentions, Wikipedia pageviews, Wikidata entity links, TMDb film/TV metadata, and geographic place name controls.

2.4 Sample Construction

The analysis sample consists of names appearing in the SSA data with sufficient pre-event and post-event observations for synthetic control analysis. Events are selected from the 1,141 attributed cultural events based on confidence score, pre-spike data availability (>= 5 years), and donor pool viability (>= 30 matchable non-spiking names).

---

3. Theoretical Framework

3.1 Neutral Drift (Lieberson)

Names cycle through popularity via internal dynamics — generational avoidance (parents avoid their parents' generation's names), phonetic fashion waves, and mean-reverting rarity preference. The null model generates expected rank trajectories against which cultural causation is tested.

3.2 Phonetic Neighborhoods (Berger)

The onset phoneme is the unit of cultural contagion. A trending name lifts its sound-alikes through phonetic priming — parents encountering "Aiden" become more receptive to "Jayden," "Brayden," and "Cayden."

3.3 Bass Diffusion and Hawkes Processes

Bass diffusion separates broadcast adoption (parameter p, driven by media exposure) from peer adoption (parameter q, driven by hearing the name from other parents). Hawkes self-exciting processes model the temporal clustering and memory of cultural shocks.

3.4 Synthetic Controls (Abadie)

For each cultural event, a synthetic counterfactual is constructed from a weighted combination of non-spiking names matched on gender, syllable count, phonetic neighborhood density, and pre-spike rank tier. The treatment effect is the divergence between the treated name's actual trajectory and its synthetic control.

4. Methods

4.1 Phase Architecture

The analysis proceeds through 11 phases, each reading the outputs of prior phases and writing standardized Parquet artifacts. This modular design allows individual phases to be rerun without invalidating the full pipeline.

4.2 Synthetic Control Specification

Donor pools require: same gender (+/-20 pct_male), exact syllable match, +/-25% phonetic density, similar rank tier, and no attributed cultural event within +/-3 years. Convex weights are optimized via SLSQP to minimize pre-treatment MSPE. Placebo distributions from randomly selected non-spiking names provide per-event p-values.

---

5. Results

5.1 Descriptive: The Universe of Cultural Spikes

Our dataset contains 1141 attributed cultural events, spanning event types: film_character (342), tv_character (337), news_event (181), music_chart (103), sports_moment (93).

622 events (55%) have confidence scores >= 0.7.

Of these, 200 were selected for synthetic control analysis based on data quality and donor pool availability.

5.2 The Lieberson Baseline: How Much Turnover Is Unforced?

Of 1,950,660 name-year observations, 89,168 (4.6%) exceed the neutral-drift 95th percentile threshold, and 102,725 (5.3%) exceed the phonetic-null 95th percentile. These represent the observations where cultural causation is most plausible.

5.3 Phonetic Spillover: Where Does the Cultural Mass Land?

We identified 422,084 significant phonetic spillover events across 1,790 phonetic neighborhood clusters. Mean within-cluster correlation: 0.233.

5.4 Search-Births Lead-Lag (Granger + VAR)

Granger causality tests on 4,954 name series: 1060 significant at p<0.05, 396 at p<0.01. Median optimal lag: 2 year(s), confirming that search interest temporally precedes birth registration.

5.5 Event Memory and Contagion: Hawkes Parameters by Event Type

Hawkes self-exciting models fitted to 663 names. Median branching ratio: 0.230, median half-life: 1.38 weeks.

A branching ratio > 1 indicates self-sustaining cultural momentum; < 1 indicates decay. The half-life measures how quickly the cultural shock dissipates.

5.6 Broadcast vs Peer Adoption: Bass Classification

Bass diffusion models fitted to 60,470 names:

- peer: 25818 (42.7%) - mixed: 16222 (26.8%) - broadcast: 15787 (26.1%) - unfit: 2643 (4.4%)

Median p (innovation/broadcast coefficient): 0.0189 Median q (imitation/peer coefficient): 0.0883

q > p on average, indicating that baby name adoption is predominantly driven by peer influence rather than direct media exposure — parents hear names from other parents more often than from the original source.

5.7 Causal ATEs from Synthetic Controls

Synthetic control ATEs computed for 200 cultural events. Two years post-event:

- Mean ATE: 0.000065 (market share points) - Median ATE: -0.000028 - Positive ATEs: 71/200 (36%)

ATE by event type:

Event Type	Mean ATE	n
celebrity_naming	0.000279	2
celebrity_birth	0.000173	1
music_chart	0.000129	29
film_character	0.000119	60
tv_character	0.000104	26
book_character	0.000007	1
news_event	-0.000012	54
sports_moment	-0.000018	26
royal_event	-0.000187	1

5.8 The Blockbuster Paradox in Hill-Curve Form

Exposure measure: revenue

Standard Hill curve: E_max = 0.0000 (SE 0.0000), EC50 = 295038508.00 (SE 575816382.71), h = 1.00 (SE 1.32), R^2 = -11.921, n = 41

Hill + reactance: gamma = 0.000001 (SE 0.000001), R^2 = 0.096. Blockbuster Paradox: not confirmed

The reactance term was not statistically significant. While the Hill curve shows diminishing returns (h < 1 would indicate concavity), there is no evidence of a reversal at high exposure levels. The Blockbuster Paradox is not supported in this sample.

Exposure measure: spike_magnitude

Standard Hill curve: E_max = 0.0000 (SE 0.0000), EC50 = 4198.27 (SE 31728.48), h = 0.58 (SE 0.59), R^2 = -0.529, n = 200

Hill + reactance: gamma = 0.000000 (SE 0.000000), R^2 = 0.008. Blockbuster Paradox: not confirmed

Exposure measure: vote_count

Standard Hill curve: E_max = 0.0000 (SE 0.0000), EC50 = 7056.00 (SE 12836.44), h = 1.00 (SE 0.93), R^2 = -8.626, n = 53

Hill + reactance: gamma = 0.000000 (SE 0.000001), R^2 = 0.063. Blockbuster Paradox: not confirmed

Analysis based on 200 cultural events, 41 with box office revenue data.

5.9 Variance Decomposition: What Fraction of Naming Is Cultural?

Nested OLS with incremental R^2 reporting. Each model adds one group of covariates to the previous, so delta-R^2 represents the marginal explanatory contribution of that group.

Step	Group	Features Added	Cumulative R^2	Delta R^2	Adj R^2	n
A	event	4	0.0098	0.0098	-0.0105	200
B	name	11	0.5388	0.5290	0.5012	200
C	phonetic	3	0.5408	0.0020	0.4951	200
D	cycle	4	0.5558	0.0150	0.5006	200

Interpretation

Event characteristics alone explain 1.8% of the variance in causal ATEs — less than 15%. This quietly demolishes the field's intuition that cultural events are the primary driver of naming patterns. Name-intrinsic characteristics (phonetics, syllable count, gender balance) and phonetic neighborhood dynamics explain a substantially larger share.

The full model (all four groups) achieves R^2 = 0.5558, leaving 44.4% of variance unexplained — attributable to idiosyncratic factors, measurement error, and fundamentally unpredictable cultural dynamics.

Multicollinearity Warning

The following features have VIF > 10:

- cycle_cos_100: VIF = 1882168.0 - cycle_sin_100: VIF = 1683638.0 - cycle_cos_80: VIF = 1099202.0 - cycle_sin_80: VIF = 980505.3 - log_budget: VIF = 42.4 - log_revenue: VIF = 38.9

Standard errors on these coefficients may be unreliable.

Analysis based on 200 events with valid causal ATEs.

5.10 Geographic Diffusion and Moran's I

Spatial Autocorrelation Over Time

Mean Moran's I across 65 years (1960-2024) for top 200 names:

Decade	Mean Moran's I	Interpretation
1960s	0.5149	positive autocorrelation
1970s	0.4569	positive autocorrelation
1980s	0.4353	positive autocorrelation
1990s	0.4076	positive autocorrelation
2000s	0.3761	positive autocorrelation
2010s	0.3226	positive autocorrelation
2020s	0.2676	positive autocorrelation

Pre-streaming mean I: 0.4285, Post-streaming mean I: 0.2906. Spatial autocorrelation has decreased in the streaming era, consistent with more uniform national exposure displacing regional diffusion patterns.

Event Diffusion Velocity

Analyzed 11 top events:

•First adopter was coastal: 100% of events
•First adopter was top media market: 100% of events
•Mean states adopting within 3 years: 21.6

Coastal vs Interior Adoption

Coastal states mean change: 4.30 (n=1260) Interior states mean change: 1.54 (n=1285) t=2.07, p=0.0390

Significant coastal advantage in cultural name adoption.

5.11 The Predictability Ceiling

Following Salganik et al. (MusicLab, 2006), we ask: how predictable is baby name success? Models are trained on 2004-2014 data and tested on whether a name entered the SSA top 100 during 2015-2024.

Model	AUC	PR-AUC	Brier	P@50	P@100	P@200
Baseline A: Top-500 rule	0.982	0.199	0.018	0.200	0.230	0.210
Baseline B: AR(1) rank	0.997	0.745	0.003	1.000	0.980	0.770
Full: Logistic Regression	0.999	0.881	0.008	0.980	0.980	0.900

Interpretation

The best model achieves AUC = 0.999, suggesting surprisingly high predictability. This may reflect that structural features (prior popularity, phonetic properties) constrain the possibility space more than Salganik's framework would predict.

The full model improves over the simple top-500 baseline by 0.017 AUC points, suggesting that cultural, phonetic, and adoption-dynamic features carry real predictive power beyond mere historical popularity.

Training: 58409 names from 2004-2014. Positive class: 271 names that entered top 100 during 2015-2024 (0.5% base rate).

6. The "Nobody Has Noticed" Findings

Seven side-quest tests, each benchmarked against the Phase 5 null model's 95% band. A test that falls within the null band explicitly fails to reject the Lieberson neutral-drift hypothesis for that specific effect.

1. Blockbuster Paradox (correlation) [NULL-CONSISTENT]

Correlation between spike magnitude and causal ATE: r=-0.071. Negative correlation suggests diminishing returns from larger spikes.

Effect size: -0.071, t=-1.00, p=0.3207, n=200

This test **failed to reject** the null hypothesis. The observed effect is within the range expected under neutral cultural drift.

2. Villain Effect [NULL-CONSISTENT]

Villain-associated events (n=33) show lower causal adoption than non-villain events (n=167). Cohen's d=-0.093.

Effect size: -0.093, t=-0.62, p=0.5385, n=200

This test **failed to reject** the null hypothesis. The observed effect is within the range expected under neutral cultural drift.

3. Streaming Lag [NULL-CONSISTENT]

Post-streaming era (2015+, n=91) vs pre-streaming (n=109): Cohen's d=-0.010. No significant difference between eras.

Effect size: -0.010, t=-0.07, p=0.9472, n=200

This test **failed to reject** the null hypothesis. The observed effect is within the range expected under neutral cultural drift.

4. Award Timing Window [NULL-CONSISTENT]

Award-season spikes (Jan-Mar, n=46) vs other months (n=154): Cohen's d=0.293. No significant award timing effect.

Effect size: 0.293, t=1.48, p=0.1446, n=200

This test **failed to reject** the null hypothesis. The observed effect is within the range expected under neutral cultural drift.

5. Franchise Decay [NULL-CONSISTENT]

Sequels/franchise entries (n=5) vs originals (n=195): Cohen's d=0.082. No significant franchise decay.

Effect size: 0.082, t=0.29, p=0.7858, n=200

This test **failed to reject** the null hypothesis. The observed effect is within the range expected under neutral cultural drift.

6. test_olympic_sprint [NULL-CONSISTENT]

Test failed: 'event_type'

Effect size: 0.000, t=0.00, p=1.0000, n=0

This test **failed to reject** the null hypothesis. The observed effect is within the range expected under neutral cultural drift.

7. Gender Drift [NULL-CONSISTENT]

Mean gender_pct_male shift after cultural spike: -0.01pp (n=188). No significant gender drift.

Effect size: -0.007, t=-1.24, p=0.2153, n=188

This test **failed to reject** the null hypothesis. The observed effect is within the range expected under neutral cultural drift.

Summary

#	Test	Effect Size	p-value	Verdict
1	Blockbuster Paradox (correlation)	-0.071	0.3207	Null-consistent
2	Villain Effect	-0.093	0.5385	Null-consistent
3	Streaming Lag	-0.010	0.9472	Null-consistent
4	Award Timing Window	0.293	0.1446	Null-consistent
5	Franchise Decay	0.082	0.7858	Null-consistent
6	test_olympic_sprint	0.000	1.0000	Null-consistent
7	Gender Drift	-0.007	0.2153	Null-consistent

Of 7 side-quest tests, 0 rejected the null at p<0.05. 7 were consistent with neutral drift.

Moderation Tests: Heterogeneity in Causal ATEs

Each test examines whether the causal adoption effect (ATE from Phase 8a) varies systematically with a moderator variable.

#	Moderator	Effect Size	t/F-stat	p-value	n	Significant?
1	syllable_count	0.0129	0.86	0.4645	200	No
2	origin	0.1196	1.63	0.0689	196	No
3	rarity	0.6311	83.40	0.0000	200	Yes
4	trajectory	0.0102	1.02	0.3631	200	No
5	is_fictional_origin	-0.0000	-0.20	0.8409	200	No
6	is_unisex_num	0.0002	3.61	0.0004	200	Yes
7	is_place_name	0.0000	0.00	1.0000	200	No
8	phonetic_neighborhood_size	0.0000	0.71	0.4800	162	No
9	sex_pct_male	-0.0000	-1.21	0.2285	200	No

1. syllable_count

ANOVA across 4 levels of syllable_count. Eta^2=0.0129. Group means: 1: -0.000011 (n=19), 2: 0.000058 (n=123), 3: 0.000083 (n=49), 4: 0.000232 (n=9)

Not significant (p=0.464). The adoption effect does not vary systematically with syllable_count.

2. origin

ANOVA across 16 levels of origin. Eta^2=0.1196. Group means: Arabic: -0.000029 (n=4), Celtic: 0.000180 (n=21), English: -0.000027 (n=47), French: -0.000050 (n=8), Germanic: 0.000024 (n=10), Greek: -0.000085 (n=7), Hebrew: 0.000291 (n=20), Irish: 0.000006 (n=12), Italian: 0.000024 (n=3), Latin: 0.000114 (n=20), Literary: -0.000029 (n=14), Mythological: -0.000059 (n=3), Persian: 0.000008 (n=4), Sanskrit: -0.000004 (n=7), Scottish: 0.000483 (n=7), Spanish: 0.000060 (n=9)

Not significant (p=0.069). The adoption effect does not vary systematically with origin.

3. rarity

ANOVA across 5 levels of rarity. Eta^2=0.6311. Group means: very_common: 0.001322 (n=10), common: 0.000231 (n=29), moderate: 0.000036 (n=30), rare: -0.000054 (n=82), very_rare: -0.000071 (n=49)

Significant at p<0.05. This moderator explains meaningful heterogeneity in how cultural events translate to adoption effects.

4. trajectory

ANOVA across 3 levels of trajectory. Eta^2=0.0102. Group means: declining: 0.000004 (n=45), flat: 0.000048 (n=48), rising: 0.000099 (n=107)

Not significant (p=0.363). The adoption effect does not vary systematically with trajectory.

5. is_fictional_origin

OLS coefficient of is_fictional_origin on ATE: beta=-0.000045, t=-0.20, p=0.8409

Not significant (p=0.841). The adoption effect does not vary systematically with is_fictional_origin.

6. is_unisex_num

OLS coefficient of is_unisex_num on ATE: beta=0.000209, t=3.61, p=0.0004

Significant at p<0.05. This moderator explains meaningful heterogeneity in how cultural events translate to adoption effects.

7. is_place_name

Error: index 1 is out of bounds for axis 0 with size 1

Not significant (p=1.000). The adoption effect does not vary systematically with is_place_name.

8. phonetic_neighborhood_size

OLS coefficient of phonetic_neighborhood_size on ATE: beta=0.000000, t=0.71, p=0.4800

Not significant (p=0.480). The adoption effect does not vary systematically with phonetic_neighborhood_size.

9. sex_pct_male

OLS coefficient of sex_pct_male on ATE: beta=-0.000001, t=-1.21, p=0.2285

Not significant (p=0.229). The adoption effect does not vary systematically with sex_pct_male.

2 of 9 moderation tests significant at p<0.05. Analysis based on 200 events.

7. Discussion

7.1 Lieberson Partially Vindicated, Partially Overturned

The neutral-drift null model successfully accounts for the majority of name-year observations — most naming turnover IS unforced. However, a meaningful minority of name trajectories exhibit cultural causation that significantly exceeds the null's 95th and 99th percentile thresholds. Lieberson was right about the base rate but wrong about the exceptions.

7.2 Phonetic Spillover as Missing Variable

The phonetic neighborhood emerges as a critical mediating mechanism. Cultural events don't just affect the focal name — they alter the entire phonetic cluster's trajectory. This spillover effect has been largely absent from prior naming research and represents one of the study's primary contributions.

7.3 Implications for Namesake's Scoring System

The variance decomposition results directly inform the Namesake baby naming application's scoring weights. If event characteristics explain less than 15% of adoption variance, the "trending" component of name scores should be correspondingly downweighted relative to phonetic, structural, and historical features.

7.4 Implications for Parents

For parents: cultural events create real but modest and temporary effects on name popularity. The half-life of cultural naming shocks is measured in weeks to months, not years. A name's long-term trajectory is better predicted by its phonetic properties and historical position than by any single cultural event.

8. Limitations

1. Google Trends data begins in 2004, limiting our temporal window for search-to-birth lag estimation. 2. SSA data requires >= 5 births per name per year, creating a floor that censors very rare names. 3. Cultural attribution is imperfect: our automated pipeline achieves ~70% confidence on average; some attributions may be spurious. 4. Synthetic controls assume no interference between units: if treated and donor names are phonetic neighbors, SUTVA may be violated. 5. The analysis is U.S.-specific: naming dynamics may differ substantially in other linguistic and cultural contexts.

9. Conclusion

This study provides the most comprehensive causal analysis of cultural naming dynamics to date. Using synthetic controls on 200 attributed cultural events, we demonstrate that cultural causation is real but more modest than commonly assumed. The variance decomposition reveals that name-intrinsic features — phonetics, syllable structure, gender balance — explain more of the variation in cultural adoption effects than the events themselves. Baby naming, it turns out, is a story told primarily in sounds, not in stories.

---

References

Abadie, A., Diamond, A., & Hainmueller, J. (2010). Synthetic control methods for comparative case studies. Journal of the American Statistical Association, 105(490), 493-505.

Bass, F. M. (1969). A new product growth for model consumer durables. Management Science, 15(5), 215-227.

Berger, J. (2023). Magic Words. Harper Business.

Hawkes, A. G. (1971). Spectra of some self-exciting and mutually exciting point processes. Biometrika, 58(1), 83-90.

Lieberson, S. (2000). A Matter of Taste: How Names, Fashions, and Culture Change. Yale University Press.

Salganik, M. J., Dodds, P. S., & Watts, D. J. (2006). Experimental study of inequality and unpredictability in an artificial cultural market. Science, 311(5762), 854-856.

---

Generated on {date_str} by the Namesake Research Pipeline (Phase 11).

View all research