Research
Research Focus
I study why creative products succeed and how creators can use innovation, borrowing, and generative AI to design content that resonates rather than disappears. My work combines economics, statistics, and multi-modal machine learning to understand diffusion and performance outcomes in music, online video, and video games.Details
Entertainment markets differ fundamentally from traditional product markets because creative goods are high-dimensional and audience preferences are largely implicit—people know they like a song or a video but cannot easily explain why. My research develops interpretable, theory-guided representations of creative content and embeds them into a historical style space to measure innovation and borrowing relative to past successful works. On the supply side, I study how creators strategically position themselves in style space based on goals (entry, ranking, persistence) and identity (incumbent vs. entrant). I use counterfactual experiments and generative AI pipelines to evaluate how creative decisions affect real outcomes, offering guidance at a moment when AI enables large-scale creation but amplifies noise and uncertainty.Job Market Paper
- Haihao Guo, P. B. (Seethu) Seetharaman, Yingkang Xie. "Echoes from the Past: Music Design Inspired by Past Hits and Its GenAI Application."
Abstract
A central challenge in music design is determining when a song should remain rooted in past musical styles and when it should innovate beyond them. This paper proposes a novel data-driven framework, grounded in canonical music theory, to guide the design of commercially successful music by identifying a song's optimal positioning relative to past musical styles. The framework provides actionable design guidance for both human musicians and generative AI systems. We implement it by 1) constructing an Interpretable Music Attribute Representation (IMAR) that extracts melody, harmony, and rhythm features from audio using deep learning models integrated with music-theoretic analysis, 2) clustering 24,618 Billboard Hot 100 songs from 1958--2017 to identify representative songs within each era and link recurring patterns across time into historical styles, and 3) positioning 29,331 songs from 2019--2025 relative to these styles using distance and concentration measures. We then train an XGBoost model of Billboard entry and show that only six interpretable style-positioning variables outperform high-dimensional audio embeddings with more than 4,000 variables in out-of-sample prediction, while also providing actionable guidance about how a song should be redesigned. We find that contemporary songs are more likely to enter the Billboard Hot 100 when they follow a close-fusion strategy: being close to multiple historical styles rather than concentrated around a single one, especially in melody and harmony. This suggests that successful songs tend to blend across multiple historical references. Finally, we validate the framework through a generative AI redesign application on five contemporary unpopular songs across five genres. We build a retrieval-augmented generation pipeline that translates historical style medoids and estimated optimal distances into musician-ready prompts and redesigned songs. In a lab experiment, the redesigned songs receive more favorable listener evaluations than the originals, especially in production-driven genres, although AI aversion and vocal familiarity dampen acceptance of these improvements. Together, our historical-style-based framework can make music design more measurable and actionable, providing concrete guidance for both human musicians and also being implementable in generative AI systems.
Working Papers
- Haihao Guo, P. B. (Seethu) Seetharaman. "Instant versus Sustained Diffusion of YouTube Videos: A Multimodal Analysis of Content Characteristics."
Abstract
Despite an emerging body of research on online content diffusion, there has been little to no investigation of how content characteristics drive distinct temporal diffusion patterns. This study addresses this gap by investigating two primary questions: (1) What are the main diffusion patterns of online videos? (2) Which specific content characteristics influence these patterns? We estimate the Bass diffusion model on a random sample of one million YouTube videos and identify 42,046 videos with highly predictable temporal diffusion patterns. Using K-means clustering, we find that these videos fall under two broad types: (1) instant diffusion (rapid spike followed by quick decline), (2) sustained diffusion (steady growth over time). Guided by Dual Process Theory, we apply state-of-the-art machine learning methods—(1) Spotify audio analysis, (2) Whisper transcription, (3) Sentence-BERT embeddings, and (4) Topic modeling—to extract audio, visual, and textual features corresponding to System 1 (fast, emotional) versus System 2 (slow, analytical) cognitive processes. Logistic regression analyses reveal that instant diffusion is associated with high-energy, emotionally engaging content (System 1), whereas sustained diffusion is associated with informational density and complexity (System 2). Viewer engagement metrics (comment complexity, delayed sharing, longer watch times) further validate these cognitive distinctions. Our findings can be used by YouTube content creators to design video content that is tailored to achieve specific temporal viewership objectives. - Haihao Guo, Baojun Jiang, P. B. (Seethu) Seetharaman. "When Should Developers Partner with a Publisher? Theory and Evidence from Steam."
Abstract
Independent game developers must choose whether to self-publish or partner with a publisher that can amplify demand through marketing. We build an analytical model in which the developer's effort is unobservable and the integration choice (partnering vs. self-publishing) is endogenous. Publisher marketing raises the marginal return to quality but weakens effort incentives under standard revenue sharing. We show that equilibrium price, effort, and mode are characterized by two cutoff values—one in the publisher's marketing capability and one in the developer's quality-to-cost ratio. The developer will self-publish only when both its quality-to-cost ratio and the publisher's marketing capability are below some threshold levels; otherwise it will partner with the publisher. We show that profit-sharing contracts expand the region where partnerships are feasible and efficient by better aligning incentives. We test the model's implications using the Steam PC games market (2015--2025), combining mode choice with price and performance outcomes and correcting for selection. The evidence is consistent with the theory: publisher-backed titles—especially when publisher capability is high—exhibit higher launch prices and stronger first-year performance, with patterns that map to the predicted thresholds. This paper offers actionable guidance on when developers should partner, how publishers should set terms, and how platforms should design policies that balance discovery with incentives.