The Algorithmic Advantage: How Reinforcement Learning Generates Rich Communication

dc.contributor.authorCalvano, Emilio
dc.contributor.authorPossnig, Clemens
dc.contributor.authorTolvanen, Juha
dc.date.accessioned2026-06-10T16:12:36Z
dc.date.available2026-06-10T16:12:36Z
dc.date.issued2026-02-12
dc.description.abstractWe analyze strategic communication when advice is generated by a reinforcement-learning algorithm rather than by a fully rational sender. Building on the cheap-talk framework of Crawford and Sobel (1982), an advisor adapts its messages based on payoff feedback, while a decision maker best-responds. We provide a theoretical analysis of the long-run communication outcomes induced by such reward-driven adaptation. With aligned preferences, we establish that learning robustly leads to informative communication even from uninformative initial policies. With misaligned preferences, no stable outcome exists; instead, learning generates cycles that sustain highly informative communication and payoffs exceeding those of any static equilibrium.
dc.identifier.urihttps://hdl.handle.net/10012/23582
dc.language.isoen
dc.publisherLuiss University, University of Waterloo, University of Rome Tor Vergata
dc.titleThe Algorithmic Advantage: How Reinforcement Learning Generates Rich Communication
dc.typePreprint
uws.contributor.affiliation1Faculty of Arts
uws.contributor.affiliation2Economics
uws.peerReviewStatusUnreviewed
uws.scholarLevelFaculty
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
AlgorithmicAdvantage_ReinforcementLearning_WorkingPaper.pdf
Size:
1.3 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
4.47 KB
Format:
Item-specific license agreed upon to submission
Description: