The Algorithmic Advantage: How Reinforcement Learning Generates Rich Communication

Calvano, Emilio; Possnig, Clemens; Tolvanen, Juha

The Algorithmic Advantage: How Reinforcement Learning Generates Rich Communication

Files

AlgorithmicAdvantage_ReinforcementLearning_WorkingPaper.pdf (1.3 MB)

Date

2026-02-12

Authors

Calvano, Emilio

Possnig, Clemens

Tolvanen, Juha

Publisher

Luiss University, University of Waterloo, University of Rome Tor Vergata

Abstract

We analyze strategic communication when advice is generated by a reinforcement-learning algorithm rather than by a fully rational sender. Building on the cheap-talk framework of Crawford and Sobel (1982), an advisor adapts its messages based on payoff feedback, while a decision maker best-responds. We provide a theoretical analysis of the long-run communication outcomes induced by such reward-driven adaptation. With aligned preferences, we establish that learning robustly leads to informative communication even from uninformative initial policies. With misaligned preferences, no stable outcome exists; instead, learning generates cycles that sustain highly informative communication and payoffs exceeding those of any static equilibrium.

URI

https://hdl.handle.net/10012/23582

Collections

Waterloo Research
Economics

Full item page

The Algorithmic Advantage: How Reinforcement Learning Generates Rich Communication

Files

Date

Authors

Advisor

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

LC Subject Headings

Citation

URI

Collections