Math Takes Two is a new benchmark that tests whether two agents can emergently invent numerical communication to solve visually grounded extrapolation problems without prior mathematical knowledge.
URL https://proceedings.neurips.cc/paper_ files/paper/2017/hash/70222949cc0db89ab32c9969754d4758-Abstract.html
2 Pith papers cite this work. Polarity classification is still indexing.
abstract
Learning to communicate through interaction, rather than relying on explicit supervision, is often considered a prerequisite for developing a general AI. We study a setting where two agents engage in playing a referential game and, from scratch, develop a communication protocol necessary to succeed in this game. Unlike previous work, we require that messages they exchange, both at train and test time, are in the form of a language (i.e. sequences of discrete symbols). We compare a reinforcement learning approach and one using a differentiable relaxation (straight-through Gumbel-softmax estimator) and observe that the latter is much faster to converge and it results in more effective protocols. Interestingly, we also observe that the protocol we induce by optimizing the communication success exhibits a degree of compositionality and variability (i.e. the same information can be phrased in different ways), both properties characteristic of natural languages. As the ultimate goal is to ensure that communication is accomplished in natural language, we also perform experiments where we inject prior information about natural language into our model and study properties of the resulting protocol.
citation-role summary
citation-polarity summary
years
2026 2verdicts
UNVERDICTED 2roles
method 1polarities
use method 1representative citing papers
Heterogeneous visual agents form shared symbols via decentralized Metropolis-Hastings captioning, where encoder similarity shapes the content and symmetry of the resulting language.
citing papers explorer
-
Math Takes Two: A test for emergent mathematical reasoning in communication
Math Takes Two is a new benchmark that tests whether two agents can emergently invent numerical communication to solve visually grounded extrapolation problems without prior mathematical knowledge.
-
Emergent Communication between Heterogeneous Visual Agents through Decentralized Learning
Heterogeneous visual agents form shared symbols via decentralized Metropolis-Hastings captioning, where encoder similarity shapes the content and symmetry of the resulting language.