How agents see things: On visual representations in an emergent language game

Diane Bouchacourt; Marco Baroni

arxiv: 1808.10696 · v2 · pith:2XHKQ4FBnew · submitted 2018-08-31 · 💻 cs.CL · cs.LG

How agents see things: On visual representations in an emergent language game

Diane Bouchacourt , Marco Baroni This is my paper

classification 💻 cs.CL cs.LG

keywords agentsvisualrepresentationscommunicationinputlanguagealignalmost

0 comments

read the original abstract

There is growing interest in the language developed by agents interacting in emergent-communication settings. Earlier studies have focused on the agents' symbol usage, rather than on their representation of visual input. In this paper, we consider the referential games of Lazaridou et al. (2017) and investigate the representations the agents develop during their evolving interaction. We find that the agents establish successful communication by inducing visual representations that almost perfectly align with each other, but, surprisingly, do not capture the conceptual properties of the objects depicted in the input images. We conclude that, if we are interested in developing language-like communication systems, we must pay more attention to the visual semantics agents associate to the symbols they use.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Lexical Consensus: Grounded Word Learning and Shared Meaning in Artificial Agents
cs.CL 2026-06 unverdicted novelty 7.0

Agents acquire lexical labels for visual concepts following a perceptual coherence gradient where perceptual distance predicts learning accuracy independently of semantic distance in a pre-registered CIFAR-100 experiment.