Towards Multi-Agent Communication-Based Language Learning
read the original abstract
We propose an interactive multimodal framework for language learning. Instead of being passively exposed to large amounts of natural text, our learners (implemented as feed-forward neural networks) engage in cooperative referential games starting from a tabula rasa setup, and thus develop their own language from the need to communicate in order to succeed at the game. Preliminary experiments provide promising results, but also suggest that it is important to ensure that agents trained in this way do not develop an adhoc communication code only effective for the game they are playing
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
Aligned but Not Partner-Specific: Distinguishing How Multimodal LLM Agents Succeed in Reference Games Without Human-Like Conventions
Multimodal LLMs coordinate in reference games through high label overlap that does not depend on specific partner history, succeeding via verbose descriptions rather than compact conventions.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.