Pretrained Transformers as universal computation engines

Kevin Lu, Aditya Grover, Pieter Abbeel, Igor Mordatch · 2021 · arXiv 2103.05247

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Perceiver IO: A General Architecture for Structured Inputs & Outputs

cs.LG · 2021-07-30 · unverdicted · novelty 7.0

Perceiver IO is a general architecture that processes arbitrary structured inputs and outputs with linear scaling and achieves strong results on GLUE, Sintel optical flow, multi-task reasoning, and StarCraft II without task-specific components.

PaLM-E: An Embodied Multimodal Language Model

cs.LG · 2023-03-06 · conditional · novelty 6.0

PaLM-E is a single 562B-parameter multimodal model that performs embodied reasoning tasks like robotic manipulation planning and visual question answering by interleaving vision, state, and text inputs with positive transfer from joint training on language and robotics data.

The Platonic Representation Hypothesis

cs.LG · 2024-05-13 · unverdicted · novelty 5.0

Representations learned by large AI models are converging toward a shared statistical model of reality.

citing papers explorer

Showing 3 of 3 citing papers.

Perceiver IO: A General Architecture for Structured Inputs & Outputs cs.LG · 2021-07-30 · unverdicted · none · ref 49
Perceiver IO is a general architecture that processes arbitrary structured inputs and outputs with linear scaling and achieves strong results on GLUE, Sintel optical flow, multi-task reasoning, and StarCraft II without task-specific components.
PaLM-E: An Embodied Multimodal Language Model cs.LG · 2023-03-06 · conditional · none · ref 24
PaLM-E is a single 562B-parameter multimodal model that performs embodied reasoning tasks like robotic manipulation planning and visual question answering by interleaving vision, state, and text inputs with positive transfer from joint training on language and robotics data.
The Platonic Representation Hypothesis cs.LG · 2024-05-13 · unverdicted · none · ref 279
Representations learned by large AI models are converging toward a shared statistical model of reality.

Pretrained Transformers as universal computation engines

fields

years

verdicts

representative citing papers

citing papers explorer