Brain Captioning: Decoding human brain activity into images and text, May 2023

Ferrante, M · 2023 · arXiv 2305.11560

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

read on arXiv browse 7 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Mind-Omni: A Unified Multi-Task Framework for Brain-Vision-Language Modeling via Discrete Diffusion

cs.AI · 2026-05-28 · unverdicted · novelty 7.0

Mind-Omni unifies seven brain-vision-language tasks in one discrete-diffusion framework with a brain tokenizer and a new BQA dataset, claiming SOTA multi-task performance competitive with larger single-task models.

Brain-IT-VQA: From Brain Signals to Answers

cs.CV · 2026-05-28 · unverdicted · novelty 7.0

Brain-IT-VQA decodes visual question answers from fMRI using a transformer to extract language tokens and introduces the NSD-VQA benchmark with 20 controlled questions per image across 20 categories.

NeuroFlow: Toward Unified Visual Encoding and Decoding from Neural Activity

cs.LG · 2026-04-10 · unverdicted · novelty 7.0

NeuroFlow is the first unified flow model for bidirectional visual encoding and decoding from neural activity using NeuroVAE and cross-modal flow matching.

MIRAGE: Robust multi-modal architectures translate fMRI-to-image models from vision to mental imagery

q-bio.NC · 2026-05-16 · unverdicted · novelty 6.0

MIRAGE achieves state-of-the-art mental image reconstruction from fMRI on the NSD-Imagery benchmark by using a linear backbone with multi-modal text and image features fed to a diffusion model.

Meta-learning In-Context Enables Training-Free Cross Subject Brain Decoding

cs.LG · 2026-04-09 · unverdicted · novelty 6.0

A meta-optimized in-context learning approach enables training-free cross-subject semantic visual decoding from fMRI by inferring individual neural encoding patterns via hierarchical inference on a few examples.

BrainJanus: A Unified Model for Understanding and Generation across Brain, Vision, and Language

cs.CV · 2026-06-29 · unverdicted · novelty 5.0

BrainJanus presents a unified autoregressive model with a brain tokenizer that maps between neural activity, vision, and language for encoding and decoding tasks.

FPED: A Functional-Network Prior-Guided Mixture-of-Experts Framework for Interpretable Brain Decoding

cs.CV · 2026-05-19 · unverdicted · novelty 5.0

FPED is a functional-network prior-guided MoE framework for fMRI visual reconstruction that claims competitive performance at 0.68B parameters and biologically meaningful routing interpretability.

citing papers explorer

Showing 7 of 7 citing papers after filters.

Mind-Omni: A Unified Multi-Task Framework for Brain-Vision-Language Modeling via Discrete Diffusion cs.AI · 2026-05-28 · unverdicted · none · ref 22
Mind-Omni unifies seven brain-vision-language tasks in one discrete-diffusion framework with a brain tokenizer and a new BQA dataset, claiming SOTA multi-task performance competitive with larger single-task models.
Brain-IT-VQA: From Brain Signals to Answers cs.CV · 2026-05-28 · unverdicted · none · ref 16
Brain-IT-VQA decodes visual question answers from fMRI using a transformer to extract language tokens and introduces the NSD-VQA benchmark with 20 controlled questions per image across 20 categories.
NeuroFlow: Toward Unified Visual Encoding and Decoding from Neural Activity cs.LG · 2026-04-10 · unverdicted · none · ref 14
NeuroFlow is the first unified flow model for bidirectional visual encoding and decoding from neural activity using NeuroVAE and cross-modal flow matching.
MIRAGE: Robust multi-modal architectures translate fMRI-to-image models from vision to mental imagery q-bio.NC · 2026-05-16 · unverdicted · none · ref 74
MIRAGE achieves state-of-the-art mental image reconstruction from fMRI on the NSD-Imagery benchmark by using a linear backbone with multi-modal text and image features fed to a diffusion model.
Meta-learning In-Context Enables Training-Free Cross Subject Brain Decoding cs.LG · 2026-04-09 · unverdicted · none · ref 28
A meta-optimized in-context learning approach enables training-free cross-subject semantic visual decoding from fMRI by inferring individual neural encoding patterns via hierarchical inference on a few examples.
BrainJanus: A Unified Model for Understanding and Generation across Brain, Vision, and Language cs.CV · 2026-06-29 · unverdicted · none · ref 8
BrainJanus presents a unified autoregressive model with a brain tokenizer that maps between neural activity, vision, and language for encoding and decoding tasks.
FPED: A Functional-Network Prior-Guided Mixture-of-Experts Framework for Interpretable Brain Decoding cs.CV · 2026-05-19 · unverdicted · none · ref 6
FPED is a functional-network prior-guided MoE framework for fMRI visual reconstruction that claims competitive performance at 0.68B parameters and biologically meaningful routing interpretability.

Brain Captioning: Decoding human brain activity into images and text, May 2023

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer