The algonauts project 2023 challenge: How the human brain makes sense of natural scenes

· 2023 · arXiv 2301.03198

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

read on arXiv browse 6 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Brain-IT-VQA: From Brain Signals to Answers

cs.CV · 2026-05-28 · unverdicted · novelty 7.0

Brain-IT-VQA decodes visual question answers from fMRI using a transformer to extract language tokens and introduces the NSD-VQA benchmark with 20 controlled questions per image across 20 categories.

NeuralBench: A Unifying Framework to Benchmark NeuroAI Models

cs.LG · 2026-05-08 · conditional · novelty 7.0

NeuralBench is a new benchmarking framework for neuroAI models on EEG data that finds foundation models only marginally outperform task-specific ones while many tasks like cognitive decoding stay highly challenging.

Gaslight, Gatekeep, V1-V3: Early Visual Cortex Alignment Shields Vision-Language Models from Sycophantic Manipulation

cs.CV · 2026-04-15 · conditional · novelty 7.0

Alignment of vision-language models with human V1-V3 early visual cortex negatively predicts resistance to sycophantic gaslighting attacks.

MIRAGE: Adaptive Multimodal Gating for Whole-Brain fMRI Encoding

cs.LG · 2026-05-28 · unverdicted · novelty 6.0

MIRAGE uses adaptive multimodal gating on native multimodal backbones plus a transformer encoder to achieve state-of-the-art whole-brain fMRI prediction for naturalistic audiovisual stimuli, outperforming post-hoc unimodal aggregation.

AlignedCut: Visual Concepts Discovery on Brain-Guided Universal Feature Space

cs.CV · 2024-06-26 · unverdicted · novelty 6.0

AlignedCut uses brain fMRI prediction to create a universal channel alignment across deep networks, revealing recurring channel clusters that correspond to brain regions and produce semantically meaningful object segments from images.

ViBE: Visual-to-M/EEG Brain Encoding via Spatio-Temporal VAE and Distribution-Aligned Projection

cs.CV · 2026-04-29 · unverdicted · novelty 4.0

ViBE generates M/EEG signals from visual stimuli by reconstructing neural responses with a TSC-VAE and aligning CLIP image features to its latent space via Q-Former, MSE, and sliced Wasserstein losses.

citing papers explorer

Showing 4 of 4 citing papers after filters.

Brain-IT-VQA: From Brain Signals to Answers cs.CV · 2026-05-28 · unverdicted · none · ref 48
Brain-IT-VQA decodes visual question answers from fMRI using a transformer to extract language tokens and introduces the NSD-VQA benchmark with 20 controlled questions per image across 20 categories.
MIRAGE: Adaptive Multimodal Gating for Whole-Brain fMRI Encoding cs.LG · 2026-05-28 · unverdicted · none · ref 35
MIRAGE uses adaptive multimodal gating on native multimodal backbones plus a transformer encoder to achieve state-of-the-art whole-brain fMRI prediction for naturalistic audiovisual stimuli, outperforming post-hoc unimodal aggregation.
AlignedCut: Visual Concepts Discovery on Brain-Guided Universal Feature Space cs.CV · 2024-06-26 · unverdicted · none · ref 1
AlignedCut uses brain fMRI prediction to create a universal channel alignment across deep networks, revealing recurring channel clusters that correspond to brain regions and produce semantically meaningful object segments from images.
ViBE: Visual-to-M/EEG Brain Encoding via Spatio-Temporal VAE and Distribution-Aligned Projection cs.CV · 2026-04-29 · unverdicted · none · ref 25
ViBE generates M/EEG signals from visual stimuli by reconstructing neural responses with a TSC-VAE and aligning CLIP image features to its latent space via Q-Former, MSE, and sliced Wasserstein losses.

The algonauts project 2023 challenge: How the human brain makes sense of natural scenes

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer