pith. sign in

ChatRex: Tam- ing Multimodal LLM for Joint Perception and Understand- ing

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

fields

cs.CV 4

years

2026 3 2025 1

verdicts

UNVERDICTED 4

clear filters

representative citing papers

SceneParser: Hierarchical Scene Parsing for Visual Semantics Understanding

cs.CV · 2026-05-14 · unverdicted · novelty 6.0

SceneParser introduces hierarchical scene parsing as object-part-affordance chains, a VLM trained with pseudo labels and curriculum learning, and SceneParser-Bench with 1.74M affordance annotations, showing better structure-aware results than existing MLLMs.

citing papers explorer

Showing 4 of 4 citing papers after filters.