Title resolution pending

Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, Baining Guo · 2021

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

Rethinking Video Human-Object Interaction: Set Prediction over Time for Unified Detection and Anticipation

cs.CV · 2026-04-12 · unverdicted · novelty 6.0

A pair-centric set-prediction model unifies present HOI detection and multi-horizon anticipation in video by modeling future interactions as residual transitions from current pair states, backed by a temporally corrected benchmark.

Learning Emergent Modular Representations in Multi-modality Medical Vision Foundation Models

cs.CV · 2026-05-21 · unverdicted · novelty 5.0

DEX is a modular network using dynamically activated experts and a group-EMA director to learn emergent modular representations for multi-modality medical vision foundation models, evaluated on a new 4M-image benchmark across 10 modalities and 26 downstream tasks.

Observe Less, Understand More: Cost-aware Cross-scale Observation for Remote Sensing Understanding

cs.CV · 2026-04-13 · unverdicted · novelty 5.0

A unified cost-aware formulation couples fine-grained high-resolution sampling decisions with cross-patch representation prediction to achieve superior performance-cost trade-offs on remote sensing recognition and retrieval tasks using a new 10M-image benchmark.

Case-Aware Medical Image Classification with Multimodal Knowledge Graphs and Reliability-Guided Refinement

cs.CV · 2026-05-21

citing papers explorer

Showing 4 of 4 citing papers.

Rethinking Video Human-Object Interaction: Set Prediction over Time for Unified Detection and Anticipation cs.CV · 2026-04-12 · unverdicted · none · ref 28
A pair-centric set-prediction model unifies present HOI detection and multi-horizon anticipation in video by modeling future interactions as residual transitions from current pair states, backed by a temporally corrected benchmark.
Learning Emergent Modular Representations in Multi-modality Medical Vision Foundation Models cs.CV · 2026-05-21 · unverdicted · none · ref 69
DEX is a modular network using dynamically activated experts and a group-EMA director to learn emergent modular representations for multi-modality medical vision foundation models, evaluated on a new 4M-image benchmark across 10 modalities and 26 downstream tasks.
Observe Less, Understand More: Cost-aware Cross-scale Observation for Remote Sensing Understanding cs.CV · 2026-04-13 · unverdicted · none · ref 27
A unified cost-aware formulation couples fine-grained high-resolution sampling decisions with cross-patch representation prediction to achieve superior performance-cost trade-offs on remote sensing recognition and retrieval tasks using a new 10M-image benchmark.
Case-Aware Medical Image Classification with Multimodal Knowledge Graphs and Reliability-Guided Refinement cs.CV · 2026-05-21 · unreviewed · ref 21

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer