A hybrid pipeline of OSGNet candidate generation followed by MLLM reranking secured first place in both the Natural Language Queries and GoalStep tracks of the Ego4D Episodic Memory Challenge.
Exo2ego: Exocentric knowledge guided mllm for egocentric video understanding
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CV 2years
2026 2representative citing papers
MARS converts long videos to captions and summaries, maintains modality-specific memories, and deploys an agent to select evidence or answer, placing second on the CASTLE Challenge leaderboard.
citing papers explorer
-
OSGNet with MLLM Reranking @ Ego4D Episodic Memory Challenge 2026
A hybrid pipeline of OSGNet candidate generation followed by MLLM reranking secured first place in both the Natural Language Queries and GoalStep tracks of the Ego4D Episodic Memory Challenge.
-
MARS: Technical Report for the CASTLE Challenge at EgoVis 2026
MARS converts long videos to captions and summaries, maintains modality-specific memories, and deploys an agent to select evidence or answer, placing second on the CASTLE Challenge leaderboard.