JFAA freezes a JEPA future-prediction model, adds a lightweight probe and ensemble, and wins the 2026 EK-100 action anticipation challenge.
Spatial understand- ing from videos: Structured prompts meet simulation data
3 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.CV 3years
2026 3representative citing papers
A hybrid pipeline of OSGNet candidate generation followed by MLLM reranking secured first place in both the Natural Language Queries and GoalStep tracks of the Ego4D Episodic Memory Challenge.
MARS converts long videos to captions and summaries, maintains modality-specific memories, and deploys an agent to select evidence or answer, placing second on the CASTLE Challenge leaderboard.
citing papers explorer
-
JFAA: Technical Report for the EPIC-KITCHENS-100 Action Anticipation Challenge at EgoVis 2026
JFAA freezes a JEPA future-prediction model, adds a lightweight probe and ensemble, and wins the 2026 EK-100 action anticipation challenge.
-
OSGNet with MLLM Reranking @ Ego4D Episodic Memory Challenge 2026
A hybrid pipeline of OSGNet candidate generation followed by MLLM reranking secured first place in both the Natural Language Queries and GoalStep tracks of the Ego4D Episodic Memory Challenge.
-
MARS: Technical Report for the CASTLE Challenge at EgoVis 2026
MARS converts long videos to captions and summaries, maintains modality-specific memories, and deploys an agent to select evidence or answer, placing second on the CASTLE Challenge leaderboard.