Action genome: Actions as compositions of spatio- temporal scene graphs

Jingwei Ji, Ranjay Krishna, Li Fei-Fei, Juan Carlos Niebles · 2020

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Agentic Video Generation: From Text to Executable Event Graphs via Tool-Constrained LLM Planning

cs.CV · 2026-04-11 · unverdicted · novelty 7.0

An LLM agentic system builds executable GEST specifications via a hierarchical Director-Scene Builder architecture with Relation Subagents, then runs them in a 3D engine, outperforming neural models on physical validity and semantic alignment in human and jury evaluations.

GraphThinker: Reinforcing Temporally Grounded Video Reasoning with Event Graph Thinking

cs.CV · 2026-02-19 · unverdicted · novelty 6.0

GraphThinker reduces temporal hallucinations in video reasoning by constructing event-based scene graphs and applying visual attention rewards in reinforcement finetuning.

citing papers explorer

Showing 2 of 2 citing papers.

Agentic Video Generation: From Text to Executable Event Graphs via Tool-Constrained LLM Planning cs.CV · 2026-04-11 · unverdicted · none · ref 8
An LLM agentic system builds executable GEST specifications via a hierarchical Director-Scene Builder architecture with Relation Subagents, then runs them in a 3D engine, outperforming neural models on physical validity and semantic alignment in human and jury evaluations.
GraphThinker: Reinforcing Temporally Grounded Video Reasoning with Event Graph Thinking cs.CV · 2026-02-19 · unverdicted · none · ref 28
GraphThinker reduces temporal hallucinations in video reasoning by constructing event-based scene graphs and applying visual attention rewards in reinforcement finetuning.

Action genome: Actions as compositions of spatio- temporal scene graphs

fields

years

verdicts

representative citing papers

citing papers explorer