pith. machine review for the scientific record. sign in

Merlot: Multimodal neural script knowledge models

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.CV 1

years

2023 1

verdicts

CONDITIONAL 1

representative citing papers

VideoChat: Chat-Centric Video Understanding

cs.CV · 2023-05-10 · conditional · novelty 7.0

VideoChat integrates video models and LLMs via a learnable interface for chat-based spatiotemporal and causal video reasoning, trained on a new video-centric instruction dataset.

citing papers explorer

Showing 1 of 1 citing paper.

  • VideoChat: Chat-Centric Video Understanding cs.CV · 2023-05-10 · conditional · none · ref 55

    VideoChat integrates video models and LLMs via a learnable interface for chat-based spatiotemporal and causal video reasoning, trained on a new video-centric instruction dataset.