pith. sign in

Visual genome: Connecting language and vision using crowdsourced dense image annotations

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

citation-role summary

dataset 1

citation-polarity summary

fields

cs.CV 1

years

2023 1

verdicts

UNVERDICTED 1

roles

dataset 1

polarities

use dataset 1

representative citing papers

Otter: A Multi-Modal Model with In-Context Instruction Tuning

cs.CV · 2023-05-05 · unverdicted · novelty 6.0

Otter is a multi-modal model instruction-tuned on the MIMIC-IT dataset of over 3 million in-context instruction-response pairs to improve convergence and generalization on tasks with multiple images and videos.

citing papers explorer

Showing 1 of 1 citing paper.

  • Otter: A Multi-Modal Model with In-Context Instruction Tuning cs.CV · 2023-05-05 · unverdicted · none · ref 44

    Otter is a multi-modal model instruction-tuned on the MIMIC-IT dataset of over 3 million in-context instruction-response pairs to improve convergence and generalization on tasks with multiple images and videos.