Visual Relationship Detection with Language Priors

· 2016 · cs.CV · arXiv 1608.00187

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Visual relationships capture a wide variety of interactions between pairs of objects in images (e.g. "man riding bicycle" and "man pushing bicycle"). Consequently, the set of possible relationships is extremely large and it is difficult to obtain sufficient training examples for all possible relationships. Because of this limitation, previous work on visual relationship detection has concentrated on predicting only a handful of relationships. Though most relationships are infrequent, their objects (e.g. "man" and "bicycle") and predicates (e.g. "riding" and "pushing") independently occur more frequently. We propose a model that uses this insight to train visual models for objects and predicates individually and later combines them together to predict multiple relationships per image. We improve on prior work by leveraging language priors from semantic word embeddings to finetune the likelihood of a predicted relationship. Our model can scale to predict thousands of types of relationships from a few examples. Additionally, we localize the objects in the predicted relationships as bounding boxes in the image. We further demonstrate that understanding relationships can improve content based image retrieval.

representative citing papers

DiagramRAG: A Lightweight Framework to Retrieve Scientific Diagram for Figure Generation

cs.AI · 2026-05-27 · unverdicted · novelty 6.0

DiagramRAG is a retrieval-augmented framework that represents diagrams as knowledge graphs, synthesizes sketch variants, trains an embedding model for structure-aware retrieval, and uses retrieved references to guide sketch-based scientific diagram generation.

citing papers explorer

Showing 1 of 1 citing paper after filters.

DiagramRAG: A Lightweight Framework to Retrieve Scientific Diagram for Figure Generation cs.AI · 2026-05-27 · unverdicted · none · ref 20 · internal anchor
DiagramRAG is a retrieval-augmented framework that represents diagrams as knowledge graphs, synthesizes sketch variants, trains an embedding model for structure-aware retrieval, and uses retrieved references to guide sketch-based scientific diagram generation.

Visual Relationship Detection with Language Priors

fields

years

verdicts

representative citing papers

citing papers explorer