pith. sign in

arxiv: 1706.07365 · v2 · pith:N6MJLRGInew · submitted 2017-06-22 · 💻 cs.CV · cs.LG

Pixels to Graphs by Associative Embedding

classification 💻 cs.CV cs.LG
keywords graphgraphsassociativeimagenetworkobjectssceneabstraction
0
0 comments X
read the original abstract

Graphs are a useful abstraction of image content. Not only can graphs represent details about individual objects in a scene but they can capture the interactions between pairs of objects. We present a method for training a convolutional neural network such that it takes in an input image and produces a full graph definition. This is done end-to-end in a single stage with the use of associative embeddings. The network learns to simultaneously identify all of the elements that make up a graph and piece them together. We benchmark on the Visual Genome dataset, and demonstrate state-of-the-art performance on the challenging task of scene graph generation.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Contrastive Multi-Modal Hypergraph Reasoning for 3D Crowd Mesh Recovery

    cs.CV 2026-04 unverdicted novelty 7.0

    Contrastive multi-modal hypergraph reasoning fuses semantic, geometric, and pose cues to achieve state-of-the-art 3D crowd mesh recovery under severe occlusions.