pith. machine review for the scientific record. sign in

arxiv: 1606.07493 · v5 · submitted 2016-06-23 · 💻 cs.CL · cs.AI· cs.CV· cs.LG

Recognition: unknown

Sort Story: Sorting Jumbled Images and Captions into Stories

Authors on Pith no claims yet
classification 💻 cs.CL cs.AIcs.CVcs.LG
keywords storytaskcommonjumbledsensesorttemporalachieving
0
0 comments X
read the original abstract

Temporal common sense has applications in AI tasks such as QA, multi-document summarization, and human-AI communication. We propose the task of sequencing -- given a jumbled set of aligned image-caption pairs that belong to a story, the task is to sort them such that the output sequence forms a coherent story. We present multiple approaches, via unary (position) and pairwise (order) predictions, and their ensemble-based combinations, achieving strong results on this task. We use both text-based and image-based features, which depict complementary improvements. Using qualitative examples, we demonstrate that our models have learnt interesting aspects of temporal common sense.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.