Visual Affordance and Function Understanding: A Survey

Mohammed Hassanin; Murat Tahtali; Salman Khan

arxiv: 1807.06775 · v1 · pith:ITYH5TCJnew · submitted 2018-07-18 · 💻 cs.CV · cs.RO

Visual Affordance and Function Understanding: A Survey

Mohammed Hassanin , Salman Khan , Murat Tahtali This is my paper

classification 💻 cs.CV cs.RO

keywords affordancesurveyvisualaffordancesfunctionalliteraturerobotsunderstand

0 comments

read the original abstract

Nowadays, robots are dominating the manufacturing, entertainment and healthcare industries. Robot vision aims to equip robots with the ability to discover information, understand it and interact with the environment. These capabilities require an agent to effectively understand object affordances and functionalities in complex visual domains. In this literature survey, we first focus on Visual affordances and summarize the state of the art as well as open problems and research gaps. Specifically, we discuss sub-problems such as affordance detection, categorization, segmentation and high-level reasoning. Furthermore, we cover functional scene understanding and the prevalent functional descriptors used in the literature. The survey also provides necessary background to the problem, sheds light on its significance and highlights the existing challenges for affordance and functionality learning.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

What Objects Enable, Not What They Are: Functional Latent Spaces for Affordance Reasoning
cs.LG 2026-06 unverdicted novelty 6.0

A4D creates functional latent spaces for affordance reasoning, reporting 94% accuracy on known affordances and over 90% on new ones with under 10% training data while enabling 100x faster inference.