pith. sign in

arxiv: 1208.0967 · v1 · pith:3PX4DKSYnew · submitted 2012-08-04 · 💻 cs.CV

Human Activity Learning using Object Affordances from RGB-D Videos

classification 💻 cs.CV
keywords affordancesobjectactivitiesactivityhumanlabelingproblemsub-activities
0
0 comments X
read the original abstract

Human activities comprise several sub-activities performed in a sequence and involve interactions with various objects. This makes reasoning about the object affordances a central task for activity recognition. In this work, we consider the problem of jointly labeling the object affordances and human activities from RGB-D videos. We frame the problem as a Markov Random Field where the nodes represent objects and sub-activities, and the edges represent the relationships between object affordances, their relations with sub-activities, and their evolution over time. We formulate the learning problem using a structural SVM approach, where labeling over various alternate temporal segmentations are considered as latent variables. We tested our method on a dataset comprising 120 activity videos collected from four subjects, and obtained an end-to-end precision of 81.8% and recall of 80.0% for labeling the activities.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.