Object Perception and Grasping in Open-Ended Domains

S. Hamidreza Kasaei

arxiv: 1907.10932 · v1 · pith:QSZGCK4Snew · submitted 2019-07-25 · 💻 cs.RO

Object Perception and Grasping in Open-Ended Domains

S. Hamidreza Kasaei This is my paper

Pith reviewed 2026-05-24 16:31 UTC · model grok-4.3

classification 💻 cs.RO

keywords open-ended learningobject recognitiongrasp affordancesservice robotsincremental learninginteractive learningvisual perceptioncognitive robotics

0 comments

The pith

Robots need open-ended learning to recognize unknown objects and their grasp affordances as categories and instances arrive gradually over time.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that service robots entering unstructured human environments require open-ended learning for object perception and grasping. This means the robot processes visual data and builds knowledge of object categories and grasp affordances without a predefined set of classes, instead drawing training instances incrementally from its own ongoing experiences and human interactions. The work examines why this matters for autonomy, how incremental learning from experience and interaction can work, the limits of deep learning in such settings, and suitable evaluation metrics. A sympathetic reader would see this as necessary for robots to adapt continuously rather than relying on static training data.

Core claim

An autonomous robot must have the ability to process visual information and conduct learning and recognition tasks in an open-ended fashion where the set of object categories is not known in advance and training instances become gradually available over time rather than being completely available at the beginning of the learning process. This capability, inspired by human ceaseless learning of object categories and grasp affordances, enables adaptation to new environments through accumulation of experiences and conceptualization of new categories.

What carries the argument

Interactive open-ended learning approaches that recognize multiple objects and their grasp affordances concurrently by accumulating experiences incrementally.

If this is right

Robots can adapt to new environments by enhancing knowledge from accumulated experiences rather than requiring all data upfront.
Robots can learn incrementally from their own experiences as well as from direct interaction with humans.
Deep learning approaches have specific limitations when applied in an open-ended manner with gradually available data.
Open-ended learning approaches require dedicated evaluation methods and metrics distinct from standard batch learning benchmarks.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Such systems would allow service robots to operate long-term in homes or offices where novel items continue to appear without retraining from scratch.
Evaluation protocols might need to incorporate lifelong interaction logs rather than isolated test sets to measure adaptation over extended periods.
Alternative learning paradigms beyond current deep networks could become necessary if incremental updates prove unstable in practice.

Load-bearing premise

Cognitive science observations of how humans learn object categories and grasp affordances ceaselessly translate into a direct requirement that robots must use the same incremental, experience-driven process.

What would settle it

A controlled test in which a robot using non-incremental batch learning on a fixed dataset maintains or exceeds performance on new object categories that appear gradually through online robot experiences.

Figures

Figures reproduced from arXiv: 1907.10932 by S. Hamidreza Kasaei.

**Figure 2.** Figure 2: Overview of the proposed OrthographicNet. B. Grasp Affordance Learning and Recognition Robots are still not able to grasp all unforeseen objects and finding a proper grasp configuration, i.e. the position and orientation of the arm relative to the object, is still challenging [8, 23, 12]. One approach for grasping unforeseen objects is to recognize an appropriate grasp configuration from previous grasp dem… view at source ↗

read the original abstract

Nowadays service robots are leaving the structured and completely known environments and entering human-centric settings. For these robots, object perception and grasping are two challenging tasks due to the high demand for accurate and real-time responses. Although many problems have already been understood and solved successfully, many challenges still remain. Open-ended learning is one of these challenges waiting for many improvements. Cognitive science revealed that humans learn to recognize object categories and grasp affordances ceaselessly over time. This ability allows adapting to new environments by enhancing their knowledge from the accumulation of experiences and the conceptualization of new object categories. Inspired by this, an autonomous robot must have the ability to process visual information and conduct learning and recognition tasks in an open-ended fashion. In this context, "open-ended" implies that the set of object categories to be learned is not known in advance, and the training instances are extracted from online experiences of a robot, and become gradually available over time, rather than being completely available at the beginning of the learning process. In my research, I mainly focus on interactive open-ended learning approaches to recognize multiple objects and their grasp affordances concurrently. In particular, I try to address the following research questions: (i) What is the importance of open-ended learning for autonomous robots? (ii) How robots could learn incrementally from their own experiences as well as from interaction with humans? (iii) What are the limitations of Deep Learning approaches to be used in an open-ended manner? (iv) How to evaluate open-ended learning approaches and what are the right metrics to do so?

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This is a research proposal that states open questions on incremental object learning for robots but contains no new methods, experiments, or results.

read the letter

This document is a research proposal that poses four questions about open-ended object perception and grasping for service robots. It draws from cognitive science to argue that robots need to learn categories and grasp affordances incrementally as new experiences arrive online, rather than from a fixed upfront dataset. The questions cover why this matters, how to learn from robot experience plus human interaction, the limits of deep learning in open-ended settings, and what metrics make sense for evaluation. That framing is clear and directly tied to practical needs in unstructured human environments. The text does a reasonable job laying out the motivation without overclaiming. The main limitation is that it stops at identifying the research focus. No algorithms are described, no data or experiments are shown, and no comparisons to existing incremental learning work appear. The premise that robots must mirror human ceaseless learning is stated as inspiration rather than demonstrated or tested against other paradigms. This kind of outline is mainly useful for researchers planning long-term projects in lifelong robot learning or for groups reviewing PhD proposals. It could prompt a reading group discussion on open problems in the area. It does not have the technical content or evidence needed for peer review as a research paper; it reads as an extended abstract or thesis summary instead.

Referee Report

2 major / 0 minor

Summary. The manuscript is a research statement outlining the author's focus on interactive open-ended learning for concurrent object recognition and grasp affordance prediction in autonomous robots. It draws motivation from cognitive science findings on human incremental learning, asserts that robots must process visual information and learn in an open-ended manner (where object categories are unknown in advance and training data arrives gradually from online experiences), and lists four research questions to be addressed: the importance of open-ended learning, incremental learning from robot experiences and human interaction, limitations of deep learning in open-ended settings, and appropriate evaluation metrics.

Significance. The topic of open-ended robotic learning is relevant to service robotics in unstructured human environments. However, the manuscript contains no methods, algorithms, experiments, derivations, or results. If the posed questions were later answered with reproducible implementations and evaluations, the work could contribute to robotics; as presented, it offers no assessable advance.

major comments (2)

[Abstract] Abstract and research questions section: The manuscript poses four open research questions but provides no technical approach, algorithm, dataset, or evaluation to address any of them. This absence means the document functions as a statement of intent rather than a completed study with load-bearing claims or evidence.
[Abstract] Abstract: The assertion that 'an autonomous robot must have the ability to process visual information and conduct learning and recognition tasks in an open-ended fashion' is presented as a premise without supporting argument, comparison to alternative paradigms, or empirical grounding within the manuscript.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their review. This manuscript is a research statement that outlines a research agenda and poses open questions on interactive open-ended learning for robot object perception and grasping, motivated by cognitive science. We respond point by point to the major comments.

read point-by-point responses

Referee: [Abstract] Abstract and research questions section: The manuscript poses four open research questions but provides no technical approach, algorithm, dataset, or evaluation to address any of them. This absence means the document functions as a statement of intent rather than a completed study with load-bearing claims or evidence.

Authors: We agree that the manuscript contains no new algorithms, datasets, or experimental results. It is intentionally structured as a research statement to define the problem space and research questions rather than to present a completed empirical study. The contribution is in framing the open-ended learning challenge for service robots based on cognitive science insights and identifying directions for future work. revision: no
Referee: [Abstract] Abstract: The assertion that 'an autonomous robot must have the ability to process visual information and conduct learning and recognition tasks in an open-ended fashion' is presented as a premise without supporting argument, comparison to alternative paradigms, or empirical grounding within the manuscript.

Authors: The premise follows directly from the preceding sentences that reference cognitive science findings on human incremental learning of categories and affordances over time. The text contrasts this with the standard robotics assumption of complete upfront training sets. While the manuscript does not introduce new empirical comparisons, the argument is grounded in the cited cognitive science motivation. revision: no

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

This is a research proposal that poses four open research questions rather than presenting any derivation, equations, predictions, or fitted quantities. The central premise that robots must adopt incremental open-ended learning is stated as an inspiration drawn from cognitive science, not derived or fitted within the document. No self-citations, ansatzes, or renamings of results appear as load-bearing steps. The paper is self-contained as a statement of research intent with no internal reduction of claims to their own inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

No free parameters, axioms, or invented entities are introduced because the text is a research agenda summary without technical derivations.

pith-pipeline@v0.9.0 · 5807 in / 964 out tokens · 18577 ms · 2026-05-24T16:31:37.581848+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

an autonomous robot must have the ability to process visual information and conduct learning and recognition tasks in an open-ended fashion where the set of object categories is not known in advance
IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

OrthographicNet generates a rotation and scale invariant global feature... instance-based learning and a nearest neighbor classification rule

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

27 extracted references · 27 canonical work pages · 1 internal anchor

[1]

Robotic roommates making pancakes

Michael Beetz, Ulrich Klank, Ingo Kresse, Alexis Maldonado, L Mosenlechner, Dejan Pangercic, T Ruhr, and Moritz Tenorth. Robotic roommates making pancakes. In Humanoid Robots (Humanoids), 2011 11th IEEE-RAS International Conference on, pages 529–536. IEEE, 2011

work page 2011
[2]

Using simulation and domain adaptation to improve efﬁciency of deep robotic grasping

Konstantinos Bousmalis, Alex Irpan, Paul Wohlhart, Yunfei Bai, Matthew Kelcey, Mrinal Kalakrishnan, Laura Downs, Julian Ibarz, Peter Pastor, Kurt Konolige, et al. Using simulation and domain adaptation to improve efﬁciency of deep robotic grasping. In 2018 IEEE International Conference on Robotics and Automation (ICRA) , pages 4243–4250. IEEE, 2018

work page 2018
[3]

Imagenet: A large-scale hierarchical image database

Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. Imagenet: A large-scale hierarchical image database. 2009

work page 2009
[4]

Orthographicnet: A deep learning ap- proach for 3D object recognition in open-ended domains

S Hamidreza Kasaei. Orthographicnet: A deep learning ap- proach for 3D object recognition in open-ended domains. arXiv preprint arXiv:1902.03057, 2019

work page arXiv 1902
[5]

An adaptive ob- ject perception system based on environment exploration and bayesian learning

S Hamidreza Kasaei, Miguel Oliveira, Gi Hyun Lim, Luís Seabra Lopes, and Ana Maria Tomé. An adaptive ob- ject perception system based on environment exploration and bayesian learning. In 2015 IEEE International Conference on Autonomous Robot Systems and Competitions , pages 221–226. IEEE, 2015

work page 2015
[6]

Interactive open-ended learning for 3D object recognition: An approach and experi- ments

S Hamidreza Kasaei, Miguel Oliveira, Gi Hyun Lim, Luís Seabra Lopes, and Ana Maria Tomé. Interactive open-ended learning for 3D object recognition: An approach and experi- ments. Journal of Intelligent & Robotic Systems , 80(3-4):537– 553, 2015

work page 2015
[7]

An orthographic descriptor for 3d object learning and recognition

S Hamidreza Kasaei, Luís Seabra Lopes, Ana Maria Tomé, and Miguel Oliveira. An orthographic descriptor for 3d object learning and recognition. In 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) , pages 4158–4163. IEEE, 2016

work page 2016
[8]

Object learning and grasping capabilities for robotic home assistants

S Hamidreza Kasaei, Nima Shaﬁi, Luís Seabra Lopes, and Ana Maria Tomé. Object learning and grasping capabilities for robotic home assistants. In LectureNotes in Computer Science , volume 9776. Springer, 2016

work page 2016
[9]

GOOD: A global orthographic object descriptor for 3D object recognition and manipulation

S Hamidreza Kasaei, Ana Maria Tomé, Luís Seabra Lopes, and Miguel Oliveira. GOOD: A global orthographic object descriptor for 3D object recognition and manipulation. Pattern Recognition Letters, 2016

work page 2016
[10]

Coping with context change in open-ended object recognition without explicit context information

S Hamidreza Kasaei, Luís Seabra Lopes, and Ana Maria Tomé. Coping with context change in open-ended object recognition without explicit context information. In 2018 IEEE/RSJ Inter- national Conference on Intelligent Robots and Systems (IROS) . IEEE, 2018

work page 2018
[11]

Towards lifelong assis- tive robotics: A tight coupling between object perception and manipulation

S Hamidreza Kasaei, Miguel Oliveira, Gi Hyun Lim, Luís Seabra Lopes, and Ana Maria Tomé. Towards lifelong assis- tive robotics: A tight coupling between object perception and manipulation. Neurocomputing, 291:151–166, 2018

work page 2018
[12]

Perceiving, learning, and recognizing 3d objects: An approach to cognitive service robots

S Hamidreza Kasaei, Juil Sock, Luis Seabra Lopes, Ana Maria Tomé, and Tae-Kyun Kim. Perceiving, learning, and recognizing 3d objects: An approach to cognitive service robots. In Thirty- Second AAAI Conference on Artiﬁcial Intelligence , 2018

work page 2018
[13]

Interactive open-ended object, affordance and grasp learning for robotic manipulation

S Hamidreza Kasaei, Nima Shaﬁi, Luís Seabra Lopes, and Ana Maria Tomé. Interactive open-ended object, affordance and grasp learning for robotic manipulation. In 2019 IEEE/RSJ International Conference on Robotics and Automation (ICRA) . IEEE, 2019

work page 2019
[14]

Hierarchical object representation for open-ended object category learning and recognition

Seyed Hamidreza Kasaei, Ana Maria Tomé, and Luís Seabra Lopes. Hierarchical object representation for open-ended object category learning and recognition. In Advances in Neural Information Processing Systems , pages 1948–1956, 2016

work page 1948
[15]

Learning hand-eye coordination for robotic grasping with large-scale data collection

Sergey Levine, Peter Pastor, Alex Krizhevsky, and Deirdre Quillen. Learning hand-eye coordination for robotic grasping with large-scale data collection. In International symposium on experimental robotics, pages 173–184. Springer, 2016

work page 2016
[16]

Interactive teaching and experience extraction for learn- ing about objects and robot activities

Gi Hyun Lim, Miguel Oliveira, Vahid Mokhtari, S Hamidreza Kasaei, Aneesh Chauhan, Luís Seabra Lopes, and Ana Maria Tomé. Interactive teaching and experience extraction for learn- ing about objects and robot activities. In The 23rd IEEE International Symposium on Robot and Human Interactive Communication, pages 153–160. IEEE, 2014

work page 2014
[17]

Hierarchical nearest neighbor graphs for building perceptual hierarchies

Gi Hyun Lim, Miguel Oliveira, S Hamidreza Kasaei, and Luís Seabra Lopes. Hierarchical nearest neighbor graphs for building perceptual hierarchies. In International Conference on Neural Information Processing, pages 646–655. Springer, 2015

work page 2015
[18]

Dex-Net 2.0: Deep Learning to Plan Robust Grasps with Synthetic Point Clouds and Analytic Grasp Metrics

Jeffrey Mahler, Jacky Liang, Sherdil Niyaz, Michael Laskey, Richard Doan, Xinyu Liu, Juan Aparicio Ojea, and Ken Gold- berg. Dex-net 2.0: Deep learning to plan robust grasps with synthetic point clouds and analytic grasp metrics. arXiv preprint arXiv:1703.09312, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[19]

A percep- tual memory system for grounding semantic representations in intelligent service robots

Miguel Oliveira, Gi Hyun Lim, Luís Seabra Lopes, S Hamidreza Kasaei, Ana Maria Tomé, and Aneesh Chauhan. A percep- tual memory system for grounding semantic representations in intelligent service robots. In 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems , pages 2216–

work page 2014
[20]

Concurrent learning of visual codebooks and object categories in open- ended domains

Miguel Oliveira, Luís Seabra Lopes, Gi Hyun Lim, S Hamidreza Kasaei, Angel D Sappa, and Ana Maria Tomé. Concurrent learning of visual codebooks and object categories in open- ended domains. In Intelligent Robots and Systems (IROS), 2015 IEEE/RSJ International Conference on, pages 2488–2495. IEEE, 2015

work page 2015
[21]

3D object perception and perceptual learning in the race project

Miguel Oliveira, Luís Seabra Lopes, Gi Hyun Lim, S Hamidreza Kasaei, Ana Maria Tomé, and Aneesh Chauhan. 3D object perception and perceptual learning in the race project. Robotics and Autonomous Systems , 75:614–626, 2016

work page 2016
[22]

You only look once: Uniﬁed, real-time object detection

Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. You only look once: Uniﬁed, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 779–788, 2016

work page 2016
[23]

Learning to grasp familiar objects using object view recognition and template matching

Nima Shaﬁi, S Hamidreza Kasaei, and Luís Seabra Lopes. Learning to grasp familiar objects using object view recognition and template matching. In Intelligent Robots and Systems (IROS), 2016 IEEE/RSJ International Conference on , pages 2895–2900. IEEE, 2016

work page 2016
[24]

Hamidreza Kasaei, Luis Seabra Lopes, and Tae- Kyun Kim

Juil Sock, S. Hamidreza Kasaei, Luis Seabra Lopes, and Tae- Kyun Kim. Multi-view 6d object pose estimation and camera motion planning using rgbd images. In The IEEE International Conference on Computer Vision (ICCV) Workshops , Oct 2017

work page 2017
[25]

HERB: a home exploring robotic butler

Siddhartha S Srinivasa, Dave Ferguson, Casey J Helfrich, Dmitry Berenson, Alvaro Collet, Rosen Diankov, Garratt Gal- lagher, Geoffrey Hollinger, James Kuffner, and Michael Vande Weghe. HERB: a home exploring robotic butler. Autonomous Robots, 28(1):5–20, 2010

work page 2010
[26]

Walk-man: A high-performance humanoid platform for realistic environments

Nikos G Tsagarakis, Darwin G Caldwell, F Negrello, W Choi, L Baccelliere, VG Loc, J Noorden, L Muratore, A Margan, A Cardellino, et al. Walk-man: A high-performance humanoid platform for realistic environments. Journal of Field Robotics , 34(7):1225–1259, 2017

work page 2017
[27]

Integrated grasp and motion planning

Niko Vahrenkamp, Martin Do, Tamim Asfour, and Rüdiger Dillmann. Integrated grasp and motion planning. In Robotics and Automation (ICRA), 2010 IEEE International Conference on, pages 2883–2888. IEEE, 2010

work page 2010

[1] [1]

Robotic roommates making pancakes

Michael Beetz, Ulrich Klank, Ingo Kresse, Alexis Maldonado, L Mosenlechner, Dejan Pangercic, T Ruhr, and Moritz Tenorth. Robotic roommates making pancakes. In Humanoid Robots (Humanoids), 2011 11th IEEE-RAS International Conference on, pages 529–536. IEEE, 2011

work page 2011

[2] [2]

Using simulation and domain adaptation to improve efﬁciency of deep robotic grasping

Konstantinos Bousmalis, Alex Irpan, Paul Wohlhart, Yunfei Bai, Matthew Kelcey, Mrinal Kalakrishnan, Laura Downs, Julian Ibarz, Peter Pastor, Kurt Konolige, et al. Using simulation and domain adaptation to improve efﬁciency of deep robotic grasping. In 2018 IEEE International Conference on Robotics and Automation (ICRA) , pages 4243–4250. IEEE, 2018

work page 2018

[3] [3]

Imagenet: A large-scale hierarchical image database

Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. Imagenet: A large-scale hierarchical image database. 2009

work page 2009

[4] [4]

Orthographicnet: A deep learning ap- proach for 3D object recognition in open-ended domains

S Hamidreza Kasaei. Orthographicnet: A deep learning ap- proach for 3D object recognition in open-ended domains. arXiv preprint arXiv:1902.03057, 2019

work page arXiv 1902

[5] [5]

An adaptive ob- ject perception system based on environment exploration and bayesian learning

S Hamidreza Kasaei, Miguel Oliveira, Gi Hyun Lim, Luís Seabra Lopes, and Ana Maria Tomé. An adaptive ob- ject perception system based on environment exploration and bayesian learning. In 2015 IEEE International Conference on Autonomous Robot Systems and Competitions , pages 221–226. IEEE, 2015

work page 2015

[6] [6]

Interactive open-ended learning for 3D object recognition: An approach and experi- ments

S Hamidreza Kasaei, Miguel Oliveira, Gi Hyun Lim, Luís Seabra Lopes, and Ana Maria Tomé. Interactive open-ended learning for 3D object recognition: An approach and experi- ments. Journal of Intelligent & Robotic Systems , 80(3-4):537– 553, 2015

work page 2015

[7] [7]

An orthographic descriptor for 3d object learning and recognition

S Hamidreza Kasaei, Luís Seabra Lopes, Ana Maria Tomé, and Miguel Oliveira. An orthographic descriptor for 3d object learning and recognition. In 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) , pages 4158–4163. IEEE, 2016

work page 2016

[8] [8]

Object learning and grasping capabilities for robotic home assistants

S Hamidreza Kasaei, Nima Shaﬁi, Luís Seabra Lopes, and Ana Maria Tomé. Object learning and grasping capabilities for robotic home assistants. In LectureNotes in Computer Science , volume 9776. Springer, 2016

work page 2016

[9] [9]

GOOD: A global orthographic object descriptor for 3D object recognition and manipulation

S Hamidreza Kasaei, Ana Maria Tomé, Luís Seabra Lopes, and Miguel Oliveira. GOOD: A global orthographic object descriptor for 3D object recognition and manipulation. Pattern Recognition Letters, 2016

work page 2016

[10] [10]

Coping with context change in open-ended object recognition without explicit context information

S Hamidreza Kasaei, Luís Seabra Lopes, and Ana Maria Tomé. Coping with context change in open-ended object recognition without explicit context information. In 2018 IEEE/RSJ Inter- national Conference on Intelligent Robots and Systems (IROS) . IEEE, 2018

work page 2018

[11] [11]

Towards lifelong assis- tive robotics: A tight coupling between object perception and manipulation

S Hamidreza Kasaei, Miguel Oliveira, Gi Hyun Lim, Luís Seabra Lopes, and Ana Maria Tomé. Towards lifelong assis- tive robotics: A tight coupling between object perception and manipulation. Neurocomputing, 291:151–166, 2018

work page 2018

[12] [12]

Perceiving, learning, and recognizing 3d objects: An approach to cognitive service robots

S Hamidreza Kasaei, Juil Sock, Luis Seabra Lopes, Ana Maria Tomé, and Tae-Kyun Kim. Perceiving, learning, and recognizing 3d objects: An approach to cognitive service robots. In Thirty- Second AAAI Conference on Artiﬁcial Intelligence , 2018

work page 2018

[13] [13]

Interactive open-ended object, affordance and grasp learning for robotic manipulation

S Hamidreza Kasaei, Nima Shaﬁi, Luís Seabra Lopes, and Ana Maria Tomé. Interactive open-ended object, affordance and grasp learning for robotic manipulation. In 2019 IEEE/RSJ International Conference on Robotics and Automation (ICRA) . IEEE, 2019

work page 2019

[14] [14]

Hierarchical object representation for open-ended object category learning and recognition

Seyed Hamidreza Kasaei, Ana Maria Tomé, and Luís Seabra Lopes. Hierarchical object representation for open-ended object category learning and recognition. In Advances in Neural Information Processing Systems , pages 1948–1956, 2016

work page 1948

[15] [15]

Learning hand-eye coordination for robotic grasping with large-scale data collection

Sergey Levine, Peter Pastor, Alex Krizhevsky, and Deirdre Quillen. Learning hand-eye coordination for robotic grasping with large-scale data collection. In International symposium on experimental robotics, pages 173–184. Springer, 2016

work page 2016

[16] [16]

Interactive teaching and experience extraction for learn- ing about objects and robot activities

Gi Hyun Lim, Miguel Oliveira, Vahid Mokhtari, S Hamidreza Kasaei, Aneesh Chauhan, Luís Seabra Lopes, and Ana Maria Tomé. Interactive teaching and experience extraction for learn- ing about objects and robot activities. In The 23rd IEEE International Symposium on Robot and Human Interactive Communication, pages 153–160. IEEE, 2014

work page 2014

[17] [17]

Hierarchical nearest neighbor graphs for building perceptual hierarchies

Gi Hyun Lim, Miguel Oliveira, S Hamidreza Kasaei, and Luís Seabra Lopes. Hierarchical nearest neighbor graphs for building perceptual hierarchies. In International Conference on Neural Information Processing, pages 646–655. Springer, 2015

work page 2015

[18] [18]

Dex-Net 2.0: Deep Learning to Plan Robust Grasps with Synthetic Point Clouds and Analytic Grasp Metrics

Jeffrey Mahler, Jacky Liang, Sherdil Niyaz, Michael Laskey, Richard Doan, Xinyu Liu, Juan Aparicio Ojea, and Ken Gold- berg. Dex-net 2.0: Deep learning to plan robust grasps with synthetic point clouds and analytic grasp metrics. arXiv preprint arXiv:1703.09312, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[19] [19]

A percep- tual memory system for grounding semantic representations in intelligent service robots

Miguel Oliveira, Gi Hyun Lim, Luís Seabra Lopes, S Hamidreza Kasaei, Ana Maria Tomé, and Aneesh Chauhan. A percep- tual memory system for grounding semantic representations in intelligent service robots. In 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems , pages 2216–

work page 2014

[20] [20]

Concurrent learning of visual codebooks and object categories in open- ended domains

Miguel Oliveira, Luís Seabra Lopes, Gi Hyun Lim, S Hamidreza Kasaei, Angel D Sappa, and Ana Maria Tomé. Concurrent learning of visual codebooks and object categories in open- ended domains. In Intelligent Robots and Systems (IROS), 2015 IEEE/RSJ International Conference on, pages 2488–2495. IEEE, 2015

work page 2015

[21] [21]

3D object perception and perceptual learning in the race project

Miguel Oliveira, Luís Seabra Lopes, Gi Hyun Lim, S Hamidreza Kasaei, Ana Maria Tomé, and Aneesh Chauhan. 3D object perception and perceptual learning in the race project. Robotics and Autonomous Systems , 75:614–626, 2016

work page 2016

[22] [22]

You only look once: Uniﬁed, real-time object detection

Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. You only look once: Uniﬁed, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 779–788, 2016

work page 2016

[23] [23]

Learning to grasp familiar objects using object view recognition and template matching

Nima Shaﬁi, S Hamidreza Kasaei, and Luís Seabra Lopes. Learning to grasp familiar objects using object view recognition and template matching. In Intelligent Robots and Systems (IROS), 2016 IEEE/RSJ International Conference on , pages 2895–2900. IEEE, 2016

work page 2016

[24] [24]

Hamidreza Kasaei, Luis Seabra Lopes, and Tae- Kyun Kim

Juil Sock, S. Hamidreza Kasaei, Luis Seabra Lopes, and Tae- Kyun Kim. Multi-view 6d object pose estimation and camera motion planning using rgbd images. In The IEEE International Conference on Computer Vision (ICCV) Workshops , Oct 2017

work page 2017

[25] [25]

HERB: a home exploring robotic butler

Siddhartha S Srinivasa, Dave Ferguson, Casey J Helfrich, Dmitry Berenson, Alvaro Collet, Rosen Diankov, Garratt Gal- lagher, Geoffrey Hollinger, James Kuffner, and Michael Vande Weghe. HERB: a home exploring robotic butler. Autonomous Robots, 28(1):5–20, 2010

work page 2010

[26] [26]

Walk-man: A high-performance humanoid platform for realistic environments

Nikos G Tsagarakis, Darwin G Caldwell, F Negrello, W Choi, L Baccelliere, VG Loc, J Noorden, L Muratore, A Margan, A Cardellino, et al. Walk-man: A high-performance humanoid platform for realistic environments. Journal of Field Robotics , 34(7):1225–1259, 2017

work page 2017

[27] [27]

Integrated grasp and motion planning

Niko Vahrenkamp, Martin Do, Tamim Asfour, and Rüdiger Dillmann. Integrated grasp and motion planning. In Robotics and Automation (ICRA), 2010 IEEE International Conference on, pages 2883–2888. IEEE, 2010

work page 2010