3D Deep Object Recognition and Semantic Understanding for Visually-Guided Robotic Service

Autor:	Ahmed M. Naguib, Sukhan Lee, Naeem Ul Islam
Rok vydání:	2018
Předmět:	Feature engineering Hierarchy (mathematics) business.industry Computer science Deep learning Feature extraction Cognitive neuroscience of visual object recognition 020206 networking & telecommunications 02 engineering and technology Ontology (information science) Object (computer science) Machine learning computer.software_genre 0202 electrical engineering electronic engineering information engineering Dependability 020201 artificial intelligence & image processing Artificial intelligence business computer
Zdroj:	IROS
DOI:	10.1109/iros.2018.8593985
Popis:	For the success of visually-guided robotic errand service, it is critical to ensure dependability under various ill-conditioned visual environments. To this end, we have developed Adaptive Bayesian Recognition Framework in which in-situ selection of multiple sets of optimal features or evidences as well as proactive collection of sufficient evidences are proposed to implement the principle of dependability. The framework has shown excellent performance with a limited number of objects in a scene. However, there arises a need to extend the framework for handling a larger number of objects without performance degradation, while avoiding difficulty in feature engineering. To this end, a novel deep learning architecture, referred to here as FER-CNN, is introduced and integrated into the Adaptive Bayesian Recognition Framework. FER-CNN has capability of not only extracting but also reconstructing a hierarchy of features with the layer-wise independent feedback connections that can be trained. Reconstructed features representing parts of 3D objects then allow them to be semantically linked to ontology for exploring object categories and properties. Experiments are conducted in a home environment with real 3D daily-life objects as well as with the standard ModelNet dataset. In particular, it is shown that FER-CNN allows the number of objects and their categories to be extended by 10 and 5 times, respectively, while registering the recognition rate for ModelNet10 and ModelNet40 by 97% and 89.5%, respectively.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::a4508234929bdf660c43dd80e22eb88a https://doi.org/10.1109/iros.2018.8593985 Zobrazit plný text záznamu