A survey on joint object detection and pose estimation using monocular vision

Aniruddha V Patil; Pankaj Rabha

doi:10.1051/matecconf/201927702029

All issues

Volume 277 (2019)

MATEC Web Conf., 277 (2019) 02029

Abstract

Open Access

Issue		MATEC Web Conf. Volume 277, 2019 2018 International Joint Conference on Metallurgical and Materials Engineering (JCMME 2018)


Article Number		02029
Number of page(s)		11
Section		Data and Signal Processing
DOI		https://doi.org/10.1051/matecconf/201927702029
Published online		02 April 2019

MATEC Web of Conferences 277, 02029 (2019)

A survey on joint object detection and pose estimation using monocular vision

Aniruddha V Patil¹ and Pankaj Rabha²^*

¹ IIIT-Hyderabad, Gachibowli, Hyderabad, Telangana, India 500032
² Intel, Bellandur, Bangalore, Karnataka, India 560103

^* Corresponding author: pankaj.rabha@intel.com

Abstract

In this survey we present a complete landscape of joint object detection and pose estimation methods that use monocular vision. Descriptions of traditional approaches that involve descriptors or models and various estimation methods have been provided. These descriptors or models include chordiograms, shape-aware deformable parts model, bag of boundaries, distance transform templates, natural 3D markers and facet features whereas the estimation methods include iterative clustering estimation, probabilistic networks and iterative genetic matching. Hybrid approaches that use handcrafted feature extraction followed by estimation by deep learning methods have been outlined. We have investigated and compared, wherever possible, pure deep learning based approaches (single stage and multi stage) for this problem. Comprehensive details of the various accuracy measures and metrics have been illustrated. For the purpose of giving a clear overview, the characteristics of relevant datasets are discussed. The trends that prevailed from the infancy of this problem until now have also been highlighted.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.