Straight to Shapes: Real-time Detection of Encoded Shapes

Jetley, Saumya; Sapienza, Michael; Golodetz, Stuart; Torr, Philip H. S.

Computer Science > Computer Vision and Pattern Recognition

arXiv:1611.07932 (cs)

[Submitted on 23 Nov 2016 (v1), last revised 5 Jul 2017 (this version, v2)]

Title:Straight to Shapes: Real-time Detection of Encoded Shapes

Authors:Saumya Jetley, Michael Sapienza, Stuart Golodetz, Philip H.S. Torr

View PDF

Abstract:Current object detection approaches predict bounding boxes, but these provide little instance-specific information beyond location, scale and aspect ratio. In this work, we propose to directly regress to objects' shapes in addition to their bounding boxes and categories. It is crucial to find an appropriate shape representation that is compact and decodable, and in which objects can be compared for higher-order concepts such as view similarity, pose variation and occlusion. To achieve this, we use a denoising convolutional auto-encoder to establish an embedding space, and place the decoder after a fast end-to-end network trained to regress directly to the encoded shape vectors. This yields what to the best of our knowledge is the first real-time shape prediction network, running at ~35 FPS on a high-end desktop. With higher-order shape reasoning well-integrated into the network pipeline, the network shows the useful practical quality of generalising to unseen categories similar to the ones in the training set, something that most existing approaches fail to handle.

Comments:	16 pages including appendix; Published at CVPR 2017
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1611.07932 [cs.CV]
	(or arXiv:1611.07932v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1611.07932

Submission history

From: Saumya Jetley [view email]
[v1] Wed, 23 Nov 2016 19:04:43 UTC (8,108 KB)
[v2] Wed, 5 Jul 2017 17:25:25 UTC (8,314 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 1611

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Saumya Jetley
Michael Sapienza
Stuart Golodetz
Philip H. S. Torr

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Straight to Shapes: Real-time Detection of Encoded Shapes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Straight to Shapes: Real-time Detection of Encoded Shapes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators