Multi-scale Orderless Pooling of Deep Convolutional Activation Features

Gong, Yunchao; Wang, Liwei; Guo, Ruiqi; Lazebnik, Svetlana

Computer Science > Computer Vision and Pattern Recognition

arXiv:1403.1840v1 (cs)

[Submitted on 7 Mar 2014 (this version), latest version 8 Sep 2014 (v3)]

Title:Multi-scale Orderless Pooling of Deep Convolutional Activation Features

Authors:Yunchao Gong, Liwei Wang, Ruiqi Guo, Svetlana Lazebnik

View PDF

Abstract:Deep convolutional neural networks (CNN) have shown their promise as a universal representation for recognition. However, global CNN activations at present lack geometric invariance, which limits their robustness for tasks such as classification and matching of highly variable scenes. To improve the invariance of CNN activations without degrading their discriminative power, this paper presents a simple but effective scheme called multi-scale orderless pooling (or MOP-CNN for short). This approach works by extracting CNN activations for local patches at multiple scales, followed by orderless VLAD pooling of these activations at each scale level and concatenating the result. This feature representation decisively outperforms global CNN activations and achieves state-of-the-art performance for scene classification on such challenging benchmarks as SUN397, MIT Indoor Scenes, and ILSVRC2012, as well as for instance-level retrieval on the Holidays dataset.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1403.1840 [cs.CV]
	(or arXiv:1403.1840v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1403.1840

Submission history

From: Yunchao Gong [view email]
[v1] Fri, 7 Mar 2014 19:03:15 UTC (16,947 KB)
[v2] Tue, 8 Jul 2014 17:38:52 UTC (35,955 KB)
[v3] Mon, 8 Sep 2014 22:03:21 UTC (17,980 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2014-03

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Yunchao Gong
Liwei Wang
Ruiqi Guo
Svetlana Lazebnik

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-scale Orderless Pooling of Deep Convolutional Activation Features

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-scale Orderless Pooling of Deep Convolutional Activation Features

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators