Image classification by visual bag-of-words refinement and reduction

Lu, Zhiwu; Wang, Liwei; Wen, Ji-Rong

doi:10.1016/j.neucom.2015.01.098

Computer Science > Computer Vision and Pattern Recognition

arXiv:1501.04292 (cs)

[Submitted on 18 Jan 2015]

Title:Image classification by visual bag-of-words refinement and reduction

Authors:Zhiwu Lu, Liwei Wang, Ji-Rong Wen

View PDF

Abstract:This paper presents a new framework for visual bag-of-words (BOW) refinement and reduction to overcome the drawbacks associated with the visual BOW model which has been widely used for image classification. Although very influential in the literature, the traditional visual BOW model has two distinct drawbacks. Firstly, for efficiency purposes, the visual vocabulary is commonly constructed by directly clustering the low-level visual feature vectors extracted from local keypoints, without considering the high-level semantics of images. That is, the visual BOW model still suffers from the semantic gap, and thus may lead to significant performance degradation in more challenging tasks (e.g. social image classification). Secondly, typically thousands of visual words are generated to obtain better performance on a relatively large image dataset. Due to such large vocabulary size, the subsequent image classification may take sheer amount of time. To overcome the first drawback, we develop a graph-based method for visual BOW refinement by exploiting the tags (easy to access although noisy) of social images. More notably, for efficient image classification, we further reduce the refined visual BOW model to a much smaller size through semantic spectral clustering. Extensive experimental results show the promising performance of the proposed framework for visual BOW refinement and reduction.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1501.04292 [cs.CV]
	(or arXiv:1501.04292v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1501.04292
Journal reference:	Neurocomputing 173: 373-384 (2016)
Related DOI:	https://doi.org/10.1016/j.neucom.2015.01.098

Submission history

From: Zhiwu Lu [view email]
[v1] Sun, 18 Jan 2015 12:46:11 UTC (641 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Image classification by visual bag-of-words refinement and reduction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Image classification by visual bag-of-words refinement and reduction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators