Self-labelling via simultaneous clustering and representation learning

Asano, Yuki Markus; Rupprecht, Christian; Vedaldi, Andrea

Computer Science > Computer Vision and Pattern Recognition

arXiv:1911.05371 (cs)

[Submitted on 13 Nov 2019 (v1), last revised 19 Feb 2020 (this version, v3)]

Title:Self-labelling via simultaneous clustering and representation learning

Authors:Yuki Markus Asano, Christian Rupprecht, Andrea Vedaldi

View PDF

Abstract:Combining clustering and representation learning is one of the most promising approaches for unsupervised learning of deep neural networks. However, doing so naively leads to ill posed learning problems with degenerate solutions. In this paper, we propose a novel and principled learning formulation that addresses these issues. The method is obtained by maximizing the information between labels and input data indices. We show that this criterion extends standard crossentropy minimization to an optimal transport problem, which we solve efficiently for millions of input images and thousands of labels using a fast variant of the Sinkhorn-Knopp algorithm. The resulting method is able to self-label visual data so as to train highly competitive image representations without manual labels. Our method achieves state of the art representation learning performance for AlexNet and ResNet-50 on SVHN, CIFAR-10, CIFAR-100 and ImageNet and yields the first self-supervised AlexNet that outperforms the supervised Pascal VOC detection baseline. Code and models are available.

Comments:	Accepted paper at the International Conference on Learning Representations (ICLR) 2020
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1911.05371 [cs.CV]
	(or arXiv:1911.05371v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1911.05371

Submission history

From: Yuki Asano [view email]
[v1] Wed, 13 Nov 2019 09:47:49 UTC (6,413 KB)
[v2] Tue, 26 Nov 2019 13:22:38 UTC (4,449 KB)
[v3] Wed, 19 Feb 2020 18:03:39 UTC (9,325 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Self-labelling via simultaneous clustering and representation learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Self-labelling via simultaneous clustering and representation learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators