Benchmarking optimization algorithms for auto-tuning GPU kernels

Schoonhoven, Richard; van Werkhoven, Ben; Batenburg, Kees Joost

doi:10.1109/TEVC.2022.3210654

Computer Science > Distributed, Parallel, and Cluster Computing

arXiv:2210.01465 (cs)

[Submitted on 4 Oct 2022]

Title:Benchmarking optimization algorithms for auto-tuning GPU kernels

Authors:Richard Schoonhoven, Ben van Werkhoven, Kees Joost Batenburg

View PDF

Abstract:Recent years have witnessed phenomenal growth in the application, and capabilities of Graphical Processing Units (GPUs) due to their high parallel computation power at relatively low cost. However, writing a computationally efficient GPU program (kernel) is challenging, and generally only certain specific kernel configurations lead to significant increases in performance. Auto-tuning is the process of automatically optimizing software for highly-efficient execution on a target hardware platform. Auto-tuning is particularly useful for GPU programming, as a single kernel requires re-tuning after code changes, for different input data, and for different architectures. However, the discrete, and non-convex nature of the search space creates a challenging optimization problem. In this work, we investigate which algorithm produces the fastest kernels if the time-budget for the tuning task is varied. We conduct a survey by performing experiments on 26 different kernel spaces, from 9 different GPUs, for 16 different evolutionary black-box optimization algorithms. We then analyze these results and introduce a novel metric based on the PageRank centrality concept as a tool for gaining insight into the difficulty of the optimization problem. We demonstrate that our metric correlates strongly with observed tuning performance.

Comments:	in IEEE Transactions on Evolutionary Computation, 2022
Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC); Graphics (cs.GR); Performance (cs.PF)
Cite as:	arXiv:2210.01465 [cs.DC]
	(or arXiv:2210.01465v1 [cs.DC] for this version)
	https://doi.org/10.48550/arXiv.2210.01465
Related DOI:	https://doi.org/10.1109/TEVC.2022.3210654

Submission history

From: Richard Schoonhoven [view email]
[v1] Tue, 4 Oct 2022 08:42:12 UTC (26,233 KB)

Computer Science > Distributed, Parallel, and Cluster Computing

Title:Benchmarking optimization algorithms for auto-tuning GPU kernels

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Distributed, Parallel, and Cluster Computing

Title:Benchmarking optimization algorithms for auto-tuning GPU kernels

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators