Optimal Scalarizations for Sublinear Hypervolume Regret

Zhang, Qiuyi

Computer Science > Machine Learning

arXiv:2307.03288 (cs)

This paper has been withdrawn by Qiuyi (Richard) Zhang

[Submitted on 6 Jul 2023 (v1), last revised 3 Dec 2023 (this version, v2)]

Title:Optimal Scalarizations for Sublinear Hypervolume Regret

Authors:Qiuyi Zhang (Richard)

No PDF available, click to view other formats

Abstract:Scalarization is a general technique that can be deployed in any multiobjective setting to reduce multiple objectives into one, such as recently in RLHF for training reward models that align human preferences. Yet some have dismissed this classical approach because linear scalarizations are known to miss concave regions of the Pareto frontier. To that end, we aim to find simple non-linear scalarizations that can explore a diverse set of $k$ objectives on the Pareto frontier, as measured by the dominated hypervolume. We show that hypervolume scalarizations with uniformly random weights are surprisingly optimal for provably minimizing the hypervolume regret, achieving an optimal sublinear regret bound of $O(T^{-1/k})$, with matching lower bounds that preclude any algorithm from doing better asymptotically. As a theoretical case study, we consider the multiobjective stochastic linear bandits problem and demonstrate that by exploiting the sublinear regret bounds of the hypervolume scalarizations, we can derive a novel non-Euclidean analysis that produces improved hypervolume regret bounds of $\tilde{O}( d T^{-1/2} + T^{-1/k})$. We support our theory with strong empirical performance of using simple hypervolume scalarizations that consistently outperforms both the linear and Chebyshev scalarizations, as well as standard multiobjective algorithms in bayesian optimization, such as EHVI.

Comments:	New version coming
Subjects:	Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Optimization and Control (math.OC)
Cite as:	arXiv:2307.03288 [cs.LG]
	(or arXiv:2307.03288v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2307.03288

Submission history

From: Qiuyi (Richard) Zhang [view email]
[v1] Thu, 6 Jul 2023 20:49:42 UTC (11,894 KB)
[v2] Sun, 3 Dec 2023 00:08:15 UTC (1 KB) (withdrawn)

Computer Science > Machine Learning

Title:Optimal Scalarizations for Sublinear Hypervolume Regret

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Optimal Scalarizations for Sublinear Hypervolume Regret

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators