Solving Games with Functional Regret Estimation

Waugh, Kevin; Morrill, Dustin; Bagnell, J. Andrew; Bowling, Michael

Computer Science > Artificial Intelligence

arXiv:1411.7974 (cs)

[Submitted on 28 Nov 2014 (v1), last revised 31 Dec 2014 (this version, v2)]

Title:Solving Games with Functional Regret Estimation

Authors:Kevin Waugh, Dustin Morrill, J. Andrew Bagnell, Michael Bowling

View PDF

Abstract:We propose a novel online learning method for minimizing regret in large extensive-form games. The approach learns a function approximator online to estimate the regret for choosing a particular action. A no-regret algorithm uses these estimates in place of the true regrets to define a sequence of policies.
We prove the approach sound by providing a bound relating the quality of the function approximation and regret of the algorithm. A corollary being that the method is guaranteed to converge to a Nash equilibrium in self-play so long as the regrets are ultimately realizable by the function approximator. Our technique can be understood as a principled generalization of existing work on abstraction in large games; in our work, both the abstraction as well as the equilibrium are learned during self-play. We demonstrate empirically the method achieves higher quality strategies than state-of-the-art abstraction techniques given the same resources.

Comments:	AAAI Conference on Artificial Intelligence 2015
Subjects:	Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA)
Cite as:	arXiv:1411.7974 [cs.AI]
	(or arXiv:1411.7974v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1411.7974

Submission history

From: Kevin Waugh [view email]
[v1] Fri, 28 Nov 2014 18:45:50 UTC (292 KB)
[v2] Wed, 31 Dec 2014 23:45:22 UTC (113 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 1411

Change to browse by:

cs
cs.GT
cs.MA

References & Citations

DBLP - CS Bibliography

listing | bibtex

Kevin Waugh
Dustin Morrill
J. Andrew Bagnell
Michael Bowling

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Solving Games with Functional Regret Estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Solving Games with Functional Regret Estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators