Extreme State Aggregation Beyond MDPs

Hutter, Marcus

Computer Science > Artificial Intelligence

arXiv:1407.3341 (cs)

[Submitted on 12 Jul 2014]

Title:Extreme State Aggregation Beyond MDPs

Authors:Marcus Hutter

View PDF

Abstract:We consider a Reinforcement Learning setup where an agent interacts with an environment in observation-reward-action cycles without any (esp.\ MDP) assumptions on the environment. State aggregation and more generally feature reinforcement learning is concerned with mapping histories/raw-states to reduced/aggregated states. The idea behind both is that the resulting reduced process (approximately) forms a small stationary finite-state MDP, which can then be efficiently solved or learnt. We considerably generalize existing aggregation results by showing that even if the reduced process is not an MDP, the (q-)value functions and (optimal) policies of an associated MDP with same state-space size solve the original problem, as long as the solution can approximately be represented as a function of the reduced states. This implies an upper bound on the required state space size that holds uniformly for all RL problems. It may also explain why RL algorithms designed for MDPs sometimes perform well beyond MDPs.

Comments:	28 LaTeX pages. 8 Theorems
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1407.3341 [cs.AI]
	(or arXiv:1407.3341v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1407.3341

Submission history

From: Marcus Hutter [view email]
[v1] Sat, 12 Jul 2014 04:10:43 UTC (30 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 1407

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Marcus Hutter

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Extreme State Aggregation Beyond MDPs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Extreme State Aggregation Beyond MDPs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators