Asymptotic Bias of Stochastic Gradient Search

Tadic, Vladislav B.; Doucet, Arnaud

Mathematics > Statistics Theory

arXiv:1709.00291 (math)

[Submitted on 30 Aug 2017]

Title:Asymptotic Bias of Stochastic Gradient Search

Authors:Vladislav B. Tadic, Arnaud Doucet

View PDF

Abstract:The asymptotic behavior of the stochastic gradient algorithm with a biased gradient estimator is analyzed. Relying on arguments based on the dynamic system theory (chain-recurrence) and the differential geometry (Yomdin theorem and Lojasiewicz inequality), tight bounds on the asymptotic bias of the iterates generated by such an algorithm are derived. The obtained results hold under mild conditions and cover a broad class of high-dimensional nonlinear algorithms. Using these results, the asymptotic properties of the policy-gradient (reinforcement) learning and adaptive population Monte Carlo sampling are studied. Relying on the same results, the asymptotic behavior of the recursive maximum split-likelihood estimation in hidden Markov models is analyzed, too.

Comments:	arXiv admin note: text overlap with arXiv:0907.1020
Subjects:	Statistics Theory (math.ST); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:1709.00291 [math.ST]
	(or arXiv:1709.00291v1 [math.ST] for this version)
	https://doi.org/10.48550/arXiv.1709.00291

Submission history

From: Vladislav Tadić B [view email]
[v1] Wed, 30 Aug 2017 20:07:51 UTC (55 KB)

Full-text links:

Access Paper:

view license

Current browse context:

math.ST

< prev | next >

new | recent | 1709

Change to browse by:

math
math.OC
stat
stat.ML
stat.TH

References & Citations

export BibTeX citation

Mathematics > Statistics Theory

Title:Asymptotic Bias of Stochastic Gradient Search

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Statistics Theory

Title:Asymptotic Bias of Stochastic Gradient Search

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators