Semigroups of stochastic gradient descent and online principal component analysis: Properties and diffusion approximations

  • Yuanyuan Feng
  • , Lei Li*
  • , Jian Guo Liu
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

22 Scopus citations

Abstract

We study the Markov semigroups for two important algorithms from machine learning: stochastic gradient descent (SGD) and online principal component analysis (PCA). We investigate the effects of small jumps on the properties of the semigroups. Properties including regularity preserving, L contraction are discussed. These semigroups are the dual of the semigroups for evolution of probability, while the latter are L1 contracting and positivity preserving. Using these properties, we show that stochastic differential equations (SDEs) in Rd (on the sphere Sd-1) can be used to approximate SGD (online PCA) weakly. These SDEs may be used to provide some insights of the behaviors of these algorithms.

Original languageEnglish
Pages (from-to)777-789
Number of pages13
JournalCommunications in Mathematical Sciences
Volume16
Issue number3
DOIs
StatePublished - 2018
Externally publishedYes

Keywords

  • Markov chain
  • Online principle component analysis
  • Semigroup
  • Stochastic differential equations
  • Stochastic gradient descent

Fingerprint

Dive into the research topics of 'Semigroups of stochastic gradient descent and online principal component analysis: Properties and diffusion approximations'. Together they form a unique fingerprint.

Cite this