SGD in Gaussian Processes

Introduction

In this project, we apply stochastic gradient descent (SGD) algorithm and its variants to accelerate and improve Gaussian process (GP) inference.
We provide code for implementing sgGP described in Stochastic Gradient Descent in Correlated Settings: A Study on Gaussian Processes by Hao Chen, Lili Zheng, Raed Al Kontar, Garvesh, Raskutti.

Contributions

We prove minibatch SGD converges to a critical point of the empirical loss function and recovers model hyperparameters with rate 1/K (K is the number of iterations) up to a statistical error term depending on the minibatch size.
We prove that the conditional expectation of the loss function given covariates satisfies a relaxed property of strong convexity, which guarantees the 1/K optimization error bound.
Computationally, we are able to scale to dataset sizes previously unexplored in GPs in a fraction of time needed for competing methods. Meanwhile statistically, we find that the induced regularization imposed by SGD improves generalization in GPs, specifically in large data settings.

Problem Setup

Loss function

Theoretical Guarantee of Convergence

Assumptions

Exponential eigendecay. The eigenvalues of the kernel function decay exponentially.
Bounded iterates. The true parameters and SGD iterates lie within a bounded interval.
Bounded stochastic gradient. The norm of the stochastic gradient is upper bounded by a constant.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
Presentation		Presentation
figures		figures
poster		poster
.DS_Store		.DS_Store
.Rhistory		.Rhistory
README.md		README.md
functions2.R		functions2.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SGD in Gaussian Processes

Introduction

Contributions

Problem Setup

Loss function

Theoretical Guarantee of Convergence

Assumptions

Convergence of parameter iterates

Convergence of full gradient

Numerical Results

Comparison

Illustration

Prerequisite

Datasets

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SGD in Gaussian Processes

Introduction

Contributions

Problem Setup

Loss function

Theoretical Guarantee of Convergence

Assumptions

Convergence of parameter iterates

Convergence of full gradient

Numerical Results

Comparison

Illustration

Prerequisite

Datasets

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages