Abstract
In this paper, we investigate a continuous time version of the Stochastic Langevin Monte Carlo method, introduced in [39], that incorporates a stochastic sampling step inside the traditional overdamped Langevin diffusion. This method is popular in machine learning for sampling posterior distribution. We will pay specific attention in our work to the computational cost in terms of n (the number of observations that produces the posterior distribution), and d (the dimension of the ambient space where the parameter of interest is living). We derive our analysis in the weakly convex framework, which is parameterized with the help of the Kurdyka- Lojasiewicz (KL) inequality, that permits to handle a vanishing curvature settings, which is far less restrictive when compared to the simple strongly convex case. We establish that the final horizon of simulation to obtain an ε approximation (in terms of entropy) is of the order (d log(n)²)(1+r)² [log²(ε−1) + n²d²(1+r) log4(1+r)(n)] with a Poissonian subsampling of parameter n(d log²(n))1+r)−1, where the parameter r is involved in the KL inequality and varies between 0 (strongly convex case) and 1 (limiting Laplace situation).
Keywords
Log-concave models ,; Stochastic gradient Langevin dynamics; Weak convexity;
Reference
Marelys Crespo, Sébastien Gadat, and Xavier Gendre, “Stochastic Gradient Langevin Dynamics for (weakly) log-concave posterior distributions”, Electronic Journal of Probability, vol. 29, 2024, pp. 1–40.
See also
Published in
Electronic Journal of Probability, vol. 29, 2024, pp. 1–40