Distributed Gradient Descent: Nonconvergence to Saddle Points and the\n Stable-Manifold Theorem

Brian R. Swenson; Ryan Murray; H Vincent Vincent Poort; Soummya Kar

doi:10.48550/arxiv.1908.02747

Abstract

1 min read

The paper studies a distributed gradient descent (DGD) process and considers\nthe problem of showing that in nonconvex optimization problems, DGD typically\nconverges to local minima rather than saddle points. The paper considers\nunconstrained minimization of a smooth objective function. In centralized\nsettings, the problem of demonstrating nonconvergence to saddle points of\ngradient descent (and variants) is typically handled by way of the\nstable-manifold theorem from classical dynamical systems theory. However, the\nclassical stable-manifold theorem is not applicable in distributed settings.\nThe paper develops an appropriate stable-manifold theorem for DGD showing that\nconvergence to saddle points may only occur from a low-dimensional stable\nmanifold. Under appropriate assumptions (e.g., coercivity), this result implies\nthat DGD typically converges to local minima and not to saddle points.\n

Open reviews(0)

Public, signed peer feedback on this preprint.

No reviews yet.

Distributed Gradient Descent: Nonconvergence to Saddle Points and the\n Stable-Manifold Theorem

Abstract

Discussion(0)

Open reviews(0)

Related publications

Distributed Gradient Descent: Nonconvergence to Saddle Points and the Stable-Manifold Theorem

Distributed Gradient Flow: Nonsmoothness, Nonconvexity, and Saddle Point\n Evasion

Distributed Gradient Flow: Nonsmoothness, Nonconvexity, and Saddle Point Evasion

Distributed Stochastic Gradient Descent and Convergence to Local Minima

Distributed Stochastic Gradient Descent: Nonconvexity, Nonsmoothness, and Convergence to Local Minima

Related publications

Preprint2019
Distributed Gradient Descent: Nonconvergence to Saddle Points and the Stable-Manifold Theorem
Preprint2019

Preprint2020
Distributed Gradient Flow: Nonsmoothness, Nonconvexity, and Saddle Point\n Evasion
Preprint2020

Preprint2021
Distributed Gradient Flow: Nonsmoothness, Nonconvexity, and Saddle Point Evasion
Preprint2021

Preprint2020
Distributed Stochastic Gradient Descent and Convergence to Local Minima
Preprint2020

Preprint2020
Distributed Stochastic Gradient Descent: Nonconvexity, Nonsmoothness, and Convergence to Local Minima
Preprint2020