Adaptive Proximal Average Based Variance Reducing Stochastic Methods for Optimization with Composite Regularization

Jingchang Liu; Linli Xu; Junliang Guo; Xin Sheng

doi:10.1609/aaai.v33i01.33011552

Authors

Jingchang Liu University of Science and Technology of China
Linli Xu University of Science and Technology of China
Junliang Guo University of Science and Technology of China
Xin Sheng University of Science and Technology of China

DOI:

https://doi.org/10.1609/aaai.v33i01.33011552

Abstract

We focus on empirical risk minimization with a composite regulariser, which has been widely applied in various machine learning tasks to introduce important structural information regarding the problem or data. In general, it is challenging to calculate the proximal operator with the composite regulariser. Recently, proximal average (PA) which involves a feasible proximal operator calculation is proposed to approximate composite regularisers. Augmented with the prevailing variance reducing (VR) stochastic methods (e.g. SVRG, SAGA), PA based algorithms would achieve a better performance. However, existing works require a fixed stepsize, which needs to be rather small to ensure that the PA approximation is sufficiently accurate. In the meantime, the smaller stepsize would incur many more iterations for convergence. In this paper, we propose two fast PA based VR stochastic methods – APA-SVRG and APA-SAGA. By initializing the stepsize with a much larger value and adaptively decreasing it, both of the proposed methods are proved to enjoy the (ô n log 1/ε + mo 1/ε) iteration complexity to achieve the accurate solutions, where m₀ is the initial number of inner iterations and n is the number of samples. Moreover, experimental results demonstrate the superiority of the proposed algorithms.

Adaptive Proximal Average Based Variance Reducing Stochastic Methods for Optimization with Composite Regularization

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription