Developer Guide for Intel® Data Analytics Acceleration Library 2019 Update 4
The adaptive subgradient method (AdaGrad) [Duchi2011] follows the algorithmic framework of an iterative solver with the algorithm-specific transformation T, set of intrinsic parameters S t defined for the learning rate η, and algorithm-specific vector U and power d of Lebesgue space defined as follows:
Convergence check: