II.5.5.3: Convergence

Suppose is such that majorizes for all . The majorization algorithm is simply i.e. it is a gradient algorithm with constant step size. From Ostrowski's Theorem the linear convergence rate is Note that if in majorizes , then any of the same form with a larger also majorizes . But shows a smaller will generally lead to faster convergence.

For all we have Adding these inequalities gives and thus, with , The left hand side of is an increasing sequence which is bounded above, and consequently converges. This implies

Generalize to more variables, generalize to constraints.