Optimal Scaling with ORDINALS | Block Relaxation Algorithms in Statistics -- Part I

I.6.5.2: Optimal Scaling with ORDINALS

In LINEALS (section x.x.x) we try to find quantifications of the variables that linearize all bivariate regressions. De Leeuw [1988] has suggested to find standardized quantifications in such a way that the loss function $f(y)=\mathop{\sum\sum}\limits_{j\not=\ell}\left\{y_j'C_{jl}^{\ }D_\ell^{-1}C_{\ell j}^{\ }y_j^{\ }-y_j'C_{jl}^{\ }y_\ell^{\ }y_\ell'C_{\ell j}^{\ }y_j^{\ }\right\}\tag{1}$ is minimized.

A more general loss function is $g(y,z)=\mathop{\sum\sum}\limits_{j\not=\ell} (z_{jl}-D_j^{-1}C_{j\ell}y_\ell)'D_j(z_{jl}-D_j^{-1}C_{j\ell}y_\ell),\tag{2}$ which must be minimized over both $y$ and $z$ . The $z_{jl}$ are $m(m-1)$ vectors, called regression targets, and target $z_{jl}$ has $k_j$ elements.

To see that this loss function generalizes $\text{(1)}$ suppose we constrain $z$ by requiring that $z_{j\ell}$ is proportional to $y_j$ , i.e. $z_{j\ell}=r_{jl}y_j.$ Then, using $y_j'D_jy_j=1$ , $g(y,R)=\mathop{\sum\sum}\limits_{j\not=\ell} r_{jl}^2-2\mathop{\sum\sum}\limits_{j\not=\ell} r_{j\ell}y_j'C_{j\ell}y_\ell+\mathop{\sum\sum}\limits_{j\not=\ell} y_\ell'C_{\ell j}D_j^{-1}C_{j\ell}y_\ell.$ This is minimized over $R$ by $r_{j\ell}=y_j'C_{j\ell}y_\ell$ , and the minimum is precisely the loss function $\text{(1)}$ . Thus $f(y)=\min_R g(y,R),$ and $g$ is an augmentation of $f$ . Block relaxation for $g$ alternates minimization over $R$ for fixed $y$ , which we have shown to be easy, and minimization over $y$ for fixed $R$ , which is a modified eigenvalue problem of the kind discussed in BRAS3, section x.x.x. This is not necessarily simpler than the direct minimum eigenvalue problem for minimizing $f$ in section x.x.x.

The major advantage from augmenting $f$ is that it now becomes simple to incorporate quite general restrictions on the $z_{j\ell}.$ For example, they can be required to be monotone with the original data, or a spline transformation, or a monotone spline. Or a mixture of these options. Thus we can constrain each individual regression functions $D_j^{-1}C_{j\ell}y_\ell$ to have one of a pre-determined number of shapes.

In ordinals.R we implement the three standard options of the Gifi system. A vector $y_j$ is treated as nominal, ordinal, or numerical. If it is nominal then it is unconstrained, except for the normalization. In that case the $z_{j\ell}$ are also unconstrained for all $\ell$ . If $y_j$ is treated as ordinal is must be monotone with the data, and so must all $z_{j\ell}$ . And a numerical $y_j$ must be linear with the data, together with its targets $z_{j\ell}$ . Of course if all variables are numerical there is nothing to optimize, and we just compute correlations. If all variables are nominal there is nothing to optimize either, because we immediately get zero loss from any starting point.

[Insert ordinals.R Here](../code/ordinals.R)