Next: The image distortion model
Up: Experiments with an Extended
Previous: Introduction
In 1993, SIMARD et al. proposed an invariant distance measure called tangent distance, which
proved to be especially effective in the domain of OCR [1]. The authors
observed that reasonably small
transformations of certain image objects does not
affect class membership.
Simple distance measures like the Euclidean distance do not account for this,
instead they are
very sensitive to affine transformations like scaling, translation, rotation or axis deformation.
When an image
is transformed (e.g. scaled and rotated) by a transformation
which depends on
parameters
(e.g. the scaling factor and rotation angle), the set of all transformed patterns
 |
(1) |
is a manifold of at most dimension
in pattern space.
The distance between
two patterns can now be defined as the minimum distance between their respective manifolds, being
truly invariant with respect to the
regarded transformations. Unfortunately,
computation of this distance is a hard non-linear
optimization problem and the manifolds concerned generally do
not have an analytic expression. Therefore,
small transformations
of the pattern
are approximated by a tangent subspace
to the manifold
at
the point
.
This subspace is obtained by adding to
a linear combination of the vectors
that
span the tangent subspace and are the partial derivatives of
with respect to
.
We obtain a first-order approximation of
 |
(2) |
The single-sided (SS) TD
is defined as
 |
(3) |
The tangent vectors
can be computed using finite differences between the original image
and a reasonably small transformation of
[1]. Example images that were computed using (2) are shown
in Fig. 1 (with the original image on the left).
Figure 1:
Examples for tangent approximation
![\includegraphics [width=8.0cm,angle=0]{/u/keysers/Paper/ICPR2000/Tangentenbild_2.ps}](img17.png) |
A double-sided (DS) TD can also be defined,
where both manifolds are
approximated and the distance is minimized over possible combinations of the respective parameters.
It is possible to achieve a better approximation of the manifolds using an iterative procedure based
on Newton's method, which is computationally more expensive.
Next: The image distortion model
Up: Experiments with an Extended
Previous: Introduction
Daniel Keysers
2000-11-16