We incorporated TD into a KD
based classifier with Bayesian decision rule
| (8) |
![]() |
(9) |
![]() |
(10) |
In order to obtain a better approximation of
the domain knowledge about invariance
can be used to
enrich the training set with shifted copies of
the given training data. In the experiments displacements of one pixel in
eight directions were used. Although the tangent distance should already
compensate for shifts of that amount, this approach still leads to improvements.
This is due to the fact that the two approaches model invariance differently (compare Fig. 3).
As few large differences in pixel values can mislead
classifiers based on squared error distances (see e.g. [9]), it can be advisable
to introduce a local threshold which limits
the maximum contribution of a single pixel to the distance between two images.
This is justified
by a priori domain knowledge, e.g. when it is known that the patterns may be subject to artifacts that
do not affect class-membership, like
noise or changing scribor position in radiographs.
On the other hand when
looking at relatively small images of digits, one notices that e.g. changing only a few pixels
can be significant for discriminating between the handwritten digits `4' and `9'. Here it
can be useful to enlarge the contribution of a single pixel difference generalizing the used norm to
and investigating also
.
The extension of TD with an iterative Newton-type approximation was proposed in [1] and successfully used for face-recognition [9]. In our experiments we used a similar algorithm inspired by the Euler-Cauchy method used in the context of differential equations. In contrast to the Newton procedure it does not require the calculation of the actual transformation but uses the tangent approximation instead. It consists of iteratively calculating the closest point in the tangent subspace, ``moving'' into the corresponding direction and recalculating the tangents until convergence.