Automatic image annotation or image classification can be an important step when searching for images from a database. Base on the IRMA project a database of 9,000 fully classified radiographs taken randomly from medical routine is made available and can be used to train a classification system. 1,000 radiographis for which classification labels are not available to the participants have to be classified. The aim is to find out how well current techniques can identify image modality, body orientation, body region, and biological system examined based on the images. The results of the classification step can be used for multilingual image annotations as well as for DICOM header corrections.
Although only 57 simple class numbers will be provided for ImageCLEFmed 2005. The images are annotated with complete IRMA code, a multi-axial code for image annotation. The code is currently available in English and German. It is planned to use the results of such automatic image annotation tasks for further, textual image retrieval tasks in the future.
<imageno> <confidence for class 1> <confidence for class 2>... <confidence for class 56> <confidence for class 57>
1876 1.0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
1895 0 0.1 0.9 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
1919 0 0.1 0.2 0.3 0.4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
These three lines would lead to a classification of the image
with no 1876 into class 1, image 1895 is classified to be
class 3, and image 1919 is classified to be class 5.
./readconfidencefile.py -c imageclef05-testname-necessary-autoannot confidences.txt
this prints out the classification result for each file
classified and complains about files that are not expected to
be classified. Most important is the last line which can give
error rates (when correct classes are known) but which will
also print whether there are files missing or files in the
list which are unknown.Example (what is not desireable): only 11 files could be classified, one of these is unknown to the system and 990 are missing:
ER: 0.0 classified: 11 wrong: 0 correct: 0 illegal: 1 missing: 990
Example (what is desireable): All files are known to the system and all 1000 files were classified:
ER: 0.0 classified: 1000 wrong: 0 correct: 0 illegal: 0 missing: 0
Explanation:
| ER | is error rate if classification is known, in this case irrelevant |
|---|---|
| classified | how many lines could be used for classification (should be 1000) |
| wrong | how many of the valid classifications where wrong (irrelevant for the moment) |
| correct | how many of the valid classifications where correct (irrelevant for the moment) |
| illegal | how many lines contained classifications of files that are not expected (mainly: wrong number in first column) |
| missing | how many of the files we were expecting to be classified were missing in the file |