The provided files contain the images and their metadata used in ImageCLEF wikipediaMM 2008 task, together with some additional information.

The collection is in:
        images          	        The full-size images. 
					There are 151,519 images in JPEG and PNG formats organised in 18 subdirectories.

	imagesIDs.txt  	        	List of all image identifiers.
	
        metadata_xml   		        The corresponding metadata.
					The metadata documents are XML files organized in 9 subdirectories. 
					Spaces in image names have been replaced by underscores.

					The image that corresponds to a metadata file (e.g., metadata_xml/0_xml/10.xml)
					can be found in its <image> tag: <image .... id="10" part="images-40000">.
					The corresponding image is: images/images-40000/10.jpeg

	imagefile2metadatafile.txt 	A list of this correspondence of image to metadata file.

