The University of Virginia Information Retrieval Group
About the Testbeds:
The testbed files describe the decomposition of documents into sites.
Each line in the file associates one document with its site.
The syntax is <document_id><site_id>.
Comments in the file are delimited by <COMMENT> at the beginning of the line.
The testbed files have all been compressed with gzip. In unix, some
browsers drop the .gz suffix when the files are downloaded.
In this case you must rename the file to include the suffix .gz so that gunzip will
uncompress the file. In Windows, Winzip will properly uncompressthe gzipped file.