Training labels

For Track I we will use a set of training labels produced by a group of volunteers. These labels are available in the WEBSPAM-UK2006 collection.

Over 5,600 hosts have at two human assessments, or belong to the list of domain names that are trusted a priori (.ac.uk, .sch.uk, .gov.uk, .mod.uk, .nhs.uk and .police.uk).