Main.PhaseIIICorpus History

Hide minor edits - Show changes to markup

March 12, 2008, at 05:40 AM by 84.88.76.49 -
Changed lines 3-6 from:

The dataset can be downloaded from:

   http://www.yr-bcn.es/webspam/datasets/uk2007/
to:

The dataset (contents, links, and labels) can be downloaded from:

  • http://www.yr-bcn.es/webspam/datasets/uk2007/
March 12, 2008, at 05:37 AM by 84.88.76.49 -
Changed lines 5-6 from:
   * http://www.yr-bcn.es/webspam/datasets/uk2007/
to:
   http://www.yr-bcn.es/webspam/datasets/uk2007/
March 12, 2008, at 05:37 AM by 84.88.76.49 -
Changed lines 3-4 from:

The WEBSPAM-UK2007 dataset will be used for this version of the challenge. It is based on a crawl of .UK done on May 2007.

to:

The dataset can be downloaded from:

   * http://www.yr-bcn.es/webspam/datasets/uk2007/

It is based on a crawl of .UK done on May 2007.

January 30, 2008, at 04:41 AM by 84.88.76.49 -
Changed lines 1-2 from:

Corpus

to:

Web Spam Challenge 2008: Corpus

January 30, 2008, at 04:41 AM by 84.88.76.49 -
Added lines 1-5:

Corpus

The WEBSPAM-UK2007 dataset will be used for this version of the challenge. It is based on a crawl of .UK done on May 2007.

2/3 of the labels have been released for training, and 1/3 of the labels are being held for testing.