Monday, October 20

ICWSM 2009 Data Challenge

Ashkay announced that the data for the ICWSM 2009 data challenge is available.

The dataset consists of 44 million blog posts (27 GB compressed) crawled by Spinn3r between August 1st and October 1st 2008. The paper deadline is in January, so get to work!

2 comments:

  1. Teaming up with Spinn3r is an excellent idea. Thanks for the heads up.

    This could definitely be a good option for other venues to reduce dataset acquisition costs (e.g. TREC Blog Track).

    ReplyDelete
  2. Thanks for the post Jeff, Look forward to catching up with you at ICWSM.

    ReplyDelete