Ashkay announced that the data for the ICWSM 2009 data challenge is available.
The dataset consists of 44 million blog posts (27 GB compressed) crawled by Spinn3r between August 1st and October 1st 2008. The paper deadline is in January, so get to work!
2 comments: