Tuesday, March 9

Data-Intensive Text Processing with MapReduce Updated Book Draft

An updated draft of the upcoming book, Data-Intensive Text Processing with MapReduce by Jimmy Lin and Chris Dyer is available.

The book isn't finished, but it still has interesting material. It emphasizes algorithms for processing text with Mapreduce: co-occurrence analysis, inverted index construction, and the EM algorithm applied to estimating parameters in HMMs.

You can also see Jimmy's cloud computing course (spring 2010) and the Ivory search engine.

No comments:

Post a Comment