- Java Open source Text Mining and Information Extraction tools
(By far the most popular with almost half my traffic) - Current Open Source Search Engine Libraries
- Search at Ebay Part I: Faceted Search and Ebay Express
- HBase: Powerset's BigTable
- Open source collaborative filtering and recommendation systems
- Query Expansion: an alternative to static stemming
- Open Source Scraping (Wrapper Generation) Tools
- Octopart and SupplyFrame: Part Search Engines
- Integrating a Database of Everything with Web Search
- SIAM Data Mining Proceedings, LingPipe 3.0, and fun with Pig, Sawzall, and DryadLinq
I would be interested if anyone has feedback on what they would like to see done differently, improved upon in 2008.
0 comments:
Post a Comment