Tuesday, August 11

Google Unveils New 'Caffeine' Search Infrastructure Update

Caffeine is a top secret project to re-rewrite of Google's indexing system. It's finally being released. According to this interview with Matt, infrastructure-wise, this compares with the BigDaddy update in 2006. There have been major changed under the hood to make indexing more flexible, faster, and more robust. According to the Google post:
For the last several months, a large team of Googlers has been working on a secret project: a next-generation architecture for Google's web search.
You can try an index served on the new archicture in the sandbox they setup to let people try it out. Notice anything different?

Matt Cutts has a post on his blog. The infrastructure team have been working hard,
...a few weeks ago, I joked that the half-life of code at Google is about six months. That means that you can write some code and when you circle back around in six months, about half of that code has been replaced with better abstractions or cleaner infrastructure...
Congratulations to the infrastructure team: I didn't notice a significant difference in the results. I expect this will help Google to significantly increase the size and freshness of their index.

You may remember Cuil. Despite getting knocked pretty hard, Cuil was not about next-generation ranking, it was about infrastructure. Read my post for details. It's not clear, but perhaps the Caffeine update tackles some of the issues that Anna Patterson, former Google infrastructure architect, recounted in a Cuil interview,
If they [Google] wanted to triple size of their index, they'd have to triple the size of every server and cluster. It's not easy or fast...increasing the index size will be 'non-trivial' exercise.

Has Google tackled these architecture issues with 'Caffeine'? We may never know.


  1. The expansion of internet and other business intelligence leads to large volume of data. Industries are looking for talented professionals to maintain and process huge volume of data with latest tools available in the market. Taking Hadoop Training in Chennai | Big Data Training in Chennai will ensure better career prospects for talented professionals.

  2. Those guidelines additionally worked to become a good way to recognize that other people online have the identical fervor like mine to grasp great deal more around this condition.

    java training in bangalore

  3. I ‘d mention that most of us visitors are endowed to exist in a fabulous place with very many wonderful individuals with very helpful things.
    dotnet training in bangalore

  4. Nice post .thank you for sharing your information with us.keep on updating..
    Best VMware Training Institute in Chennai | Best VMware Training Institute in Velachery

  5. I simply wanted to write down a quick word to say thanks to you for
    those wonderful tips and hints you are showing on this site.
    Summer Course Training Institute in Chennai | Summer Course Training Institute in Velachery

  6. I think this is an great blogs. Such a very informative and creative contents. These concept is good for these knowledge. I like it and help me to development very well. Thank you for this brief explanations... Java Training in Chennai

  7. Really awesome blog. Your blog is really useful for me. Thanks for sharing this informative blog. Keep updating your blog... Java Training in Chennai