<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/'><id>tag:blogger.com,1999:blog-18315968.post6830386365366163832..comments</id><updated>2008-06-17T08:48:11.926-04:00</updated><category term='lingpipe'/><category term='nlp'/><category term='information retrieval'/><category term='java'/><category term='information extraction'/><category term='stemming'/><category term='personalization'/><category term='software'/><category term='local community'/><category term='chandler'/><category term='open source'/><category term='local search'/><category term='google'/><title type='text'>Comments on Jeff's Search Engine Caffè: How Microsoft Live Search Plans to Differentiate I...</title><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://www.searchenginecaffe.com/feeds/6830386365366163832/comments/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/18315968/6830386365366163832/comments/default'/><link rel='alternate' type='text/html' href='http://www.searchenginecaffe.com/2008/06/how-will-microsoft-live-search.html'/><author><name>jeff.dalton</name><uri>http://www.blogger.com/profile/12887721174386884522</uri><email>noreply@blogger.com</email><gd:image xmlns:gd='http://schemas.google.com/g/2005' rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='31' src='http://1.bp.blogspot.com/-BQPIreWshSg/Tf-6pG_XoCI/AAAAAAAAACs/0kJUPQH9tQI/s220/tw-32-sm.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>2</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-18315968.post-729414357453836533</id><published>2008-06-17T08:48:00.000-04:00</published><updated>2008-06-17T08:48:00.000-04:00</updated><title type='text'>Thanks for the pointer to the test.&lt;br&gt;&lt;br&gt;An inte...</title><content type='html'>Thanks for the pointer to the test.&lt;BR/&gt;&lt;BR/&gt;An interesting start, but I have more than a few issues with the way they did their test.  My biggest issue is with their evaluation metric. &lt;BR/&gt;&lt;BR/&gt;"For each engine, we count the number of queries that had at least one "Highly Relevant" result within the first five results the engine returned. This is a version of the "precision at 5" metric from information retrieval."&lt;BR/&gt;&lt;BR/&gt;They aren't using P@5, see the article on &lt;A HREF="http://en.wikipedia.org/wiki/Information_retrieval#Performance_measures" REL="nofollow"&gt;IR evaluation metrics&lt;/A&gt;.  P@5 is  relevant retrieved / retrieved.  &lt;BR/&gt;&lt;BR/&gt;Basically, the metric they are using isn't meaningful.  A better metric would be normalized discounted cumulative gain (NDCG)@10.  Perhaps even better, use pairwise preference judgments: &lt;A HREF="http://ciir.cs.umass.edu/~carteret/ecir08.pdf" REL="nofollow"&gt;Here or There: Preference Judgments for Relevance&lt;/A&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/18315968/6830386365366163832/comments/default/729414357453836533'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/18315968/6830386365366163832/comments/default/729414357453836533'/><link rel='alternate' type='text/html' href='http://www.searchenginecaffe.com/2008/06/how-will-microsoft-live-search.html?showComment=1213706880000#c729414357453836533' title=''/><author><name>jeff.dalton</name><uri>http://www.blogger.com/profile/12887721174386884522</uri><email>noreply@blogger.com</email><gd:image xmlns:gd='http://schemas.google.com/g/2005' rel='http://schemas.google.com/g/2005#thumbnail' width='30' height='32' src='http://photos1.blogger.com/img/267/8468/100/images.jpg'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://www.searchenginecaffe.com/2008/06/how-will-microsoft-live-search.html' ref='tag:blogger.com,1999:blog-18315968.post-6830386365366163832' source='http://www.blogger.com/feeds/18315968/posts/default/6830386365366163832' type='text/html'/><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='blogger.itemClass' value='pid-1997369634'/></entry><entry><id>tag:blogger.com,1999:blog-18315968.post-987505292564979563</id><published>2008-06-16T05:05:00.000-04:00</published><updated>2008-06-16T05:05:00.000-04:00</updated><title type='text'>At Dolores Lab they have done an empirical evaluat...</title><content type='html'>At Dolores Lab they have done an empirical evaluation of the 4 major search engines and found no statistical difference between Google, Yahoo and Live. Only Ask performs worst.&lt;BR/&gt;&lt;BR/&gt;They used Amazon's Mechanical Turk to conduct the tests. I was surprised with the tie in the top three.&lt;BR/&gt;&lt;BR/&gt;http://blog.doloreslabs.com/2008/04/search-engine-relevance-an-empirical-test/</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/18315968/6830386365366163832/comments/default/987505292564979563'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/18315968/6830386365366163832/comments/default/987505292564979563'/><link rel='alternate' type='text/html' href='http://www.searchenginecaffe.com/2008/06/how-will-microsoft-live-search.html?showComment=1213607100000#c987505292564979563' title=''/><author><name>Sérgio Nunes</name><uri>http://sergionunes.com</uri><email>noreply@blogger.com</email><gd:image xmlns:gd='http://schemas.google.com/g/2005' rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img1.blogblog.com/img/blank.gif'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://www.searchenginecaffe.com/2008/06/how-will-microsoft-live-search.html' ref='tag:blogger.com,1999:blog-18315968.post-6830386365366163832' source='http://www.blogger.com/feeds/18315968/posts/default/6830386365366163832' type='text/html'/><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='blogger.itemClass' value='pid-1525011311'/></entry></feed>
