AOL has released several large search engine related datasets. This looks like a pretty impressive data release, and it is a big opportunity for people everywhere to worry about search engine related learning problems, if they want.
AOL has released several large search engine related datasets. This looks like a pretty impressive data release, and it is a big opportunity for people everywhere to worry about search engine related learning problems, if they want.
Powered by WordPress
It seems like they’re probably going to take some of this down (http://www.ugcs.caltech.edu/~dangelo/aol-search-query-logs/)
so get your copies before it goes away.
see also “AOL Search Data Shows Users Planning to commit Murder.”
see this AOL apologizes for release of user search data
The New York Times has an article now.
There is no reasonable debate on the obvious topic here: AOL went to far in releasing this data to the public.
[...] It’s important to understand that this is a much harder problem than many people appreciate. The AOL data release is a good example. To those doing machine learning, the following strategies might be obvious: [...]