AOL’s data drop

AOL has released several large search engine related datasets. This looks like a pretty impressive data release, and it is a big opportunity for people everywhere to worry about search engine related learning problems, if they want.

    It seems like they’re probably going to take some of this down (
    so get your copies before it goes away.

    The New York Times has an article now.

    There is no reasonable debate on the obvious topic here: AOL went to far in releasing this data to the public.

  3. […] It’s important to understand that this is a much harder problem than many people appreciate. The AOL data release is a good example. To those doing machine learning, the following strategies might be obvious: […]

