{"id":138,"date":"2005-11-02T23:08:53","date_gmt":"2005-11-03T05:08:53","guid":{"rendered":"http:\/\/hunch.net\/?p=138"},"modified":"2005-11-04T22:26:22","modified_gmt":"2005-11-05T04:26:22","slug":"progress-in-active-learning","status":"publish","type":"post","link":"https:\/\/hunch.net\/?p=138","title":{"rendered":"Progress in Active Learning"},"content":{"rendered":"<p>Several bits of progress have been made since <a href=\"http:\/\/charlotte.ucsd.edu\/users\/dasgupta\/\">Sanjoy<\/a> pointed out the significant <a href=\"https:\/\/hunch.net\/index.php?cat=22\">lack of theoretical understanding of active learning<\/a>.  This is an update on the progress I know of.  As a refresher, active learning as meant here is:<\/p>\n<ol>\n<li>There is a source of unlabeled data.<\/li>\n<li>There is an oracle from which labels can be requested for unlabeled data produced by the source.<\/li>\n<li>The goal is to perform well with minimal use of the oracle.<\/li>\n<\/ol>\n<p>Here is what I&#8217;ve learned:<\/p>\n<ol>\n<li>Sanjoy has developed sufficient and semi-necessary conditions for active learning given the assumptions of IID data and &#8220;realizability&#8221; (that one of the classifiers is a correct classifier).<\/li>\n<li><a href=\"http:\/\/www.cs.cmu.edu\/~ninamf\/\">Nina<\/a>, <a href=\"https:\/\/hunch.net\/~beygel\/publications.html\">Alina<\/a>, and I developed an algorithm for active learning relying on only the assumption of IID data.  A draft is <a href=\"https:\/\/hunch.net\/~jl\/projects\/agnostic_active\/active.pdf\">here<\/a>.<\/li>\n<li><a href=\"http:\/\/mercurio.srv.dsi.unimi.it\/~cesabian\/papers.html\">Nicolo<\/a>, <a href=\"http:\/\/www.dicom.uninsubria.it\/~cgentile\/\">Claudio<\/a>, and <a href=\"http:\/\/www.dti.unimi.it\/~zaniboni\/\">Luca<\/a> showed that it is possible to do active learning in an entirely adversarial setting for linear threshold classifiers <a href=\"http:\/\/www.dti.unimi.it\/~zaniboni\/publications\/rl3.pdf\">here<\/a>.  This was published a year or two ago and I recently learned about it.<\/li>\n<\/ol>\n<p>All of these results are relatively &#8216;rough&#8217;: they don&#8217;t necessarily make good algorithms as stated (although the last one has a few experiments).  None of these results are directly comparable because the assumptions vary.  Comparing the assumptions and the results leads to a number of remaining questions:<\/p>\n<ol>\n<li>Do the sufficient and seminecessary conditions apply to the IID only case? The adversarial case?<\/li>\n<li>Is there a generic algorithm for any hypothesis space that works in the fully adversarial setting?<\/li>\n<li>What are special cases of these algorithms which are computationally tractable and useful?<\/li>\n<\/ol>\n<p>The <a href=\"http:\/\/www.cs.huji.ac.il\/~ranb\/aln05\/\">Foundations of Active Learning<\/a> workshop at NIPS should be a good place to discuss these questions.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Several bits of progress have been made since Sanjoy pointed out the significant lack of theoretical understanding of active learning. This is an update on the progress I know of. As a refresher, active learning as meant here is: There is a source of unlabeled data. There is an oracle from which labels can be &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/hunch.net\/?p=138\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Progress in Active Learning&#8221;<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[22,19],"tags":[],"class_list":["post-138","post","type-post","status-publish","format-standard","hentry","category-active","category-solutions"],"_links":{"self":[{"href":"https:\/\/hunch.net\/index.php?rest_route=\/wp\/v2\/posts\/138","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/hunch.net\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/hunch.net\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/hunch.net\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/hunch.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=138"}],"version-history":[{"count":0,"href":"https:\/\/hunch.net\/index.php?rest_route=\/wp\/v2\/posts\/138\/revisions"}],"wp:attachment":[{"href":"https:\/\/hunch.net\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=138"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/hunch.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=138"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/hunch.net\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=138"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}