{"id":192,"date":"2006-05-23T15:24:19","date_gmt":"2006-05-23T21:24:19","guid":{"rendered":"http:\/\/hunch.net\/?p=192"},"modified":"2006-05-23T15:25:01","modified_gmt":"2006-05-23T21:25:01","slug":"what-is-the-best-regret-transform-reduction-from-multiclass-to-binary","status":"publish","type":"post","link":"https:\/\/hunch.net\/?p=192","title":{"rendered":"What is the best regret transform reduction from multiclass to binary?"},"content":{"rendered":"<p>This post is about an open problem in learning reductions.<\/p>\n<p><strong>Background<\/strong> A reduction might transform a a multiclass prediction problem where there are <em>k<\/em> possible labels into a binary learning problem where there are only 2 possible labels.   On this induced binary problem we might learn a binary classifier with some error rate <em>e<\/em>.  After subtracting the minimum possible (Bayes) error rate <em>b<\/em>, we get a regret <em>r = e &#8211; b<\/em>.  The <a href=\"https:\/\/hunch.net\/~jl\/projects\/reductions\/pecoc\/final\/secoc.ps\">PECOC<\/a>(Probabilistic Error Correcting Output Code) reduction has the property that binary regret <em>r<\/em> implies multiclass regret at most <em>4r<sup>0.5<\/sup><\/em>.<\/p>\n<p><strong>The problem<\/strong> This is not the &#8220;rightest&#8221; answer.  Consider the <em>k=2<\/em> case, where we reduce binary to binary.  There exists a reduction (the identity) with the property that regret <em>r<\/em> implies regret <em>r<\/em>.  This is substantially superior to the transform given by the PECOC reduction, which suggests that a better reduction may exist for general <em>k<\/em>.  For example, we can not rule out the possibility that a reduction <em>R<\/em> exists with regret transform guaranteeing binary regret <em>r<\/em> implies at most multiclass regret <em>c(k) r<\/em> where <em>c(k)<\/em> is a <em>k<\/em> dependent constant between 1 and 4.<\/p>\n<p><strong>Difficulty<\/strong>  I believe this is a solvable problem, given some serious thought.<\/p>\n<p><strong>Impact<\/strong> The use of some reduction from multiclass to binary is common practice, so a good solution could be widely useful.  One thing to be aware of is that there is a common and reasonable concern about the &#8216;naturalness&#8217; of induced problems.  There seems to be no way to address this concern other than via empirical testing.  On the theoretical side, a better reduction may help us understand whether classification or <em>l<sub>2<\/sub><\/em> regression is the more natural primitive for reduction.  The PECOC reduction essentially first turns a binary classifier into an <em>l<sub>2<\/sub><\/em> regressor and then uses the regressor repeatedly to make multiclass predictions.<\/p>\n<p>Some background material which may help:<\/p>\n<ol>\n<li>Dietterich and Bakiri introduce <a href=\"http:\/\/www.cs.cmu.edu\/afs\/cs\/project\/jair\/pub\/volume2\/dietterich95a.pdf\">Error Correcting Output Codes<\/a>.<\/li>\n<li>Guruswami and Sahai <a href=\"http:\/\/www.cs.washington.edu\/homes\/venkat\/pubs\/papers\/colt99boost.ps\">analyze ECOC as an error transform reduction<\/a>. (see lemma 2)<\/li>\n<li>Allwein, Schapire, and Singer <a href=\"http:\/\/www.cs.princeton.edu\/~schapire\/uncompress-papers.cgi\/mult2bin.ps\">generalize ECOC to use loss-based decoding<\/a>.<\/li>\n<li>Beygelzimer and Langford showed that <a href=\"https:\/\/hunch.net\/~jl\/projects\/reductions\/pecoc\/final\/secoc.ps\">ECOC is not a regret transform and proved the PECOC regret transform<\/a>.<\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>This post is about an open problem in learning reductions. Background A reduction might transform a a multiclass prediction problem where there are k possible labels into a binary learning problem where there are only 2 possible labels. On this induced binary problem we might learn a binary classifier with some error rate e. After &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/hunch.net\/?p=192\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;What is the best regret transform reduction from multiclass to binary?&#8221;<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[29,16,12],"tags":[],"class_list":["post-192","post","type-post","status-publish","format-standard","hentry","category-machine-learning","category-problems","category-reductions"],"_links":{"self":[{"href":"https:\/\/hunch.net\/index.php?rest_route=\/wp\/v2\/posts\/192","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/hunch.net\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/hunch.net\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/hunch.net\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/hunch.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=192"}],"version-history":[{"count":0,"href":"https:\/\/hunch.net\/index.php?rest_route=\/wp\/v2\/posts\/192\/revisions"}],"wp:attachment":[{"href":"https:\/\/hunch.net\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=192"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/hunch.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=192"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/hunch.net\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=192"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}