<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Machine Learning (Theory)</title>
	<atom:link href="http://hunch.net/?feed=rss2" rel="self" type="application/rss+xml" />
	<link>http://hunch.net</link>
	<description>Machine learning and learning theory research</description>
	<lastBuildDate>Tue, 24 Aug 2010 23:44:59 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Alex Smola starts a blog</title>
		<link>http://hunch.net/?p=1470</link>
		<comments>http://hunch.net/?p=1470#comments</comments>
		<pubDate>Tue, 24 Aug 2010 23:44:59 +0000</pubDate>
		<dc:creator>jl</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[Machine Learning]]></category>

		<guid isPermaLink="false">http://hunch.net/?p=1470</guid>
		<description><![CDATA[Adventures in Data Land.
]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.smola.org/">Adventures in Data Land</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://hunch.net/?feed=rss2&amp;p=1470</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Boosted Decision Trees for Deep Learning</title>
		<link>http://hunch.net/?p=1467</link>
		<comments>http://hunch.net/?p=1467#comments</comments>
		<pubDate>Mon, 23 Aug 2010 17:18:53 +0000</pubDate>
		<dc:creator>jl</dc:creator>
				<category><![CDATA[Deep]]></category>
		<category><![CDATA[Machine Learning]]></category>
		<category><![CDATA[Supervised]]></category>

		<guid isPermaLink="false">http://hunch.net/?p=1467</guid>
		<description><![CDATA[About 4 years ago, I speculated that decision trees qualify as a deep learning algorithm because they can make decisions which are substantially nonlinear in the input representation.  Ping Li has proved this correct, empirically at UAI by showing that boosted decision trees can beat deep belief networks on versions of Mnist which are [...]]]></description>
			<content:encoded><![CDATA[<p>About 4 years ago, I speculated that <a href="http://hunch.net/?p=219">decision trees qualify as a deep learning algorithm</a> because they can make decisions which are substantially nonlinear in the input representation.  <a href="http://www.stat.cornell.edu/~li/">Ping Li</a> has <a href="http://event.cwi.nl/uai2010/papers/UAI2010_0282.pdf">proved this correct, empirically</a> at <a href="http://event.cwi.nl/uai2010/">UAI</a> by showing that boosted decision trees can beat deep belief networks on versions of <a href="http://yann.lecun.com/exdb/mnist/">Mnist</a> which are artificially hardened so as to make them solvable only by deep learning algorithms.  </p>
<p>This is an important point, because the ability to solve these sorts of problems is probably the best objective definition of a deep learning algorithm we have.   I&#8217;m not that surprised.  In my experience, if you can accept the computational drawbacks of a boosted decision tree, they can achieve pretty good performance.</p>
<p><a href="http://www.cs.toronto.edu/~hinton/">Geoff Hinton</a> once told me that the great thing about deep belief networks is that they work.  I understand that Ping had very substantial difficulty in getting this published, so I hope some reviewers step up to the standard of valuing what works.</p>
]]></content:encoded>
			<wfw:commentRss>http://hunch.net/?feed=rss2&amp;p=1467</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>KDD 2010</title>
		<link>http://hunch.net/?p=1450</link>
		<comments>http://hunch.net/?p=1450#comments</comments>
		<pubDate>Mon, 23 Aug 2010 00:39:07 +0000</pubDate>
		<dc:creator>jl</dc:creator>
				<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Machine Learning]]></category>

		<guid isPermaLink="false">http://hunch.net/?p=1450</guid>
		<description><![CDATA[There were several papers that seemed fairly interesting at KDD this year.  The ones that caught my attention are:

Xin Jin, Mingyang Zhang, Nan Zhang, and Gautam Das, Versatile Publishing For Privacy Preservation.  This paper provides a conservative method for safely determining which data is publishable from any complete source of information (for example, [...]]]></description>
			<content:encoded><![CDATA[<p>There were several papers that seemed fairly interesting at <a href="http://www.kdd.org/kdd2010/">KDD this year</a>.  The ones that caught my attention are:</p>
<ol>
<li><a href="http://home.gwu.edu/~xjin/">Xin Jin</a>, Mingyang Zhang, <a href="http://www.seas.gwu.edu/~nzhang10/index.html">Nan Zhang</a>, and <a href="http://ranger.uta.edu/~gdas/">Gautam Das</a>, <a href="http://home.gwu.edu/~xjin/pub/kdd2010.pdf">Versatile Publishing For Privacy Preservation</a>.  This paper provides a conservative method for safely determining which data is publishable from any complete source of information (for example, a hospital) such that it does not violate privacy rules in a natural language.  It is not differentially private, so no external sources of join information can exist.  However, it is a mechanism for <i>publishing</i> data rather than (say) the output of a learning algorithm.</li>
<li><a href="http://www.cs.technion.ac.il/~arikf/">Arik Friedman</a> <a href="http://www.cs.technion.ac.il/~assaf/">Assaf Schuster</a>, <a href="http://www.cs.technion.ac.il/~arikf/online-publications/DiffPDataMining10.pdf">Data Mining with Differential Privacy</a>.  This paper shows how to create effective differentially private decision trees.  Progress in differentially private datamining is pretty impressive, as it was <a href="http://research.microsoft.com/pubs/64346/dwork.pdf">defined in 2006</a>.</li>
<li>David Chan, Rong Ge, Ori Gershony, <a href="http://www.linkedin.com/in/timhesterberg">Tim Hesterberg</a>, <a href="http://research.google.com/pubs/author16666.html">Diane Lambert</a>, <a href="http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en/us/pubs/archive/36552.pdf">Evaluating Online Ad Campaigns in a Pipeline: Causal Models At Scale</a> This paper is about automated estimation of ad campaign effectiveness.  The double robust estimation technique seems intuitively appealing and plausibly greatly enhances effectiveness.</li>
<li><a href="http://domino.research.ibm.com/comm/research_people.nsf/pages/nabe.index.html">Naoki Abe</a> et al. <a href="http://www.cs.wayne.edu/~reddy/Papers/KDD10.pdf">Optimizing Debt Collections Using Constrained Reinforcement Learning</a>  This is an application paper about optimizing the New York State income tax collection agency.  As you might expect, there are several cludgy aspects due to working within legal and organizational constraints.  They deal with them, and expect to end up making NY state around $10<sup>8</sup>/year.  Too bad I live in NY <img src='http://hunch.net/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </li>
<li><a href="http://www.umiacs.umd.edu/~vikas/publications/papers.html">Vikas C Raykar</a>, <a href="http://web.duke.edu/~balaji/">Balaji Krishnapuram</a>, and <a href="http://www.dbs.informatik.uni-muenchen.de/~spyu/">Shinpeng Yu</a> <a href="http://www.umiacs.umd.edu/~vikas/publications/raykar_kdd2010_cascade_v3.pdf">Designing Efficient Cascaded Classifiers: Tradeoff between Accuracy and Cost</a> This paper is about a <a href="http://hunch.net/?p=240">continuization based solution</a> to designing a cost-efficient yet accurate classifier cascade.  It&#8217;s a step beyond the <a href="http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.6.2036&#038;rep=rep1&#038;type=pdf">Viola Jones style boosting with cutouts</a>, but I suspect not yet a final solution.</li>
<li><a href="http://www.eecs.tufts.edu/~dsculley/">D. Sculley</a>, <a href="http://www.eecs.tufts.edu/~dsculley/papers/combined-ranking-and-regression.pdf">Combined Regression and Ranking</a>.  There are lots of applications where you want both a correct ordering and an estimated value of each item.  This paper shows a simple combined-loss approach to getting both which empirically improves on either metric.</li>
</ol>
<p>In addition, I enjoyed <a href="http://www.quantcast.com/docs/display/info/Management+Team">Konrad Feldman</a>&#8217;s invited talk on <a href="http://www.quantcast.com/">Quantcast</a>&#8217;s data and learning systems which sounded pretty slick.</p>
<p>In general, it seems like KDD is substantially maturing as a conference.  The work on empirically effective privacy-preserving algorithms and some of the stats-work is ahead of what I&#8217;ve seen at other machine learning conferences.  Presumably this is due to KDD being closer to the business side of machine learning and hence more aware of what are real problems there.   An annoying aspect of KDD as a publishing venue is that they don&#8217;t put the papers on the conference website, due to <a href="http://www.acm.org/">ACM</a> constraints.   A substantial compensation is that all talks are scheduled to appear on <a href="http://videolectures.net/">videolectures.net</a> and, as you can see, most papers can be found on author webpages.</p>
<p>KDD also experimented with <a href="http://kdd10.crowdvine.com/calendar">crowdvine</a> again this year so people could announce which talks they were interested in and setup meetings. My impression was that it worked a bit less well than last year, partly because it wasn&#8217;t pushed as much by the conference organizers.  Small changes in the interface might make a big difference&#8212;for example, just providing a ranking of papers by interest might make it pretty compelling.</p>
]]></content:encoded>
			<wfw:commentRss>http://hunch.net/?feed=rss2&amp;p=1450</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Rob Schapire at NYC ML Meetup</title>
		<link>http://hunch.net/?p=1448</link>
		<comments>http://hunch.net/?p=1448#comments</comments>
		<pubDate>Sun, 22 Aug 2010 02:10:09 +0000</pubDate>
		<dc:creator>jl</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[Machine Learning]]></category>

		<guid isPermaLink="false">http://hunch.net/?p=1448</guid>
		<description><![CDATA[I&#8217;ve been wanting to attend the NYC ML Meetup for some time and hope to make it next week on the 25th.  Rob Schapire is talking about &#8220;Playing Repeated Games&#8221;, which in my experience is far more relevant to machine learning than the title might indicate.
]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been wanting to attend the <a href="http://www.meetup.com/NYC-Machine-Learning/">NYC ML Meetup</a> for some time and hope to make it <a href="http://nycml-repeated-games-algorithm.eventbrite.com/">next week on the 25th</a>.  <a href="http://www.cs.princeton.edu/~schapire/">Rob Schapire</a> is talking about &#8220;Playing Repeated Games&#8221;, which in my experience is far more relevant to machine learning than the title might indicate.</p>
]]></content:encoded>
			<wfw:commentRss>http://hunch.net/?feed=rss2&amp;p=1448</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Workshop on Cores, Clusters, and Clouds</title>
		<link>http://hunch.net/?p=1446</link>
		<comments>http://hunch.net/?p=1446#comments</comments>
		<pubDate>Fri, 20 Aug 2010 14:47:01 +0000</pubDate>
		<dc:creator>jl</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[Workshop]]></category>

		<guid isPermaLink="false">http://hunch.net/?p=1446</guid>
		<description><![CDATA[Alekh, John, Ofer, and I are organizing a workshop at NIPS this year on learning in parallel and distributed environments.  The general interest level in parallel learning seems to be growing rapidly, so I expect quite a bit of attendance.  Please join us if you are parallel-interested.
And, if you are working in the [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.eecs.berkeley.edu/~alekh/">Alekh</a>, <a href="http://www.cs.berkeley.edu/~jduchi/">John</a>, <a href="http://www.oferdekel.com/">Ofer</a>, and I are organizing a <a href="http://lccc.eecs.berkeley.edu/">workshop</a> at <a href="http://nips.cc/">NIPS</a> this year on learning in parallel and distributed environments.  The general interest level in parallel learning seems to be growing rapidly, so I expect quite a bit of attendance.  Please join us if you are parallel-interested.</p>
<p>And, if you are working in the area of parallel learning, please consider <a href="http://lccc.eecs.berkeley.edu/submission.html">submitting an abstract</a> due Oct. 17 for presentation at the workshop.</p>
]]></content:encoded>
			<wfw:commentRss>http://hunch.net/?feed=rss2&amp;p=1446</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>ICML &amp; COLT 2010</title>
		<link>http://hunch.net/?p=1431</link>
		<comments>http://hunch.net/?p=1431#comments</comments>
		<pubDate>Sun, 18 Jul 2010 22:09:43 +0000</pubDate>
		<dc:creator>jl</dc:creator>
				<category><![CDATA[Competitions]]></category>
		<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Machine Learning]]></category>
		<category><![CDATA[Workshop]]></category>

		<guid isPermaLink="false">http://hunch.net/?p=1431</guid>
		<description><![CDATA[The papers which interested me most at ICML and COLT 2010 were:

Thomas Walsh, Kaushik Subramanian, Michael Littman and Carlos Diuk Generalizing Apprenticeship Learning across Hypothesis Classes.  This paper formalizes and provides algorithms with guarantees for mixed-mode apprenticeship and traditional reinforcement learning algorithms, allowing RL algorithms that perform better than for either setting alone.
István Szita [...]]]></description>
			<content:encoded><![CDATA[<p>The papers which interested me most at <a href="http://www.icml2010.org/index.html">ICML</a> and <a href="http://www.colt2010.org/">COLT</a> 2010 were:</p>
<ol>
<li><a href="http://paul.rutgers.edu/~thomaswa/">Thomas Walsh</a>, <a href=http://videolectures.net/kaushik_subramanian/"">Kaushik Subramanian</a>, <a href="http://www.cs.rutgers.edu/~mlittman/">Michael Littman</a> and <a href="http://www.princeton.edu/~cdiuk/">Carlos Diuk</a> <a href="http://mldiscuss.appspot.com/discuss/475">Generalizing Apprenticeship Learning across Hypothesis Classes</a>.  This paper formalizes and provides algorithms with guarantees for mixed-mode apprenticeship and traditional reinforcement learning algorithms, allowing RL algorithms that perform better than for either setting alone.</li>
<li><a href="http://web.eotvos.elte.hu/szityu/">István Szita</a> and <a href="http://webdocs.cs.ualberta.ca/~szepesva/">Csaba Szepesvári</a> <a href="http://mldiscuss.appspot.com/discuss/546">Model-based reinforcement learning with nearly tight exploration complexity bounds</a>.  This paper and <a href="http://www.jmlr.org/papers/volume11/jaksch10a/jaksch10a.pdf">another</a>represent the frontier of best-known algorithm for Reinforcement Learning in a Markov Decision Process.</li>
<li><a href="http://www.cs.toronto.edu/~jmartens/">James Martens</a> <a href="http://mldiscuss.appspot.com/discuss/458">Deep learning via Hessian-free optimization</a>.  About a new not-quite-online second order gradient algorithm for learning deep functional structures.  Potentially this is very powerful because while people have often talked about end-to-end learning, it has rarely worked in practice.</li>
<li><a href="http://www.cs.uni-potsdam.de/~sawade/">Chrisoph Sawade</a>, <a href=http://www.cs.uni-potsdam.de/~landwehr/index.html"">Niels Landwehr</a>, <a href="http://www.mpi-inf.mpg.de/~bickel/">Steffen Bickel</a>. and <a href="http://mldiscuss.appspot.com/discuss/285">Tobias Scheffer</a> <a href="http://mldiscuss.appspot.com/discuss/285">Active Risk Estimation</a>.  When a test set is not known in advance, the model can be used to safely aid test set evaluation using importance weighting techniques.  Relative to the paper, placing a lower bound on p(y|x) is probably important in practice.</li>
<li><a href="http://www.cs.cmu.edu/~mcmahan/">H. Brendan McMahan</a> and <a href="http://www.cs.cmu.edu/~matts/">Matthew Streeter</a> <a href="http://www.colt2010.org/papers/104mcmahan.pdf">Adaptive Bound Optimization for Online Convex Optimization</a> and the almost-same paper <a href="http://www.cs.berkeley.edu/~jduchi/">John Duchi</a>, <a href="http://www.cs.princeton.edu/~ehazan/">Elad Hazan</a>, and <a href="http://www.magicbroom.info/About.html">Yoram Singer</a>, <a href="http://www.colt2010.org/papers/023Duchi.pdf">Adaptive Subgradient Methods for Online Learning and Stochastic Optimization</a>.  These papers provide tractable online algorithms with regret guarantees over a family of metrics rather than just euclidean metrics.  They look pretty useful in practice.</li>
<li><a href="http://www.cesa-bianchi.net/">Nicolò Cesa-Bianchi</a>, <a href="http://www.dicom.uninsubria.it/~cgentile/">Claudio Gentile</a>, <a href="http://pascallin2.ecs.soton.ac.uk/Network/Researchers/1099/">Fabio Vitale</a>, <a href="http://pascallin2.ecs.soton.ac.uk/Network/Researchers/1708/">Giovanni Zappella</a>, <a href="http://www.colt2010.org/papers/67Vitale.pdf">Active Learning on Trees and Graphs</a> Various subsets of these authors have other papers about actively learning graph-obeying functions which in total provide a good basis for understanding what&#8217;s possible and how to learn.</li>
</ol>
<p>The program chairs for ICML did a wide-ranging <a href="http://www.icml2010.org/ReportAndSurvey/ICML2010ProcessAndSurvey.html">survey</a> over participants.  The results seem to suggest that participants generally agree with the current ICML process.  I expect there is some amount of anchoring effect going on where participants have an apparent preference for the known status quo, although it&#8217;s difficult to judge the degree of that.  Some survey results which aren&#8217;t of that sort are: </p>
<ol>
<li>7.7% of reviewers say author feedback changed their mind.  It would be interesting to know for which fraction of accepted papers reviewers had their mind changed, but that isn&#8217;t there.</li>
<li>85.4% of authors don&#8217;t know if the reviewers read their response, believe they read and ignored it, or believe they didn&#8217;t read it.  Authors clearly don&#8217;t feel like they are communicating with reviewers.</li>
<li>58.6% support growing the conference with the largest fraction suggesting poster-only papers.</li>
<li>Other conferences attended by the ICML community in order are NIPS, ECML/PKDD, AAAI, IJCAI, AIStats, UAI, KDD, ICDM, COLT, SIGIR, ECAI, EMNLP, CoNLL.  This is pretty different from the standard colocation list for ICML.  Many possibilities are precluded by scheduling, but AAAI, IJCAI, UAI, KDD, COLT, SIGIR are all serious possibilities some of which haven&#8217;t been used much in the past.</li>
</ol>
<p>My experience with <a href="http://mark.reid.name/">Mark</a>&#8217;s new <a href="http://mldiscuss.appspot.com/">paper discussion site</a> is generally positive&#8212;having comments emailed to interested parties really helps the discussion.  There are a few comments that authors haven&#8217;t responded to, so if you are an author you might want to sign up to receive comments.</p>
<p>In addition, I was the workshop chair for ICML&#038;COLT this year.  My overall impression was that things went reasonably well, with the exception of internet connectivity at Dan Panorama which was a minidisaster courtesy of a broken per-machine authentication system.  One of the things I&#8217;m particularly happy about was the <a href="http://learningtorankchallenge.yahoo.com/#">Learning to Rank Challenge</a>  workshop.  I think it would be great if ICML can continue to attract new challenge workshops in the future.  If anyone else has comments about the workshops, I&#8217;d love to hear them.</p>
]]></content:encoded>
			<wfw:commentRss>http://hunch.net/?feed=rss2&amp;p=1431</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>MetaOptimize</title>
		<link>http://hunch.net/?p=1425</link>
		<comments>http://hunch.net/?p=1425#comments</comments>
		<pubDate>Fri, 02 Jul 2010 06:39:20 +0000</pubDate>
		<dc:creator>jl</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[Machine Learning]]></category>

		<guid isPermaLink="false">http://hunch.net/?p=1425</guid>
		<description><![CDATA[Joseph Turian creates MetaOptimize for discussion of NLP and ML on big datasets.  This includes a blog, but perhaps more importantly a question and answer section.  I&#8217;m hopeful it will take off.
]]></description>
			<content:encoded><![CDATA[<p><a href="http://www-etud.iro.umontreal.ca/~turian/">Joseph Turian</a> creates <a href="http://metaoptimize.com/">MetaOptimize</a> for discussion of NLP and ML on big datasets.  This includes a <a href="http://metaoptimize.com/blog/">blog</a>, but perhaps more importantly a <a href="http://metaoptimize.com/qa">question and answer section</a>.  I&#8217;m hopeful it will take off.</p>
]]></content:encoded>
			<wfw:commentRss>http://hunch.net/?feed=rss2&amp;p=1425</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>2010 ICML discussion site</title>
		<link>http://hunch.net/?p=1419</link>
		<comments>http://hunch.net/?p=1419#comments</comments>
		<pubDate>Sun, 20 Jun 2010 21:00:34 +0000</pubDate>
		<dc:creator>jl</dc:creator>
				<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Machine Learning]]></category>
		<category><![CDATA[Organization]]></category>

		<guid isPermaLink="false">http://hunch.net/?p=1419</guid>
		<description><![CDATA[A substantial difficulty with the 2009 and 2008 ICML discussion system was a communication vacuum, where authors were not informed of comments, and commenters were not informed of responses to their comments without explicit monitoring.  Mark Reid has setup a new discussion system for 2010 with the goal of addressing this.
Mark didn&#8217;t want to [...]]]></description>
			<content:encoded><![CDATA[<p>A substantial difficulty with the 2009 and 2008 <a href="http://www.conflate.net/icml/">ICML discussion system</a> was a communication vacuum, where authors were not informed of comments, and commenters were not informed of responses to their comments without explicit monitoring.  <a href="http://mark.reid.name/">Mark Reid</a> has setup a <a href="http://mldiscuss.appspot.com/">new discussion system for 2010</a> with the goal of addressing this.</p>
<p>Mark didn&#8217;t want to make it to intrusive, so you must opt-in.  As an author, <a href="http://mldiscuss.appspot.com/articles">find your paper</a> and &#8220;Subscribe by email&#8221; to the comments.  As a commenter, you have the option of providing an email for follow-up notification.  </p>
]]></content:encoded>
			<wfw:commentRss>http://hunch.net/?feed=rss2&amp;p=1419</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Good News on Exploration and Learning</title>
		<link>http://hunch.net/?p=1389</link>
		<comments>http://hunch.net/?p=1389#comments</comments>
		<pubDate>Sun, 13 Jun 2010 20:49:54 +0000</pubDate>
		<dc:creator>jl</dc:creator>
				<category><![CDATA[Exploration]]></category>
		<category><![CDATA[Interactive]]></category>
		<category><![CDATA[Machine Learning]]></category>
		<category><![CDATA[Online]]></category>

		<guid isPermaLink="false">http://hunch.net/?p=1389</guid>
		<description><![CDATA[Consider the contextual bandit setting where, repeatedly:

A context x is observed.
An action a is taken given the context x. 
A reward r is observed, dependent on x and a.

Where the goal of a learning agent is to find a policy for step 2 achieving a large expected reward.  
This setting is of obvious importance, [...]]]></description>
			<content:encoded><![CDATA[<p>Consider the contextual bandit setting where, repeatedly:</p>
<ol>
<li>A context <em>x</em> is observed.</li>
<li>An action <em>a</em> is taken given the context <em>x</em>. </li>
<li>A reward <em>r</em> is observed, dependent on <em>x</em> and <em>a</em>.</li>
</ol>
<p>Where the goal of a learning agent is to find a policy for step 2 achieving a large expected reward.  </p>
<p>This setting is of obvious importance, because in the real world we typically make decisions based on some set of information and then get feedback only about the single action taken.  It also fundamentally differs from supervised learning settings because knowing the value of one action is not equivalent to knowing the value of all actions.</p>
<p>A decade ago the best machine learning techniques for this setting where implausibly inefficient.  <a href="http://gosset.wharton.upenn.edu/~foster/index.pl">Dean Foster</a> once told me he thought the area was a research sinkhole with little progress to be expected.  Now we are on the verge of being able to routinely attack these problems, in almost exactly the same sense that we routinely attack bread and butter supervised learning problems.  Just as for supervised learning, we know how to create and reuse datasets, how to benchmark algorithms, how to reuse existing supervised learning algorithms in this setting, and how to achieve optimal rates of learning quantitatively similar to supervised learning.</p>
<p>This is also one of the times when understanding the basic theory can make a huge difference in your success.  There are many wrong ways to attack contextual bandit problems or prepare datasets, and taking a wrong turn can easily mean the difference between failure and success.  Understanding how contextual bandit problems differ from basic supervised learning problems is critical to routine success here.</p>
<p>All of the above is not meant to claim that everything is done research-wise here so we&#8217;ll try to outline where the current boundary of research lies as best we can.  However, we are surely at a point both in terms of application demand (especially for internet applications of search, advertising, page optimization, but also medical applications and surely others) and methodology supply (with basic reliable techniques now easily available or created) where these techniques are shifting from theory esoterica to required education.  </p>
<p>Given the above, <a href="http://hunch.net/~beygel/">Alina</a> and I decided to <a href="http://hunch.net/~exploration_learning/">prepare a tutorial</a> to be given at <a href="http://bangalore.yahoo.com/labs/summerschool.html">Yahoo! Labs summer school</a> (my first India trip!), <a href="http://www.icml2010.org/">ICML</a>, <a href="http://www.kdd.org/kdd2010/tutorials.shtml#t4">KDD</a>, and hopefully <a href="http://videolectures.net/"> videolectures.net</a>.  Please join us.  The subjects we plan to cover are essentially the keys to the kingdom of solving shallow interactive learning problems.  </p>
]]></content:encoded>
			<wfw:commentRss>http://hunch.net/?feed=rss2&amp;p=1389</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Google Predict</title>
		<link>http://hunch.net/?p=1383</link>
		<comments>http://hunch.net/?p=1383#comments</comments>
		<pubDate>Thu, 20 May 2010 15:25:57 +0000</pubDate>
		<dc:creator>jl</dc:creator>
				<category><![CDATA[Machine Learning]]></category>

		<guid isPermaLink="false">http://hunch.net/?p=1383</guid>
		<description><![CDATA[Slashdot points out Google Predict.  I&#8217;m not privy to the details, but this has the potential to be extremely useful, as in many applications simply having an easy mechanism to apply existing learning algorithms can be extremely helpful.  This differs goalwise from MLcomp&#8212;instead of public comparisons for research purposes, it&#8217;s about private utilization [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://slashdot.org/">Slashdot</a> points out <a href="http://code.google.com/apis/predict/">Google Predict</a>.  I&#8217;m not privy to the details, but this has the potential to be extremely useful, as in many applications simply having an easy mechanism to apply existing learning algorithms can be extremely helpful.  This differs goalwise from <a href="http://mlcomp.org/">MLcomp</a>&#8212;instead of public comparisons for research purposes, it&#8217;s about private utilization of good existing algorithms.  It also differs infrastructurally, since a system designed to do this is much less awkward than using Amazon&#8217;s cloud computing.  The latter implies that datasets several order of magnitude larger can be handled up to limits imposed by network and storage.</p>
]]></content:encoded>
			<wfw:commentRss>http://hunch.net/?feed=rss2&amp;p=1383</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
