(Ab)Use of Bounds

NIPS 2004, Whistler, BC, Canada, December 18

Organizers: Shai Ben-David, John Langford, John Shawe-Taylor, and Bob Williamson

It's been over 30 years since the foundations of sample complexity based learning theory and now seems a good time to assess the program. Has this branch of learning theory been useful?

The purpose of this workshop is not merely progress assessment. The sample complexity bounds community has internal disagreements about what is (and is not) a useful bound, what is (and is not) a tight bound, how (and where) bounds might reasonably be used, and which bounds-related questions should be answered. One goal of this workshop is to debate the merits of these different issues in order to foster better understanding internally as well as externally.

It is not the purpose of the workshop to converge to the one right way to assess sample complexity or learning performance etc; rather we seek to understand the relative merits of diverse approaches and how they relate, recognising that it is very unlikely there is one true and best solution.

The workshop is generally focused on answers to the above questions. Some specific topics include:

Quantitatively tight bounds. (What are they, how are they useful, etc...)
Position statements and arguments about what bounds should deliver.
Bounds for clustering and other "non-standard" learning problems
The relationship between bounds and algorithms
When are bounds useless?
Issues in bound use (computational and informational complexities)
What quantities should bounds depend on? (a priori knowledge of the task? Unlabeled training data? All training data?)

Morning Session: (Ab)Use of older bounds

7:30	Introduction & Opening	John Langford and Bob Williamson
8:15	Discussion
8:20	Using Unlabeled Data in Generalization Error Bounds	Matti Kaariainen
8:40	Discussion
8:45	Break
8:55	Learning the prior for the PAC-Bayes bound	Amiran Ambroladze, Emilio Parrado-Hernandez, and John Shawe-Taylor
9:20	Discussion
9:25	Improved Risk-Tail Bounds for On-line Algorithms	Nicolo Cesa-Bianchi and Claudio Gentile	Paper
9:50	Discussion
9:55	Closing Debate

Afternoon Session: New (Ab)Uses of bounds

4:00	Generalization Bounds for Clustering	Shai Ben-David	presentation
4:10	Generalization Bounds for Clustering - Discussion
4:40	An Objective Evaluation Criterion for Clustering	Arindam Banerjee and John Langford
5:00	Discussion
5:05	Break
5:15	A Sample-Complexity Analysis of Learning from Labeled and Unlabeled Data	Maria Balcan and Avrim Blum
5:35	Discussion
5:40	Error Bounds for Correlation Clustering	Thorsten Joachims and John Hopcroft
6:00	Discussion
6:05	Bounds which exploit a-priori knowledge	Petra Philips
6:25	Discussion
6:30	closing debate