The spock challenge for named entity recognition was won by Berno Stein, Sven Eissen, Tino Rub, Hagen Tonnies, Christof Braeutigam, and Martin Potthast.

The spock challenge for named entity recognition was won by Berno Stein, Sven Eissen, Tino Rub, Hagen Tonnies, Christof Braeutigam, and Martin Potthast.

## Details

## Recent Comments

- sarjana masbuk on Eliminating the Birthday Paradox for Universal Features
- Jordan on NYU Large Scale Machine Learning Class
- problems prostate pain beta on ICML acceptance statistics
- http://Portal2.Blueink.net/UserProfile/tabid/858/userId/85316/Default.aspx on Wikis for Summer Schools and Workshops
- RB on Open Machine Learning Workshop, August 22

## RSS Feeds

Turing's Oracle

Thu, Aug 21, 2014

Computational ComplexityLong Live the Fall Workshop (guest post by Don Sheehy)

Tue, Aug 19, 2014

GeomblogKomlos conjecture, Gaussian correlation conjecture, and a bit of machine learning

Tue, Aug 19, 2014

Im a banditReading group notes: point/counter-point on "predict models"

Thu, Jul 31, 2014

natural language processing blog## Categories

- Active (11)
- AI (12)
- Announcements (95)
- applications (4)
- Bayesian (18)
- Boosting (1)
- Code (11)
- Competitions (23)
- Computation (6)
- Conferences (76)
- CS (4)
- Deep (4)
- Definitions (11)
- Economics (2)
- Empirical (5)
- Exploration (4)
- Funding (22)
- General (70)
- Graduates (2)
- Information Theory (4)
- Interactive (2)
- Language (11)
- Machine Learning (289)
- Mathematics (2)
- MDL (1)
- medical (1)
- Meta (4)
- Neuroscience (2)
- Online (37)
- Organization (12)
- Papers (45)
- Prediction Theory (20)
- Privacy (1)
- Problem Design (2)
- Problems (31)
- Questions (3)
- Reductions (32)
- Reinforcement (21)
- Research (49)
- Reviewing (23)
- Robots (2)
- Semisupervised (4)
- Solutions (5)
- Statistics (6)
- structured (7)
- Supervised (15)
- Teaching (15)
- Theory (8)
- Trees (2)
- Universal Learning (5)
- Unsupervised (7)
- Vision (6)
- Workshop (34)

## ML Related

## Research

## Meta

Powered by **WordPress**

For those interested I’ve set up a page describing our approach

That’s great—I couldn’t find it looking around the Spock site.

I find it it interesting that in contrast to the Netflix challenge where the winner combined many different approaches, your approach is more coherent. Would averaging over approaches of other contestants yield even better results?

Well that’s of course a good question. The results of other approaches are also clusterings, and there are some interesting approaches to combine clusterings. Indeed we tried a modification of this algorithm by Fred and Jain, which essentialy counts for each pair of items

i,jhow often they appear together in a cluster (over all clusterings). The outcome is a matrix that can be interpreted as an adjacency matrix of a weighted graph. This graph in turn can be clustered to receive the final ensemble clustering. This combination of “evidences” proved to be powerful – it delivered good results for the combination of the clusterings of the individual models. However, combining the models via logistic regression (before clustering) had some advantages in our case: the weights for the individual models could be estimated from the 25,000 training documents, leading to betterF-measure values. Nevertheless, it would be worth trying to combine results.