{"id":144,"date":"2005-12-01T10:47:30","date_gmt":"2005-12-01T16:47:30","guid":{"rendered":"http:\/\/hunch.net\/?p=144"},"modified":"2005-12-01T10:49:45","modified_gmt":"2005-12-01T16:49:45","slug":"the-webscience-future","status":"publish","type":"post","link":"https:\/\/hunch.net\/?p=144","title":{"rendered":"The Webscience Future"},"content":{"rendered":"<p>The internet has significantly effected the way we do research but it&#8217;s capabilities have not yet been fully realized.<\/p>\n<p>First, let&#8217;s acknowledge some known effects.<\/p>\n<ol>\n<li><strong>Self-publishing<\/strong> By default, all researchers in machine learning (and more generally computer science and physics) place their papers online for anyone to download.  The exact mechanism differs&#8212;physicists tend to use a central repository (<a href=\"http:\/\/arxiv.org\/\">Arxiv<\/a>) while computer scientists tend to place the papers on their webpage.  Arxiv has been slowly growing in subject breadth so it now sometimes used by computer scientists.<\/li>\n<li><strong>Collaboration<\/strong> Email has enabled working remotely with coauthors.  This has allowed collaborationis which would not otherwise have been possible and generally speeds research.<\/li>\n<\/ol>\n<p>Now, let&#8217;s look at attempts to go further.<\/p>\n<ol>\n<li><strong>Blogs<\/strong> (like this one) allow public discussion about topics which are not easily categorized as &#8220;a new idea in machine learning&#8221; (like this topic).<\/li>\n<li><strong>Organization<\/strong> of some subfield of research.   This includes <a href=\"http:\/\/neuromancer.eecs.umich.edu\/cgi-bin\/twiki\/view\/Main\/MythsofRL\">Satinder Singh&#8217;s Reinforcement Learning<\/a> pages, and, more generally books that have been placed online such as <a href=\"http:\/\/www.cs.ualberta.ca\/%7Esutton\/book\/ebook\/the-book.html\">this one<\/a>. <\/li>\n<li><strong>Discussion Groups<\/strong> The <a href=\"http:\/\/www.kernel-machines.org\/phpbb\/\">kernel machines discussions<\/a> provide a good example of some common format allowing discussion.<\/li>\n<li><strong>Class notes<\/strong> have been placed online such as <a href=\"http:\/\/www.cs.cmu.edu\/~avrim\/ML04\/index.html\">Avrim&#8217;s learning theory lecture notes<\/a>.<\/li>\n<li><strong>Wikipedia<\/strong> has an article on <a href=\"http:\/\/en.wikipedia.org\/wiki\/Machine_learning\">Machine Learning<\/a>.  The article gives a reasonable quick overview and is surely read by a very large number of people.<\/li>\n<li><strong>Online Proceedings<\/strong> are now being used by several conferences such as <a href=\"http:\/\/books.nips.cc\/nips18.html\">NIPS<\/a>.<\/li>\n<\/ol>\n<p>Now, let&#8217;s consider some futures.  <\/p>\n<ol>\n<li><strong>Wikifuture<\/strong> Wikipedia becomes better to the point where it is a completely comprehensive listing of current research in machine learning.  At some point, we-the-community realize this and begin to emphasize (and credit) information placed in wikipedia.  This process reinforces itself to the point where &#8220;if it&#8217;s not in wikipedia, it doesn&#8217;t exist&#8221;.\n<p>This future is significantly more probable than most people understand.  As evidence compare the machine learning page three years ago (yep, it didn&#8217;t exist), <a href=\"http:\/\/en.wikipedia.org\/w\/index.php?title=Machine_learning&#038;oldid=1914904\">two years ago<\/a>, <a href=\"http:\/\/en.wikipedia.org\/w\/index.php?title=Machine_learning&#038;oldid=8494988\">one year ago<\/a>, and <a href=\"http:\/\/en.wikipedia.org\/wiki\/Machine_learning\">today<\/a>.  That progression strongly suggests that wikipedia:machine learning will continue to grow into a significant information resource.<\/p>\n<p>There are fundamental obstacles to the success of the wikipedia future.  <\/p>\n<ol>\n<li><strong>credit<\/strong> Wikipedia has only very weak mechanisms for crediting editors.  A list of the changes done by <a href=\"http:\/\/en.wikipedia.org\/w\/index.php?title=Special:Contributions&#038;target=217.42.38.93&#038;offset=20050426212101&#038;limit=50\">one user account<\/a> is about as much credit as is available.  This is not enough to make career-deciding questions on.  We could hope for a stronger link between identity and editor along with tools to track the value of particular edits (Think of counting hyperlinks as an analogue for counting citations).<\/li>\n<li><strong>controversy<\/strong> Wikipedia has grown up in a nonmanipulative environment.  When it was little known, the incentive to fabricate entries was not great.  Now that it is becoming well known that incentive is growing.  Character assasination by <a href=\"http:\/\/www.usatoday.com\/news\/opinion\/editorials\/2005-11-29-wikipedia-edit_x.htm\">false article<\/a> exists.  In science, the thing to worry about is misplaced ideas of the importance of your topic of research since it is very difficult to be sufficiently interested in a research topic and simultaneously view it objectively.  Research is <em>about<\/em> creating new ideas, and the location of these ideas in some general organization is in dispute by default.<\/li>\n<\/ol>\n<\/li>\n<li><strong>Evolutionary Progression<\/strong>  Consider the following sequence of steps.\n<ol>\n<li><strong>Conference Organization<\/strong> We realize that having a list of online papers isn&#8217;t nearly as useful as having an organized list of online papers so the conferences which already have online proceedings create an explorable topic hierarchy.<\/li>\n<li><strong>Time Organization<\/strong> We realize that the organization at one particular year&#8217;s conference is sketchy&#8212;research is a multiyear endeavor.  Consequently, we start adding to last years topic hierarchy rather than creating a new one from scratch each year.<\/li>\n<li><strong>Transformation<\/strong> We realize that it is better if papers are done in the language of the web.  For example, it&#8217;s very handy to be able to hyperlink inside of a paper.  A good solution to the <a href=\"https:\/\/hunch.net\/?p=75\">math on the web<\/a> problem would greatly help here.<\/li>\n<li><strong>Consolidation<\/strong> We realize that there is a lot of redundancy in two papers on the same or a similar topic.  They share an introduction, motivation, and (often) definitions.  By joining the shared pieces, the contents of both papers can be made clearer.<\/li>\n<\/ol>\n<p>Each of these individual steps clearly yields something better.  At the end of these steps, creating a paper is simply the process of creating a webpage or altering an existing webpage.  We can imagine doing all of this while keeping the peer-review mechanisms of science intact, so the resulting process is simply better in all ways.  It&#8217;s easier to author because for most papers much of the &#8220;filler&#8221; introduction\/motivation\/definition can be reused from previous papers.  It&#8217;s easier to review, because reviewers can consider the result in context.  Much of the difficulty of reviewing is simply due to the author and reviewer not being &#8220;on the same page&#8221; in how they understand things.  An organized topic hierarchy greatly aids this.\n<\/li>\n<li><strong>The unknown<\/strong> It is difficult to anticipate everything.  What other futures might exist?<\/li>\n<\/ol>\n<p>Which future comes about is dependent on many things&#8212;the decisions of community leaders, enabling &#8216;math-on-the-web&#8217; technologies, etc&#8230;, so it is difficult to predict which future and when it will come about.  Nevertheless, a potential exists and there are several paths leading towards reaching that potential.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The internet has significantly effected the way we do research but it&#8217;s capabilities have not yet been fully realized. First, let&#8217;s acknowledge some known effects. Self-publishing By default, all researchers in machine learning (and more generally computer science and physics) place their papers online for anyone to download. The exact mechanism differs&#8212;physicists tend to use &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/hunch.net\/?p=144\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;The Webscience Future&#8221;<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-144","post","type-post","status-publish","format-standard","hentry","category-general"],"_links":{"self":[{"href":"https:\/\/hunch.net\/index.php?rest_route=\/wp\/v2\/posts\/144","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/hunch.net\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/hunch.net\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/hunch.net\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/hunch.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=144"}],"version-history":[{"count":0,"href":"https:\/\/hunch.net\/index.php?rest_route=\/wp\/v2\/posts\/144\/revisions"}],"wp:attachment":[{"href":"https:\/\/hunch.net\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=144"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/hunch.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=144"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/hunch.net\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=144"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}