<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>experiment, three &#187; information management</title>
	<atom:link href="http://experimenthree.wordpress.com/tag/information-management/feed/" rel="self" type="application/rss+xml" />
	<link>http://experimenthree.wordpress.com</link>
	<description>The blog you couldn't live without</description>
	<lastBuildDate>Thu, 05 Nov 2009 09:16:26 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<cloud domain='experimenthree.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://www.gravatar.com/blavatar/7dd08251b23684cd89b03d5604fc5953?s=96&#038;d=http://s.wordpress.com/i/buttonw-com.png</url>
		<title>experiment, three &#187; information management</title>
		<link>http://experimenthree.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://experimenthree.wordpress.com/osd.xml" title="experiment, three" />
		<item>
		<title>Web 2.0 and Databases, can the two worlds meet?</title>
		<link>http://experimenthree.wordpress.com/2008/07/02/web-20-and-databases-can-the-two-worlds-meet/</link>
		<comments>http://experimenthree.wordpress.com/2008/07/02/web-20-and-databases-can-the-two-worlds-meet/#comments</comments>
		<pubDate>Wed, 02 Jul 2008 22:27:00 +0000</pubDate>
		<dc:creator>alezzandro</dc:creator>
				<category><![CDATA[google]]></category>
		<category><![CDATA[information management]]></category>
		<category><![CDATA[internet]]></category>
		<category><![CDATA[vldb]]></category>
		<category><![CDATA[web]]></category>
		<category><![CDATA[web2.0]]></category>
		<category><![CDATA[wiki]]></category>

		<guid isPermaLink="false">http://experimenthree.wordpress.com/2008/07/02/web-20-and-databases-can-the-two-worlds-meet/</guid>
		<description><![CDATA[A few weeks ago, I had an interesting conversation with Paolo on why web 2.0 tools are still struggling to find their way in the academic world. Back in September last year I attended the panel What Web 2.0 Has To Do With Databases?, which investigated the reasons why the database community has left behind [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=experimenthree.wordpress.com&blog=4535657&post=30&subd=experimenthree&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><span style="font-family:trebuchet ms;">A few weeks ago, </span><span style="font-family:trebuchet ms;">I had an interesting conversation with <a href="http://www.gnuband.org/">Paolo</a> on <a href="http://www.gnuband.org/2008/05/21/science20_and_the_scientific_bzaar_collective_brainstorming_for_better_research/">why web 2.0 tools are still struggling</a> to find their way in the academic world. </span><span style="font-family:trebuchet ms;">Back in September last year I attended the panel <a href="http://www.vldb.org/conf/2007/papers/panels/p1443-ameryahia.pdf">What Web 2.0 Has To Do With Databases?</a>, which investigated the reasons why the database community has left behind in the research in the field of web 2.0.</p>
<p>Following Paolo&#8217; suggestion, I post the notes I took at the time. Having clear in mind that the two topics are different, I think they are somehow correlated, because those people that consider blogs, wiki, etc., a &#8220;waste of time&#8221; are also the ones that are missing the opportunity in doing research in such an interesting field.</span><br />***<br /><span style="font-family:trebuchet ms;">Panellists:</span>
<ul>
<li><span style="font-family:trebuchet ms;">Sihem Amer-Yahia (Yahoo!)</span></li>
<li><span style="font-family:trebuchet ms;">Alon Halevy (Google)</span></li>
<li><span style="font-family:trebuchet ms;">AnHai Doan (University of Wisconsin)</span></li>
<li><span style="font-family:trebuchet ms;">Gerhard Weikum (Max-Planck Institute for Informatics, Germany)</span></li>
<li><span style="font-family:trebuchet ms;">Gustavo Alonso (ETH, Zurich)</span></li>
</ul>
<p><span style="font-family:trebuchet ms;">Abstract can be found <a href="http://www.vldb.org/conf/2007/papers/panels/p1443-ameryahia.pdf">here</a>.<br /><a href="http://alonhalevy.blogspot.com/2007/09/web-20-panel.html">Here</a> is Alon Halevy&#8217;s post on the panel: read, in particular these two comments (<a href="http://alonhalevy.blogspot.com/2007/09/web-20-panel.html?showComment=1190890200000#c5783847103239519041">1</a>, <a href="http://alonhalevy.blogspot.com/2007/09/web-20-panel.html?showComment=1190900100000#c5173875358441442986">2</a>) which, in my opinion, summarise quite well the situation.<br /></span><span style="font-family:trebuchet ms;">***<br /><span style="font-weight:bold;">PROBLEM</span><br /></span> <span style="font-family:trebuchet ms;"><span>Is the database community ready</span><span style="font-weight:bold;"> </span><span>to accept the new</span></span><span style="font-family:trebuchet ms;"> challenges that are coming from the Web 2.0 world? <span style="color:rgb(204, 0, 0);">The risk of &#8220;</span><span style="color:rgb(204, 0, 0);">missing the train</span><span style="color:rgb(204, 0, 0);">&#8221; is very high</span>, considering that <span style="color:rgb(204, 0, 0);">the commercial interest on these technologies is leaving academic research behind</span>.</p>
<p><span style="font-weight:bold;">INTRODUCTION</span></span>
<ul>
<li><span style="font-family:trebuchet ms;"><span style="font-weight:bold;">Web 2.0 is about </span><span style="font-style:italic;">people</span>, <span style="font-style:italic;">unstructured data</span>, <span style="font-style:italic;">imprecise queries</span>, <span style="font-style:italic;">information retrieval</span>. </span></li>
<li><span style="font-family:trebuchet ms;"><span style="font-weight:bold;">Web 2.0 is not</span> about <span style="font-style:italic;">structure</span> and <span style="font-style:italic;">quality</span>.</span></li>
</ul>
<p><span style="font-family:trebuchet ms;">Unstructured data and applications are pervasive, they are everywhere and companies greatly exploit them, but:<br /></span>
<ul>
<li><span style="font-family:trebuchet ms;">A “holistic approach” is lacking (all current solutions are ad-hoc solutions)</span></li>
<li><span style="font-family:trebuchet ms;">The “structured methodology”, typical of the database community, should be brought into the Web 2.0.</span></li>
</ul>
<p><span style="font-family:trebuchet ms;"><span style="font-weight:bold;">WEB 2.0 IS </span><span style="font-weight:bold;">FASHION, DBMS&#8217; RULE</span><br />Database people were not fully convinced by Web 2.0 and <span style="color:rgb(204, 0, 0);">the two worlds seemed quite distant</span>. In general, they do not believe that databases as we know them (their structure, methodologies, best practices, etc.) will ever lose their cenrtrality in any information management application. Even web 2.0 is only a &#8220;cool application&#8221; that will eventually be substituted by something else, whereas databases will still be in place.</p>
<p>This is quite a conservative point of view and even those who say that “traditional DBMS’ are dead” (<a href="http://www.databasecolumn.com/">Michel Stonebraker</a> among others, but he’s not the only one) seem, in practice, to be a bit sceptical about the loss of centrality of the databases.</p>
<p><span style="font-weight:bold;">SCHEMA INTEGRATION FAILED, WEB 2.0 MIGHT BE THE ALTERNATIVE</span><br /><span style="color:rgb(0, 0, 0);">Everybody seemed to </span><span style="color:rgb(204, 0, 0);"><span style="color:rgb(0, 0, 0);">agree that</span> tight schema integration</span><span style="color:rgb(204, 0, 0);"> is a buzz word</span> that <span style="color:rgb(204, 0, 0);">does not work in the real world</span>, and this despite the fact that it has been studied for several years both in the industry and in the academia.</p>
<p></span><span style="font-family:trebuchet ms;">Web 2.0 seems the good compromise to have &#8220;real&#8221; integration, though this happens at the data level (and should probably be called &#8220;data reconciliation&#8221; instead). From the schema point of view, s</span><span style="font-family:trebuchet ms;">omeone argued a real integration is not possible because there are no strong stakeholders demanding for it (these will not be neither the people on the street nor Google or Yahoo).<br /></span><br /><span style="font-family:trebuchet ms;"><span style="color:rgb(204, 0, 0);">Google pushes forward the concept of a dataspace</span> (btw, <a href="http://www.cs.washington.edu/homes/alon/files/dataspacesDec05.pdf">Halevy&#8217;s dataspace</a>) that includes all users’ data. The physical system is left in the background, almost a legacy from the past: data matters, databases are needed for storage, reliability, etc. (are we talking about <a href="http://en.wikipedia.org/wiki/Cloud_computing">cloud computing</a>?).<br /></span><span style="font-family:trebuchet ms;"><br /><span style="font-weight:bold;">OTHER COMMENTS</span><br />Someone&#8217;s comment: companies are keen of groups that do research on Web 2.0 and even encourage them to do it. However, Web 2.0 is about people and data: <span style="color:rgb(204, 0, 0);">if the big companies do not release the data they have, how can the DB community research on it</span> (and what should they analyse?)?</p>
<p></span><span style="font-family:trebuchet ms;">***<br /><span style="font-weight:bold;">SUMMARY</span><br />The two worlds seemed very distant and the main reason probably relies in the different backgrounds: database are structure, metodology and algorithms. Web 2.0 is based on randomness (well, some form of), no predefined schema and, among all, unpredictable social interactions that are kept away from databases. It is no surprise that the communication between the two is particularly difficult.</span></p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/experimenthree.wordpress.com/30/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/experimenthree.wordpress.com/30/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/experimenthree.wordpress.com/30/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/experimenthree.wordpress.com/30/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/experimenthree.wordpress.com/30/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/experimenthree.wordpress.com/30/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/experimenthree.wordpress.com/30/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/experimenthree.wordpress.com/30/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/experimenthree.wordpress.com/30/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/experimenthree.wordpress.com/30/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/experimenthree.wordpress.com/30/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/experimenthree.wordpress.com/30/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=experimenthree.wordpress.com&blog=4535657&post=30&subd=experimenthree&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://experimenthree.wordpress.com/2008/07/02/web-20-and-databases-can-the-two-worlds-meet/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/8204b303bd6f96dd46394785d131ad2e?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
			<media:title type="html">alezzandro</media:title>
		</media:content>
	</item>
	</channel>
</rss>