<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Tim Berners-Lee&#8217;s missing star</title>
	<atom:link href="http://blog.datamarket.com/2012/05/25/tim-berners-lees-missing-star-2/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.datamarket.com/2012/05/25/tim-berners-lees-missing-star-2/</link>
	<description>Data, visualization and startup life</description>
	<lastBuildDate>Wed, 08 May 2013 04:22:32 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Kerstin Forsberg</title>
		<link>http://blog.datamarket.com/2012/05/25/tim-berners-lees-missing-star-2/#comment-886</link>
		<dc:creator><![CDATA[Kerstin Forsberg]]></dc:creator>
		<pubDate>Fri, 25 May 2012 20:07:39 +0000</pubDate>
		<guid isPermaLink="false">http://blog.datamarket.com/?p=1039#comment-886</guid>
		<description><![CDATA[Good point! So, how about making associated meta-data 4/5 star rated before going beyond 3 star for the many data sets?]]></description>
		<content:encoded><![CDATA[<p>Good point! So, how about making associated meta-data 4/5 star rated before going beyond 3 star for the many data sets?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Hjalmar Gislason</title>
		<link>http://blog.datamarket.com/2012/05/25/tim-berners-lees-missing-star-2/#comment-885</link>
		<dc:creator><![CDATA[Hjalmar Gislason]]></dc:creator>
		<pubDate>Fri, 25 May 2012 19:31:42 +0000</pubDate>
		<guid isPermaLink="false">http://blog.datamarket.com/?p=1039#comment-885</guid>
		<description><![CDATA[I can agree with this, up to a point. TBL&#039;s rationale here is probably that when the GOVERNMENT is publishing data, it should do so in formats that are non-proprietary to ensure that there are absolutely no discriminating issues or &quot;favoritism&quot; involved. And I agree with him.

I will on the other hand agree with you that for most people, Excel files are more approachable than CSV files for human consumption. However, I&#039;d add the fact that there are other and even more approachable methods for that, e.g. using proper, web-based data publishing solutions such as - touting my own horn - &lt;a href=&quot;http://datamarket.com/p/data_providers/enterprise/&quot; rel=&quot;nofollow&quot;&gt;DataMarket&#039;s Enterprise solution&lt;/a&gt;.]]></description>
		<content:encoded><![CDATA[<p>I can agree with this, up to a point. TBL&#8217;s rationale here is probably that when the GOVERNMENT is publishing data, it should do so in formats that are non-proprietary to ensure that there are absolutely no discriminating issues or &#8220;favoritism&#8221; involved. And I agree with him.</p>
<p>I will on the other hand agree with you that for most people, Excel files are more approachable than CSV files for human consumption. However, I&#8217;d add the fact that there are other and even more approachable methods for that, e.g. using proper, web-based data publishing solutions such as &#8211; touting my own horn &#8211; <a href="http://datamarket.com/p/data_providers/enterprise/" rel="nofollow">DataMarket&#8217;s Enterprise solution</a>.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tom Schenk Jr. (@tomschenkjr)</title>
		<link>http://blog.datamarket.com/2012/05/25/tim-berners-lees-missing-star-2/#comment-884</link>
		<dc:creator><![CDATA[Tom Schenk Jr. (@tomschenkjr)]]></dc:creator>
		<pubDate>Fri, 25 May 2012 18:59:57 +0000</pubDate>
		<guid isPermaLink="false">http://blog.datamarket.com/?p=1039#comment-884</guid>
		<description><![CDATA[I like CSV and is my main storage mechanism for data sets for reasons stated before. But should that be such a goal for this listl? To me, open source tends to be about accessibility and understandable for many users.
Often, I see novice users more bewildered about CSV data than Excel. While CSV is better, I think the emphasis should be on users understanding. I rarely see formats like Excel being huge barriers. Besides CSV, merely stating &quot;non-proprietary&quot; can be much worse than Excel. To wit, xml structured data to a novice or even intermediate user can be absolute hell. Even in R, it can be a lot of work to get an XML document imported. Given the choice, Excel is much better option.
To me, it makes more sense for Excel and CSV (e.g., machine readable format) to be 2 stars and consistent format/dicoverability (3.5 stars) be placed at 3 stars. Although, &quot;consistent&quot; needs to be defined. It&#039;s very easy to say data needs to be consistent, but that can be a deep topic.]]></description>
		<content:encoded><![CDATA[<p>I like CSV and is my main storage mechanism for data sets for reasons stated before. But should that be such a goal for this listl? To me, open source tends to be about accessibility and understandable for many users.<br />
Often, I see novice users more bewildered about CSV data than Excel. While CSV is better, I think the emphasis should be on users understanding. I rarely see formats like Excel being huge barriers. Besides CSV, merely stating &#8220;non-proprietary&#8221; can be much worse than Excel. To wit, xml structured data to a novice or even intermediate user can be absolute hell. Even in R, it can be a lot of work to get an XML document imported. Given the choice, Excel is much better option.<br />
To me, it makes more sense for Excel and CSV (e.g., machine readable format) to be 2 stars and consistent format/dicoverability (3.5 stars) be placed at 3 stars. Although, &#8220;consistent&#8221; needs to be defined. It&#8217;s very easy to say data needs to be consistent, but that can be a deep topic.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: jurgen</title>
		<link>http://blog.datamarket.com/2012/05/25/tim-berners-lees-missing-star-2/#comment-883</link>
		<dc:creator><![CDATA[jurgen]]></dc:creator>
		<pubDate>Fri, 25 May 2012 17:50:10 +0000</pubDate>
		<guid isPermaLink="false">http://blog.datamarket.com/?p=1039#comment-883</guid>
		<description><![CDATA[I think there are a few more things that could make it into the 3.5th star: data providers work with vintages and publish every vintage in a new directory, so you can&#039;t automatically find the latest content. I would also argue that csv&#039;s with time across columns is bad practice, especially if the rightmost columns are footnotes, not to talk about the unstandardized way of placing table/metric/series/entity/data point annotations that make things unnecessarily hard. 
Star 4 and 5 will gradually evolve, but I think the big step is getting from OPEN to USEABLE OPEN data.]]></description>
		<content:encoded><![CDATA[<p>I think there are a few more things that could make it into the 3.5th star: data providers work with vintages and publish every vintage in a new directory, so you can&#8217;t automatically find the latest content. I would also argue that csv&#8217;s with time across columns is bad practice, especially if the rightmost columns are footnotes, not to talk about the unstandardized way of placing table/metric/series/entity/data point annotations that make things unnecessarily hard.<br />
Star 4 and 5 will gradually evolve, but I think the big step is getting from OPEN to USEABLE OPEN data.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
