<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Should you use a file-system or a database</title>
	<atom:link href="http://punetech.com/should-you-use-a-file-system-or-a-database/feed/" rel="self" type="application/rss+xml" />
	<link>http://punetech.com/should-you-use-a-file-system-or-a-database/</link>
	<description>Connecting together Pune&#039;s Technologists</description>
	<lastBuildDate>Wed, 08 Feb 2012 09:03:24 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3</generator>
	<item>
		<title>By: navin</title>
		<link>http://punetech.com/should-you-use-a-file-system-or-a-database/#comment-14877</link>
		<dc:creator>navin</dc:creator>
		<pubDate>Wed, 24 Mar 2010 04:14:54 +0000</pubDate>
		<guid isPermaLink="false">http://punetech.com/?p=960#comment-14877</guid>
		<description>@Swapan, 

As I said in an earlier comment, the question to be answered is this: &quot;When I have some data to store, what are the conditions under which I choose to store it in a file system, and when should I use a database&quot;. Now file-systems and databases start looking comparable. There are a very large number of applications that don&#039;t really need all the complexity, and overheads of a database, but use databases instead of file-systems because they need one feature (maybe indexing, or maybe transactions) that is missing from their file-system.

Why don&#039;t they use ZFS, VxFS etc? My guess is this: MySQL, Oracle, or Sybase will give me a way to get the feature I want without forcing the hardware/OS on me, or without forcing my customers&#039; admins to learn an obscure filesystem (come on, VxFS _is_ obscure compared to the databases I listed). Also, I can get the features I need from the database in a &quot;standard&quot; way, where as in a file-system I&#039;m stuck with proprietary interfaces for getting any of these features.

I think what Jaspreet is arguing for (and others have been saying since early 80s) is that having a &quot;standard&quot; way to do some of these things in file-systems will be a much better solution for many app vendors than being forced to use databases.</description>
		<content:encoded><![CDATA[<p>@Swapan, </p>
<p>As I said in an earlier comment, the question to be answered is this: &#8220;When I have some data to store, what are the conditions under which I choose to store it in a file system, and when should I use a database&#8221;. Now file-systems and databases start looking comparable. There are a very large number of applications that don&#8217;t really need all the complexity, and overheads of a database, but use databases instead of file-systems because they need one feature (maybe indexing, or maybe transactions) that is missing from their file-system.</p>
<p>Why don&#8217;t they use ZFS, VxFS etc? My guess is this: MySQL, Oracle, or Sybase will give me a way to get the feature I want without forcing the hardware/OS on me, or without forcing my customers&#8217; admins to learn an obscure filesystem (come on, VxFS _is_ obscure compared to the databases I listed). Also, I can get the features I need from the database in a &#8220;standard&#8221; way, where as in a file-system I&#8217;m stuck with proprietary interfaces for getting any of these features.</p>
<p>I think what Jaspreet is arguing for (and others have been saying since early 80s) is that having a &#8220;standard&#8221; way to do some of these things in file-systems will be a much better solution for many app vendors than being forced to use databases.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Swapan</title>
		<link>http://punetech.com/should-you-use-a-file-system-or-a-database/#comment-14841</link>
		<dc:creator>Swapan</dc:creator>
		<pubDate>Tue, 23 Mar 2010 21:05:31 +0000</pubDate>
		<guid isPermaLink="false">http://punetech.com/?p=960#comment-14841</guid>
		<description>Hi Navin and Jaspreet.. Nice article.. Quite informative ..

But, at the same time, I sense that you have missed out on the complete holistic picture from the FS perspective. Just adding my bits on it.

Lets have a look on the features on which you have compared FS and DB:

1. Transaction Support: As you yourself said that ZFS, VxFS, AdVFS are the industry standards which support this feature very well.

2. Snapshots: For a FS, it is mostly expected from the backup application running on top of it to do the snapshotting. At the same time, VxFS, AdVFS etc have taken it to the FS level.

3. Clustering: You have yourself given the examples of FS that support it.

4. Replication: 
    How about the different RAID levels.?
   

PS: Navin can you please explain the motivation behind trying to compare FS and Db when they have been exiting for serving altogether a different organization of data.

Just trying to understand if I am missing on some trivial point. Or, are you just concentrating on the limitation of most of the opensource OS. Check for AdVFS (tru64 UNIX) which is too opensource.  

And what are we trying to build actually siting the limitations of FS ?</description>
		<content:encoded><![CDATA[<p>Hi Navin and Jaspreet.. Nice article.. Quite informative ..</p>
<p>But, at the same time, I sense that you have missed out on the complete holistic picture from the FS perspective. Just adding my bits on it.</p>
<p>Lets have a look on the features on which you have compared FS and DB:</p>
<p>1. Transaction Support: As you yourself said that ZFS, VxFS, AdVFS are the industry standards which support this feature very well.</p>
<p>2. Snapshots: For a FS, it is mostly expected from the backup application running on top of it to do the snapshotting. At the same time, VxFS, AdVFS etc have taken it to the FS level.</p>
<p>3. Clustering: You have yourself given the examples of FS that support it.</p>
<p>4. Replication:<br />
    How about the different RAID levels.?</p>
<p>PS: Navin can you please explain the motivation behind trying to compare FS and Db when they have been exiting for serving altogether a different organization of data.</p>
<p>Just trying to understand if I am missing on some trivial point. Or, are you just concentrating on the limitation of most of the opensource OS. Check for AdVFS (tru64 UNIX) which is too opensource.  </p>
<p>And what are we trying to build actually siting the limitations of FS ?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Farheen</title>
		<link>http://punetech.com/should-you-use-a-file-system-or-a-database/#comment-8481</link>
		<dc:creator>Farheen</dc:creator>
		<pubDate>Sun, 25 Oct 2009 20:24:28 +0000</pubDate>
		<guid isPermaLink="false">http://punetech.com/?p=960#comment-8481</guid>
		<description>I like the articles as well as every comments. I want to clarify that i myself is an engineering student and i want to know the pros and cons of flat file system and database system . Bcoz this article don&#039;t clear the pros and cons . So if somebody reading this blog know the answer plz write an article on this topic .</description>
		<content:encoded><![CDATA[<p>I like the articles as well as every comments. I want to clarify that i myself is an engineering student and i want to know the pros and cons of flat file system and database system . Bcoz this article don&#8217;t clear the pros and cons . So if somebody reading this blog know the answer plz write an article on this topic .</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Himanshu Ubale</title>
		<link>http://punetech.com/should-you-use-a-file-system-or-a-database/#comment-4948</link>
		<dc:creator>Himanshu Ubale</dc:creator>
		<pubDate>Fri, 06 Feb 2009 09:18:33 +0000</pubDate>
		<guid isPermaLink="false">http://punetech.com/?p=960#comment-4948</guid>
		<description>Most of the applications tend to use both file system as well as database system for various reasons. It is equally important to abstract operations. Consider using something similar to Apache Jackrabbit (JCR implementation); we had excellent results while accessing half-GB size images (on file system) along with typical application data from relational databases thru JCR. Just for clarification, we were not storing webpages; we were storing typical application data.</description>
		<content:encoded><![CDATA[<p>Most of the applications tend to use both file system as well as database system for various reasons. It is equally important to abstract operations. Consider using something similar to Apache Jackrabbit (JCR implementation); we had excellent results while accessing half-GB size images (on file system) along with typical application data from relational databases thru JCR. Just for clarification, we were not storing webpages; we were storing typical application data.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mitul</title>
		<link>http://punetech.com/should-you-use-a-file-system-or-a-database/#comment-4940</link>
		<dc:creator>Mitul</dc:creator>
		<pubDate>Thu, 05 Feb 2009 08:35:25 +0000</pubDate>
		<guid isPermaLink="false">http://punetech.com/?p=960#comment-4940</guid>
		<description>@Navin: I sent a mail on your smriti.com mail id. Did you get it?</description>
		<content:encoded><![CDATA[<p>@Navin: I sent a mail on your smriti.com mail id. Did you get it?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mitul</title>
		<link>http://punetech.com/should-you-use-a-file-system-or-a-database/#comment-4933</link>
		<dc:creator>Mitul</dc:creator>
		<pubDate>Wed, 04 Feb 2009 07:18:27 +0000</pubDate>
		<guid isPermaLink="false">http://punetech.com/?p=960#comment-4933</guid>
		<description>@Dhananjay: GemFire has been there for more than 6 years as well and Tangosol&#039;s Coherence is our direct competitor. By the way, Tangosol was acquired by Oracle and it is now Oracle Coherence. You are right about TerraCotta, though it is more like a shared JVM as opposed to a typical distributed data management solution but definitely in a similar space.</description>
		<content:encoded><![CDATA[<p>@Dhananjay: GemFire has been there for more than 6 years as well and Tangosol&#8217;s Coherence is our direct competitor. By the way, Tangosol was acquired by Oracle and it is now Oracle Coherence. You are right about TerraCotta, though it is more like a shared JVM as opposed to a typical distributed data management solution but definitely in a similar space.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dhananjay Nene</title>
		<link>http://punetech.com/should-you-use-a-file-system-or-a-database/#comment-4932</link>
		<dc:creator>Dhananjay Nene</dc:creator>
		<pubDate>Wed, 04 Feb 2009 07:11:13 +0000</pubDate>
		<guid isPermaLink="false">http://punetech.com/?p=960#comment-4932</guid>
		<description>@mitul, you are right these solutions have been proprietary for long (hence no links to them). A distributed, transactional, in-memory caching solution I know has been around since 2004 at least is Tangosol. An open source solution I am aware of which is moving parts of data management from the database into a distributed in memory storage is TerraCotta. These may not serve exactly the same set of purposes as GemFire would, but the general class of distributed transactional in memory storage has been active for some time. 

Shall look forward to your post.</description>
		<content:encoded><![CDATA[<p>@mitul, you are right these solutions have been proprietary for long (hence no links to them). A distributed, transactional, in-memory caching solution I know has been around since 2004 at least is Tangosol. An open source solution I am aware of which is moving parts of data management from the database into a distributed in memory storage is TerraCotta. These may not serve exactly the same set of purposes as GemFire would, but the general class of distributed transactional in memory storage has been active for some time. </p>
<p>Shall look forward to your post.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nilesh Sane</title>
		<link>http://punetech.com/should-you-use-a-file-system-or-a-database/#comment-4931</link>
		<dc:creator>Nilesh Sane</dc:creator>
		<pubDate>Wed, 04 Feb 2009 07:08:39 +0000</pubDate>
		<guid isPermaLink="false">http://punetech.com/?p=960#comment-4931</guid>
		<description>Good article, always nice to refresh oneself on the &quot;classic&quot; db terms.
I&#039;d like to add my two cents to the discussion.
The author mentioned Replication as a means to have disaster recovery plans, but replication can and more often does go beyond it. Replication is a very essential indgredient for managing distributed databases. DoubleD&#039;s are common now a days and file system db&#039;s will find it hard to match this requirement.
I wonder how XML&#039;s would be rated then? Just another means of EDI?</description>
		<content:encoded><![CDATA[<p>Good article, always nice to refresh oneself on the &#8220;classic&#8221; db terms.<br />
I&#8217;d like to add my two cents to the discussion.<br />
The author mentioned Replication as a means to have disaster recovery plans, but replication can and more often does go beyond it. Replication is a very essential indgredient for managing distributed databases. DoubleD&#8217;s are common now a days and file system db&#8217;s will find it hard to match this requirement.<br />
I wonder how XML&#8217;s would be rated then? Just another means of EDI?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jaspreet</title>
		<link>http://punetech.com/should-you-use-a-file-system-or-a-database/#comment-4930</link>
		<dc:creator>Jaspreet</dc:creator>
		<pubDate>Wed, 04 Feb 2009 06:38:32 +0000</pubDate>
		<guid isPermaLink="false">http://punetech.com/?p=960#comment-4930</guid>
		<description>Navin,

There is a dark side to the cloud as well. We investigated a lot in this space. And I plan to write on this soon.

My reasons -

1. Its something which is critical to your business and you can&#039;t control - I know 3 startups which went out of business because of cloud downtimes and data losses.

Unlike hosted email, which is there for the past 20 years. This space would take a long time to mature. Besides,no one minds if the email server goes down for some time.

2. Applications need to be changed to adjust to cloud API.

3. Performance - Its a big question here.

4. Its a vendor lockin. You can&#039;t change options tomorrow.


We investigated on a lot of options and benchmarked - SQLite, MSSQL,  MySQL, PostgreSQL and Oracle. Then finally settled on PostgreSQL. Will share this investigation sometime.

@sashi, look at the disclosure.

Jaspreet</description>
		<content:encoded><![CDATA[<p>Navin,</p>
<p>There is a dark side to the cloud as well. We investigated a lot in this space. And I plan to write on this soon.</p>
<p>My reasons -</p>
<p>1. Its something which is critical to your business and you can&#8217;t control &#8211; I know 3 startups which went out of business because of cloud downtimes and data losses.</p>
<p>Unlike hosted email, which is there for the past 20 years. This space would take a long time to mature. Besides,no one minds if the email server goes down for some time.</p>
<p>2. Applications need to be changed to adjust to cloud API.</p>
<p>3. Performance &#8211; Its a big question here.</p>
<p>4. Its a vendor lockin. You can&#8217;t change options tomorrow.</p>
<p>We investigated on a lot of options and benchmarked &#8211; SQLite, MSSQL,  MySQL, PostgreSQL and Oracle. Then finally settled on PostgreSQL. Will share this investigation sometime.</p>
<p>@sashi, look at the disclosure.</p>
<p>Jaspreet</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: navin</title>
		<link>http://punetech.com/should-you-use-a-file-system-or-a-database/#comment-4929</link>
		<dc:creator>navin</dc:creator>
		<pubDate>Wed, 04 Feb 2009 06:37:32 +0000</pubDate>
		<guid isPermaLink="false">http://punetech.com/?p=960#comment-4929</guid>
		<description>@Dhananjay, Agreed that doing a good in-depth analysis is easily a one-semester course! However, I think there is still value in giving a 30,000-foot view of this area. I mean one paragraph on each of the major approaches that appear to be emerging.

@Mitul, what you are describing sounds fine - can you write an article for PuneTech on that topic?</description>
		<content:encoded><![CDATA[<p>@Dhananjay, Agreed that doing a good in-depth analysis is easily a one-semester course! However, I think there is still value in giving a 30,000-foot view of this area. I mean one paragraph on each of the major approaches that appear to be emerging.</p>
<p>@Mitul, what you are describing sounds fine &#8211; can you write an article for PuneTech on that topic?</p>
]]></content:encoded>
	</item>
</channel>
</rss>

