<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: TechWeekend #3: Website Performance, Scalability and Availability: Sept 5</title>
	<atom:link href="http://punetech.com/techweekend-3-website-performance-scalability-and-availability-sept-5/feed/" rel="self" type="application/rss+xml" />
	<link>http://punetech.com/techweekend-3-website-performance-scalability-and-availability-sept-5/</link>
	<description>Connecting together Pune&#039;s Technologists</description>
	<lastBuildDate>Wed, 08 Feb 2012 09:03:24 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3</generator>
	<item>
		<title>By: Abhijit Sharma</title>
		<link>http://punetech.com/techweekend-3-website-performance-scalability-and-availability-sept-5/#comment-7826</link>
		<dc:creator>Abhijit Sharma</dc:creator>
		<pubDate>Wed, 09 Sep 2009 13:18:28 +0000</pubDate>
		<guid isPermaLink="false">http://punetech.com/?p=1617#comment-7826</guid>
		<description>Thanks Mukul,

That certainly makes sense. I had a couple of follow up questions:

* How do you get your data (the input data) onto the local storage of your large EC2 instances?

* The EC2 instances are not persistent i.e. when they go down any local storage disappears. How do plan for such an eventuality i.e. some of your EC2 instances which are part of the Hadoop cluster go down?

Regards
Abhijit</description>
		<content:encoded><![CDATA[<p>Thanks Mukul,</p>
<p>That certainly makes sense. I had a couple of follow up questions:</p>
<p>* How do you get your data (the input data) onto the local storage of your large EC2 instances?</p>
<p>* The EC2 instances are not persistent i.e. when they go down any local storage disappears. How do plan for such an eventuality i.e. some of your EC2 instances which are part of the Hadoop cluster go down?</p>
<p>Regards<br />
Abhijit</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mukul Kumar</title>
		<link>http://punetech.com/techweekend-3-website-performance-scalability-and-availability-sept-5/#comment-7825</link>
		<dc:creator>Mukul Kumar</dc:creator>
		<pubDate>Wed, 09 Sep 2009 10:30:30 +0000</pubDate>
		<guid isPermaLink="false">http://punetech.com/?p=1617#comment-7825</guid>
		<description>Hi Abhijit - we have considered AWS, however there were 2 problems - 1) storing in S3 would be too slow for very large data sets and 2) storing on EBS will be very costly. EBS is very costly per GB.

That said it is possible to run Hadoop on EC2, many people are doing that. The way would you would do the hardware configuration is - use a large EC2 instance, and use local storage that comes with the server. Then you can run HDFS on top of the local storage.

I hope that helps.

Thanks,
 Mukul.</description>
		<content:encoded><![CDATA[<p>Hi Abhijit &#8211; we have considered AWS, however there were 2 problems &#8211; 1) storing in S3 would be too slow for very large data sets and 2) storing on EBS will be very costly. EBS is very costly per GB.</p>
<p>That said it is possible to run Hadoop on EC2, many people are doing that. The way would you would do the hardware configuration is &#8211; use a large EC2 instance, and use local storage that comes with the server. Then you can run HDFS on top of the local storage.</p>
<p>I hope that helps.</p>
<p>Thanks,<br />
 Mukul.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Abhijit Sharma</title>
		<link>http://punetech.com/techweekend-3-website-performance-scalability-and-availability-sept-5/#comment-7824</link>
		<dc:creator>Abhijit Sharma</dc:creator>
		<pubDate>Wed, 09 Sep 2009 06:19:51 +0000</pubDate>
		<guid isPermaLink="false">http://punetech.com/?p=1617#comment-7824</guid>
		<description>Hi Mukul,

It was indeed a very interesting session particularly the interaction and the discussions around various points of interest. I seem to recollect that you were investigating use of Hadoop to process the humongous amount of data that you get from the activity (clicks etc) at the various websites where the ads get displayed. I was wondering whether you have considered using Amazon Elastic MapReduce offering http://aws.amazon.com/elasticmapreduce/

It seems that this requires the data to be in S3 - Would loading data in S3 be a prohibitive cost? Are there any other concerns that you would have using the cloud for this use case as opposed to your own hosted servers ? 

Regards
Abhijit</description>
		<content:encoded><![CDATA[<p>Hi Mukul,</p>
<p>It was indeed a very interesting session particularly the interaction and the discussions around various points of interest. I seem to recollect that you were investigating use of Hadoop to process the humongous amount of data that you get from the activity (clicks etc) at the various websites where the ads get displayed. I was wondering whether you have considered using Amazon Elastic MapReduce offering <a href="http://aws.amazon.com/elasticmapreduce/" rel="nofollow">http://aws.amazon.com/elasticmapreduce/</a></p>
<p>It seems that this requires the data to be in S3 &#8211; Would loading data in S3 be a prohibitive cost? Are there any other concerns that you would have using the cloud for this use case as opposed to your own hosted servers ? </p>
<p>Regards<br />
Abhijit</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Web Scalability and Performance &#8211; Real Life Lessons (Pune TechWeekend #3) &#124; PuneTech</title>
		<link>http://punetech.com/techweekend-3-website-performance-scalability-and-availability-sept-5/#comment-7822</link>
		<dc:creator>Web Scalability and Performance &#8211; Real Life Lessons (Pune TechWeekend #3) &#124; PuneTech</dc:creator>
		<pubDate>Wed, 09 Sep 2009 03:02:08 +0000</pubDate>
		<guid isPermaLink="false">http://punetech.com/?p=1617#comment-7822</guid>
		<description>[...] TechWeekend #3: Website Performance, Scalability and Availability: Sept 5  [...]</description>
		<content:encoded><![CDATA[<p>[...] TechWeekend #3: Website Performance, Scalability and Availability: Sept 5  [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mukul Kumar</title>
		<link>http://punetech.com/techweekend-3-website-performance-scalability-and-availability-sept-5/#comment-7800</link>
		<dc:creator>Mukul Kumar</dc:creator>
		<pubDate>Sun, 06 Sep 2009 17:24:35 +0000</pubDate>
		<guid isPermaLink="false">http://punetech.com/?p=1617#comment-7800</guid>
		<description>Hi,

I just posted my presentation on &#039;Web Scalability &amp; Performance&#039; at the following URL:

http://mukulblog.blogspot.com/2009/09/web-scalability-performance-real-life.html

Thanks again for the great response and questions. I will be happy to answer more questions. You can send me an email or a message on Twitter @mukulneetika .

Thanks,
 Mukul.</description>
		<content:encoded><![CDATA[<p>Hi,</p>
<p>I just posted my presentation on &#8216;Web Scalability &amp; Performance&#8217; at the following URL:</p>
<p><a href="http://mukulblog.blogspot.com/2009/09/web-scalability-performance-real-life.html" rel="nofollow">http://mukulblog.blogspot.com/2009/09/web-scalability-performance-real-life.html</a></p>
<p>Thanks again for the great response and questions. I will be happy to answer more questions. You can send me an email or a message on Twitter @mukulneetika .</p>
<p>Thanks,<br />
 Mukul.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: navin</title>
		<link>http://punetech.com/techweekend-3-website-performance-scalability-and-availability-sept-5/#comment-7744</link>
		<dc:creator>navin</dc:creator>
		<pubDate>Thu, 03 Sep 2009 11:29:04 +0000</pubDate>
		<guid isPermaLink="false">http://punetech.com/?p=1617#comment-7744</guid>
		<description>Yes, indeed. It&#039;s 5th September. Thanks for pointing that out - I&#039;ve corrected the post.</description>
		<content:encoded><![CDATA[<p>Yes, indeed. It&#8217;s 5th September. Thanks for pointing that out &#8211; I&#8217;ve corrected the post.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dipen</title>
		<link>http://punetech.com/techweekend-3-website-performance-scalability-and-availability-sept-5/#comment-7741</link>
		<dc:creator>Dipen</dc:creator>
		<pubDate>Thu, 03 Sep 2009 08:51:36 +0000</pubDate>
		<guid isPermaLink="false">http://punetech.com/?p=1617#comment-7741</guid>
		<description>5th August must be a typo :).</description>
		<content:encoded><![CDATA[<p>5th August must be a typo <img src='http://punetech.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> .</p>
]]></content:encoded>
	</item>
</channel>
</rss>

