Category Archives: Featured

The best articles

Spread PuneTech – tell your friends why you like PuneTech

Please consider putting this (or a smaller badge) on your blog/website. See the bottom of this post for other sizes.
Please consider putting this (or a smaller badge) on your blog/website. See the bottom of this article for other sizes available

Today is PuneTech’s first birthday, and as a birthday gift, we are asking our readers to spread the word about PuneTech through emails, blogs, twitter and other means. (Update 1: Honorary Punekar Sushrut Bidwai has written a nice post giving the reasons why he likes PuneTech. I think he has managed to articulate very well a lot of the things we have been striving for with PuneTech. Please take a look at the article and comments there. Update 2: The community is doing a better job of articulating what PuneTech is than we are able to do ourselves. Dhananjay Nene, has a post giving insights into the role that PuneTech plays in the tech community in Pune. Update 3: Sahil Khan of The Tossed Salad called me up for a mini-interview to produce this article. Update 4: Rohit Srivastwa has a great post about what “PuneTech is…” according to google)

Why the blatant begging? Read on.

PuneTech is a non-commercial, by the community, for the community site. Which means that any and all publicity/marketing for PuneTech happens only one way: word of mouth. And that’s your mouth and your blog that we are talking about. So here’s what we are asking for:

If you don’t love PuneTech, let us know why. And give us some time to fix it – we are volunteering only part of our time on this. But if you do love PuneTech, help us spread the word. Send email to a few of your friends who are techies in Pune asking them to subscribe to our updates by email or via RSS (free!) That is the main thing we care about – more subscribers. Because, the value that a community provides to its members is directly proportional to the square of the number of members. Also, if someone visits the website directly, there would be days when they forget, or are too busy, and miss some updates. In contrast, a subscriber is forever.

The other thing that we care about is google rank. A search for “Pune technology” results in a PuneTech page showing up as the second link (at least for us – google search results are different for different people, so please don’t flame us if you see different results). Strangely, that link is to the PuneTech wiki page about the PuneTech linked-in group. And, sometimes there is a sub-link to Amit’s article about whether Pune can emulate Silicon Valley. While I don’t understand why the PuneTech front page is not ranked higher than these articles, I am happy that at least this showed up. (By the way, have you joined the PuneTech group on linked-in? Considering that it is the second search result, you probably should 🙂

But what I find extremely irritating is that the first link Google throws up for this search is a ridiculously outdated, ridiculously incomplete listing of IT companies from PuneDiary (http://www.punediary.com/itcomp.html). I cringe to think that this is the first impression about Pune for people all over the world searching for “pune technology.” I’ve tried contacting the PuneDiary people, and suggested that they update that page with PuneTech content (all our content is CC-licensed). I even offered to update the page myself and keep it updated without asking for anything in return. But no response.

So now I’d like to go on the warpath. I would like to knock that page off the top of the google search results. I want “pune technology” to be linked with the PuneTech front page. I believe that, while not perfect, it is a much better indicator of the activity and energy of the software technology community in Pune.

So who is with me? If you agree, please link to the PuneTech front page from whereever it is appropriate. And let others know why. If possible, put up an “I love PuneTech” badge. (If you don’t agree, please let us know why in the comments below, and suggest an alternative. Because the current number #1 search result is completely unacceptable.)

Here are the “I love PuneTech” badges that for you to put up somewhere. If you would like a different size, please let us know. If you are a graphic designer and can come up with better badges, please, please do so and let us know.

I love PuneTech medium

http://punetech.com/files/I_love_punetech_logo-135x80.gif

I love PuneTech medium

http://punetech.com/files/I_love_punetech_logo-200x119.gif

I love PuneTech medium

http://punetech.com/files/I_love_punetech_logo-300x178.gif

For larger images (i.e. if you want to make a T-shirt or a mug) check out

http://punetech.com/files/I_love_punetech_logo-506x300.gif

and

http://punetech.com/files/I_love_punetech_logo_large.gif

Dhanesh Shivanandan of Pune IT Labs has created some more badges for us.
I love PuneTech

http://punetech.com/files/I_love_punetech_logo2-120x120.jpg

I love PuneTech

http://punetech.com/files/I_love_punetech_logo2-150x150.jpg

I love PuneTech

http://punetech.com/files/I_love_punetech_logo2-200x200.jpg

PuneTech Comment Policy

PuneTech is a for-the-community, by-the-community site, and comments by our readers play an important part of the content. However, to ensure that the discussion always stays healthy, constructive and safe, we occasionally have to delete some of the comments. This note lays out our comments policy to help the community understand what kinds of comments we delete, and why.

PuneTech comment policy – Short version

  if (the comment is not relevant to the article)
     We will delete it;
       /* take your irrelevant rambling elsewhere */

  else if (the comment is a personal attack)
     we will delete it;
       /* rude people not welcome here */

  else if (the comment has abusive language)
     we will delete it;
       /* we are trying to have a civil discussion here */

  else if (the comment exposes PuneTech to legal liability)
     we will delete it;
       /* we don't want to get sued
          that distracts from the purpose of this website
          more details below */

  else
     your comment is welcome;

In select cases, we might allow a comment in spite of violating one of the above rules, if it has other redeeming qualities. Also, if we delete a comment, and you really, really want your voice heard, we suggest a workaround that will allow the world to still see the comment.

PuneTech comment policy – Long version

Relevance

Our primary objective is to provide PuneTech readers with focused, relevant articles and discussions. Anything that distracts from this reduces the value of PuneTech for our readers. Hence, any comment that has nothing to do with the article (and trust me, we get a bunch of these), will be deleted (unless we find it very interesting in its own right). If you want a job, please post your resume on naukri.com – don’t post a comment here. If you find yourself compelled to beg for jobs on PuneTech, seriously consider changing careers.

Personal Attacks

We are trying to build a community here, not poison it. Something about the internet makes people more rude than they would be in real life. Please resist the temptation. We love a good argument, we are after all argumentative Indians. But please argue the issues. You might be surprised to discover that it can be done without attacking the character of the other person.

Abusive Language

If it is worth saying, it can be said in polite language. If you have abusive language in a boring comment, we’ll delete it. If you have abusive language in an interesting comment, we will, at our discretion, remove the offending words, or sentences. If you don’t want your comment mangled like this by us, use polite language.

Defamation

PuneTech is a non-commercial website that is run by us on a part-time basis. We make no money from PuneTech. Which means that we have neither the time, nor the money to get involved in legal issues. We cannot afford to retain lawyers to get accurate legal advice. In the absense of that, we have to make a guess based on our understanding of the law. And anything that we think exposes us to legal liability, will be deleted.

Here is our limited understanding of the law:

If something can hurt the reputation of another person or company, legally, we can publish it if and only if it is true.  This is tricky because we need to be sure of the truth before we feel safe. Just because it is on wikipedia, does not make it true. Just because Times of India published it, does not make it necessarily true. And we can be sued even if we are simply relaying info published by someone else.

Even if the damaging statements are contained in a comment made by a third-party commenter (i.e. somebody other than us) we are still obligated to remove the comment. Otherwise PuneTech can be held liable.

So it boils down to this: if we cannot verify the truth of a damaging claim in a comment, we will delete the comment.

Please note, just because it is true, does not necessarily mean that we will allow a comment. The earlier filters of relevance, rudeness, etc. still apply. If we are unsure about the “public good” of a true but damaging statement, we will delete the comment.

Other Objectionable Content

Other reasons why comments might fall afoul of the laws are: obscenity; hurting religious sentiments; promoting violence; against security of the state; or infringing of someone’s right to privacy. In most cases, these will get deleted for violating one of our earlier policies (e.g. irrelevance, personal attack, etc.) . In the rare case that the comment somehow manages to not violate any of the earlier policies, it can still get deleted for being against the law.

Workaround

When we delete one of your comments, we are not really preventing you from expressing yourself. Please feel free to go ahead and post it on your own blog. If for some reason, you are ashamed of putting your own comment on your own blog, go ahead and create a brand new blog on blogger.com just for holding this one comment. It’s easy, it’s free, and anybody can do it. Then post a link in the comments on PuneTech. If it is relevant to the post, we’ll probably allow the link to remain.

Suggestions

If you have any feedback for us, please leave a comment below, or send us an email. The comment is subject to the same policies (ha! ha!) unless we decide to change the policy based on your suggestion. In any case, we promise to read everything, even if we delete it.

Moderation

Comments on PuneTech are moderated. Which means that one of us might have to take a look and approve the comment before it appears on the site. Sometimes, it takes us a while to get around to doing this. Please be patient. Don’t post the comment multiple times. If you are unsure of whether your comment has reached our moderation queue, send us an email.

Further Reading

What I’ve learned from Hacker News by Paul Graham. A good introduction to the issues to be considered when deciding why and how to moderate comments on a site.

Free to blog but accountable you are. The Supreme Court of India weighs in on blogging and online expression. – Dhananjay Nene

Of blogs, bloggers and freedom of expression – Mutiny.in

Bloggers Legal Guide from EFF – Note this applies to US law, but still worth reading, as it does a great job of explaining the issues.

Acknowledgements

Thanks to Dhananjay Nene, Rohit Srivastwa, Amit Kumar Singh, Unmesh Mayekar, Manas Garg, Rohas Nagpal, and Debasis Nayak for discussions that helped us clarify our thinking and craft this policy. Note: this comment policy does not necessarily reflect the views of these people – it is just that they helped us while we were struggling to figure out what the comment policy should be.

Reblog this post [with Zemanta]

One Laptop Per Child (OLPC) Project comes to Pune – Please join

100 dollar laptop: production prototype
Image via Wikipedia

The One Laptop per Child association develops a low-cost laptop—the “XO Laptop”—to revolutionize how we educate the world’s children. Their mission is “to provide educational opportunities for the world’s poorest children by providing each child with a rugged, low-cost, low-power, connected laptop with content and software designed for collaborative, joyful, self-empowered learning.”

Now a bunch of volunteers have banded together to take the OLPC initiative forward in Pune. If you believe in this mission, you should join them. They will have a kickoff meeting on Sunday (1st March 2009) to decide how to take this forward. This would be a good place to find out more about the project and see how you can contribute. You don’t need to be a technical person to contribute.

If you are unable to attend this event, but would still like to contribute, join the OLPC India mailing list. For more details, see the OLPC Pune wiki.

Details: OLPC Camp Pune

What: Kickoff meeting for volunteers interested in furthering the cause of the One Laptop Per Child (OLPC) group in Pune.
When: 9am-5pm, Sunday, 1st March, 2009
Where: Room No. 207, Symbiosis Institute of Computer Studies and Research (SICSR), 1st Floor, Atur Centre, Gokhale Cross Road, Model Colony, Pune. Map.
Registration and Fees: This event is free for anyone to attend. You are requested to register here.

Tentative Schedule:

Time Room No.207 Session conducted by
0845 Coffee and registration
0900 Introduction and Agenda Review
0915 XO World – Why, What, Where To Sankarshan Mukhopadhyay from The Fedora Project
1215 Break for Lunch
1300 Localization of Sugar -Introduction and Getting Started Sankarshan Mukhopadhyay from The Fedora Project
1400 Digital Bridge Foundation (DBF)- Introduction Amit Gogna from Digital Bridge Foundation
1500 Starting OLPC Pune Deployment – Road Ahead Open Discussion
1600 Break for Tea
1615 To be Scheduled …
Reblog this post [with Zemanta]

TouchMagix: Convert any wall or floor into an interactive touchscreen

TouchMagix is a Pune-based startup with a product that can convert any wall of floor into an interactive touch screen. Targeted towards “high footfall” areas like malls, conferences and lobbys, it is a technology with a lot of potential, and frankly, something that we don’t expect to see out of Indian companies. After being very impressed with the demo at their Bhonsale Nagar office, we interviewed founder Anup Tapadia for PuneTech. TouchMagix technology will be on display on both days of the Hi Tech Pune Maharashtra 2009 conference in Pune on 26th and 27th – it’s definitely worth checking out. Anup will also be speaking at the conference on Friday. Read on for the interview.


Q: Can you give an overview of TouchMagix?

A: TouchMagix is a technology that allows any projection screen to be made interactive. Imagine a picture being projected on a regular wall, and a person can walk up to the wall and start interacting with the objects in the picture, and they actually react to the touch. TouchMagix technology allows adding such multi-point interactivity to any projected surface i.e walls, floors or screens. The projected surface reacts to the human gestures. It reacts to a touch, or it can react to gestures from a distance. And this can be done with any existing surface.

It is a medium that attracts crowds of all ages and helps brands “Get Noticed” with fun and interaction. It has extensive applications in retail, malls, advertising, events, promotions and gaming. We believe that this is the way brands would communicate to their consumers in future. At TouchMagix we are focussed towards building technologies and services which will aid brand owners to create an everlasting impression in consumers mind. Right now we have launched this product, and our team is overwhelmed with the response from all over the world. Now the real challenge we are facing is to accelerate the process of deployment across the globe in short amount of time. We are looking for awesome people to join this wagon!

Demo of TouchMagix. Click here if you can’t see the video.
Q: Can you dig a little into the technology that goes into making TouchMagix?

As off now we have 2 products, MagixFloor™ and MagixWall™ which are like large multi-point touch-screens. Both the technologies use same set of hardware and software. The hardware comprises of a high-end PC, a projector and our proprietary TouchMagix sensor. The sensor feeds the signals to the PC software, which recognizes human gestures and generates an XML feed on loopback interface. This feed can be integrated with any designing software like Adobe Flash, Adobe Director, 3DS Max etc… to build games and content. We offer an Open SDK interface which can be used by creative agencies or game designers to build content for deployment on TouchMagix. We also provide a remote controlled media scheduler for rotating content on the media. We are also in the process of designing audience measurement tools that will give web based interaction statistics to an advertiser.

Q: There are existing systems in US, Israel and elsewhere which do similar things. For example, Reactrix Systems. How do you differentiate yourself from them, both from an IPR perspective and a marketing perspective.

We have a superior and cost effective technology than most of the competition around. Technology, is just one aspect of the media. The possibilities with the content that one can build on this media would establish our position. We are offering an Open SDK interface for developers that no other competitor is offering. We are also introducing a feature of audience measurement, and web management which will set us apart. Reactrix systems was a close competitor for us, but they recently shutdown despite investment of $75 Million. Any good idea badly executed can also lead to failure. Being in India, we have an cost advantage along with the technology edge to deliver this product to the world. We have a team with diverse experience in technology, advertising and gaming and we believe we can make this happen.

Q: What were the unexpected challenges that you faced in making this happen.

A: Loads of challenges surfaced during the process of development of this technology. It was like playing the game of snakes and ladders for last 2 years day-in and day-out. What we thought was working in the labs and we said hurray, was a basket of new challenges on the field. Elimination of pseudo sensing was one of the big challenges that we had to overcome.

Pseudo sensing is a problem in which the movements of the objects that are in the picture being projected need to be distinguished from the motion of the human. Earlier the system used to do false detections on the projected image thinking it was the human motion. Now we have overcome that problem.

Q: Unlike most other “IT” and “web” startups, your technology has a significant hardware component. This is not something common in India. Has that been a challenge for you?

A: I was fortunate to get an experience in both hardware and software during my journey of learning. My dad always says, “You are a jack of all trades, and master of some!”. This trait helped me in integrating hardware and software required to create the product. Having our own hardware solved lot of problems, which would have been difficult otherwise. It also helped in building the intellectual property around it.

Q: What conventional (and unconventional) means, you are trying out for marketing your product?

A: Basically we have 3 types of customers.

  1. Private companies advertising their own product in private spaces. e.g – An mobile company putting it in their every showroom.
  2. Public place owners where advertisers would like to advertise their products. e.g – A mall who would lease out their space to advertisers.
  3. Short term installations for events, exhibitions, activations, promotions, parties etc.

We would be handling 1st type of customers by direct marketing initiatives. 2nd and 3rd type of customers would be handled through channel partners who have local presence. In addition to this, we will also be doing online direct marketing.

Q: What funding options have you considered?

A: The first phase was funded by Family and Friends. Along with the business plan, the next phase funding plan also would be frozen.

Q: At a very young age, you have done a lot of interesting and varied things? Can you tell us a bit about your background and the various areas you’ve worked in?

A: I started exploring computers at a very young age. At the age of 14 I became the world’s youngest Microsoft Certified professional to achieve MCSD, MCSE, MCDBA. I was fortunate to receive written compliments and motivation from Bill Gates, Azim Premji, Dr. Raghunath Mashalkar, Dr.Irwin Jacobs and many others for this achievement. At the age of 16 I started off my company TechnoKarma Labs with a vision to build innovative indian products for the global market. Our first product was 802.11b based mesh router. This router was a low cost and more efficient alternative to the way wireless networks were being deployed. During this time, I was also associated with International Institute of Information Technology, Pune (I2IT) for research in High Performance Computing under mentoring of Dr.Vijay Bhatkar. After my 12th standard at Fergusson College, I was offered to join in as a full time MS student in Networks & Telecom at I2IT skipping my engineering degree. At the age of 19 I received my 1st Masters degree. As part of my masters thesis, I had worked on a low level distributed memory management system for virtual machine clusters. After my Masters, I worked for 1 year with Qualcomm at San Diego, USA. I developed 2 products in distributed mobile computing at Qualcomm’s Corp R&D which were patented. I received an offer to join University of California, San Diego (UCSD) for Ms-PhD program in Communication Theory and Systems. I decided to join UCSD for PhD. My topic of research was Ad-Hoc networks and large scale distributed systems. During my stay in San Diego, I got a chance to work with a Venture Capital firm and get a view of several start-ups that they had invested in, or were planning to invest-in. This experience was invaluable. After completing the Master’s courses and some MBA courses I decided that I would like to come back to India, and start off TouchMagix. Presently I am on leave of absence from UCSD. I have been consulting to several start-ups in India and US on technology and product development.

Q: You could have chosen to do your startup in San Diego, or in Silicon Valley. Why did you choose to come back to India?

A: I see India as a place with ample opportunities for an entrepreneur. My dream always has been to see India as a product innovation hub rather than a BPO hub. I decided to setup my company in Pune to pursue this dream. TouchMagix Media Pvt. Ltd. is the first entity of the TechnoKarma Labs. I dream to have many more product companies like this in times to come. I love to interact with like minded people who share this vision.

Reblog this post [with Zemanta]

Can Pune Emulate The Silicon Valley Technology Startup Ecosystem?

Shaniwarwada, Pune
Image via Wikipedia

This article is cross-posted from Amit Paranjape‘s blog.

I have always wondered about the parallels between Silicon Valley and Pune. Some might call this farfetched, but there are some really interesting and some coincidental similarities. Pune (as does any other Indian City) has a long way to go to even get closer, but that should not stop us from brainstorming on leveraging these similarities as building blocks, and strive towards emulating the leader. Silicon Valley is unique. Maybe one day, we can indeed see a vibrant Pune Technology Startup Ecosystem, thriving and prospering – on the lines of the Great Technology Entrepreneurship Capital of the World!

I started this self-brainstorming exercise by simply listing those Pune characteristics that parallel Silicon Valley (The degrees of similarities might vary …):

¡ Good quality of life (Compared to other Indian Metros).

¡ Temperate Weather.

¡ Technology Entrepreneurship Culture.

¡ Fledgling Startup Ecosystem.

¡ Education Hub:

o Many Good Colleges

o Leading Research Institutes

¡ Magnet for people from all over India/World:

o Pune has the highest number of foreign students (amongst Indian cities)

o Large number of foreign expats, and visiting researchers

o Recently, Pune has also benefited from the ‘Reverse Brain Drain’: Many highly qualified professionals and successful entrepreneurs (of Indian origin) from the US/Silicon Valley have moved back to Pune.

¡ Proximity to a Financial Capital.

¡ Large IT talent pool (Thanks to the many IT Outsourcing Companies).

· Leading ‘Green’ Technology Hub in India. (Headquarters of companies like Praj and Suzlon).

¡ Hub of Hi-tech manufacturing (Note: San Francisco Bay Area has quite a few high-end manufacturing companies as well).

And some coincidental ones …

¡ Leading Wine Producing Region in India.

¡ Developing into a key hub for specialty fruits, vegetables and Flori-culture.

¡ Open (and tolerant) culture.

Some of the obvious things that are lacking in Pune include – Infrastructure and Sustainable Development. The other ones include greater focus on pure research, and venture funding. [I will discuss these and more in further detail in following articles in this series.]

As I was continuing with this brainstorming and gap-analysis, I stumbled upon a great article written by the well-known Silicon Valley Venture Capitalist and Essayist, Paul Graham on ‘How To Be Silicon Valley’, in May 2006. He describes the key characteristics of the San Francisco Bay Area that led to the development of this amazing ecosystem over the past few decades. He also explores how similar Ecosystem Development could happen in other cities/towns anywhere in the world.

In my article, I make an attempt to use the key characteristics identified by Paul Graham, and try to map them to Pune. I believe that Pune is amongst the best places in India where a Silicon Valley like Ecosystem can take shape. Obviously, there are many challenges. I am not going to use sobriquets such as ‘Pune – The Silicon Valley of India’. Silicon Valley is unique – there can be only one.

I am also not doing a comparison (either/or) between different Indian cities here. Many writers routinely refer to Bangalore as the Silicon Valley of India. While to me, Bangalore is definitely not there at this point, it might very well be on the right track. To me, if places like Bangalore, Pune and others introspect and strive for the key characteristics that are described by Paul Graham, then all of them have a shot (tough, it might be!) at creating Silicon Valley like Ecosystems someday.

Note – I recommend that you do read Paul Graham’s article: “How To Be Silicon Valley”, before reading further. This will enable you to better relate to my observations below.

I am going to follow a format, similar to that used by Paul Graham. Listed below are the key characteristics and how Pune fares with respect to these.

Presence of Rich People who are not Bureaucrats

Paul Graham argues that a technology ecosystem needs rich people who can take risks, and invest the necessary seed capital. These investors shouldn’t be purely financial investors who don’t understand the domain that they are investing in. Nor should they be bureaucrats who simply evaluate short and medium term financial returns and are risk averse.

Paul Graham describes the example of how the money made on the risks taken in the 1980s (e.g. Sun Microsystems) was then re-invested again in the 1990s (e.g. Google, Amazon) and now being re-invested again. Essentially, startups create startups – this how this ecosystem starts and grows.

Does Pune have rich people, who are technocrats? The answer is yes. Maybe nowhere enough, but the successful technology entrepreneurs of the 1980s and 1990s have built up some good reserves and have started looking for interesting technology ideas to fund.

Not Just Buildings

Large buildings and nice campuses don’t make the Silicon Valley! The massive new IT Parks we see today in many Indian cities, don’t equate to Silicon Valley. We need the DNA of a startup that was founded here and grew.

Unfortunately, Pune doesn’t have many IT firms that became very big (like Infosys – though one could argue that Infosys in fact started in Pune, and moved on…). However, there are a few good examples of non-IT technology companies (Manufacturing, Industrial Automation, Green Energy, etc.) that made it big.

Universities & Research Institutions

Pune has a strong education culture and some excellent engineering and science & technology colleges, including the 2nd oldest engineering college in India. A new Indian Institute of Science Campus is also being planned. Distance wise one can argue that IIT Bombay is less than 3 hours (150 km) from Pune.

In addition to the universities, Pune has many leading research institutions in a variety of technology areas – National Chemical Laboratory, Institute of Virology, Indian Meteorological Office, Inter-University Center for Astronomy & Astro-Physics, Agricultural Research Institute, Various Defense Research Organizations, etc.

Personality

Paul Graham talks about a city/town having a ‘personality’. He further states that you don’t build such a personality – you let it grow. I believe that Pune has grown a strong personality over the past many decades. In fact, this is one of the attributes that Pune is quite famous for. A ‘Punekar’ (resident of Pune) can be identified by many interesting traits!

Pune has a personality of a small city/town; a personality of knowledge & learning; a personality of creativity (not just in technology, but in other areas such as arts and music); a personality of a distinctive life-style; a personality of tolerance & openness to new ideas. Historically, it has embraced and assimilated people from different parts of India (and the world).

And while the new Pune is morphing into a cookie-cutter solution of suburban development seen in other metros, the old – new Pune combination still maintains a distinct identity.

Nerds

According to Paul Graham, ‘Nerds’ constitute one of the most critical building blocks of such an ecosystem. You can call them anything – But Pune is increasingly a preferred destination for many techies (or nerds, or whatever you want to call them!). Historically, Pune has always been a center of attraction for the learned – not only in technology areas, but in other areas such as History, Sociology, Arts, Music and Languages.

Many of these people find Mumbai and other Metros to be too big, too fast, and too glamorous. Pune is compact, liberal and relatively quiet in comparison to most Indian cities. These nerds don’t mind paying a lot more to live in such a place with it’s unique identity (see earlier section). Quality of life is important for them. Note – Pune real estate is quite expensive, and the overall cost of living is amongst the highest in India.

Youth

Given Pune’s strong education ecosystem, Pune is a ‘young’ city. It is vibrant with fresh energy and drive. Culturally, it is less conservative/more liberal – whichever way you want to look at it. There are also quite a few people here who are ‘young at heart’.

Time

Even after having all the right mix of the above key characteristics, you need to provide ‘Time’ for a Silicon Valley to be built. While Pune has many of the desirable ingredients, it still needs more time. And it is critical that these characteristics don’t degrade/disintegrate over that time period. I will discuss this further in future articles in this series.

Competing with the ‘Original Silicon Valley’

Paul Graham’s last point relates to competing with the ‘original Silicon Valley’. Any new challenger will definitely face competition from the original one!

I do think that there is room for more challengers. Speaking about Pune/India, we have an advantage of having more generalists (engineers who are more application oriented that theory oriented; and can quickly span interdisciplinary boundaries). The costs in India have risen significantly this decade, but still remain low compared to the west. Thus, if planned and used correctly, the same capital can stretch longer here. Pune and India have a large and growing young population. Many innovative ideas are driven by young people – both as innovators, as well as consumers. It is here where India in general and Pune in particular has a strong credential.

In future articles in this series, I will explore specific steps that I think Pune needs to take towards its goal of emulating a Silicon Valley like technology ecosystem.

Please provide your feedback, other ideas and comments on this article. I will try to incorporate these in the future articles in this series.

This is a very important topic for all people interested in and/or working in the technology area in Pune!

Reblog this post [with Zemanta]

Should you use a file-system or a database

Whether to use a file-system or a database to store the data of your application has been a contentious issue since the 80s. It was something we worried about even when I was doing my Ph.D. in Databases in the 90s. Now Jaspreet Singh, of Pune-based startup Druvaa has weighed in on this issue on Druvaa’s blog. His post is republished here with permission.

This topic has been on my plate for some time now. It’s interesting to see how databases have come a long way and have clearly out-shadowed file-systems for storing structured or unstructured information.

Technically, both of them support the basic features necessary for data access. For example both of them ensure  –

  • Data is managed to ensure its integrity and quality
  • Allow shared access by a community of users
  • Use of well defined schema for data-access
  • Support a query language

But, file-systems seriously lack some of the critical features necessary for managing data. Lets take a look at some of these feature.

Transaction support
Atomic transactions guarantee complete failure or success of an operation. This is especially needed when there is concurrent access to same data-set. This is one of the basic features provided by all databases.

But, most file-systems don’t have this features. Only the lesser known file-systems – Transactional NTFS(TxF), Sun ZFS, Veritas VxFS support this feature. Most of the popular opensource file-systems (including ext3, xfs, reiserfs) are not even POSIX compliant.

Fast Indexing
Databases allow indexing based on any attribute or data-property (i.e. SQL columns). This helps fast retrieval of data, based on the indexed attribute. This functionality is not offered by most file-systems i.e. you can’t quickly access “all files created after 2PM today”.

The desktop search tools like Google desktop or MAC spotlight offer this functionality. But for this, they have to scan and index the complete file-system and store the information in a internal relational-database.

Snapshots
Snapshot is a point-in-time copy/view of the data. Snapshots are needed for backup applications, which need consistent point-in-time copies of data.

The transactional and journaling capabilities enable most of the databases to offer snapshots without shopping access to the data. Most file-systems however, don’t provide this feature (ZFS and VxFS being only exceptions). The backup softwares have to either depend on running application or underlying storage for snapshots.

Clustering
Advanced databases like Oracle (and now MySQL) also offer clustering capabilities. The “g” in “Oracle 11g” actually stands for “grid” or clustering capability. MySQL offers shared-nothing clusters using synchronous replication. This helps the databases scale up and support larger & more-fault tolerant production environments.

File systems still don’t support this option 🙁  The only exceptions are Veritas CFS and GFS (Open Source).

Replication
Replication is commodity with databases and form the basis for disaster-recovery plans. File-systems still have to evolve to handle it.

Relational View of Data
File systems store files and other objects only as a stream of bytes, and have little or no information about the data stored in the files. Such file systems also provide only a single way of organizing the files, namely via directories and file names. The associated attributes are also limited in number e.g. – type, size, author, creation time etc. This does not help in managing related data, as disparate items do not have any relationships defined.

Databases on the other hand offer easy means to relate stored data. It also offers a flexible query language (SQL) to retrieve the data. For example, it is possible to query a database for “contacts of all persons who live in Acapulco and sent emails yesterday”, but impossible in case of a file system.

File-systems need to evolve and provide capabilities to relate different data-sets. This will help the application writers to make use of native file-system capabilities to relate data. A good effort in this direction was Microsoft WinFS.

Conclusion

The only disadvantage with using the databases as primary storage option, seems to be the additional cost associated. But, I see no reason why file-systems in future will borrow features from databases.

Disclosure

Druvaa inSync uses a proprietary file-system to store and index the backed up data. The meta-data for the file-system is stored in an embedded PostgreSQL database. The database driven model was chosen to store additional identifiers withe each block – size, hash and time. This helps the filesystem to –

  1. Divide files into variable sized blocks
  2. Data deduplication – Store single copy of duplicate blocks
  3. Temporal File-system – Store time information with each block. This enables faster time-based restores.
Reblog this post [with Zemanta]

Are you a misfit in your company?

This article was written for PuneTech readers by Dr. Basant Rajan, who has played various engineering/management/research roles over 18 years in the Indian Software industry. Most recently he was CTO of Symantec India.

This article is primarily targeted at software professionals (in India) and attempts to introduce a few concepts that’ll help you understand your aspirations and your work environment better, so you can make some career choices that could empower you to realize your true potential. If the section below on employee types, does not strike a chord, the rest of this article will likely not either, be forewarned.

Your long term career prospects depends a lot on the expectations of your manager and organization matching your aspirations. Frustration, especially when linked to growth prospects is often a tell tale sign that something’s amiss.

Before we can narrow down on what is amiss and fix it, we need to cover some ground related to what drives these expectations in the first place. Next we’ll revisit frustrations at the work place in the context of what we learned about organizations and employees. We will then go over some simple tests to help classify your manager and organization and finally use the information we’ve gleaned to chart a possible course of action that will let you positively influence your destiny at the work place.

Employee types – cooks & chefs

One typically uses the term talent interchangeably with employee. In the context of the Indian software industry however, we see two distinct kinds of employees – one that brings skills to the work place and another that also bring long some talent.

Think of it more like the difference between a chef and a cook. Given a recipe, both can make a delicacy you’ll relish. But you’d expect the chef to be able to surprise you with something he conjured up. Put another way, they both can cook, but have different limits of capability.

We see both the cook and the chef in the software industry too. There are a large number of knowledge workers, cooks, who armed with programming skills, can deliver on complicated software components to a specified design. Then there are some, chefs, who with their deep understanding of the domain to augment their skills, can architect solutions to problems, small and large.

Note, there is nothing the matter with being either one of them …

Engagement models

There are two main reasons why a company opens an offshore branch in India – to save costs, and/or to augment availability of specialized talent.

Correspondingly, there are predominantly two engagement models in play at off-shored operations – the cost leverage model and the talent leverage model. In reality, a single company can have both these models operating in different parts.

Off-shoring under the cost leverage model (CLM) is undertaken primarily to benefit from the cost differential of skilled labour between the two sites. For example, cheap labour is pretty much what drives the growth of call centers in India. For CLM to succeed, the organization needs to be able to source skilled people who can be relatively easily trained for the specific work at hand, in significant numbers. Process driven work can typically be executed well this way.

The talent leverage model (TLM) on the other hand prioritizes the availability of specialized talent over the cost differential. The parent company is setting up their operation in India because they cannot source the required number of specialists locally. That is not to say that the TLM precludes benefits from the cost differential of the two sites.

Note, there is nothing inherently wrong with either model.

Mismatches and frustration

So much for setting up the necessary context and a common vocabulary. From the discussion so far it should be relatively easy to infer that to succeed, organizations should take care to hire the right mix of cooks and chefs.

The cost leverage model expects skilled executors of strategy (cooks) for success while the talent leverage model also depends on the supply of people who can drive the strategy (chefs). Consequently, the cost leverage model (CLM) tends to attract/need managers who are cooks while the talent leverage model (TLM) tends to attract/need managers who are chefs.

Unfortunately organizations often fail to make a conscious identification of the model they are operating under and often end up recruiting the wrong employee type for a given engagement model resulting in a frustrating work environment for the employee. While exploring why organizations slip up on this front is interesting in itself, let’s move on for now.

If you happen to be a chef and find yourself in (part of) an organization that has embraced the cost leverage model, you are likely to get disillusioned as the novelty of the job wears off. Your aspirations for growth aren’t likely to be satisfied in such an environment. Changing roles/jobs will become necessary for growth.

If you happen to be a cook and find yourself in (part of) an organization that’s operating in the talent leverage model, you aren’t that badly off, especially if you are an individual contributor. No organization can function effectively without the cooks and they are therefore valued. However, organizations operating under talent leverage tend to favour chefs when it comes to promotions.

If you are a chef and your manager happens to be a cook, your work environment is likely to be frustrating. Career growth is very unlikely to happen in such situations unless you can successfully work around your manager. Changing roles/jobs is an option worth considering seriously.

Is your manager a cook or a chef?

Knowing one’s manager is certainly a good thing. But for our purposes, let’s just limit ourselves to classifying one’s manager as a cook or a chef … Exactly how does one go about doing that? Fortunately, external behavior you are bound to notice can help you make an informed decision. Here’s how.

  • Would you consider him a supervisor (cook) or a real manager (chef)?
  • Does/can he exercise his discretion in addressing issues in your environment? (yes- chef)/(no – cook)
  • Does he regularly challenge status quo to effect changes for the better? (yes – chef) / (no – cook)
  • Does he seem to value growth in size (cook) or growth in impact (chef) of his organization?
  • Do you associate him more with your team (chef) or his management (cook)?
  • Can he work with influence (chef) or does he always need authority (cook)?
  • Caution : be sure not to confuse an isolated incident with a behavioral trait

Which engagement model is in play?

Now that we have the manager nicely squared away, let’s focus on the organization. To determine whether the your organization is really interested in the cost leverage model or the talent leverage model, you could simply ask your manager, but on second thoughts, don‘t bother. What management claims and what they really value can be two different things, so one needs to figure this out for oneself, and here’s a simple test to do it.

  • Classify the managers around as either cooks or chefs.
  • Next, check to see which ones are getting more promotions, more power and more visibility.
  • If the cooks have the upper hand and the chefs are running into roadblocks or leaving, you are looking at the cost leverage model in operation. If the reverse is true, then you have an organization that values talent leverage.
  • Again a word of caution : take care to distinguish between isolated incidents and a trend.

Don’t surprised to see significant numbers of successful cooks in an organization that has embraced the talent leverage model. Look a little closer and you’ll see that most positions of influence in such an organization are held by chefs. Remember, that while one can’t have an army without soldiers, to be successful they need to be led in battle by officers of calibre.

Empower and be free…

Hardest for last … now what can that be?

Given that you’ve just finished judging your manager and your organization, now might be a good time to introspect and make an honest assessment of what you bring to the table, just skills or talent as well?

As the road to self realization is still uncharted territory, you’ll have to figure this part out yourself 🙂

Assuming you got back from the wilderness, it’s about time you got down ensuring a sustainable. mutually rewarding, relationship at the work place and signing up to making it happen.

Now that we’ve decided to act, it’s perhaps a good time to arm ourselves with a little prayer (ok, indulge me …, for now, let’s assume there is a God) … and here it is.

The serenity prayer (excerpt) …

God grant me the serenity
to accept the things I cannot change;
courage to change the things I can;
and wisdom to know the difference.

… and the industrial strength version (snippet)

if (type(self, wisdom) == CHEF) { /* you sure?, just kidding ... */
    if (model(self.org, wisdom) == CLM)) { /* in the wrong type of org */
        change_org(courage);
    } else {
        if (type(self.manager, wisdom) == COOK) { /* unfortunate ... */
            wait timeout { /* the organization might correct it */
                manager_might_change(hope);
            } change_role(courage); /* else, you have to do the honours */
        } /* else: lucky you! */
    }
} else { /* you happen to be a cook ... */
    if (model(self.org) == TLM) { /* wrong place for fast growth */
        accept_it(serenity); /* but it ain’t so bad ... */
    } /* else: lucky you! */
}

Note : God has been coded away in the industrial version, so you’ll have to make do with friends and family.

The bad news is that typically, one cannot hope to change the engagement model or one’s nature in short order.

The good news is that given the wide variety of needs and opportunities that exist, finding a work environment that suits your style is not likely to be hard once you know what you are looking for.

Let’s wrap up with a toast to “marriages made in heaven!” (else, just so you know, my wife’s a counselor).

Disclaimer

The views expressed here are solely mine and are not intended to expressly hurt the feelings of any particular minority group or organization.

This disclosure was made under duress brought to bear by Navin Kabra.

Credits

These thoughts have been shaped by a lot many colleagues and for that I am forever grateful.

About the Author – Basant Rajan

Basant Rajan holds a Doctorate in Computer Science from TIFR, Bombay and is currently an independent consultant.

He has 18 years of industry experience, and previous positions held include CTO, Symantec India heading up Symantec Research Lab (India) and Sr. Director, Symantec heading the Storage Foundations group in Pune and has been actively involved in promoting innovation at various levels within the engineering community at Symantec. He also concurrently held a Visiting Member position at TIFR, Bombay for a few years.

With a number of publications and patent applications to his name, Basant’s areas of interest include organizational change, storage technology, distributed systems, formal languages and logic.

He can be reached at basant+web [at] gmail.com.

An overview of Computer Aided Design (CAD)

Pune is a hotbed of activity for CAD Software – both, for users as well as developers. We asked Yogesh Kulkarni, who has more than a decade of experience in this industry to team up with Amit Paranjape to give PuneTech readers an overview of this area.

What is CAD?

Nonlinear statistics analysis of a 3D structure subjected to plastic deformations. Image by JoĂŤl Cugnoni courtesy the Wikimedia Project
Nonlinear statistics analysis of a 3D structure subjected to plastic deformations. Image by JoĂŤl Cugnoni courtesy the Wikimedia Project

CAD is defined as the use of computer technology to aid in the design of a part, a sub-assembly, or an entire product. Design can include Technical Drawings with Symbol based Representations, Visualization, 3D Rendering, and Simulation. Note, the term ‘Product’ could range from a small Widget, to an iPOD, too a large Building. Components of CAD technologies have also found widespread use in somewhat unrelated fields such Animation & Gaming.

Consider a World-War II era vintage B-17 Flying Fortress bomber; probably the only bomber ever to manufactured on an assembly line. How was it designed? Each and every part was painstakingly drawn on a drafting board. The various components and sub-assemblies were represented through various engineering drawing conventions. Yet the true visualization of how all these complex pieces fit and worked together, was left to that of the top engineers’ minds. And what about the complex 3-D shapes such as the wings? How were they designed and tested? Actual wooden models had to be created for this to visualize their shapes and test out their air-flow characteristics in wind tunnels. You can think of an army of literally hundreds of Draftsmen working on various pieces of this complex machine. Cars were designed the same way. ‘Machine Designing’ had elements of ‘Art’ in it. This style of designing was with us until recently. It’s only in the past 2-3 decades (even more recent in many SMEs in India) that computers have started replacing these ubiquitous ‘A1’ sized drawing boards that ruled the designers shop for so many decades.

Fast forward to today, and now let’s look at how Boeing’s latest 787 Dreamliner is being designed. This truly 21st century aircraft is built with composites instead of the traditional aluminum structures, and a whole bunch of other interesting innovations. All put together, Boeing claims to improve fuel efficiency by over 20% compared to other modern day commercial airplanes. All the designs of the Dreamliner are done using CAD. From the smallest widget to the entire airframe, the drawing, designing, assembling, and visualization is done on computer monitors. These designs are also evaluated for their validity and performance via Computer Aided Engineering (CAE). CAE works in conjunction with CAD to simulate and analyze various mechanical and other aspects of the design. Similarly, Computer Aided Manufacturing (CAM) works closely with CAD to help manufacture the complex parts on Computer Numerically Controlled (CNC) machines.

History

CAD has evolved a great deal over the past few decades along with the rise in computing and graphics power. Earlier CAD solutions were simple 2-Dimensional solutions for drawing machines and structures. These still represented a big step forward over drawing boards in terms of ability to save, edit and reuse drawings. Initial 3-Dimensional solutions were based on ‘wireframe models’ and ‘surface modeling’. Loosely speaking, these represented the outer edges and the external surfaces of a solid object in mathematical terms. Real 3-D capability involves representing the real object as a solid model. Mathematically, this involves a series of complex equations and data points. This had to wait for computing power to catch-up. Only in the late 1980s did this power become available to a wider engineering community via desktop workstations.

At a high level, you can think of a CAD package to have 2 important pieces: 1) The backend mathematical engine and 2) The front-end graphical rendering service.

Earlier CAD programs were primarily written in FORTRAN. Present day, CAD packages are typically developed in C or C++. Rendering was not a strong point of the earlier solutions. However over the past 2 decades, life-like rendering and simulation (rotation, motion, etc.) have become a reality. This capability has also taken this technology into the Animation & Gaming fields.

Associated Areas

CAD works closely with other allied areas such as CAE (Computer Aided Engineering), CAM (Computer Aided Manufacturing), as well as PLM (Product Lifecycle Management). In fact, CAD/CAM or CAD/CAE are often used together to describe the entire workflow. In this section we will take a brief look at these allied areas. In future, PuneTech will feature more detailed and specific articles about each of these areas.

Computer Aided Engineering is the use of computer technology to support engineering tasks such as analysis, simulation and optimization. These tasks are often performed by the engineer in close synchronization of the actual CAD activities. An example of ‘Analysis’ could be leveraging mathematical techniques such as ‘FEA/FEM’ (Finite-Element-Analysis/Finite-Element-Method) for designing a safe Bridge. ‘Simulation’ can be used to study how a mechanical assembly with various moving parts work together, on a computer screen, before actually building it. ‘Optimization’ can build on top of Analysis and Simulation to come up with the ‘most optimal’ design that meets the designer’s requirements. ‘Most Optimal’ could mean least weight, smallest number of parts, least friction, highest reliability, etc. depending on the designer’s primary objective.

Computer Aided Manufacturing is the use of computer technology to manufacture complex parts on automated machine tools. These machine tools are commonly referred to as ‘CNC’ or ‘Computer Numerically Controlled’ machines. Here’s a simple example. Let’s say an engineer has created a complex 3-D shape consisting of various contours for a new car’s exterior. The exterior parts are made by die-stamping in huge presses. The ‘dies’ are essentially molds made of hard metal. Principally, they are similar to a clay mold that is used to create various artifacts out of Plaster-of-Paris. These metal dies themselves have to be created by machining a ‘die-block’ to create a solid mirror image of the final part. This complex 3-D shape needs a sophisticated machine tool that can machine (cut/drill/shape) metal across multiple (3 or more) dimensions.

Controlling the motion of these machine tools is similar to controlling a robotic arm. CAM packages convert the solid designs in CAD packages into a set of coordinates and path instructions, along with desired speeds & acceleration/deceleration profiles for the machine tools, and communicate these instructions to the CNC machines.

PLM or ‘Product Lifecycle Management’ is not directly related to CAD like CAE or CAM. Instead, PLM as the name suggests focuses on managing the entire lifecycle of designing activity across multiple groups and departments in a company. A complete design is not limited to the machine designer. Various other players come into the picture. These include Purchasing Managers who have to source design components and sub-assemblies from vendors; Cost Accountants who want to keep a tab on the overall material and manufacturing costs of a design, Compliance Experts who want to review the design for various agency compliance requirements, etc. Similarly there are requirements to maintain the design as it goes through various versions/upgrades through its life-cycle. PLM enables collaboration across different departments on the key aspects of the design. PLM also enables collaboration between designers in terms of sharing parts data, etc.

Major Players

AutoCADÂŽ by Autodesk is one of the most popular CAD packages out there. It focuses more on 2-D drawings such as part drawings, architect plans, electronic circuit designs, etc.

Packages like CatiaÂŽ by Dassault, NXÂŽ by Siemens-Unigraphics, Pro/EÂŽ by Parametric Corporation are popular 3-D Solid Modeling Solutions. These solutions find wide use in Automotive, Aerospace and Other Manufacturing Industry Segments.

CAD in Pune

Due to the strong industrial and manufacturing base, Pune not only contains some of the biggest users of CAD/CAM software, but it also hosts some of the biggest developers of CAD/CAM software in the world.

Leaving the sobriquets such as ‘Detroit of East’ aside, it is safe to say that Pune is indeed the primary automotive hub of India. Pioneering Indian automotive companies such as Tata Motors, Bajaj Auto, Bharat Forge and Kirloskar Oil Engines are headquartered here. Along with these, a number of top multi-nationals such as Mercedes-Benz, General Motors and Volkswagen are also based here.

These big auto-majors, along with other industrial powerhouses such as Cummins Diesel have created a strong industrial manufacturing ecosystem in Pune. These OEM (Original Equipment Manufacturers) in turn drive requirements for sub-assemblies and components to Tier-1 and Tier-2 vendors.

A large number of small and med-sized industrial automation companies have also sprung up in Pune. These companies design and develop various factory automation and material handling solutions for automotive as well as other industries.

Designing activity is important at all levels, in all these companies – big or small. As a result, Pune has become probably the biggest user of various 2-D and 3-D CAD applications and other associated CAE/CAM applications, in India.

However, the ecosystem for CAD doesn’t stop here! Given Pune’s dominance in Information Technology and the huge CAD users’ base, many CAD/CAM/CAE software companies worldwide have found Pune to be the natural choice for their R&D and Service Centers. All CAD majors described in the earlier section have some development presence in Pune. Pune also has software service companies focusing on this area, such as Geometric Systems.

About the Authors

Yogesh Kulkarni has more than a decade’s experience with CAD Software Development (PTC, SDRC, UGS and now Autodesk). He is based in Pune and can be contacted at yogeshkulkarni@yahoo.com. More details are available at: http://www.linkedin.com/in/yogeshkulkarni

Amit Paranjape is one of the driving forces behind PuneTech. He has been in the supply chain management area for over 12 years, most of it with i2 in Dallas, USA. For more details, see his PuneTech profile.

Reblog this post [with Zemanta]

Enrich your website (with content & money) – An interview with Hover.in

hover_logoHover.in is a Pune startup that provides a service for web publishers (i.e. website/blog owners) to automatically insert extra content into the webpages, in the form of a bubble that appears when the mouse is hovered over underlined words. The bubble can be informational (like a map appearing wherever a name of a place appears, or a background information about a company appearing wherever a name of a company appears), or it can be contextual, in-text, advertisement from hover’s network of partners, and most importantly it is fully under the publisher’s control. While services like this have been around in other forms, hover.in believes that its ability to handle any language, and the focus on Indian market sets it apart from the competition. See the PuneTech profile of hover.in to get a better idea of what hover.in provides.

Hover.in was one of the startups chosen to be showcased at proto.in’s Jan ’08 edition. Earlier this, week, they announced that they have received seed funding from Media2Win, and will soon be seen in action on some large Indian portals.

PuneTech interviewed Arun Prabhudesai, CEO of Hover.in, (he also runs popular Indian Business blog trak.in) to get a deeper look at hover.in. To be true to the “tech” part of PuneTech, we also asked some technical questions that were answered by Bhasker V. Kode (Bosky), CTO of Hover.

Q: Congratulations on getting funded – especially under these economic conditions. How do you plan on using this funding – what will be the focus areas?

The seed funding was finalized few months back before the whole “recession” thing started constantly ringing in our ears.

Actually, from hover.in perspective we feel this funding as more of strategic investment where Media2Win – being a leading digital media agency – will help us to go to the market. We have immensely benefitted from the experience Me2W brings on table.

The funding is being mostly used to ramp up our technical resources and infrastructure.

Q: Your main “customers” are website publishers. Are you targeting any specific geography, like India (as the .in domain name would suggest)?

Hover.in in-text platform is global and open for web publishers and bloggers from all geographies. However, we are actively targeting Indian market first. India currently does not have any in-text platform and that’s puts us in a great position to capture this market. Infact, hover.in is world’s first in-text platform that is also language agnostic, which opens up a large chunk of regional language websites.

Q: I keep hearing that “there isn’t enough money to be made from online advertisements alone in India, except for a few specific verticals.” And you seem to be going squarely after this market. What is your take on this issue?

You know, this people have started talking about because there are too many ad networks that have come up in last couple of years…more than 15 odd I can count on my fingers !

But if you look at the larger picture, online advertisements are the only ones that are growing year on year. Traditional advertising is hardest hit…

For us the advantage is, we DO NOT compete with traditional ad networks as they are 99% display advertising. We are in-text and this market has not even tapped. From publisher perspective, it is an additional channel for content and monetization.

From Advertisers, this is the most targeted way of displaying their advertisement. Also, as we follow CPA / CPC kind of model, advertisers have full ROI on investment.

Co-founders of Hover.in - Bhasker V. Kode, CTO (left) and Arun Prabhudesai, CEO
Co-founders of Hover.in - Bhasker V. Kode, CTO (left) and Arun Prabhudesai, CEO

Q: If I remember right, you are using Erlang for your development – a very non-standard choice. Can you share the reasons behind the choice? Isn’t it difficult to get Erlang developers? In retrospect are you happy with this decision?

(by Bosky)

Erlang has been used to build fault-tolerant and distributed applications for quite some time in areas like telecom, especially for allowing highly granular choices in networking. Off-late projects like ejabberd, mnesia, yaws and tsung have shown how products spanning several hundred nodes can be implemented with the erlang stack and in particular – web technologies.

It most definitely is a paradigm shift courtesy of it’s functional programming concepts, and we are glad we took that decision because of its inherent focus on distributed systems, and although the erlang developer community in India is non-existent, with the right attitude towards learning now a day’s it does’nt matter. Moreover it only took a couple of months for our developers to get used to the semantics, following which as with any stack – it’s about what you do with that expertise.

Erlang gives you that power, but at the same time – there are areas where it might not seem a right fit and perhaps look to perl or ruby for tasks that suit them. For example, we use python wherever it seems required as well. The good part is erlang open-source community has quite a closely-knit presence online, which does help quite a lot. We ourselves are now looking at contributing and opening up internal projects.

Q: One of the important challenges for hover.in will be scalability. How are you gearing up to handle that?

(By Bosky)

Right from day one, erlang based systems like ours are designed built for horizontal scaling – which allows plug-n-play addition to our growing cluster. Regardless of the development stack you work on – some things need to be built in early and that’s something we spend a whole lot of time during our first year fine tuning.

Especially for us – where practically every page hit – for every one of our users – reflects a page visit to us where we need to compute and render hovers in a matter of milliseconds. To this end – before starting out application-logic, we first built out our own distributed priority-queuing systems, our own distributed crawler and various indexing mechanisms, a time-splicing based cpu allocation for various tasks, which made things like adding jobs, executing them a more controlled operation regardless of what the actual job is and has been handling burst mode quite well.

Moreover, we can also add workers on-the-fly to almost all major tasks much like an Amazon ec2 instance where each work itself is supervised for crash recovery thanks to erlang’s open telecom platform libraries and guidelines. Caching is something else we have and continue to work on consistently. No matter how many papers, algorithms or implementations there are out there – every system needs to fine tune their own unique set of optimizations vs compromises that reflect their infrastructure, traffic, memory & space needs,etc ..

Having granular control of this is something that is a real challenge as well as a pleasure with the stack (Linux, Yaws, Mnesia, Erlang). We ‘ve also been quick to adopt cloud-computing initiatives like Amazon s3, and more recently cloudfront for our static content delivery needs.

We’re also working on a parallel map-reduce implementation, exploring options with xmpp, and better logging for our developers to find and fix glitches or bottlenecks, eventually translating to a faster and better user experience for our users.

Q: You moved to Pune specifically to start hover.in. What made you choose Pune?

Yes, I did move to Pune to start hover.in, however, it would not be fair to say that is the only reason why I moved here. I have lived most of my formative years here in Pune, before going to USA. And as you know, once a Puneite, always a Puneite!

Actually we had to choose from 2 cities – Chennai (Our Co-founder, Bhasker VK, is from Chennai) and Pune. Few important aspects tilted the balance in favour of latter. Better weather, Pune’s proximity to Mumbai where majority of big publishers, investors and advertisers have their offices. To add to it all Pune has great startup & tech community.

Q: In the journey so far, have you made any significant mistakes that you’d like to share with others, so they can learn from your experience?

Absolutely… Mistakes are important aspect of learning process and especially for first generation entrepreneurs like Bosky and Me. I think “attention to detail” is one of the most important aspects that an entrepreneur should watch for. You need to have clear in-depth blueprint in your mind about the direction your startup is going to take, otherwise it’s very easy to fall off the cliff!

Optimizing, especially during these tough times – be it resources, infrastructure or even your time. Optimize everything. Startups can’t afford any leaks.

The third thing and the one which I don’t see very often. Partner with other startups; see if there are any synergies between you. In most cases it is a win-win situation for both of them

Q: Are you partnering with other startups? At this stage, would it be possible for you to share info about any of these partnerships?

Yes, we are…one example would be Alabot (another Pune startup -ed.). Where we have got their NLP application (Travel bot) inside our hoverlet. So for any travel related publishers, it becomes a boon. So a win-win situation for both of us.

Another example would be – Before we got our own office, 2 other startups were kind enough to accommodate us for few weeks – These kind of partnerships in any way possible go a long way !

Q: What would your advice for struggling Pune entrepreneurs be?

Entrepreneurship is a roller coaster ride … It ain’t easy, but the thrills along the way make it all more than worth it!

Just jump into the rough waters and by the time you reach the other side, you will be glad you did it….

Reblog this post [with Zemanta]

Understanding Data De-duplication

Druvaa is a Pune-based startup that sells fast, efficient, and cheap backup (Update: see the comments section for Druvaa’s comments on my use of the word “cheap” here – apparently they sell even in cases where their product is priced above the competing offerings) software for enterprises and SMEs. It makes heavy use of data de-duplication technology to deliver on the promise of speed and low-bandwidth consumption. In this article, reproduced with permission from their blog, they explain what exactly data de-duplication is and how it works.

Definition of Data De-duplication

Data deduplication or Single Instancing essentially refers to the elimination of redundant data. In the deduplication process, duplicate data is deleted, leaving only one copy (single instance) of the data to be stored. However, indexing of all data is still retained should that data ever be required.

Example
A typical email system might contain 100 instances of the same 1 MB file attachment. If the email platform is backed up or archived, all 100 instances are saved, requiring 100 MB storage space. With data deduplication, only one instance of the attachment is actually stored; each subsequent instance is just referenced back to the one saved copy reducing storage and bandwidth demand to only 1 MB.

Technological Classification

The practical benefits of this technology depend upon various factors like –

  1. Point of Application – Source Vs Target
  2. Time of Application – Inline vs Post-Process
  3. Granularity – File vs Sub-File level
  4. Algorithm – Fixed size blocks Vs Variable length data segments

A simple relation between these factors can be explained using the diagram below –

Deduplication Technological Classification

Target Vs Source based Deduplication

Target based deduplication acts on the target data storage media. In this case the client is unmodified and not aware of any deduplication. The deduplication engine can embedded in the hardware array, which can be used as NAS/SAN device with deduplication capabilities. Alternatively it can also be offered as an independent software or hardware appliance which acts as intermediary between backup server and storage arrays. In both cases it improves only the storage utilization.

Target Vs Source Deduplication

On the contrary Source based deduplication acts on the data at the source before it’s moved. A deduplication aware backup agent is installed on the client which backs up only unique data. The result is improved bandwidth and storage utilization. But, this imposes additional computational load on the backup client.

Inline Vs Post-process Deduplication

In target based deduplication, the deduplication engine can either process data for duplicates in real time (i.e. as and when its send to target) or after its been stored in the target storage.

The former is called inline deduplication. The obvious advantages are –

  1. Increase in overall efficiency as data is only passed and processed once
  2. The processed data is instantaneously available for post storage processes like recovery and replication reducing the RPO and RTO window.

the disadvantages are –

  1. Decrease in write throughput
  2. Extent of deduplication is less – Only fixed-length block deduplication approach can be use

The inline deduplication only processed incoming raw blocks and does not have any knowledge of the files or file-structure. This forces it to use the fixed-length block approach (discussed in details later).

Inline Vs Post Process Deduplication

The post-process deduplication asynchronously acts on the stored data. And has an exact opposite effect on advantages and disadvantages of the inline deduplication listed above.

File vs Sub-file Level Deduplication

The duplicate removal algorithm can be applied on full file or sub-file levels. Full file level duplicates can be easily eliminated by calculating single checksum of the complete file data and comparing it against existing checksums of already backed up files. It’s simple and fast, but the extent of deduplication is very less, as it does not address the problem of duplicate content found inside different files or data-sets (e.g. emails).

The sub-file level deduplication technique breaks the file into smaller fixed or variable size blocks, and then uses standard hash based algorithm to find similar blocks.

Fixed-Length Blocks v/s Variable-Length Data Segments

Fixed-length block approach, as the name suggests, divides the files into fixed size length blocks and uses simple checksum (MD5/SHA etc.) based approach to find duplicates. Although it’s possible to look for repeated blocks, the approach provides very limited effectiveness. The reason is that the primary opportunity for data reduction is in finding duplicate blocks in two transmitted datasets that are made up mostly – but not completely – of the same data segments.

Data Sets and Block Allignment

For example, similar data blocks may be present at different offsets in two different datasets. In other words the block boundary of similar data may be different. This is very common when some bytes are inserted in a file, and when the changed file processes again and divides into fixed-length blocks, all blocks appear to have changed.

Therefore, two datasets with a small amount of difference are likely to have very few identical fixed length blocks.

Variable-Length Data Segment technology divides the data stream into variable length data segments using a methodology that can find the same block boundaries in different locations and contexts. This allows the boundaries to “float” within the data stream so that changes in one part of the dataset have little or no impact on the boundaries in other locations of the dataset.

ROI Benefits

Each organization has a capacity to generate data. The extent of savings depends upon – but not directly proportional to – the number of applications or end users generating data. Overall the deduplication savings depend upon following parameters –

  1. No. of applications or end users generating data
  2. Total data
  3. Daily change in data
  4. Type of data (emails/ documents/ media etc.)
  5. Backup policy (weekly-full – daily-incremental or daily-full)
  6. Retention period (90 days, 1 year etc.)
  7. Deduplication technology in place

The actual benefits of deduplication are realized once the same dataset is processed multiple times over a span of time for weekly/daily backups. This is especially true for variable length data segment technology which has a much better capability for dealing with arbitrary byte insertions.

Numbers
While some vendors claim 1:300 ratios of bandwidth/storage saving. Our customer statistics show that, the results are between 1:4 to 1:50 for source based deduplication.

Reblog this post [with Zemanta]