For enterprises, what comes first?
Adopting a solution that is not highly available and then try to make it enterprise ready or start with high availability as core feature then add the power of analytics?
In my past few articles I wrote about the challenges to adopting Hadoop in the enterprise and what it would take to make it enterprise ready. One of the points I highlighted is the NameNode high availability or the lack of it. In Hadoop Distributed File System (HDFS), NameNode is the metadata server that has the location information for data blocks distributed across DataNodes. If NameNode fails, the cluster would be unavailable to analytics applications.
The Hadoop community has been working on a solution to add High Availability to HDFS. The solution entails adding another NameNode with shared storage and changing DataNodes to send...