Copy of Understanding Hadoop 2 0 Architechture

preview_player
Показать описание

Apache Software foundation (ASF), the open source group which manages the Hadoop Development has recently announced that Hadoop 2 is now Generally Available (GA) and is now ready for Production deployment.

Hadoop 2 and YARN: What's the Fuss about?
With its enterprise class features, the Hadoop Release 2.0 is a major milestone towards increased Hadoop adoption among businesses. The new release provides many enterprise-class features in Hadoop, for example:

YARN framework and Next Generation MapReduce (MRv2): YARN provides better resource management in Hadoop, resulting in improved cluster efficiency and application performance.

HDFS High Availability (aka NameNode HA): In Hadoop 1.0 NameNode was the single point of failure in a Cluster, resulting in data loss in case of a NameNode failure. Hadoop 2.0 Architecture supports multiple Name Nodes to remove this bottleneck, making Hadoop attractive to enterprises.

HDFS Federation: This feature allows horizontal scalability for Hadoop file system. Again one of the many sought after feature by enterprise class Hadoop users such as Amazon and eBay.

There are additional features such as Data Snapshot, Support for Windows, NFS access which will increase Hadoop adoption in the Industry to solve Big Data problems.

Рекомендации по теме