The document provides an overview of the Hadoop ecosystem, focusing on components like HDFS, YARN, and NoSQL databases. It discusses the high availability feature in Hadoop 2.x, which addresses the single point of failure issue by allowing multiple NameNodes, and introduces HDFS Federation for improved scalability. Additionally, it contrasts NoSQL databases with traditional RDBMS, highlighting their advantages and disadvantages in terms of scalability, availability, and data structure.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
0 ratings0% found this document useful (0 votes)
8 views54 pages
Big Data Unit-4
The document provides an overview of the Hadoop ecosystem, focusing on components like HDFS, YARN, and NoSQL databases. It discusses the high availability feature in Hadoop 2.x, which addresses the single point of failure issue by allowing multiple NameNodes, and introduces HDFS Federation for improved scalability. Additionally, it contrasts NoSQL databases with traditional RDBMS, highlighting their advantages and disadvantages in terms of scalability, availability, and data structure.