0% found this document useful (0 votes)
42 views3 pages

3days Bigdata Crash Course Content

The 3-day big data crash course covers Hadoop, Spark, NoSQL databases and related tools. Day 1 focuses on Hadoop fundamentals like HDFS, MapReduce and Yarn, as well as Hive data warehousing. Day 2 covers NoSQL databases like HBase and Spark components. Day 3 examines tools for data ingestion (Sqoop, Flume, Kafka) and workflow (Oozie), the HUE query interface, and integrating big data with BI/visualization tools on platforms like Cloudera and HDInsight. Hands-on exercises are included each day to provide practical experience with the concepts and technologies.

Uploaded by

geoinsys
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
42 views3 pages

3days Bigdata Crash Course Content

The 3-day big data crash course covers Hadoop, Spark, NoSQL databases and related tools. Day 1 focuses on Hadoop fundamentals like HDFS, MapReduce and Yarn, as well as Hive data warehousing. Day 2 covers NoSQL databases like HBase and Spark components. Day 3 examines tools for data ingestion (Sqoop, Flume, Kafka) and workflow (Oozie), the HUE query interface, and integrating big data with BI/visualization tools on platforms like Cloudera and HDInsight. Hands-on exercises are included each day to provide practical experience with the concepts and technologies.

Uploaded by

geoinsys
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
You are on page 1/ 3

Duration of the course 3 days

Program Name Bigdata crash course Daywise

Hadoop Spark Nosql Module 1


Introduction to Big Data 1
Characteristics 1
Why, How and What s of Big data 1
Existing OLTP, ETL,DWH,OLAP 1

Module 2 1
Introduction to Hadoop Ecosystem 1
Architecture-HDFS
Sharding , Distributed and Replication factor (SDR) 1
Daemons 1
Hadoop Fs shell commands 1
Writing Data to HDFS 1
Reading Data from DFS 1
Map reduce and Yarn 1
Hands on 1

Module 3 1
Introduction to Hive Data warehouse 1
Hive QL Commands 1
Manipulation and anlytical function in hive 1
Managed table and external tables 2
Partitioning and Bucketing 2

Day 2

Module 4 2
Nosql Database 2
CAP theorem /BASE 2
The HBase Data Model
The HBase Shell 2
HBase Architecture 2
Schema Design 2
Module 5 2
Spark core and Components 2
Spark Shell 2
RDD 2
Dataframe /Dataset 2
Spark sql 2

Day 3
Module 6
Sqoop 3
Flume 3
Kafka 3
Oozie 3
HUE 3
Hands-on each module 3
Bigdata sources as Source and target 3
ETL integration with Hadoop (Informatica or SSIS ) 3
OLAP /Data visualization integration with Hadoop (Microstrategy o 3
Distribution : Cloudera /Horton works ( HDInsight ) /MapR or A 3
Daywise

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy