0% found this document useful (0 votes)
127 views3 pages

Big Data Analytics With Lab

This document outlines a course on big data and Hadoop. The course objectives are to learn about big data, Hadoop, NoSQL databases, HDFS concepts, data processing operators, and various visualization techniques. The course contains 5 units that cover introduction to big data and Hadoop, NoSQL databases, HDFS and data analysis, the Hadoop ecosystem, and data visualization. It also lists experiments and assignments including installing Hadoop, MapReduce programs, Hive, HBase, and data importing/exporting. The outcomes are for students to understand big data technologies, analytical approaches, Hadoop YARN, NoSQL, develop big data solutions, and apply Hadoop tools to problems.

Uploaded by

Keerthana K
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
127 views3 pages

Big Data Analytics With Lab

This document outlines a course on big data and Hadoop. The course objectives are to learn about big data, Hadoop, NoSQL databases, HDFS concepts, data processing operators, and various visualization techniques. The course contains 5 units that cover introduction to big data and Hadoop, NoSQL databases, HDFS and data analysis, the Hadoop ecosystem, and data visualization. It also lists experiments and assignments including installing Hadoop, MapReduce programs, Hive, HBase, and data importing/exporting. The outcomes are for students to understand big data technologies, analytical approaches, Hadoop YARN, NoSQL, develop big data solutions, and apply Hadoop tools to problems.

Uploaded by

Keerthana K
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

OBJECTIVES:

● To learn big data and hadoop platform


● To provide an overview of No SQL databases
● To understand HDFS concepts and interfacing with HDFS
● To examine data processing operators and compare with traditional databases
● To gain knowledge on various visualization techniques

UNIT I INTRODUCTION TO BIG DATA AND HADOOP 9


Analytics – DescriptiveAnalytics – Diagnostic Analytics – Predictive Analytics –
Prescriptive Analytics –Types of Digital Data - Introduction to Big Data - Big Data Analytics - History of
Hadoop - Apache Hadoop -Analyzing Data with Unix tools -Analyzing Data with Hadoop - Hadoop
Streaming - Hadoop Ecosystem -IBM Big Data Strategy.

UNIT II BIG DATA PATTERNS & NOSQL 9


No SQL databases: MongoDB: Introduction – Features – Data types – MongoDB Query language –
CRUD operations – Arrays – Functions: Count – Sort – Limit – Skip – Aggregate – Map Reduce. Cursors
– Indexes – Mongo Import – Mongo Export. Cassandra: Introduction – Features – Data types – CQLSH –
Key spaces – CRUD operations – Collections – Counter – TTL – Alter commands – Import andExport
–QueryingSystemtables.

UNIT III BIG DATA STORAGE AND ANALYSIS 9


Design of HDFS- HDFS Concepts - Command Line Interface - Hadoop file system interfaces - Data flow
- Hadoop I/O: Compression, Serialization, Avro - File-Based Data structures, Mapreduce Model with
example – Hadoop YARN – HadoopSchedulers.

UNIT IV HADOOP ECOSYSTEM 9


Introduction to PIG, Execution Modes of PigComparison of Pig with Databases, Grunt, Pig Latin, User
Defined Functions, Data Processing operators - Hive : Hive Shell, Hive Services, Hive Metastore -
Comparison with Traditional Databases,HiveQLBig SQL:Introduction

UNIT V CASE STUDY AND DATA VISUALIZATION 9


Data Visualisation – Frameworks & Libraries – Types - Line Chart – Scatter Plot - Bar Chart - Box Plot -
Pie Chart - Dot Chart - Map Chart - Gauge Chart - Radar Chart - Matrix Chart - Spatial Graph -
Distribution Plot - Violin Plot - Count Plot – Case Study: Installation of Hive along with practice
examples - Implement of MatrixMultiplicationwithHadoopMapReduce.
LIST OF EXPERIMENTS:
1. Downloading and installing Hadoop; Understanding different Hadoop modes. Startup scripts,
Configuration files.
2. Hadoop Implementation of file management tasks, such as Adding files and directories, Retrieving files
and Deleting files
3. Implement of Matrix Multiplication with Hadoop Map Reduce
4. Run a basic Word Count Map Reduce program to understand Map Reduce Paradigm.
5. Implementation of K-means clustering using Map Reduce
6. Installation of Hive along with practice examples.
7. Installation of HBase, Installing thrift along with Practice examples
8. Practice importing and exporting data from various databases .
TOTAL : 60 PERIODS

TEXT BOOK:
1. Seema Acharya, Subhashini Chellappan,“Big Data and Analytics”, Wiley Publication, 2015.
2. Arshdeep Bahga, Vijay Madisettai,“Big Data Science & Analytics”, Vpt Publisher, 2016
3. David Loshin, "Big Data Analytics: From Strategic Planning to Enterprise Integration with Tools,
Techniques, NoSQL, and Graph", Morgan Kaufmann/Elsevier Publishers, 2013.
4. TomWhite,“Hadoop: TheDefinitive Guide”, O’Reilly, 4th Edition, 2015.
5. Bart Baesens,“Analytics in a Big Data World: The Essential Guide to Data Science and its
Applications”, Wiley, 2014.

REFERENCES:
1. Jure Leskovec, Anand Rajaraman and Jeffrey David Ullman,“Mining of Massive Datasets”, Cambridge
University Press, 2012.
2. Michael Berthold, David J.Hand,“Intelligent Data Analysis”, Springer, 2007.
3. “Data Science and Big Data Analytics”, EMC2 Education Services, 2013.
4. Seema Acharya, SubhashiniChellappan,“Big Data and Analytics”, Wiley Publications, First Edition,
2015

WEB REFERENCES:
1. https://nptel.ac.in/courses/110106072
2. https://archive.nptel.ac.in/courses/106/104/106104189/

ONLINE RESOURCES:
1. https://nptel.ac.in/courses/110106072
OUTCOMES:
Upon completion of the course, the student should be able to:
1. Understand the Technologies for Handling Big Data and Hadoop Ecosystem (K1)
2. Identify the Analytical Approaches and Tools to analyze the data (K2)
3. Acquire clear understanding of Hadoop YARN and NoSQL Data Management (K2)
4. Develop Big Data Solutions using Hadoop EcoSystem (K2)
5. Illustrate the distribution of numerical data and various visualization methods(K3)
6. Apply Hadoop, Map Reduce, Hive and HBase to solve sample and real time problems(K3)

CO – PO, PSO MAPPING:

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy