0% found this document useful (0 votes)

37 views41 pages

Introduction To Business Analytics

This document provides an introduction to business analytics and big data. It discusses the types of business analytics including descriptive, diagnostic, predictive, and prescriptive analytics. It also outlines the benefits of business analytics and careers in big data analytics. Hadoop is introduced as a framework for distributed processing of large datasets across clusters of computers using MapReduce. The key design principles of Hadoop are that it can process big data using commodity hardware in a fault-tolerant and automatically parallelized manner.

Uploaded by

Himanshu Kashyap

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views41 pages

Introduction To Business Analytics

Uploaded by

Himanshu Kashyap

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

Introduction to Business Analytics

Outline
 Big Data Drivers
 Challenges of Analytics
 Types of Business Analytics
 Benefits of Business Analytics
 Careers in Big data Analytics
 Benefits of Big data Analytics
What’s driving Big Data

- Optimizations and predictive analytics

- Complex statistical analysis
- All types of data, and many sources
- Very large datasets
- More of real-time

- Ad-hoc querying and reporting

- Data mining techniques
- Structured data, typical sources
- Small to mid-size datasets

3
The Big-Data Challenge
“Everywhere you look, the quantity of
information in the world is soaring.
Merely keeping up with this flood, and
storing the bits that might be useful, is
difficult enough. Analyzing it, to spot
patterns and extract useful information,
is harder still.”
The Economist; “The Data Deluge”; 2/10/2010

Gartner says:
• “The Big Data Challenge Involves More
Than Just Managing Volumes of Data”
• “The real issue is making sense out of
the data and (…) helping
organizations make better
decisions.”
Definition of Insights

 “Insights are thoughts, facts, data, or analysis of facts

and data that induce meaning and further
understanding of a business challenge and answer
essential questions and create an urgency to act or
rethink a business challenge in terms of its problems or
solutions.”
Value of Big Data Analytics

 Big data is more real-time in

nature than traditional DW
applications
 Traditional DW architectures (e.g.
Exadata, Teradata) are not well-
suited for big data apps
 Shared nothing, massively parallel
processing, scale out architectures
are well-suited for big data apps

6
Predictive Analytics
Prescriptive Analytics
Big problem: understanding the output
What Technology Do We Have
For Big Data ??

20
22
Why Hadoop is able to compete?
24

Database

vs.

Scalability (petabytes of data, Performance (tons of indexing,

thousands of machines) tuning, data organization tech.)

Flexibility in accepting all data

formats (no schema) Features:
- Provenance tracking
Efficient and simple fault- - Annotation management
tolerant mechanism - ….

Commodity inexpensive
hardware
What is Hadoop
25

 Hadoop is a software framework for distributed

processing of large datasets across large clusters of
computers
 Large datasets  Terabytes or petabytes of data
 Large clusters  hundreds or thousands of nodes
 Hadoop is open-source implementation for Google
MapReduce
 Hadoop is based on a simple programming model
called MapReduce
 Hadoop is based on a simple data model, any data
will fit
What is Hadoop (Cont’d)
26

 Hadoop framework consists on two main layers

 Distributed file system (HDFS)
 Execution engine (MapReduce)
Hadoop Master/Slave Architecture
27

 Hadoop is designed as a master-slave shared-nothing architecture

Master node (single node)

Many slave nodes

Design Principles of Hadoop
28

 Need to process big data

 Need to parallelize computation across thousands
of nodes
 Commodity hardware
 Large number of low-end cheap machines working in
parallel to solve a computing problem
 This is in contrast to Parallel DBs
 Small number of high-end expensive machines
Design Principles of Hadoop
29

 Automatic parallelization & distribution

 Hidden from the end-user

 Fault tolerance and automatic recovery

 Nodes/tasks will fail and will recover automatically

 Clean and simple programming

abstraction
 Users only provide two functions “map” and “reduce”

Content For
No ratings yet
Content For
7 pages
Big Data Analytics
No ratings yet
Big Data Analytics
36 pages
Chapter 6 (P1)
No ratings yet
Chapter 6 (P1)
29 pages
Unit 4 LT
No ratings yet
Unit 4 LT
16 pages
Big Data Analytics M1
No ratings yet
Big Data Analytics M1
27 pages
BDA Unit 1
No ratings yet
BDA Unit 1
39 pages
Big Data - Module 1
No ratings yet
Big Data - Module 1
35 pages
CHAPTER 02: Big Data Analytics
No ratings yet
CHAPTER 02: Big Data Analytics
62 pages
Lecture 2 - Hadoop 221
No ratings yet
Lecture 2 - Hadoop 221
28 pages
Chapter 1
No ratings yet
Chapter 1
40 pages
BIG DATA AND ANALYTICS Presentation
No ratings yet
BIG DATA AND ANALYTICS Presentation
31 pages
Big Data
No ratings yet
Big Data
16 pages
Unit - 2 Fundamentals of Big Data Analytics
No ratings yet
Unit - 2 Fundamentals of Big Data Analytics
39 pages
Hadoop PPT
No ratings yet
Hadoop PPT
25 pages
UNIT Two Emerging Technology
No ratings yet
UNIT Two Emerging Technology
43 pages
BDS DS307 Unit-1
No ratings yet
BDS DS307 Unit-1
46 pages
Introduction To Big Data Computing
No ratings yet
Introduction To Big Data Computing
25 pages
Prepared by Richa Btech (Cse) 6 Sem Dav University Jalandhar
No ratings yet
Prepared by Richa Btech (Cse) 6 Sem Dav University Jalandhar
30 pages
Introduction To Bda
No ratings yet
Introduction To Bda
67 pages
ETB 1 (Big Data)
No ratings yet
ETB 1 (Big Data)
28 pages
Big Data Analytics
No ratings yet
Big Data Analytics
8 pages
Hadoop Ecosystem Large PDF
No ratings yet
Hadoop Ecosystem Large PDF
229 pages
BDA 02 - Fundamentals
No ratings yet
BDA 02 - Fundamentals
64 pages
Unit-1 Bda
No ratings yet
Unit-1 Bda
72 pages
Document 1
No ratings yet
Document 1
9 pages
Big Data Unveiling The Power of Information
No ratings yet
Big Data Unveiling The Power of Information
10 pages
Big Data Sent 24 10 24
No ratings yet
Big Data Sent 24 10 24
49 pages
Big Data Analytics02
No ratings yet
Big Data Analytics02
20 pages
CHAPTER 02: Big Data Analytics
No ratings yet
CHAPTER 02: Big Data Analytics
73 pages
Unit 1 - BDS - DS307
No ratings yet
Unit 1 - BDS - DS307
47 pages
5.innovating Big Data Analytic
No ratings yet
5.innovating Big Data Analytic
27 pages
Big Data
No ratings yet
Big Data
25 pages
Big Data
No ratings yet
Big Data
190 pages
BigDataAnalytics 1.2
No ratings yet
BigDataAnalytics 1.2
25 pages
DBMS Unit1
No ratings yet
DBMS Unit1
30 pages
Big Data
100% (1)
Big Data
82 pages
BDS Session 3
No ratings yet
BDS Session 3
64 pages
Big Data
No ratings yet
Big Data
82 pages
Big Data Analytics
No ratings yet
Big Data Analytics
5 pages
Big Data: Dr.S.Lovelyn Rose Associate Professor PSG College of Technology Coimbatore
No ratings yet
Big Data: Dr.S.Lovelyn Rose Associate Professor PSG College of Technology Coimbatore
25 pages
Manoj Kumari Roll No. 20
No ratings yet
Manoj Kumari Roll No. 20
11 pages
Introduction Part
No ratings yet
Introduction Part
5 pages
Big Data Analytics Unit - 1 Notes
No ratings yet
Big Data Analytics Unit - 1 Notes
24 pages
Analytics and Processing: Yuanyuan Zhu Email: Yyzhu@whu - Edu.cn
No ratings yet
Analytics and Processing: Yuanyuan Zhu Email: Yyzhu@whu - Edu.cn
47 pages
Big Data Complete Notes
No ratings yet
Big Data Complete Notes
33 pages
Week 5 Big Data Application in Business
No ratings yet
Week 5 Big Data Application in Business
51 pages
BDA Lec3
No ratings yet
BDA Lec3
46 pages
Big Data
No ratings yet
Big Data
25 pages
Kwasu-Csc204 Big Data Computing and Security-1
No ratings yet
Kwasu-Csc204 Big Data Computing and Security-1
57 pages
11-12 Big Data Concepts and Tools
No ratings yet
11-12 Big Data Concepts and Tools
30 pages
BDS Session 3
No ratings yet
BDS Session 3
56 pages
IM08
No ratings yet
IM08
36 pages
Unit 1 - ETI (BDA)
No ratings yet
Unit 1 - ETI (BDA)
20 pages
Module 1
No ratings yet
Module 1
29 pages
BDA-1st Unit
No ratings yet
BDA-1st Unit
39 pages
Updated Unit-2
0% (1)
Updated Unit-2
55 pages
Hadoop Ecosystem for Big Data
From Everand
Hadoop Ecosystem for Big Data
Dr. Zemelak Goraga
No ratings yet
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Learn Hadoop in 24 Hours
From Everand
Learn Hadoop in 24 Hours
Alex Nordeen
No ratings yet
The Power of Big Data: Transforming Industries and Shaping the Future
From Everand
The Power of Big Data: Transforming Industries and Shaping the Future
Tom Henricksen
No ratings yet
MOS Course Coverage
No ratings yet
MOS Course Coverage
6 pages
PGDM 2019-21 Semester-III (PAPER 2) : PGDM (G) / PGDM (M) / PGDM (F)
No ratings yet
PGDM 2019-21 Semester-III (PAPER 2) : PGDM (G) / PGDM (M) / PGDM (F)
2 pages
Company Based Research Project For Batch (2019 - 2021)
No ratings yet
Company Based Research Project For Batch (2019 - 2021)
7 pages
Course Coverage For BA&A Exam
No ratings yet
Course Coverage For BA&A Exam
17 pages
Introduction To Business Analytics
No ratings yet
Introduction To Business Analytics
33 pages
Aci 311.1
No ratings yet
Aci 311.1
1 page
1-1. Location: 1. Background To Nairobi City
No ratings yet
1-1. Location: 1. Background To Nairobi City
9 pages
ML QB 5
No ratings yet
ML QB 5
44 pages
Convocation 2024 Letter Registration LIST 25112024
No ratings yet
Convocation 2024 Letter Registration LIST 25112024
28 pages
Specimen MS - Paper 1H Edexcel Maths (A) IGCSE
No ratings yet
Specimen MS - Paper 1H Edexcel Maths (A) IGCSE
12 pages
Personal Definition of Leadership
100% (1)
Personal Definition of Leadership
3 pages
Earth Station Subsystem
No ratings yet
Earth Station Subsystem
3 pages
(A) We Know That The Sum of The Coefficients in A Binomial Expansion Is Obtained by
No ratings yet
(A) We Know That The Sum of The Coefficients in A Binomial Expansion Is Obtained by
7 pages
04.DNS Protection Advanced Profiles
No ratings yet
04.DNS Protection Advanced Profiles
3 pages
Ge CWP PH 2 O&m Manual
No ratings yet
Ge CWP PH 2 O&m Manual
2 pages
RPT Bi Ting 1
No ratings yet
RPT Bi Ting 1
5 pages
Maruti Suzuki
No ratings yet
Maruti Suzuki
18 pages
Iso 17 (1973)
No ratings yet
Iso 17 (1973)
8 pages
International Society For Soil Mechanics and Geotechnical Engineering
No ratings yet
International Society For Soil Mechanics and Geotechnical Engineering
3 pages
LCF Paper High Strength Steel-2024
No ratings yet
LCF Paper High Strength Steel-2024
12 pages
Essay About Different Cultures in South Africa
100% (1)
Essay About Different Cultures in South Africa
13 pages
Development On The Four Domain Skills of English Language by Grade 12 Contact Center Services Students Through Work Immersion
No ratings yet
Development On The Four Domain Skills of English Language by Grade 12 Contact Center Services Students Through Work Immersion
55 pages
Great Florida Birding Trail Map - South Section Updates
No ratings yet
Great Florida Birding Trail Map - South Section Updates
4 pages
Lab 12 DLD
No ratings yet
Lab 12 DLD
4 pages
Fees Structure 2015 - 2016: PGDCA Courses
No ratings yet
Fees Structure 2015 - 2016: PGDCA Courses
3 pages
1 Logarthmic - Decrement
No ratings yet
1 Logarthmic - Decrement
5 pages
Thoughts
No ratings yet
Thoughts
1 page
Question Answers
No ratings yet
Question Answers
13 pages
DHCP Configration
No ratings yet
DHCP Configration
3 pages
Needs Medical Professional Medical Professional For Its Hospital at Rourkela
No ratings yet
Needs Medical Professional Medical Professional For Its Hospital at Rourkela
5 pages
E11-15 Standard Specification For Woven Wire Test Sieve Cloth and Test Sieves
No ratings yet
E11-15 Standard Specification For Woven Wire Test Sieve Cloth and Test Sieves
9 pages
The Common House Gecko, Hemidactylus Frenatus Schlegel in Dumeril & Bibron 1836 (Reptilia: Gekkonidae) in Gujarat, India
No ratings yet
The Common House Gecko, Hemidactylus Frenatus Schlegel in Dumeril & Bibron 1836 (Reptilia: Gekkonidae) in Gujarat, India
6 pages
Catalogue Partition Seiceito
No ratings yet
Catalogue Partition Seiceito
3 pages
Book Report
No ratings yet
Book Report
5 pages
Hotpoint E2BY 19423 F O3 User Manual
No ratings yet
Hotpoint E2BY 19423 F O3 User Manual
16 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Introduction To Business Analytics

Uploaded by

Introduction To Business Analytics

Uploaded by

Introduction to Business Analytics

- Optimizations and predictive analytics

- Ad-hoc querying and reporting

 “Insights are thoughts, facts, data, or analysis of facts

 Big data is more real-time in

Scalability (petabytes of data, Performance (tons of indexing,

Flexibility in accepting all data

 Hadoop is a software framework for distributed

 Hadoop framework consists on two main layers

 Hadoop is designed as a master-slave shared-nothing architecture

Master node (single node)

Many slave nodes

 Need to process big data

 Automatic parallelization & distribution

 Fault tolerance and automatic recovery

 Clean and simple programming

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.