0% found this document useful (0 votes)
124 views6 pages

Zoya Parasher - 2152916 - Big Data

This document summarizes an assignment submitted by Zoya Parasher to Dr. Sapna Arora on using big data to improve customer retention for organization ABC. The assignment addresses two questions: 1) Identifying business strategies related to big data that can be applied to improve customer retention and optimize customer behavior; 2) Describing a "Big Data" application including the application domain, big data problems, and existing solutions. Specific examples are provided of how Netflix uses big data for content development, recommendations, streaming optimization, and other purposes. The response also details several big data applications in the telecommunications sector including network improvement, predictive churn analysis, price reduction, and avoiding fraud.

Uploaded by

Zoya Parasher
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
124 views6 pages

Zoya Parasher - 2152916 - Big Data

This document summarizes an assignment submitted by Zoya Parasher to Dr. Sapna Arora on using big data to improve customer retention for organization ABC. The assignment addresses two questions: 1) Identifying business strategies related to big data that can be applied to improve customer retention and optimize customer behavior; 2) Describing a "Big Data" application including the application domain, big data problems, and existing solutions. Specific examples are provided of how Netflix uses big data for content development, recommendations, streaming optimization, and other purposes. The response also details several big data applications in the telecommunications sector including network improvement, predictive churn analysis, price reduction, and avoiding fraud.

Uploaded by

Zoya Parasher
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Assignment#1

On

Big Data

SUBMITTED BY:

Zoya Parasher

Enroll No.: 2152916

SUBMITTED TO:

Dr. Sapna Arora

Associate Professor

MBA PROGRAM

IILM UNIVERSITY, GURUGRAM

August,2022
Ques 1: Identify a key business initiative for organization ABC. What are the different business strategies
related to big data can be applied to improve customer retention, optimizing customer behavior?
Big data is a tool that Netflix employs to create popular content. Netflix was able to readily establish what kind
of material people desired because of the direct interaction it has with its users and the wealth of data it has
on how viewers interact with its content.
Netflix wants to give content producers the possibility to realize their most inventive ideas and to amuse via
creative storytelling (Smith et al., 2019). According to Brennan (2018), Netflix's intention is to concentrate on
local content both for locals and for people around the world. It carefully selects content and fosters artists'
inventiveness. Determining titles to stream, content that should be licensed or produced, customer journey
analytics from prospect to member and ongoing, marketing to existing customer base and attracting new
members, portfolio mix effectiveness, and insights to implement various strategies in various markets are
areas where data is used for decision-making and predictive analytics.
Bigdata analytics are used by Netflix to better understand its consumer base. They can provide the customer a
better service or product by utilizing these facts. Large volumes of data are gathered by Netflix from a diverse
subscriber base. It gathers information about a user's location, the content they watch, their interests, the
information they search for, and the time they watch it.
Netflix uses big data in the fields of global content development, content mix, user interface, licensing,
marketing, payment partnerships, and device support, according to Purkayastha, et al. (2013) and Brennan
(2018). In order to provide pertinent, individualized suggestions and produce spectacular, successful
experiences, Netflix analyses data on individual viewing patterns, including the subtleties of when members
pause, skip, stop, or switch. Netflix has confidently entered the production of original, inventive content by
utilizing insights into viewing behavior, hand-picking directors, writers, and performers, and acquiring license
rights (Brennan, 2018; McCranken, 2013). The article goes on to say that as part of its plan for exponential
globalization, Netflix is adjusting to local customs and preferences, forming relationships with local businesses,
and attracting viewers from abroad.
Currently, Netflix is very popular, and its popularity will increase during the Pandemic since people will truly
live by the slogan "Netflix and Chill." During the lockdown, it gained increased significance. The corporation
has significantly increased its subscriber base—it now has 195 million users worldwide—and has made AI and
data the cornerstones of their corporate strategy.
How are they using Big Data?

• They are using this to create content in the first place because it makes it evident what viewers enjoy
and dislike, where they spend more time watching, and which genre is popular among different age
groups. They were able to develop more content and comprehend specific markets as a result.
• The second area, which is rather large, is the automatic recommendations we receive after finishing a
series. The interesting conclusion is that since suggestions account for 80% of what we watch, Netflix is
able to predict our preferences almost as well as our mothers! They accomplished this by optimizing
their algorithms.
• The third section, which has automatically created thumbnails that we saw on the screen, is the most
fascinating section. In the five minutes it takes us to go through 30 titles, each thumbnail must be
interesting, succinct, and represent the main idea of the series or film. This thumbnail might be from
any scene, but the data makes it more likely that it will come from one that I particularly enjoy!
• The fourth area, which is their streaming optimization, is quite large. When a murderer is about to be
revealed, no one wants their movie to get stuck. They put the movies and shows you wish to view on a
local server using AI.
Netflix uses data processing software and traditional business intelligence tools such as Hadoop and Teradata,
as well as its own open-source solutions such as Lipstick and Genie, to gather, store, and process massive
amounts of information. (Sadeh, ClickZ, 2019).
Ques 2: Identify a "Big Data" application and write a report with a detailed description of the following:

• Application domain
• Big data-related problems (computing, storage, database, analytics, transfer, etc.)
• Existing solutions
Also, please mention the 10-vs associated with the term “Big Data”.
According to Valuates Reports, the market for big data analytics is expected to rise at a CAGR of 13.5% from
2020 to 2030.
The expansion of the big data market is being driven by the rising use of data analytics by various industries in
order to lower costs and deliver quicker and better decision-making by evaluating and acting on information in
a timely way.
Big data may help telecom firms become more profitable by enhancing customer experience, enhancing
network utilization and services, and improving security.
The telecommunications sector has access to new prospects because to big data. It can enhance service
quality and enable more efficient traffic routing. Tele businesses can also spot fraud and take prompt action
on it by monitoring call data records in real-time. In the end, this gives them a market advantage and helps
them discover untapped possibilities.
Big data is now crucial for advancing the telecommunications sector. Telecommunications providers may
significantly enhance their services and make their users happier with the correct data analytics strategy.
Big data analytics may help businesses and organizations make more informed decisions, provide better
customer service, and run more smoothly.

• Network Improvement:
The telecom sector is beginning to make use of big data analytics to efficiently monitor and manage network
capacity, create predictive capacity models, and use it for decision-making on network growth.
The telecom service providers can identify extremely congested locations where network usage is approaching
its capacity thresholds to prioritize expansion for new capacity roll out using real-time data analytics.
They can create predictive capacity forecasting models based on real-time analytics and prepare for extra
capacity in case of disruptions.
Data analytics for telecom can also assist in the detection of abnormalities and the maintenance of the secure,
dependable, and effective operation of the network systems.

• Predictive churn Analysis:


Long-term client engagement requires a lot of work. Numerous US customers cease using their telecom
provider's services each year for reasons like subpar customer service.
To avoid customer turnover, it is essential to analyze consumer behavior and respond appropriately. Data
analytics can be used to predict network behavior, map future demands, and continuously monitor and
manage any reduction in service performance.
By thoroughly examining hundreds of data points and millions of network usage patterns, it also aids in
understanding consumer preferences and locating problems like churn concerns. Advanced data analytics can
help the telecom sector predict customer turnover and lower it by 15%, according to McKinsey & Company.
For instance, data analytics in the telecom sector can assist operators in actively contacting high-value clients
who have reported a number of service-related problems or complaints on social media.
This would make it easier for service providers to address the problems and provide discounts or service
credits to entice customers to stay with them.

• Price Reduction
It is now essential for telecom operators to set the best prices for their goods and services due to the
increased market rivalry for customers.
By examining consumers' responses to various pricing strategies, purchase histories, and competition pricing,
telecom operators can use data analytics to gather precise data insights and develop the best pricing plans.
Additionally, telecom providers can increase their return on investment, determine the perceived worth of
their goods or services, and boost the efficiency of their sales force.
Sales may be increased, new consumers can be attracted, and most importantly, devoted clients can be kept
by optimizing the price plan based on profit and revenue earned.

• Avoiding Fraud
According to industry estimates, telecoms lose 2.8% of their revenue each year to fraud and leakage, costing
the sector $40 billion yearly.
The telecommunications sector can be shielded against this scam by using big data analytics. It can capture
spam calls and mailings and recognize terms used frequently by online fraudsters. As an illustration, a Chinese
mobile operator recently released an app called Sky Shield that uses big data and AI to stop fraud in the
telecom industry.
The police gave the programmers a database of fraud cases, and Sky Shield was able to identify fraudulent
communication patterns, tell them apart from legitimate communications, and block spam calls and texts as a
result.

• Engine Recommendations
The customer's behavior is indicated by the recommendation engine, which is a collection of clever
algorithms. It forecasts the wants of customers in the future based on that behavior. Both content-based and
collaborative filtering techniques are used by recommendation engines.
The attributes that demonstrate the connection between the customer profile and the product or service a
customer selects are used in content-based filtering. On the other hand, collaborative filtering is based on
data analysis in accordance with user preferences and behavior.

• Making preventative diagnoses


Telcos can recognize patterns of system behavior that appear before errors occur and pinpoint the reasons for
those failures by using data analytics.
Early diagnosis aids in planning preventative maintenance, equipment replacement, and equipment repair for
operators.
By using data from their social networks, big data-based predictive analytics can also assist operators in
analyzing the intentions of their clients. Big data enables telecom companies to identify influencers within
their clientele.
Volume

•Data that is already in an organisation and has to be processed.


•Making sense of the data is now crucial for making the best business decisions.
•When the data's size itself contributes to the issue, this is called "volume."
•The volume of data we have. how fast it is expanding Different storage, management, and processing
requirements result from this.

Velocity

•Data in motion is referred to as having a velocity.


•The speed with which data enters an organisation.
•The time needed for processing by the organisation.
•The speed at which data is generated is implied by the term velocity.
•It alludes to the moving data.

Variety

•Data in a variety of formats


•Data of various kinds
•Different types of data, data in diverse forms, and data from many sources.
•Varieties can affect how uniformly data is collected inside an organisation.

Veracity

•The Data in Doubt is another name for veracity. It speaks to the calibre of the information.
•The validity of the source data affects how accurate the analysis is.
•Veracity raises the question of whether the information is true and thorough, as well as from a reputable
source.
•Biases, noise, and irregularity are all examples of veracity in data.
•If the data is inaccurate, having a lot of it arriving quickly in diverse amounts is useless.

Validity

•The term "validity" describes the data's veracity, responsibility, correctness, appropriateness, precision, and
accuracy with regard to time.

Value

•Value refers to the usefulness of the data.


•Raw data itself is not valuable at all.
•Value does not characterize data as “Big Data”.

Variability

•Data that has a constantly shifting meaning is referred to as variable.


•Variety and variability are not the same thing.
•Having an understanding of the data's context might help you manage variability.

Venue

•Different types of data came from various sources via various platforms, including personnel systems and
private and public clouds.
•Work in data science occurs locally, on consumer workstations, and in the cloud, among other varied settings.

Vocabulary

•Data science offers a language for solving many issues. Different modelling strategies address various problem
domains, and various validation methods strengthen these strategies in various applications.

Vagueness

•Regardless of the quantity of data accessible, the interpretation of found data is frequently quite ambiguous.
•Information that was vague regarding the truth or showed little to no consideration for what it would mean
Ques 3: Can daily transaction be considered a part of HDFS? (Y/N?) Illustrate the difference with the existing
systems.
The data kept in Hadoop's file is not accessible at random. Hadoop cannot be used as an OLTP database, which
is defined by INSERT, UPDATE, and DELETE operations. Hadoop makes it possible to access previous data for
analysis. MapReduce is an algorithm that runs on the Hadoop distributed file system.

S.No. RDBMS Hadoop


1 Traditional row-column based databases, An open-source software used for storing data
basically used for data storage, manipulation and running applications or processes
and retrieval. concurrently.
2 In this structured data is mostly processed. In this both structured and unstructured data
is processed.
3 It is best suited for OLTP environment. It is best suited for BIG data.
4 It is less scalable than Hadoop. It is highly scalable.
5 Data normalization is required in RDBMS. Data normalization is not required in Hadoop.
6 It stores transformed and aggregated data. It stores huge volume of data.
7 It has no latency in response. It has some latency in response.
8 The data schema of RDBMS is static type. The data schema of Hadoop is dynamic type.
9 High data integrity available. Low data integrity available than RDBMS.
10 Cost is applicable for licensed software. Free of cost, as it is an open source software.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy