Data Sciences in Telecommunication-Chapitre-1
Data Sciences in Telecommunication-Chapitre-1
FGE
3 -ème Ingénieur ATT
Course
Date Science in Telecommunication
Industry : use Cases
Presented by Pr. Krobba.A
hkrobba@gmail.com
Course
Date Science in Telecommunication Industry : use Cases
What is Data
Data is a collection of discrete or continuous values that convey information,
describing
Quantity, Quality, Fact, Statistics,
Chapiter 1 : Introduction to Data science
What is Data
What is Data
Data acquisition
Data analysis
Data analysis is the process of inspecting, cleansing, transforming, and modeling data with the
goal of discovering useful information, informing conclusions, and supporting decision-making
Data analysis
Data requirements
Data collection
Data processing
Data cleaning
Data modeling
Chapiter 1 : Introduction to Data science
Data protection.
Data protection is a strategy set to secure privacy, availability, & integrity of data. Data privacy
is a guideline of how data should be collected
Data Science is a field that gives insights from structured and unstructured data, using
different scientific methods and algorithms, and consequently helps in generating insights,
making predictions and devising data driver solutions.
It uses a large amount of data to get meaningful insights using statistics and computation for
decision making.
Chapiter 1 : Introduction to Data science
The data used in Data Science is usually collected from different sources, such as e-
commerce sites, surveys, social media, and internet searches. All this access to data has
become possible due to the advanced technologies for data collection.
This data helps in making predictions and providing profits to the businesses accordingly.
Data Science is the most discussed topic in today’s time and is a hot career option due to the
great opportunities it has to offer.
Chapiter 1 : Introduction to Data science
Chapiter 1 : Introduction to Data science
Data science is not a one-step process such that you will get to learn it in a short time
and call ourselves a Data Scientist. It’s passes from many stages and every element is
important.
One should always follow the proper steps to reach the ladder. Every step has its value
and it counts in your model. Buckle up in your seats and get ready to learn about those
steps.
Chapiter 1 : Introduction to Data science
Data Collection:
After defining the problem statement, the next obvious step is to go in search of data that
you might require for your model. You must do good research, find all that you need. Data
can be in any form i.e unstructured or structured. It might be in various forms like videos,
spreadsheets, coded forms, etc. You must collect all these kinds of sources.
Chapiter 1 : Introduction to Data science
Data Cleaning:
As you have formulated your motive and also you did collect your data, the next step to
do is cleaning. Yes, it is! Data cleaning is the most favorite thing for data scientists to
do.
Data cleaning is all about the removal of missing, redundant, unnecessary and
duplicate data from your collection. There are various tools to do so with the help of
programming in either R or Python. It’s totally on you to choose one of them. Various
scientist have their opinion on which to choose.
When it comes to the statistical part, R is preferred over Python, as it has the privilege
of more than 12,000 packages. While python is used as it is fast, easily accessible and
we can perform the same things as we can in R with the help of various packages.
Chapiter 1 : Introduction to Data science
Data Modelling:
Once you are done with your study that you have formed from data visualization, you must start
building a hypothesis model such that it may yield you a good prediction in future. Here, you
must choose a good algorithm that best fit to your model.
There different kinds of algorithms from regression to classification, SVM( Support vector
machines), Clustering, etc. Your model can be of a Machine Learning algorithm.
You train your model with the train data and then test it with test data. There are various
methods to do so. One of them is the K-fold method where you split your whole data into two
parts, One is Train and the other is test data. On these bases, you train your model.
Chapiter 1 : Introduction to Data science
In short, you check the efficiency of the data model and thus try to optimize it for better
accurate prediction. Deployment deals with the launch of your model and let the people
outside there to benefit from that.
You can also obtain feedback from organizations and people to know their need and then to
work more on your model
Chapiter 1 : Introduction to Data science
Data quality: The accuracy and quality of the data used in data science can have a significant
impact on the results obtained.
Privacy concerns: The collection and use of data can raise privacy concerns, particularly if
the data is personal or sensitive.
Complexity: Data science can be a complex and technical field that requires specialized skills
and expertise.
Bias: Data science algorithms can be biased if the data used to train them is biased, which
can lead to inaccurate results.
Interpretation: Interpreting data science results can be challenging, particularly for non-
technical stakeholders who may not understand the underlying assumptions and methods
used.