0% found this document useful (0 votes)
17 views5 pages

X Ai SS CH4 Notes

The document provides notes on data science, including multiple-choice questions and answers, short and long questions, and explanations related to data collection, data privacy, and data mining. It emphasizes the importance of data science in extracting valuable insights for decision-making and highlights various skills and methods used in the field. Additionally, it discusses ethical considerations in data collection and the significance of reliable data sources.

Uploaded by

Laksin VJ
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
17 views5 pages

X Ai SS CH4 Notes

The document provides notes on data science, including multiple-choice questions and answers, short and long questions, and explanations related to data collection, data privacy, and data mining. It emphasizes the importance of data science in extracting valuable insights for decision-making and highlights various skills and methods used in the field. Additionally, it discusses ethical considerations in data collection and the significance of reliable data sources.

Uploaded by

Laksin VJ
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

CLASS X – ARTIFICIAL INTELLIGENCE NOTES

CHAPTER 4 – DATA SCIENCE


QUESTIONS AND ANSWER: MCQ:
1. Data science is the process of diverse set of data through?
A. Organizing data
B. Processing data
C. Analysing data
D. All of the above
Explanation: Data science is the field which includes organizing data,
processing data and analysing data to extract valuable information from
data for business decision-making, strategic planning, etc. So, All of the
above is correct.
2.Point out the correct statement.
A. Raw data is original source of data
B. Preprocessed data is original source of data
C. Raw data is the data obtained after processing steps
D. None of the above
Explanation: Raw data is original source of data is the correct answer. So,
option A is correct.
3. How do we perform Bayesian classification when some features are
missing?
A. We integrate the posteriors probabilities over the missing features
B. We ignore the missing features
C. We assuming the missing values as the mean of all values
D. Drop the features completely
Answer: A
Explanation: When some features are missing, while performing Bayesian
classification we don’t use general methods of handling missing values but
we integrate the posteriors probabilities over the missing features for better
predictions. So, option A is correct

1
4.The modern conception of data science as an independent discipline is
sometimes attributed to?
A. John McCarthy B. Arthur Samuel C. William S. D. Dennis Ritchie
5. graph displays information as a series of data points connected by
straight line segments.
A. Bar B. Scatter C. Histogram D. Line
Explanation: A line graph displays information as a series of data points
connected by straight line segments.
6. Data fishing is sometimes referred to as
A. Data bagging
B. Data dredging
C. Data merging
D. None of the mentioned
Explanation: Data fishing is sometimes referred to as Data dredging so
option B is correct.
7. Which is one of the significant data science skills?
A. Statistics B. Data Visualization C. Machine Learning D. All of the above
Answer: D
8. A method used to make vector of repeated values?
A. read() B. data() C. rep() D. view()
Answer: B
Explanation: data() method used to make vector of repeated values.
9. Which of the following step is performed by the data scientist after
acquiring the data?
A. Data Replication B. Data Integration C. Data Cleansing
Answer : C
Explanation: Data cleansing, data cleaning or data scrubbing is the process
of detecting and correcting (or removing) corrupt or inaccurate records from
a database, table, or record set and it is generally performed by data
scientist after acquiring the data.
10. Which of the following is good way of performing experiments in data
science?
A. Measure variability

2
B. Generalize to the problem
C. Have Replication
D. All of the above
Answer: D
Explanation: Measure variability, generalize to the problem, have Replication
all of the mentioned is good way of performing experiments in data science
SHORT AND LONG QUESTION AND ANSWER:
1. All of us use Smartphone’s. When we install a new app, it asks us for
several permissions to access our phone’s data in different ways. Why do
apps collect such data?
1. To provide customized notifications and recommendations.
2. To improve the efficiency and accuracy of the app.

2. Sirisha and Divisha want to make a model which will organize the
unlabeled input data into groups based on features. Which learning model
should they use and why?
ANS: Clustering model/Unsupervised learning is used to organize the
unlabeled input data into groups based on features. Clustering is an
unsupervised learning algorithm which can cluster unknown data according
to the patterns or trends identified out of it. The patterns observed might be
the ones which are known to the developer or it might even come up with
some unique patterns out of it.
3. Ajay wants to access data from various sources. Suggest him any two
points that he needs to keep in mind while accessing data from any data
source.
ANS: While accessing data from any of the data sources, following points
should be kept in mind:
1.Data which is available for public usage only should be taken up.
2.Personal datasets should only be used with the consent of the owner.
3.One should never breach someone’s privacy to collect data.
4.Data should only be taken from reliable sources as the data collected from
random sources can be wrong or unusable.
5.Reliable sources of data ensure the authenticity of data which helps in the
proper training of the AI model.
6.Data should be relevant to the problem

3
4. What is Data science? Give an example of it.
ANS:Data sciences is a domain of AI related to data systems and processes,
in which the system collects numerous data, maintains data sets and
derives meaning/sense out of them. The information extracted through data
science can be used to make a decision about it.
OR
Data science is the field of study that combines domain expertise,
programming skills, and knowledge of mathematics and statistics to extract
meaningful insights from data
Data Sciences, it is a concept to unify statistics, data analysis, machine
learning and their related methods in order to understand and analyses
actual phenomena with data.
For example: a company that has petabytes of user data may use data
science to develop effective ways to store, manage, and analyze the data.
5. Where do we collect data from?
ANS: Data can be collected from various sources like –
a. Surveys
b. Sensors
c. Observations
d. Web scrapping (Internet)
e. Interviews
f. Documents and records.
g. Oral histories
6. Why do we need to collect data?
ANS: Data to a machine is similar to food for human being to function. The
world of Artificial Intelligence revolves around Data. Every company whether
small or big is collecting data from as many sources as possible. Data is
called the New Gold today. It is through data collection that a business or
management has the quality information they need to make informed
decisions from further analysis, study, and research. Data collection allows
them to stay on top of trends, provide answers to problems, and analyze new
insights to great effect.
7. What is data mining? Explain with example.
ANS: Data mining is the process of analyzing large data sets and extracting
the useful information from it. Data mining is used by companies to turn

4
raw data into useful information. It is an interdisciplinary subfield of
computer science and statistics with an overall goal to extract information
OR
Data mining is an automatic or semi-automatic technical process that
analyses large amounts of scattered information to make sense of it and
turn it into knowledge. It looks for anomalies, patterns or correlations
among millions of records to predict results, as indicated by the SAS
institute, a world leader in business analytics.
Example:
Price Comparison websites- They collect data about a product from different
sites and then analyze trends out of it and show up the most appropriate
results. Data mining is also known as Knowledge Discovery in Data (KDD)
8. What do you understand by Data Privacy?
ANS: The world of Artificial Intelligence revolves around Data. Proper and
ethical handling of own data or user data is called data privacy. It is all
about the rights of individuals with respect to their personal information.
Data privacy or information privacy is a branch of data security concerned
with the proper handling of data – consent, notice, and regulatory
obligations. More specifically, practical data privacy concerns often revolve
around: Whether or how data is shared with third parties

9. Is data which is collected by various applications ethical in nature?


Justify your answer.
ANS: Yes, most of the times, the data collected by various applications is
ethical in nature as the users agree to it by clicking on allow when the
application asks for various permissions. They ask for our data for various
facilities like - to show us personalized recommendations and
advertisements and to make their app more accurate and efficient.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy