0% found this document useful (0 votes)

18 views12 pages

Priority Questions

Fds priority Questions

Uploaded by

sainaresh2727

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views12 pages

Priority Questions

Fds priority Questions

Uploaded by

sainaresh2727

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Mount Zion College of Engineering & Technology

To Make Man Whole!!

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

CS3352 – FOUNDATIONS OF DATA SCIENCE

UNIT I INTRODUCTION
P3 Data Science: Benefits and uses – facets of data
P1*Data Science Process: Overview
P2Defining research goals
P1*Retrieving data
P1*Data preparation
P1*Exploratory Data analysis
P1*Build the model
P2Presenting findings and building applications
P3Data Mining and Data Warehousing
P2Basic Statistical descriptions of Data
PART A - PRIORITY 1

1) Define Data science and its life cycle.

2) What is big data and list the V’s of big data?
3) List the categories of data used in data science.
4) State outliers with an example
5) Define data warehouse, data mart and data lake.
6) List the steps in data cleansing.
7) How will you handle the missing data?
8) What are the different ways of combining data?
9) Sketch the components of big data technologies
10) Define statistics and its types

PART A - PRIORITY 2

1) Identify the components of data science.

2) List the issues with real world data.
3) Identify the important contents of a project charter.
4) Mention the benefits of data preparation phase.
5) What is the implication of erroneous data for analysis?
6) What is confusion matrix?
Mount Zion College of Engineering & Technology
To Make Man Whole!!

PART A - PRIORITY 3

1) List the common evaluation metrics used to measure the performance of models.
2) How will you combine data from different data sources?
3) Define Euclidean distance.
4) What is machine learning?

PART B – PRIORITY 1

1) Give an overview of the data science process.

2) Explain the different stages of data preparation.
3) Describe the approaches for data exploration.
4) Explain the different types of retrieving data for analysis

PART B – PRIORITY 2

1) Discuss the significance setting research goal for the data science project.
2) Describe in brief about the tools for data science model building.
3) Explain the basic statistical descriptions of data.
4) Explain the benefits and uses of data science.

PART B – PRIORITY 3

1) Elaborate any 5 application domains of data science.

2) Describe the facets or categories of data for data mining.
Mount Zion College of Engineering & Technology
To Make Man Whole!!

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

CS3352 – FOUNDATIONS OF DATA SCIENCE

UNIT II DESCRIBING DATA

P3  Types of Data
P2  Types of Variables
P1* Describing Data with Tables and Graphs
P1* Describing Data with Averages
P1* Describing Variability
P2  Normal Distributions and Standard (z) Scores
PART A – PRIORITY 1

1) Define population and sample.

2) What is frequency distribution?
3) Write the guidelines for frequency distribution.
4) What is relative frequency distribution?
5) What is cumulative frequency distribution?
6) List the instructions to find the median.
7) Define range and variance.
8) Define standard deviation
9) What are degrees of freedom?
10) What is interquartile range?
11) Write the steps to calculate of the IQR.
12) Define Z score.
13) What is standard normal curve?

PART A – PRIORITY 2
1) What is causation?
2) What is positive relationship?
3) What is negative relationship?
4) Define linear relationship
5) What is curvilinear relationship?
6) What is standard error estimate?
7) When does regression fallacy occur?
Mount Zion College of Engineering & Technology
To Make Man Whole!!

PART A – PRIORITY 3

1) Differentiate constant and variable.

2) Write the steps to convert histogram to frequency histogram.
3) Give an example for stem and leaf display.
4) What are the typical shapes of graph?
5) Draw positively skewed distribution graph.
6) Draw negatively skewed distribution graph.
7) Define Bar graph and misleading graph.

PART B – PRIORITY 1

1) Explain the different types of frequency distribution with suitable examples and
diagrams
2) Construct the histogram and convert it to a frequency polygon for the following data
138, 139, 139, 145, 145, 150, 145, 136, 150, 152, 144, 138, 138, 150, 149, 133, 134,
152, 155, 151
3) Using the computation formula for the sum of squares, calculate the population
standard deviation for the scores in (a) and the sample standard deviation for the
scores in (b).
a) 1, 3, 7, 2, 0, 4, 7, 3
b) 10, 8, 5, 0, 1, 1, 7, 9, 2
6) Determine the values of the range and the IQR for the following sets of data.
a) Retirement ages: 60, 63, 45, 63, 65, 70, 55, 63, 60, 65, 63
b) Residence changes: 1, 3, 4, 1, 0, 2, 5, 8, 0, 2, 3, 4, 7, 11, 0, 2, 3, 4
7) Suppose that the burning times of electric light bulbs approximate a normal curve
with a mean of 1200 hours and a standard deviation of 120 hours. What proportion
of lights burn for
(a) less than 960 hours?
(b) more than 1500 hours?
(c) within 50 hours of the mean?
(d) between 1300 and 1400 hours?

PART B – PRIORITY 2

1) Discuss the methods to measure the variability for qualitative and ranked data.
2) Construct the frequency table and draw histogram, stem leaf displays for the
following data 139, 145, 150, 145, 136, 150, 152, 144, 138, 138
3) Compute the mean, median and mode for the following data sets
a) 45, 55, 60, 60, 63, 63, 63, 63, 65, 65, 70
b) 26.9, 26.3, 28.7, 27.4, 26.6, 27.4, 26.9, 26.9

PART B – PRIORITY 3

1) Explain the types of data and types of variables

Mount Zion College of Engineering & Technology
To Make Man Whole!!

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

CS3352 – FOUNDATIONS OF DATA SCIENCE

UNIT III DESCRIBING RELATIONSHIPS

Priority 1

 Correlation - Definition
 correlation coefficient for quantitative data
 computational formula for correlation coefficient
 Regression - regression line
 least squares regression line
 multiple regression equations

Priority 2

 Standard error of estimate

 interpretation of r2
Priority 3
 Scatter plots
 regression towards the mean
PART A – PRIORITY 1

1) Define scatter plot.

2) Define Pearson correlation coefficient
3) Define Regression.
4) Differentiate simple and multiple linear regressions?
5) Define least square regression equation.
6) Define interpretation of r^2
7) Define multiple regressions.
8) Define regression towards mean.
9) Differentiate correlation and regression.
10) Define correlation matrix.
11) What is standard error estimate?
PART A – PRIORITY 2
1) What are the types of correlation?
2) What is causation?
3) Differentiate linear and non linear relationship.
4) List the types of nonlinear relationship.
5) What is curvilinear relationship?
Mount Zion College of Engineering & Technology
To Make Man Whole!!

6) What is an outlier?

PART A – PRIORITY 3

1) When does regression fallacy occur?

2) Give the least square regression equation.
3) State the multiple regression equation.

PART B – PRIORITY 1

1) Calculate the coefficient of correlation between the expenditure on advertising and

sales of the company from the following data.
Advertising Expenditure 165 166 167 168 167 169 170 172
(in 000 rs)
Sales (in Lakhs ) 167 168 165 172 168 172 169 171

2) In an investigation into prediction using the stars and planets a celebrated astrologist
Horace Cope predicted the ages at which thirteen young people would first marry. The
complete data, of predicted and actual ages at first marriage, are now available and are
summarised in the table.
Person Predicte Actual
d Age Age(y
(x years)
years)

A 24 23
B 30 31
C 28 28
D 36 35
E 20 20
F 22 25
G 31 45
H 28 30
I 21 22
J 29 27
K 40 40
L 25 27
M 27 26

i. Draw a scatter diagram of these data.

Mount Zion College of Engineering & Technology
To Make Man Whole!!

ii. Calculate the equation of the regression line of y on x and draw this line on the
scatter

3) Conduct a multiple regression analysis by finding the regression model on the

following data set.

Y X1 X2
140 60 22
155 62 25
159 67 24
179 70 20
192 71 15
200 72 14
212 75 14
215 78 11

PART B – PRIORITY 2

1) Find the standard error of the estimate of the mean weight of high school football
players using the data given of weight of the players
Player 1 2 3 4 5 6 7 8 9 10
Number
Weight in 150 203 176 190 168 193 189 178 197 172
pounds

2) What is the significance of r2? Give a detailed interpretation of r2.

PART B – PRIORITY 3

1) What are scatter plots? Elaborate on the various types with suitable examples.
2) Explain regression towards the mean.
Mount Zion College of Engineering & Technology
To Make Man Whole!!

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

CS3352 – FOUNDATIONS OF DATA SCIENCE

UNIT IV- PYTHON LIBRARIES FOR DATA WRANGLING
Priority 1
 comparisons, masks, boolean logic
 Hierarchical indexing
 combining datasets
 aggregation and grouping
 pivot tables
Priority 2
 Data manipulation with Pandas
 data indexing and selection –
 operating on data
 missing data
Priority 3
 Basics of Numpy arrays
 aggregations
 computations on arrays
 fancy indexing
 structured arrays
PART A – PRIORITY 1
1. What is numpy in python used for?
2. Write a python program create an array?
3. Write the output of the following numpy copy.

• np.array([3,14,4,2,3])

• np,array([1,2,3,4],dtype=’float32’

• np.array([range (i,i+3) for i in [2,4,6]])

• np.zeros (1().dtype=int)

• np.ones((3,5),dtype=float)
Mount Zion College of Engineering & Technology
To Make Man Whole!!

• np.full((3,5),3.14)

• np.arange(0,20,2)

• np.linespace(0,1,5)

• np.random .random((3,3))

• np.random.normal(0,1(3,3))

4. What is data frame?

5. How a pandas data frame can be constructed?
6. What are indexers?
7. How missing data can be handled in python?
8. How the operations can be performed on null values in pandas data structures?
9. Define hierarchical indexing?
10. What is pivot table?
11. What is fancy indexing?
12. What is combined indexing?
13. What is the arithmetic operation implemented in numpy?
14. What is structured array in numpy?
15. Mention the purpose of iloc, loc, ix.

PART A – PRIORITY 2

1) What are index preservation and index alignment?

2) Map python operators and pandas methods.
3) Mention the function names that operate on null values in pandas.
4) How will you slice and index multi index?
5) What are the methods in data aggregations on multi indices?
PART A – PRIORITY 3
1) How will you concatenate arrays using numpy?
2) What are the categories of joins?
3) What are the methods used for groupby in pandas?
4) What are the basics of numpy arrays?
5) Define series object.

PART B – PRIORITY 1

1. Explain hierarchical indexing using pandas with an example.

2. Write a python program to explain Data indexing and selection using pandas.
3. How pandas libraries are used in data science for handling missing data ? Explain in
detail.
4. Explain combining datasets, aggregation and grouping in pandas.
Mount Zion College of Engineering & Technology
To Make Man Whole!!

5. Explain pivot tables in pandas.

PART B – PRIORITY 2

1. Extract from the array np.array([3,4,6,10,24,89,45,43,46,99,100]) with Boolean

masking all the number
a. Which are not divisible by 3
b. Which are divisible by 5
c. Which are divisible by 3 and 5
d. Which are divisible by 3 and set them to 42

PART B – PRIORITY 3

1) Explain about fancy indexing in detail using python program.

2) Write a python program to create structured arrays.
Mount Zion College of Engineering & Technology
To Make Man Whole!!

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

CS3352 – FOUNDATIONS OF DATA SCIENCE

UNIT V DATA VISUALIZATION
Priority 1
 Three Dimensional Plotting
 Geographic Data with Basemap
 Visualization with Seaborn.
Priority 2
 Density and contour plots
 Histograms – legends – colors – subplots
 Text and annotation
 Visualizing errors
 Customization colors
Priority 3
 Importing Matplotlib
 Line plots
 Scatter plots
PART A - PRIORITY 1

1) What is the purpose of matplotlib?

2) Write the dual interface of matplotlib
3) How to draw a simple line plot using matplotlib?
4) Write the syntax to draw scatter plot using matplolib?
5) Write the different between plot and scatter functions?
6) Define contour plot?
7) What are the functions can be used to draw the contour plots?
8) What is the purpose of using histogram?
9) Write the source code to draw a simple histogram?
10) How to create a three-dimensional wireframe plot?
11) Define surface plot?
12) What is the use of seaborn?
13) Write the significance of data visualization.
14) Define Kernel Distribution Estimation
Mount Zion College of Engineering & Technology
To Make Man Whole!!

PART A - PRIORITY 2

1. What is the purpose of error bar?

2. Enumerate the classes of colormaps in scatterplot
3. Comment on text transforms.
4. List the ways to customize Matplotlib
5. What is density plot?
6. What are pair plot?
7. What are factor plot and surface plot in seaborn?

PART A - PRIORITY 3

1. Write the features of seaborn modules.

2. Short notes on Basemap tool kit.

PART B - PRIORITY 1

1. Explain about the density and contour plots in matplotlib.

2. Describe text and annotation in detail with a python programming.
3. Illustrate customization and three dimensional plotting
4. Explain Geographic Data with Basemap Visualization.
5. Write a python program to visualize various plots using seaborn.

PART B - PRIORITY 2

1. How histogram can be implemented using matplotlib?

2. How will you customize the legends in matplotlib?
3. Describe about the subplots in matplotlib.
4. Illustrate the simple line plots with its attributes in matplotlib
5. Write python program to visualize the dataset using scatterplot and explain its
parameters
PART B - PRIORITY 3

1. Explain the concept of adding single and multiple legends to the plot.
2. Describe about customizing colors
3. Write a python program to draw histogram for any dataset.

CS3352 Foundations of Data Science Nov Dec 2022 Question Paper Download
No ratings yet
CS3352 Foundations of Data Science Nov Dec 2022 Question Paper Download
4 pages
Question Bank
No ratings yet
Question Bank
7 pages
Introduction To Statistics..Final
No ratings yet
Introduction To Statistics..Final
221 pages
April May 2023 FODS Arrear
No ratings yet
April May 2023 FODS Arrear
3 pages
FDS IMPORTANT QUESTIONS EduEngg
100% (1)
FDS IMPORTANT QUESTIONS EduEngg
7 pages
Question Bank
No ratings yet
Question Bank
7 pages
Fdsa Question-Bank
No ratings yet
Fdsa Question-Bank
7 pages
Ad3491 QB
No ratings yet
Ad3491 QB
17 pages
Ch03 - Predetermined OH Rates & Absorption-Variable Costing
No ratings yet
Ch03 - Predetermined OH Rates & Absorption-Variable Costing
10 pages
Mathematics For Intelligent Systems
No ratings yet
Mathematics For Intelligent Systems
7 pages
Fds UNIT 1
No ratings yet
Fds UNIT 1
38 pages
CS1 Mapping Syllabus PDF
No ratings yet
CS1 Mapping Syllabus PDF
9 pages
1152CS239-Intro. To Data Science-Syllabus
No ratings yet
1152CS239-Intro. To Data Science-Syllabus
6 pages
FDS - Ans Key 16.09 PDF
No ratings yet
FDS - Ans Key 16.09 PDF
12 pages
Punyashlok Ahilyadevi Holkar Solapur University, Solapur Final Year B.Tech. (Electronics & Telecommunication Engg.) (Part - II) CBCS Pattern
No ratings yet
Punyashlok Ahilyadevi Holkar Solapur University, Solapur Final Year B.Tech. (Electronics & Telecommunication Engg.) (Part - II) CBCS Pattern
6 pages
Andhra University
No ratings yet
Andhra University
51 pages
Data Science Question Bank Updated
No ratings yet
Data Science Question Bank Updated
15 pages
CS3352 QB
No ratings yet
CS3352 QB
9 pages
FODSQN
No ratings yet
FODSQN
9 pages
Question Bank FDS
No ratings yet
Question Bank FDS
4 pages
FDS QB
No ratings yet
FDS QB
3 pages
FDS QB
No ratings yet
FDS QB
3 pages
FDS QB New Format
No ratings yet
FDS QB New Format
10 pages
Graphical Displays of Data: Unit 1
No ratings yet
Graphical Displays of Data: Unit 1
45 pages
Course Syllabus For ICT 2020-2021 FV
No ratings yet
Course Syllabus For ICT 2020-2021 FV
53 pages
FDSA Unit 1
No ratings yet
FDSA Unit 1
34 pages
SYBScCS Statistics Sem IV Minor 2023 Pattern
No ratings yet
SYBScCS Statistics Sem IV Minor 2023 Pattern
16 pages
Unit I Introduction To Data Science: Dept. of IT 2024-2025
No ratings yet
Unit I Introduction To Data Science: Dept. of IT 2024-2025
9 pages
SEMIV AIML Final
No ratings yet
SEMIV AIML Final
28 pages
Foundations of Data Science Faq 5 Units
No ratings yet
Foundations of Data Science Faq 5 Units
13 pages
DS Tansche 03.06.2024
No ratings yet
DS Tansche 03.06.2024
23 pages
Homework Index: To See If The Questions Have Been Changed, or If You Are Required To Use Different Data or Examples
No ratings yet
Homework Index: To See If The Questions Have Been Changed, or If You Are Required To Use Different Data or Examples
86 pages
Question Bank
No ratings yet
Question Bank
7 pages
CS3552 - Fods - QB 2024
No ratings yet
CS3552 - Fods - QB 2024
11 pages
Dat QB
No ratings yet
Dat QB
8 pages
QB FDS
No ratings yet
QB FDS
5 pages
Question Bank
No ratings yet
Question Bank
7 pages
Fods Question Paper
No ratings yet
Fods Question Paper
4 pages
Updated Cs3352 - Foundations of Data Science - Duraimurugan
No ratings yet
Updated Cs3352 - Foundations of Data Science - Duraimurugan
16 pages
FDSA - Question Bank
No ratings yet
FDSA - Question Bank
5 pages
LP Stats
No ratings yet
LP Stats
34 pages
Quiz 1
No ratings yet
Quiz 1
2 pages
Nov Dec 2023
No ratings yet
Nov Dec 2023
3 pages
Practice Problems For Curve Fitting - Solution
No ratings yet
Practice Problems For Curve Fitting - Solution
8 pages
DSCTP 2022 1 ML Slides
No ratings yet
DSCTP 2022 1 ML Slides
351 pages
Mathematics As A Tool (Descriptive Statistics) (Midterm Period) Overview: This Module Tackles Mathematics As Applied To Different Areas Such As Data
No ratings yet
Mathematics As A Tool (Descriptive Statistics) (Midterm Period) Overview: This Module Tackles Mathematics As Applied To Different Areas Such As Data
33 pages
) LKK
No ratings yet
) LKK
3 pages
Ad3491 Foda Question Bank
No ratings yet
Ad3491 Foda Question Bank
7 pages
Review Unit3 New
No ratings yet
Review Unit3 New
10 pages
SEE5211 Chapter3-P2017
No ratings yet
SEE5211 Chapter3-P2017
58 pages
Fods Cia 1
No ratings yet
Fods Cia 1
2 pages
Cs3352 Foundations of Data Science
No ratings yet
Cs3352 Foundations of Data Science
4 pages
Operations Management: Chapter 4 - Forecasting
No ratings yet
Operations Management: Chapter 4 - Forecasting
110 pages
Outline (Tentative) Updated
No ratings yet
Outline (Tentative) Updated
7 pages
FDS Important Q
No ratings yet
FDS Important Q
5 pages
Stats
No ratings yet
Stats
16 pages
Business Stats
No ratings yet
Business Stats
11 pages
Module 3 Numericals
No ratings yet
Module 3 Numericals
3 pages
Fiches Machine Learning
No ratings yet
Fiches Machine Learning
21 pages
Ds Imp Qs
No ratings yet
Ds Imp Qs
4 pages
QT All Question
No ratings yet
QT All Question
2 pages
II PUC Statistics Paper 2 2020
No ratings yet
II PUC Statistics Paper 2 2020
4 pages
AP Classroom Unit 2 FRQ Scoring Guide
No ratings yet
AP Classroom Unit 2 FRQ Scoring Guide
13 pages
Assignment of Introduction To Statistics - Updated
No ratings yet
Assignment of Introduction To Statistics - Updated
3 pages
Regression - Part III - 2021
No ratings yet
Regression - Part III - 2021
55 pages
Data Science Syllabus
No ratings yet
Data Science Syllabus
4 pages
CW 2-3 Regression & Reexpresing 11 03 2024
No ratings yet
CW 2-3 Regression & Reexpresing 11 03 2024
36 pages
Comparison and Simulation of Building
No ratings yet
Comparison and Simulation of Building
19 pages
Chapter2 Regression SimpleLinearRegressionAnalysis
No ratings yet
Chapter2 Regression SimpleLinearRegressionAnalysis
41 pages
ECE 3040 Lecture 18: Curve Fitting by Least-Squares-Error Regression
No ratings yet
ECE 3040 Lecture 18: Curve Fitting by Least-Squares-Error Regression
38 pages
VCTest 1 BF09 Ans
No ratings yet
VCTest 1 BF09 Ans
9 pages
Inversion Techniques Applied To Resistivity Invers
No ratings yet
Inversion Techniques Applied To Resistivity Invers
19 pages
Nurse Saul
No ratings yet
Nurse Saul
10 pages
TPS1200 Quick Setup
No ratings yet
TPS1200 Quick Setup
50 pages
Gadzo 2019
No ratings yet
Gadzo 2019
16 pages
AP Stat Spring Pacing
No ratings yet
AP Stat Spring Pacing
4 pages
Non-Linear Curve Fit Proof
0% (1)
Non-Linear Curve Fit Proof
5 pages
Chelyshkov Least Squares Support Vector Regression For Nonl 2022 Chaos Soli
No ratings yet
Chelyshkov Least Squares Support Vector Regression For Nonl 2022 Chaos Soli
12 pages
Geophysical Research Letters - 2016 - Rousset - An Aseismic Slip Transient On The North Anatolian Fault
No ratings yet
Geophysical Research Letters - 2016 - Rousset - An Aseismic Slip Transient On The North Anatolian Fault
9 pages
1992 - A Method For Registration of 3-D Shapes - ICP - OK
No ratings yet
1992 - A Method For Registration of 3-D Shapes - ICP - OK
18 pages
Soft Reviewer Sa Finance by Totowable..: Activity Cost and Cost Analysis Theories
No ratings yet
Soft Reviewer Sa Finance by Totowable..: Activity Cost and Cost Analysis Theories
9 pages
MIT6 057IAP19 hw3
No ratings yet
MIT6 057IAP19 hw3
12 pages
IZhO 2021 Exp - Eng - Sol
No ratings yet
IZhO 2021 Exp - Eng - Sol
8 pages
Econ 471 Notes 1
No ratings yet
Econ 471 Notes 1
14 pages
1991 Barth Multivariate Analysis Organic Acids North Sea PDF
No ratings yet
1991 Barth Multivariate Analysis Organic Acids North Sea PDF
15 pages
An Introduction To Regression Analysis
No ratings yet
An Introduction To Regression Analysis
34 pages
Derivation of The Conjugate Gradient Method: 1 Goal
No ratings yet
Derivation of The Conjugate Gradient Method: 1 Goal
5 pages
Cuadrados Minimos Solver PDF
No ratings yet
Cuadrados Minimos Solver PDF
3 pages
Thematic Cartography, Cartography and the Impact of the Quantitative Revolution
From Everand
Thematic Cartography, Cartography and the Impact of the Quantitative Revolution
Colette Cauvin
No ratings yet
IGNOU BCA Computer Oriented Numerical Technique Previous Year Unsolved Papers BCS 054
From Everand
IGNOU BCA Computer Oriented Numerical Technique Previous Year Unsolved Papers BCS 054
Manish Soni
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Priority Questions

Uploaded by

Priority Questions

Uploaded by

Mount Zion College of Engineering & Technology

To Make Man Whole!!

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

CS3352 – FOUNDATIONS OF DATA SCIENCE

1) Define Data science and its life cycle.

1) Identify the components of data science.

1) Give an overview of the data science process.

1) Elaborate any 5 application domains of data science.

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

CS3352 – FOUNDATIONS OF DATA SCIENCE

UNIT II DESCRIBING DATA

1) Define population and sample.

1) Differentiate constant and variable.

1) Explain the types of data and types of variables

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

CS3352 – FOUNDATIONS OF DATA SCIENCE

UNIT III DESCRIBING RELATIONSHIPS

 Standard error of estimate

1) Define scatter plot.

1) When does regression fallacy occur?

1) Calculate the coefficient of correlation between the expenditure on advertising and

i. Draw a scatter diagram of these data.

3) Conduct a multiple regression analysis by finding the regression model on the

2) What is the significance of r2? Give a detailed interpretation of r2.

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

CS3352 – FOUNDATIONS OF DATA SCIENCE

• np.array([range (i,i+3) for i in [2,4,6]])

4. What is data frame?

1) What are index preservation and index alignment?

1. Explain hierarchical indexing using pandas with an example.

5. Explain pivot tables in pandas.

1. Extract from the array np.array([3,4,6,10,24,89,45,43,46,99,100]) with Boolean

1) Explain about fancy indexing in detail using python program.

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

CS3352 – FOUNDATIONS OF DATA SCIENCE

1) What is the purpose of matplotlib?

1. What is the purpose of error bar?

1. Write the features of seaborn modules.

1. Explain about the density and contour plots in matplotlib.

1. How histogram can be implemented using matplotlib?

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.