0% found this document useful (0 votes)

7 views4 pages

IV AI-DS AD3491 FDSA Unit2

The document provides notes for a course on Fundamentals of Data Science and Analytics at Grace College of Engineering, focusing on descriptive analytics. It covers key concepts such as frequency distribution, outliers, statistical tests like T-tests and F-tests, and correlation, along with practical applications and examples. Additionally, it includes exercises for students to apply these concepts in real-world scenarios.

Uploaded by

lefih93289

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views4 pages

IV AI-DS AD3491 FDSA Unit2

Uploaded by

lefih93289

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

4931_Grace College of Engineering,Thoothukudi.

B.Tech- Artificial Intelligence and Data Science

Anna University Regulation: 2021

AD3491- FUNDAMENTALS OF DATASCIENCE AND

ANALYTICS

II Year/IV Semester

UNIT II DESCRIPTIVE ANALYTICS

NOTES

Prepared By,
Mrs. S. Porkodi, AP/AI&DS

AD3491_FDSA
4931_Grace College of Engineering,Thoothukudi.

UNIT-II
PART – A
1.What is Frequency Distribution?
Frequency distribution is used to organize the collected data in table form. The data could be
marks scored by students, temperatures of different towns, points scored in a volleyball match,
etc. After data collection, we have to show data in a meaningful manner for better understanding.
Organize the data in such a way that all its features are summarized in a table.
2.List down the Types of Frequency Distribution
* Ungrouped frequency distribution: It shows the frequency of an item in each separate data
value rather than groups of data values.
* Grouped frequency distribution: In this type, the data is arranged and separated into groups
called class intervals. The frequency of data belonging to each class interval is noted in a
frequency distribution table.
3.State the Frequency Distribution Table.
A frequency distribution table is a chart that shows the frequency of each of the items in a data
set. Let's consider an example to understand how to make a frequency distribution table using
tally marks A jar containing beads of different colors- red, green, blue, black, red, green, blue,
yellow, red, red, green, green, green, yellow, red, green, yellow.
4. Define an outlier.
Outliers are data points that are far from other data points In other words, they're unusual values
in a dataset. Outliers are problematic for many statistical analyses because they can cause tests to
either miss significant findings or distort real results.
5. How do you use Z-scores to Detect Outliers?
Z-scores can quantify the unusualness of an observation when your data follow the normal
distribution Z-scores are the number of standard deviations above and below the mean that cach
value falls. For example, a Z-score of 2 indicates that an observation is two standard deviations
above the average while a Z-score of -2 signifies it is two standard deviations below the mean A
Z score of zero represents a value that equals the mean.
6 .What is Data Interpretation?
Data interpretation refers to the process of using diverse analytical methods to review data and
arrive at relevant conclusions. The interpretation of data helps researchers to categorize,
manipulate, and summarize the information in order to answer critical questions. Before any
serious data analysis can begin, the scale of measurement must be decided for the data as this
will have a long-term impact on data interpretation ROL
7.Define T-Test?
Statistical method for the comparison of the mean of the two groups of the normally
distributed sample(s).
8.Define F-Test?
An F-test is any statistical test in which the test statistic has an F-distribution under the null
hypothesis. It is most often used when comparing statistical models that have been fitted to a
data set, in order to identify the model that best fits the population from which the data were
sampled.
9. What is analysis of variance?
Analysis of variance is a collection of statistical models and their associated estimation
procedures used to analyze the differences among means. ANOVA was developed by the
statistician Ronald Fisher.

AD3491_FDSA
4931_Grace College of Engineering,Thoothukudi.

10. Define effect size estimation ?

Effect size estimates provide important information about the impact of a treatment on the
outcome of interest or on the association between variables. • Effect size estimates provide a
common metric to compare the direction and strength of the relationship between
variables across studies.
11. What is mean by multiple comparisons, multiplicity or multiple testing.
The multiple comparisons, multiplicity or multiple testing problem occurs when one
considers a set of statistical inferences simultaneously or infers a subset of parameters
selected based on the observed values.
12. What do you mean by two-factor factorial design?
A two-factor factorial design is an experimental design in which data is collected for all possible
combinations of the levels of the two factors of interest. If equal sample sizes are taken for each
of the possible factor combinations then the design is a balanced two-factor factorial design.
13. Define statistical test in F-test
An F-test is any statistical test in which the test statistic has an F-distribution under the null
hypothesis. It is most often used when comparing statistical models that have been fitted to a
data set, in order to identify the model that best fits the population from which the data were
sampled.
14. What are the two- way analyses of variance?
The two-way analysis of variance is an extension of the one-way ANOVA that examines the
influence of two different categorical independent variables on one continuous dependent
variable.
15. What are the types of ANOVA?
There are two main types of ANOVA: one-way (or unidirectional) and two-way. There also
variations of ANOVA. For example, MANOVA (multivariate ANOVA) differs from ANOVA
as the former tests for multiple dependent variables simultaneously while the latter assesses
only one dependent variable at a time.
16. What is the Trending Market Test?
In an up-trending market, previous resistance becomes support, while in a down-trending market,
past support becomes resistance. Once price breaks out to a new high or low, it often retraces to
test these levels before resuming in the direction of the trend. Momentum traders can use the test
of a previous swing high or swing low to enter a position at a more favorable price than if they
would have chased the initial breakout. A stop-loss order should be placed directly below the
test area to close the trade if the trend unexpectedly reverses.
17. State the term Correlation.
Correlation refers to a process for establishing the relationships between two variables. You
learned a way to get a general idea about whether or not two variables are related, is to plot them
on a "scatter plot". While there are many measures of association for variables which are
measured at the ordinal or higher level of measurement, correlation is the most commonly used
approach.
18. State in brief Correlation Coefficient.
The correlation coefficient, r, is a summary measure that describes the extent of the statistical
relationship between two interval or ratio level variables. The correlation coefficient is scaled so
that it is always between -1 and +1 When r is close to 0 this means that there is little relationship
between the variables and the farther away from 0 r is, in either the positive or negative
direction, the greater the relationship between the two variables.

AD3491_FDSA
4931_Grace College of Engineering,Thoothukudi.

19. List the Types of Correlation.

Positive Correlation when the values of the two variables move in the same direction so that an
increase/decrease in the value of one variable is followed by an increase/decrease in the value of
the other variable.
Negative Correlation when the values of the two variables move in the opposite direction so that
an increase/decrease in the value of one variable is followed by decrease/increase in the value of
the other variable.
No Correlation when there is no linear dependence or no relation between the two variables.

PART – B
1. A library systems lends books for the periods of 21 days. This policy is being
reevaluated in view of a possible new loan period that could be either longer or shorter than 21
days. To aid in making this decision, books-lending records were consulted to determine the
loan period actually used by the patrons. A random sample of 8 records revealed the
following loan periods in days: 21,15,12,24,20,21,13 and 16. Test the null hypothesis with t-
test, using the .05 level of significance.
2. A consumers’ group randomly samples 10 “one-pound” package of ground wheat sold by a
super market. Calculate the mean and the estimated standard error of the mean for this sample,
given the following weight in ounces:16,15,14,15,14,15,16,14,14,14
3. Illustrate in detail about one factor ANOVA with example.
4. Estimate the calculations for the t test for gas mileage investigation. Showcase the hypothesis
analysis, t ratio calculation with three panels along with confidence interval .
5. Estimate the calculations for the t test using two independent samples for EPO experiment.
Showcase the hypothesis analysis, sampling distribution, t ratio calculation with three
panels, p value estimation along with confidence interval .
6. State the use of counterbalancing and explain the EPO experiment with repeated measures.
Give the detailed table of summary of t tests for population MEANS for one sample, two
independent samples and two related samples
7. Suggest the hypothesis test summary for t test for a population correlation coefficient
for the case study on Greeting Card Exchange
8. Suggest the hypothesis test summary using One-Factor F Test for Sleep Deprivation
Experiment and also the variance estimates, mean squares, sum of squares with degree of
freedom
9. Blood pressure of 8 patients are before and after are recorded: Before: 180,200,230,
240,170,190,200 and 165 After: 140,145, 150,155,120,130,140 and 130. Find, is there any
significant difference between BP reading before and after by applying two-sample t-test.
10. Marks of student are 10.5, 9, 7, 12, 8.5, 7.5, 6.5, 8, 11 and 9.5.Mean population score is 12
and standard deviation is 1.80.Is the mean value for student significantly differ from the
mean population value.

AD3491_FDSA

AAOS Orthopaedic Knowledge Update 8
100% (1)
AAOS Orthopaedic Knowledge Update 8
763 pages
CS22021 - EXPLORATORY DATA ANALYSIS - FAT 3 - Notes
No ratings yet
CS22021 - EXPLORATORY DATA ANALYSIS - FAT 3 - Notes
74 pages
Anova
No ratings yet
Anova
35 pages
Fdsa Unit 2
No ratings yet
Fdsa Unit 2
88 pages
Biostatistics 2021-22 Part 2 8th Sem
No ratings yet
Biostatistics 2021-22 Part 2 8th Sem
13 pages
FDS Unit II Update
No ratings yet
FDS Unit II Update
84 pages
F Test
No ratings yet
F Test
10 pages
CAT-2 Answ
No ratings yet
CAT-2 Answ
9 pages
Biostatistics Project
No ratings yet
Biostatistics Project
5 pages
Data Analysis and Report Writing BRM
No ratings yet
Data Analysis and Report Writing BRM
49 pages
Statistical Analysis of Data With Report Writing
100% (2)
Statistical Analysis of Data With Report Writing
16 pages
Viva Questions
No ratings yet
Viva Questions
11 pages
Lecture Notes in MAED Stat Part 1
100% (1)
Lecture Notes in MAED Stat Part 1
15 pages
50 Important Statistics' Q & A To Crack DS Interview
No ratings yet
50 Important Statistics' Q & A To Crack DS Interview
14 pages
Unit Test 3
No ratings yet
Unit Test 3
9 pages
A Study On Students Entrepreneurial Intention The Case of Wollega University, Ethiopia
100% (1)
A Study On Students Entrepreneurial Intention The Case of Wollega University, Ethiopia
9 pages
Things To Know in Data Analysis
No ratings yet
Things To Know in Data Analysis
4 pages
Analytical Method Validation
100% (1)
Analytical Method Validation
57 pages
IV AI-DS AD3491 FDSA QB Unit4
No ratings yet
IV AI-DS AD3491 FDSA QB Unit4
6 pages
Advanced Business Reasearch Methodology
No ratings yet
Advanced Business Reasearch Methodology
7 pages
Kmbn203 BRM Unit 5
No ratings yet
Kmbn203 BRM Unit 5
41 pages
Level of Competence of Food & Beverage Services NC II Passers: Basis For Strengthening The Training Program in Western Visayas
100% (1)
Level of Competence of Food & Beverage Services NC II Passers: Basis For Strengthening The Training Program in Western Visayas
8 pages
Unit-5 BRM
No ratings yet
Unit-5 BRM
10 pages
Chapter 7
No ratings yet
Chapter 7
39 pages
Research 619
No ratings yet
Research 619
22 pages
Solution Manual For Statistics Data Analysis and Decision Modeling 5th Edition Evans 0132744287 9780132744287
100% (51)
Solution Manual For Statistics Data Analysis and Decision Modeling 5th Edition Evans 0132744287 9780132744287
7 pages
Hypothesis Testing Parametric and Non Parametric Tests
No ratings yet
Hypothesis Testing Parametric and Non Parametric Tests
14 pages
IV AI-DS AD3491 FDSA QB Unit2
No ratings yet
IV AI-DS AD3491 FDSA QB Unit2
4 pages
Asl Qa
No ratings yet
Asl Qa
5 pages
Data Analysis Chapter 7
No ratings yet
Data Analysis Chapter 7
20 pages
UNIT 4 - Part B
No ratings yet
UNIT 4 - Part B
15 pages
Fdsa U 4
No ratings yet
Fdsa U 4
16 pages
Fdsa Unit 2
No ratings yet
Fdsa Unit 2
89 pages
Chapter 09
No ratings yet
Chapter 09
75 pages
Pranjal - Singh - 27.11.2022 AS Project
No ratings yet
Pranjal - Singh - 27.11.2022 AS Project
9 pages
Turiel
No ratings yet
Turiel
8 pages
Enhancing The Students Pronunciation Using Shadow
No ratings yet
Enhancing The Students Pronunciation Using Shadow
9 pages
Basic Concepts
No ratings yet
Basic Concepts
105 pages
MNS3173 - Chapter 8 - Types of Data Analysis Methods
No ratings yet
MNS3173 - Chapter 8 - Types of Data Analysis Methods
19 pages
Effects of Nursing Rounds: On Patients' Call Light Use, Satisfaction, and Safety
No ratings yet
Effects of Nursing Rounds: On Patients' Call Light Use, Satisfaction, and Safety
13 pages
Class II Treatment Efficiency in U4 Extraction & Non Extraction Protocols 2007
No ratings yet
Class II Treatment Efficiency in U4 Extraction & Non Extraction Protocols 2007
9 pages
Word Typed Stats Theory
No ratings yet
Word Typed Stats Theory
3 pages
ANOVA
No ratings yet
ANOVA
7 pages
CH 5
No ratings yet
CH 5
26 pages
Business Research Method: Unit 5
No ratings yet
Business Research Method: Unit 5
19 pages
Basicof Stats
No ratings yet
Basicof Stats
7 pages
Not 1
No ratings yet
Not 1
8 pages
FIN10002 - Notes Master
No ratings yet
FIN10002 - Notes Master
44 pages
BCSL44 Short Notes
No ratings yet
BCSL44 Short Notes
2 pages
Unit 4 - Notes
No ratings yet
Unit 4 - Notes
14 pages
Datascience Interview
100% (1)
Datascience Interview
31 pages
Q. Anaysis of Variance (Anova)
No ratings yet
Q. Anaysis of Variance (Anova)
29 pages
Chapter 11
No ratings yet
Chapter 11
26 pages
Tutorial 08
No ratings yet
Tutorial 08
6 pages
Stats Questions
0% (1)
Stats Questions
3 pages
DBBA2102
No ratings yet
DBBA2102
10 pages
Notes Unit-4 BRM
No ratings yet
Notes Unit-4 BRM
10 pages
Theoretical Framework: John - Ehrke@acu - Edu
No ratings yet
Theoretical Framework: John - Ehrke@acu - Edu
15 pages
Chronic Lead Poisoning Prevention in Children With Calcium Supplementation
No ratings yet
Chronic Lead Poisoning Prevention in Children With Calcium Supplementation
6 pages
Character and Moral Education Based Learning in Students' Character Development
No ratings yet
Character and Moral Education Based Learning in Students' Character Development
10 pages
Sophomore Research Paper
No ratings yet
Sophomore Research Paper
45 pages
Chapter 7&8
No ratings yet
Chapter 7&8
40 pages
Data Science Interview Questions and Answer
100% (1)
Data Science Interview Questions and Answer
41 pages
Business Stats
No ratings yet
Business Stats
5 pages
EXAM 4 Review Fall 2010 Converted RTF With Key
No ratings yet
EXAM 4 Review Fall 2010 Converted RTF With Key
11 pages
DABM Lab
No ratings yet
DABM Lab
52 pages
DBB2102 - Quantitative Techniques For Management
No ratings yet
DBB2102 - Quantitative Techniques For Management
9 pages
MR Unit-V
No ratings yet
MR Unit-V
13 pages
AD3491 - Unit 4 - Analysis of Variance Important Questions 2 Marks With Answer - 3-9
No ratings yet
AD3491 - Unit 4 - Analysis of Variance Important Questions 2 Marks With Answer - 3-9
7 pages
Step 6 Data Analysis
No ratings yet
Step 6 Data Analysis
23 pages
Stats 10 F21 Lab 5
No ratings yet
Stats 10 F21 Lab 5
6 pages
Statistics
No ratings yet
Statistics
8 pages
Data Analysis Guide
No ratings yet
Data Analysis Guide
4 pages
T Test Conclusion
No ratings yet
T Test Conclusion
2 pages
Computer Application in Research Ojambo Paul 160-500
No ratings yet
Computer Application in Research Ojambo Paul 160-500
5 pages
Force Assessment of Thermoformed and Direct-Printed Aligners in A Lingual Bodily Movement of A Central Incisor Over Time A 14-Day in Vitro Study
No ratings yet
Force Assessment of Thermoformed and Direct-Printed Aligners in A Lingual Bodily Movement of A Central Incisor Over Time A 14-Day in Vitro Study
12 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
13 pages
Module For R-3
No ratings yet
Module For R-3
14 pages
A Modified Noise-Prediction Model For Highways With Significant Motorcycle Traffic
No ratings yet
A Modified Noise-Prediction Model For Highways With Significant Motorcycle Traffic
9 pages
Work Stress BR101
No ratings yet
Work Stress BR101
7 pages
The Effectiveness of Elsa Speak Applicat E47fc683
No ratings yet
The Effectiveness of Elsa Speak Applicat E47fc683
6 pages
000 000groleger0225apr05
No ratings yet
000 000groleger0225apr05
7 pages
Unit Outline MBS659 T3 2024
No ratings yet
Unit Outline MBS659 T3 2024
19 pages
1kpolovie Statistical Analysis With SPSS For Research
No ratings yet
1kpolovie Statistical Analysis With SPSS For Research
13 pages
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
UNIT-5: Procedure of T-Test
No ratings yet
UNIT-5: Procedure of T-Test
12 pages
IOSRJEnvol4Issue9 Sep14 - pp38 47
No ratings yet
IOSRJEnvol4Issue9 Sep14 - pp38 47
11 pages
Ad3491 Fdsa Unit 4 Notes Eduengg-2
No ratings yet
Ad3491 Fdsa Unit 4 Notes Eduengg-2
16 pages
Market Research: Data Analysis Methods
No ratings yet
Market Research: Data Analysis Methods
20 pages
Glossary of Research Methodology
From Everand
Glossary of Research Methodology
Dr. Awadhesh Kishore
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

IV AI-DS AD3491 FDSA Unit2

Uploaded by

IV AI-DS AD3491 FDSA Unit2

Uploaded by

4931_Grace College of Engineering,Thoothukudi.

B.Tech- Artificial Intelligence and Data Science

Anna University Regulation: 2021

AD3491- FUNDAMENTALS OF DATASCIENCE AND

UNIT II DESCRIPTIVE ANALYTICS

10. Define effect size estimation ?

19. List the Types of Correlation.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.