0% found this document useful (0 votes)

75 views36 pages

Submitted To: Mrs. Geetika Vashisht College of Vocational Studies University of Delhi

This document provides a 3-page summary of key concepts in statistics, including: 1) Definitions of basic statistical terms like population, sample, variable, and parameter. 2) Descriptions of different variable types and distributions. 3) Explanations of measures of central tendency like mean, median, and mode. 4) Examples of statistical concepts like interval estimation and hypothesis testing. The document serves as an introduction to fundamental statistical topics for students.

Uploaded by

sanchit nagpal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

75 views36 pages

Submitted To: Mrs. Geetika Vashisht College of Vocational Studies University of Delhi

Uploaded by

sanchit nagpal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 36

PRESENTATION

ON
REVISION OF STATISTICS

SUBMITTED TO:
MRS. GEETIKA VASHISHT
SUBMITTED BY:
COLLEGE OF VOCATIONAL STUDIES
SANCHIT NAGPAL
UNIVERSITY OF DELHI
BSC(HONS) COMPUTER SCIENCE
STATISTICS

• The science of collectiong, organizing, presenting, analyzing, and interpreting data to

assist in making more effective decisions
• Statistical analysis – used to manipulate summarize, and investigate data, so that
useful decision-making information results.
WHY STUDY STATISTICS?

1. Data are everywhere

2. Statistical techniques are used to make many decisions that affect our
lives
3. No matter what your career, you will make professional decisions
that involve data. An understanding of statistical methods will help
you make these decisions efectively
1.2 INTRODUCTION TO BASIC TERMS

Population: A collection, or set, of individuals or objects or events whose properties are

to be analyzed.
Two kinds of populations: finite or infinite.

Sample: A subset of the population.

Variable: A characteristic about each individual element of a
population or sample.
Data (singular): The value of the variable associated with one
element of a population or sample. This value may be a number,
a word, or a symbol.
Data (plural): The set of values collected for the variable from
each of the elements belonging to the sample.
Experiment: A planned activity whose results yield a set of data.
Parameter: A numerical value summarizing all the data of an
entire population.
Statistic: A numerical value summarizing the sample data.
Example: A college dean is interested in learning about the average age of faculty. Identify the basic
terms in this situation.

The population is the age of all faculty members at the college.

A sample is any subset of that population. For example, we might select 10 faculty members and
determine their age.
The variable is the “age” of each faculty member.
One data would be the age of a specific faculty member.
The data would be the set of values in the sample.
The experiment would be the method used to select the ages forming the sample and determining the
actual age of each faculty member in the sample.
The parameter of interest is the “average” age of all faculty at the college.
The statistic is the “average” age for all faculty in the sample.
Variables may be further subdivided:

Nominal
Qualitative
Ordinal
Variable
Discrete
Quantitative
Continuous
• Nominal - Categorical variables with no inherent order or ranking sequence such as
names or classes (e.g., gender). Value may be a numerical, but without numerical value
(e.g., I, II, III). The only operation that can be applied to Nominal variables is
enumeration.

• Ordinal - Variables with an inherent rank or order, e.g. mild, moderate, severe. Can
be compared for equality, or greater or less, but not how much greater or less.

• Interval - Values of the variable are ordered as in Ordinal, and additionally,

differences between values are meaningful, however, the scale is not absolutely
anchored. Calendar dates and temperatures on the Fahrenheit scale are examples.
Addition and subtraction, but not multiplication and division are meaningful
operations.

• Ratio - Variables with all properties of Interval plus an absolute, non-arbitrary zero
point, e.g. age, weight, temperature (Kelvin). Addition, subtraction, multiplication, and
division are all meaningful operations.
DISTRIBUTION - (OF A VARIABLE) TELLS US WHAT VALUES THE VARIABLE TAKES
AND HOW OFTEN IT TAKES THESE VALUES

FREQUENCY DISTRIBUTION

Consider a data set of 26 children of ages 1-6 years. Then the frequency
distribution of variable ‘age’ can be tabulated as follows:

Frequency Distribution of Age

Age 1 2 3 4 5 6
Frequency 5 3 7 5 4 2

Grouped Frequency Distribution of Age:

Age Group 1-2 3-4 5-6

Frequency 8 12 6
CUMULATIVE FREQUENCY
Cumulative frequency of data in previous page

Age 1 2 3 4 5 6

Frequency 5 3 7 5 4 2

Cumulative Frequency 5 8 15 20 24 26

Age Group 1-2 3-4 5-6

Frequency 8 12 6

Cumulative Frequency 8 20 26
. MEASURES OF CENTRAL TENDENCY
(LOCATION)
Measures of location indicate where on the number line the data are to be found.
Common measures of location are:

(i) the Arithmetic Mean,

(ii) the Median, and
(iii) the Mode
MEAN
Mean: Summing up all the observation and dividing by number of
observations.
Mean of 20, 30, 40 is (20+30+40)/3 = 30.

Notation : Let x1 , x2, ...xn are n observatio ns of a variable

x. Then the mean of this variable,
n

x1  x2  ...  xn x i
x  i 1
n n
Example 2: The systolic blood pressure of seven middle aged men were as follows:

151, 124, 132, 170, 146, 124 and 113.

The mean is
x
 151  124  132  170  146  124  113 
7
 137.14
.
THE MEDIAN AND MODE

• If the sample data are arranged in increasing order, the median is

(i) the middle value if n is an odd number, or
(ii) midway between the two middle values if n is an even number
• The mode is the most commonly occurring value.
.
EXAMPLE 1 – N IS ODD

The reordered systolic blood pressure data seen earlier are:

113, 124, 124, 132, 146, 151, and 170.

The Median is the middle value of the ordered data, i.e. 132.

Two individuals have systolic blood pressure = 124 mm Hg, so the Mode is 124.
EXAMPLE 2 – N IS EVEN
.
Six men with high cholesterol participated in a study to investigate the
effects of diet on cholesterol level. At the beginning of the study, their
cholesterol levels (mg/dL) were as follows:

366, 327, 274, 292, 274 and 230.

Rearrange the data in numerical order as follows:

230, 274, 274, 292, 327 and 366.

The Median is half way between the middle two readings, i.e.
(274+292)  2 = 283.

Two men have the same cholesterol level- the Mode is 274.
GEOMETRIC PROBABILITY DISTRIBUTION

The geometric distribution is a special case of the negative binomial distribution. It deals
with the number of trials required for a single success. Thus, the geometric distribution is
a negative binomial distribution where the number of successes (r) is equal to 1.

P(X=x) = p*q*x−1

Where

•p = probability of success for single trial.

•q = probability of failure for a single trial (1-p)
•x = the number of failures before a success.
•P(X−x) = Probability of x successes in n trials.
EXAMPLE
• In an amusement fair, a competitor is entitled for a prize if he throws a ring on a peg from a certain distance.
It is observed that only 30% of the competitors are able to do this. If someone is given 5 chances, what is the
probability of his winning the prize when he has already missed 4 chances?
• Solution:
• If someone has already missed four chances and has to win in the fifth chance, then it is a probability
experiment of getting the first success in 5 trials. The problem statement also suggests the probability
distribution to be geometric. The probability of success is given by the geometric distribution formula:
P(X=x) = p*q*x−1

P(X=5)= 0.3×(1−0.3)5−1,
=0.3×(0.7)4,
≈0.072
≈7.2%
INTERVAL ESTIMATION

Interval estimation is the use of sample data to calculate an interval of possible (or probable)
values of an unknown population parameter, in contrast to point estimation, which is a single
number.

μ=x¯ ± Zα/2 σ/√n

Where

•x¯= mean
•Zα2 = the confidence coefficient
•α = confidence level
•σ = standard deviation
•n= sample size
EXAMPLE
Suppose a student measuring the boiling temperature of a certain liquid observes the readings
(in degrees Celsius) 102.5, 101.7, 103.1, 100.9, 100.5, and 102.2 on 6 different samples of the
liquid. He calculates the sample mean to be 101.82. If he knows that the standard deviation for
this procedure is 1.2 degrees, what is the interval estimation for the population mean at a 95%
confidence level?

Solution:
The student calculated the sample mean of the boiling temperatures to be 101.82, with standard
deviation
σ=0.49
. The critical value for a 95% confidence interval is 1.96, where
1−0.952=0.025

. A 95% confidence interval for the unknown mean.

=((101.82−(1.96×0.49)),
(101.82+(1.96×0.49))) =(101.82−0.96,101.82+0.96) =(100.86,102.78)
HYPOTHESIS TESTING

A statistical hypothesis is an assumption about a population which may or may

not be true. Hypothesis testing is a set of formal procedures used by
statisticians to either accept or reject statistical hypotheses. Statistical
hypotheses are of two types:

Null hypothesis(H0)
•- represents a hypothesis of chance basis.

Alternative hypothesis(Ha)
- represents a hypothesis of observations which are influenced by some non-
random cause.
EXAMPLE
.
suppose we wanted to check whether a coin was fair and balanced. A
null hypothesis might say, that half flips will be of head and half will of
tails whereas alternative hypothesis might say that flips of head and
tail may be very different.

H0: P=0.5
Ha: P≠0.5
For example if we flipped the coin 50 times, in which 40 Heads and 10
Tails results. Using result, we need to reject the null hypothesis and
would conclude, based on the evidence, that the coin was probably
not fair and balanced.
As the level of confidence decreases, the size of the corresponding interval
will decrease. Suppose the student was interested in a 90% confidence
interval for the boiling temperature. In this case,
σ=0.90
, and 1−0.902=0.05

The critical value for this level is equal to 1.645, so the 90% confidence
interval is

=((101.82−(1.645×0.49)),
(101.82+(1.645×0.49))) =(101.82−0.81,101.82+0.81) =(101.01,
102.63)

An increase in sample size will decrease the length of the confidence

interval without reducing the level of confidence. This is because the
standard deviation decreases as n increases.
TYPE II ERROR

Example
• Hypothesis - Floride added to a toothpaste protects teeth against cavities.
• Null Hypothesis - Floride added to a toothpaste has no effect against cavities.
Here Null hypothesis is to be tested against experimental data to nullify the effect of
floride and water on teeth's cavities.
Consider the Example . Here Null hypothesis is false i.e. Floride added to a toothpaste has effect
against cavities. But if using experimental data, we do not detect an effect of floride added on
cavities then we are accepting a false null hypothesis. This is a Type II error. It is also called a
False Positive condition (a situation which indicates that a given condition is not present but it
actually is present).

Type II error is denoted by β and is also called beta level.

Goal of a statistical test is to determine that a null hypothesis can be rejected or not. A statistical
test can reject or not be able to reject a null hypothesis. Following table illustrates the relationship
between truth or falseness of the null hypothesis and outcomes of the test in terms of Type I or
Type II error.
GOODNESS OF FIT
• The Goodness of Fit test is used to check the sample data whether it fits from a distribution
of a population. Population may have normal distribution or Weibull distribution. In simple
words, it signifies that sample data represents the data correctly that we are expecting to
find from actual population. Following tests are generally used by statisticians:
• Chi-square
• Kolmogorov-Smirnov
• Anderson-Darling
• Shipiro-Wilk
ANOVA (ANALYSIS OF VARIANCE)
• Analysis of Variance also termed as ANOVA. It is procedure followed by statisticans to check the
potential difference between scale-level dependent variable by a nominal-level variable having two or
more categories. It was developed by Ronald Fisher in 1918 and it extends t-test and z-test which
compares only nominal level variable to have just two categories.
• TYPES OF ANOVA
1. One-way ANOVA - One-way ANOVA have only one independent variable and refers to numbers in
this variable. For example, to assess differences in IQ by country, you can have 1, 2, and more
countries data to compare.

2. Two-way ANOVA - Two way ANOVA uses two independent variables. For example, to access
differences in IQ by country (variable 1) and gender(variable 2). Here you can examine the
interaction between two independent variables. Such Interactions may indicate that differences in
IQ is not uniform across a independent variable. For examples females may have higher IQ score
over males and have very high score over males in Europe than in America.
ANOVA TEST PROCEDURE

• Setup null and alternative hypothesis where null hypothesis states that there is no
significant difference among the groups. And alternative hypothesis assumes that there
is a significant difference among the groups.
• Calculate F-ratio and probability of F.
• Compare p-value of the F-ratio with the established alpha or significance level.
• If p-value of F is less than 0.5 then reject the null hypothesis.
• If null hypothesis is rejected, conclude that mean of groups are not equal.
LINEAR REGRESSION

Once the degree of relationship between variables has been established using co-relation
analysis, it is natural to delve into the nature of relationship. Regression analysis helps in
determining the cause and effect relationship between variables. It is possible to predict
the value of other variables (called dependent variable) if the values of independent
variables can be predicted using a graphical method or the algebraic method.
Algebraic method develops two regression equations of X on Y, and Y on X.
.

Regression equation of Y on X Regression equation of X on Y

EQN: Y=a+Bx EQN: X=a+By

where where
Y= Dependent variable X= Dependent variable
X= Independent variable Y= Independent variable
a= Constant showing Y-intercept a= Constant showing Y-intercept
b= Constant showing slope of line b= Constant showing slope of line

Values of a and b is obtained by the Values of a and b is obtained by the

following normal equations following normal equations

∑Y=Na+b∑X ∑X=Na+b∑Y

∑XY=a∑X+b∑X2 ∑XY=a∑Y+b∑Y2
PROBLEM STATEMENT:

A researcher has found that there is a co-relation between the weight tendencies of father
and son. He is now interested in developing regression equation on two variables from
the given data:
Develop Regression equation of Y on X.

Weight of
father (in 69 63 66 64 67 64 70 66 68 67 65 71
Kg)

Weight of
Son (in 70 65 68 65 69 66 68 65 71 67 64 72
Kg)
SOLUTION
• Y = a+bX
• Where , a and b are obtained by normal equations
∑Y=Na+b∑X
∑XY=a∑X+b∑X2
∑Y=810,∑X=800
∑X2=53,402,∑XY=54,049,
N=12
⇒

810 = 12a + 800b ... (i)

⇒
54049 = 800a + 53402 b ... (ii)

• Multiplying equation (i) with 800 and equation (ii) with 12, and subtracting them
-824 b = -588
⇒
b = -.0713
• Putting it in eq (i)
Y=19.96−0.713X
LOGISTICS REGRESSION
Logistic regression is a statistical method for analyzing a dataset in which there are one
or more independent variables that determine an outcome. The outcome is measured
with a dichotomous variable (in which there are only two possible outcomes).

π(x)=eα+βx/1+eα+βx

•Response - Presence/Absence of characteristic.

•Predictor - Numeric variable observed for each case
•β=0⇒ P (Presence) is the same at each level of x.
•β>0⇒P (Presence) increases as x increases
•β=0⇒P (Presence) decreases as x increases.
PROBLEM STATEMENT: SOLVE THE LOGISTIC REGRESSION OF THE FOLLOWING
PROBLEM RIZATRIPTAN FOR MIGRAINE
RESPONSE - COMPLETE PAIN RELIEF AT 2 HOURS (YES/NO).
PREDICTOR - DOSE (MG): PLACEBO (0), 2.5,5,10

SOLUTION: Having α=−2.490and beta = .165}, we've following data:

π(0)=eα+β×01/eα+β×0
DOSE PIE(x)
=e−2.490+01+e−2.490
=0.03π(2.5) 0 0.03
=eα+β×2.51+eα+β×2.5 2.5 0.09
)
=e−2.490+.165×2.51+e−2.490+.165×2.5 π(x)
5 0.23
=0.09π(5)=
eα+β×51+eα+β×5= 10 0.29
e−2.490+.165×51+e−2.490+.165×5
=0.23π(10)=eα+β×101+eα+β×10
=e−2.490+.165×101+e−2.490+.165×10
=0.29
THANK YOU

Maths Grade 10 Term 3 Topics
No ratings yet
Maths Grade 10 Term 3 Topics
6 pages
Appendix D Answers To Odd-Numbered Section Exercises
0% (1)
Appendix D Answers To Odd-Numbered Section Exercises
48 pages
Basics of Statistics: Definition: Science of Collection, Presentation, Analysis, and Reasonable
100% (1)
Basics of Statistics: Definition: Science of Collection, Presentation, Analysis, and Reasonable
33 pages
Resident Stipend Survey
No ratings yet
Resident Stipend Survey
18 pages
Applied Math Unit1 Summary and Useful Formulas
100% (1)
Applied Math Unit1 Summary and Useful Formulas
4 pages
Statistics 1232445944520487 1
No ratings yet
Statistics 1232445944520487 1
101 pages
Business Statistics NOtes
No ratings yet
Business Statistics NOtes
46 pages
DDDDDDDDDDDDDDDDDDDDDDDDD
No ratings yet
DDDDDDDDDDDDDDDDDDDDDDDDD
2 pages
Bus 173 - 1
No ratings yet
Bus 173 - 1
28 pages
What Is Statistic
No ratings yet
What Is Statistic
129 pages
Measure of Central Tendency
No ratings yet
Measure of Central Tendency
40 pages
Module 2 - Statistical Foundations
No ratings yet
Module 2 - Statistical Foundations
108 pages
Basics of Descriptive Ststistics
No ratings yet
Basics of Descriptive Ststistics
24 pages
Dynsim 5.3.2 Utilities: Simsci
No ratings yet
Dynsim 5.3.2 Utilities: Simsci
36 pages
Descriptive + Hypothesis
No ratings yet
Descriptive + Hypothesis
28 pages
Engineering Probability and Statistics
No ratings yet
Engineering Probability and Statistics
42 pages
Lecture 6 - Estimation Part A
No ratings yet
Lecture 6 - Estimation Part A
23 pages
Prelim Lec 2017
No ratings yet
Prelim Lec 2017
49 pages
Statistics 110, Lecture Notes - Cedar Crest College
No ratings yet
Statistics 110, Lecture Notes - Cedar Crest College
111 pages
PROBABILITY Lecture 1 - 2 - 3
No ratings yet
PROBABILITY Lecture 1 - 2 - 3
63 pages
Normal Distribution
No ratings yet
Normal Distribution
3 pages
CHAPTERS
No ratings yet
CHAPTERS
17 pages
CHAPTER-6 FORECASTING TECHNIQUES - Formatted PDF
No ratings yet
CHAPTER-6 FORECASTING TECHNIQUES - Formatted PDF
46 pages
Basic Statistics: Statistics: Is A Science That Analyzes Information Variables (For Instance
No ratings yet
Basic Statistics: Statistics: Is A Science That Analyzes Information Variables (For Instance
14 pages
M5L5 Population Forecast
No ratings yet
M5L5 Population Forecast
12 pages
NITKclass 1
No ratings yet
NITKclass 1
50 pages
Statistics 101
100% (1)
Statistics 101
20 pages
Location) .: Distribution Is The Purpose of Measure of Central
No ratings yet
Location) .: Distribution Is The Purpose of Measure of Central
13 pages
43hyrs Principles of Statistics 3
No ratings yet
43hyrs Principles of Statistics 3
56 pages
Mathematics: Quarter 4 - Module 35 Analyzing and Interpreting Research Data
No ratings yet
Mathematics: Quarter 4 - Module 35 Analyzing and Interpreting Research Data
26 pages
Statistical Techniques For Analyzing Quantitative Data
100% (1)
Statistical Techniques For Analyzing Quantitative Data
41 pages
Unit II: Basic Data Analytic Methods
No ratings yet
Unit II: Basic Data Analytic Methods
38 pages
Data Mining CSE-443: Ayesha Aziz Prova Lecturer, Dept. of CSE CWU
No ratings yet
Data Mining CSE-443: Ayesha Aziz Prova Lecturer, Dept. of CSE CWU
51 pages
Aaa Math
No ratings yet
Aaa Math
2 pages
Measures of Variability and Position
No ratings yet
Measures of Variability and Position
34 pages
Unit 2 Fod
No ratings yet
Unit 2 Fod
27 pages
Lec 1
No ratings yet
Lec 1
44 pages
Lesson 5 - Quantitative Analysis and Interpretation of Data
No ratings yet
Lesson 5 - Quantitative Analysis and Interpretation of Data
78 pages
Statistics MCT
No ratings yet
Statistics MCT
7 pages
Research Methodology
No ratings yet
Research Methodology
19 pages
Project Locus Grade 11
No ratings yet
Project Locus Grade 11
9 pages
The Myth of The Bell Curve
No ratings yet
The Myth of The Bell Curve
9 pages
1 Intro-Statistics
No ratings yet
1 Intro-Statistics
61 pages
Consumption in Kilowatt Hours No. of Consumers
No ratings yet
Consumption in Kilowatt Hours No. of Consumers
7 pages
Final Exam - ASUM
No ratings yet
Final Exam - ASUM
3 pages
HBTopic 4
No ratings yet
HBTopic 4
10 pages
Randomly Scattered Error Analysis of Data: Lab. Report Measurement
No ratings yet
Randomly Scattered Error Analysis of Data: Lab. Report Measurement
6 pages
Basics of Statistics: Descriptive Statistics Inferential Statistics
No ratings yet
Basics of Statistics: Descriptive Statistics Inferential Statistics
6 pages
Definition of Median
No ratings yet
Definition of Median
6 pages
Computational: Erwin L. Medina
No ratings yet
Computational: Erwin L. Medina
29 pages
Chapter 14 - Nonparametric Tests
No ratings yet
Chapter 14 - Nonparametric Tests
10 pages
And Dividing It by Total Number of Values
No ratings yet
And Dividing It by Total Number of Values
3 pages
Statistics
No ratings yet
Statistics
14 pages
STA301 Quiz-2 File by Vu Topper RM
No ratings yet
STA301 Quiz-2 File by Vu Topper RM
116 pages
Statistics SS2020
No ratings yet
Statistics SS2020
12 pages
How To Create Normal Probability Plot
No ratings yet
How To Create Normal Probability Plot
3 pages
MBA 105 Statistical Techniques
100% (1)
MBA 105 Statistical Techniques
107 pages
Statistics
No ratings yet
Statistics
46 pages
LQ1 Notes
No ratings yet
LQ1 Notes
15 pages
Batc602 Business Simulation All Questions
No ratings yet
Batc602 Business Simulation All Questions
29 pages
Weighted Mean Thesis Formula
100% (3)
Weighted Mean Thesis Formula
6 pages
Week 9+10+11
No ratings yet
Week 9+10+11
82 pages
COM 201 - Inferential Statistics - 18032022-1
No ratings yet
COM 201 - Inferential Statistics - 18032022-1
58 pages
Satistics
No ratings yet
Satistics
18 pages
SPTC 0404 q3 FPF
No ratings yet
SPTC 0404 q3 FPF
22 pages
Stats Reviewer
No ratings yet
Stats Reviewer
16 pages
1 - III YR, VII Unit Intro To Statistics
No ratings yet
1 - III YR, VII Unit Intro To Statistics
214 pages
Statistics Revised
No ratings yet
Statistics Revised
73 pages
Lesson 5 Statistics & Probability
No ratings yet
Lesson 5 Statistics & Probability
18 pages
Statistics Is A Branch of
No ratings yet
Statistics Is A Branch of
6 pages
Combinepdf
No ratings yet
Combinepdf
137 pages
Ppt01. A Review To Statistics and Probability
No ratings yet
Ppt01. A Review To Statistics and Probability
28 pages
L01. A Review To Statistics and Probability
No ratings yet
L01. A Review To Statistics and Probability
27 pages
Statistics and Probability
No ratings yet
Statistics and Probability
43 pages
Biostatistics Notes Part 1
No ratings yet
Biostatistics Notes Part 1
9 pages
Math
No ratings yet
Math
6 pages
Lecture-3&4 - Measure of Centeral
No ratings yet
Lecture-3&4 - Measure of Centeral
67 pages
Statatics Chapter 1
No ratings yet
Statatics Chapter 1
21 pages
2466939-EDA and STATISTICS NOTES
No ratings yet
2466939-EDA and STATISTICS NOTES
15 pages
ML Unit-3
No ratings yet
ML Unit-3
18 pages
Unit 2 Data Preprocessing
No ratings yet
Unit 2 Data Preprocessing
8 pages
Lecture Note On Biostatistics
No ratings yet
Lecture Note On Biostatistics
74 pages
ST Topic 3
No ratings yet
ST Topic 3
71 pages
TCS Phase 2 First 100 QA Clean
No ratings yet
TCS Phase 2 First 100 QA Clean
15 pages
Basic Statistics
No ratings yet
Basic Statistics
23 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
11 pages
Basics of Statistics MATH100N MIDTERMS
No ratings yet
Basics of Statistics MATH100N MIDTERMS
11 pages
STATISTICS FOR BUSINESS 1st Semester KPELA
No ratings yet
STATISTICS FOR BUSINESS 1st Semester KPELA
36 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Statistics I Essentials
From Everand
Statistics I Essentials
Emil G. Milewski
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Submitted To: Mrs. Geetika Vashisht College of Vocational Studies University of Delhi

Uploaded by

Submitted To: Mrs. Geetika Vashisht College of Vocational Studies University of Delhi

Uploaded by

PRESENTATION

• The science of collectiong, organizing, presenting, analyzing, and interpreting data to

1. Data are everywhere

Population: A collection, or set, of individuals or objects or events whose properties are

Sample: A subset of the population.

The population is the age of all faculty members at the college.

• Interval - Values of the variable are ordered as in Ordinal, and additionally,

Frequency Distribution of Age

Grouped Frequency Distribution of Age:

Age Group 1-2 3-4 5-6

(i) the Arithmetic Mean,

Notation : Let x1 , x2, ...xn are n observatio ns of a variable

151, 124, 132, 170, 146, 124 and 113.

• If the sample data are arranged in increasing order, the median is

The reordered systolic blood pressure data seen earlier are:

113, 124, 124, 132, 146, 151, and 170.

366, 327, 274, 292, 274 and 230.

Rearrange the data in numerical order as follows:

230, 274, 274, 292, 327 and 366.

•p = probability of success for single trial.

μ=x¯ ± Zα/2 σ/√n

. A 95% confidence interval for the unknown mean.

A statistical hypothesis is an assumption about a population which may or may

An increase in sample size will decrease the length of the confidence

Type II error is denoted by β and is also called beta level.

Regression equation of Y on X Regression equation of X on Y

EQN: Y=a+Bx EQN: X=a+By

Values of a and b is obtained by the Values of a and b is obtained by the

810 = 12a + 800b ... (i)

•Response - Presence/Absence of characteristic.

SOLUTION: Having α=−2.490and beta = .165}, we've following data:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.