0% found this document useful (0 votes)

18 views54 pages

Lecture - 1 Introduction To Statistics

The document outlines the IS 141 course on Statistics and Probability, detailing its contents, expected learning outcomes, and various statistical concepts such as descriptive statistics, probability theory, and regression analysis. Students will learn to apply statistical methodologies to analyze data and draw conclusions. Additionally, the course includes practical applications using statistical software tools.

Uploaded by

Hussein Kingazi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views54 pages

Lecture - 1 Introduction To Statistics

Uploaded by

Hussein Kingazi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 54

IS 141: Statistics and Probability

Dr. Emmanuel-UDSM

March 27, 2024

Table of contents

Course Contents

Basic Concepts in Statistics

Introduction to Statistics

Descriptive Statistics
Describing data using graphs and tables
Grouped Data
A: Expected Learning Outcomes

Upon completion of this course students will be able to:

(a) Use statistical methodology and tools in the problem-solving
process
(b) Demonstrate understanding of basic concepts derived from
probability and statistics.
(c) Demonstrate ability to manage and organize data, and
identify appropriate statistical analysis.
(d) Model and analyze information and arrive at reasonable
conclusions.estimate univariate confidence intervals and
multivariate confidence regions
B: Content Coverage

I Basic Concepts in Statistics: Introduction to statistics,

Need and objectives of Statistical investigation. Divisions of
statistics, Importance and Limitations of statistics, Variables
and types, Collection of data, presentation of data, frequency
distributions.
II Measures of Central Tendency and Variation: Measuring
center: mean, mode and median Measuring spread: range,
interquartile range, standard deviation and mean deviation
Measuring position: quartiles, percentiles, z-scores, Moments
and their relationships, Sheppard’s correction, Skewness and
Kurtosis, Measuring Kurtosis for a distribution, calculating
coefficient of variation.
III Probability Theory: Introduction to probability, operations
with events, mutually exclusive events, sample space, classical
definition of probability and the relative frequency definition,
rules of probability, Total probability and Bayes’
Theorem.Random Variables and Probability distributions:
Probability density functions for random variables (discrete
and continuous), Discrete distribution (Bernoulli, Binomial,
Hypergeometric, Poisson and Geometric distributions),
continuous distribution function (exponential and normal
distributions). Expected value variance and moment
generating functions.Sampling: Sampling distribution of
means and of proportions, Confidence intervals and hypothesis
testing.
III Probability Theory: Introduction to probability, operations
with events, mutually exclusive events, sample space, classical
definition of probability and the relative frequency definition,
rules of probability, Total probability and Bayes’
Theorem.Random Variables and Probability distributions:
Probability density functions for random variables (discrete
and continuous), Discrete distribution (Bernoulli, Binomial,
Hypergeometric, Poisson and Geometric distributions),
continuous distribution function (exponential and normal
distributions). Expected value variance and moment
generating functions.Sampling: Sampling distribution of
means and of proportions, Confidence intervals and hypothesis
testing.
IV Chi-squared Goodness of fit test: Chi-squared goodness of
fit test for various distributions.
V Regression and Correlation Analysis: Scatter diagrams,
Correlation measures, Simple regression analysis, Estimation
of regression coefficients, forecasting with simple regression,
relationship between correlation and regression and multiple
regression analysis. Application of technology: Introduction to
statistical packages (Eg.R, Python, SPSS, MATLAB etc.)
V Regression and Correlation Analysis: Scatter diagrams,
Correlation measures, Simple regression analysis, Estimation
of regression coefficients, forecasting with simple regression,
relationship between correlation and regression and multiple
regression analysis. Application of technology: Introduction to
statistical packages (Eg.R, Python, SPSS, MATLAB etc.)
C: Reading List:
1. Ross, S. M. (2014). Introduction to probability and statistics
for engineers and scientists. Academic Press.
2. Mendenhall, W., Beaver, R. J., & Beaver, B. M. (2012).
Introduction to probability and statistics. Cengage Learning.
3. Balakrishnan, N., Voinov, V., & Nikulin, M. S. (2013).
Chi-squared goodness of fit tests with applications. Academic
Press.
Introduction to Statistics

Statistics is the art of learning from data.

Introduction to Statistics

Statistics is the art of learning from data.

It is concerned with the collection, description and analysis of data,

which often leads to the drawing of conclusion.
Introduction to Statistics

Statistics is the art of learning from data.

It is concerned with the collection, description and analysis of data,

which often leads to the drawing of conclusion.

Statistics can be divided into two major categories:

Introduction to Statistics

Statistics is the art of learning from data.

It is concerned with the collection, description and analysis of data,

which often leads to the drawing of conclusion.

Statistics can be divided into two major categories:

▶ Descriptive statistics: dealing with description and
summarization of data including averages.
Introduction to Statistics

Statistics is the art of learning from data.

It is concerned with the collection, description and analysis of data,

which often leads to the drawing of conclusion.

Statistics can be divided into two major categories:

▶ Descriptive statistics: dealing with description and
summarization of data including averages.
▶ Inferential statistics: dealing with drawing conclusion from
data, where we must take into account the possibility of
chance (probabilities).
For instance, there are two groups of students in IS141, each group
taught using different method. Suppose that the average score of
members of the first group is quite a little higher than that of the
second group.
Can we conclude that this increase is due to the teaching method
used?
Or is it possible that the teaching method was not responsible for
the increased scores but rather the higher scores of the first group
were just a chance occurrence?
For instance, there are two groups of students in IS141, each group
taught using different method. Suppose that the average score of
members of the first group is quite a little higher than that of the
second group.
Can we conclude that this increase is due to the teaching method
used?
Or is it possible that the teaching method was not responsible for
the increased scores but rather the higher scores of the first group
were just a chance occurrence?

To draw logical conclusions from data, usually make some

assumptions about the chances (or probabilities) of obtaining the
different data values.
For instance, there are two groups of students in IS141, each group
taught using different method. Suppose that the average score of
members of the first group is quite a little higher than that of the
second group.
Can we conclude that this increase is due to the teaching method
used?
Or is it possible that the teaching method was not responsible for
the increased scores but rather the higher scores of the first group
were just a chance occurrence?

To draw logical conclusions from data, usually make some

assumptions about the chances (or probabilities) of obtaining the
different data values.
The totality of these assumptions is referred to as a probability
model for the data.
For instance, there are two groups of students in IS141, each group
taught using different method. Suppose that the average score of
members of the first group is quite a little higher than that of the
second group.
Can we conclude that this increase is due to the teaching method
used?
Or is it possible that the teaching method was not responsible for
the increased scores but rather the higher scores of the first group
were just a chance occurrence?

To draw logical conclusions from data, usually make some

assumptions about the chances (or probabilities) of obtaining the
different data values.
The totality of these assumptions is referred to as a probability
model for the data.

Sometimes the nature of the data suggests the form of the

probability model that is assumed.
For instance, suppose that a scientist wants to find out what
proportion of water bottles produced by a new method, will be
defective.
For instance, suppose that a scientist wants to find out what
proportion of water bottles produced by a new method, will be
defective.

The scientist might select a group of these bottles, with the

resulting data being the number of defective bottles in this group.
For instance, suppose that a scientist wants to find out what
proportion of water bottles produced by a new method, will be
defective.

The scientist might select a group of these bottles, with the

resulting data being the number of defective bottles in this group.

Provided that the bottles selected were ”randomly” chosen, it is

reasonable to suppose that each one of them is defective with
probability p, where p is the unknown proportion of all the bottles
produced by the new method that will be defective. The resulting
data can then be used to make inferences about p.
For instance, suppose that a scientist wants to find out what
proportion of water bottles produced by a new method, will be
defective.

The scientist might select a group of these bottles, with the

resulting data being the number of defective bottles in this group.

Provided that the bottles selected were ”randomly” chosen, it is

Population and Samples

Population is the total collection of elements/things under
consideration.
For instance, suppose that a scientist wants to find out what
proportion of water bottles produced by a new method, will be
defective.

The scientist might select a group of these bottles, with the

resulting data being the number of defective bottles in this group.

Provided that the bottles selected were ”randomly” chosen, it is

Population and Samples

Population is the total collection of elements/things under
consideration.

The subgroup of a population is called a sample.

Describing data
The numerical results/findings of a study should be presented
clearly, concisely, and in such a manner that someone can quickly
obtain the essential characteristics of the data.
Describing data
The numerical results/findings of a study should be presented
clearly, concisely, and in such a manner that someone can quickly
obtain the essential characteristics of the data.

Data are often described by using tables and graphs. They reveal
important features such as the range, the degree of concentration,
and the symmetry of the data.
Describing data
The numerical results/findings of a study should be presented
clearly, concisely, and in such a manner that someone can quickly
obtain the essential characteristics of the data.

Data are often described by using tables and graphs. They reveal
important features such as the range, the degree of concentration,
and the symmetry of the data.

Frequency Tables and Graphs

A data set having a relatively small number of distinct values can
be conveniently presented in a frequency table. For example,
Describing data
The numerical results/findings of a study should be presented
clearly, concisely, and in such a manner that someone can quickly
obtain the essential characteristics of the data.

Data are often described by using tables and graphs. They reveal
important features such as the range, the degree of concentration,
and the symmetry of the data.

Frequency Tables and Graphs

A data set having a relatively small number of distinct values can
be conveniently presented in a frequency table. For example,

Starting Salary 47 48 49 50 51 52 53 54 56 57 60
Frequency 4 1 3 5 8 10 0 5 2 3 1
Table: Frequency table for starting yearly salaries (thousands USD) of 42
recently graduated students with B.Sc degree in environmental science.
Line Graph
Data from a frequency table can be graphically represented by a
line graph, showing distinct data vs frequencies.

Figure: Line graph

Bar Graph
Also, data from a frequency table can be graphically represented
by a bar graph.

Figure: Bar graph

Frequency polygon

Another type of graph used to represent a frequency table is the

frequency polygon
Frequency polygon

Another type of graph used to represent a frequency table is the

frequency polygon

Plots the frequencies and data values on the vertical axis, and then
connects the plotted points with straight lines.
Figure: Frequency polygon
Relative frequency tables and graphs

Consider a data set consisting of n values, if f is the frequency of a

particular value, then the ratio f /n is called its relative frequency.
Relative frequency tables and graphs

Consider a data set consisting of n values, if f is the frequency of a

particular value, then the ratio f /n is called its relative frequency.

The relative frequency of a data value is the proportion of the data

that have that value.
Relative frequency tables and graphs

Consider a data set consisting of n values, if f is the frequency of a

particular value, then the ratio f /n is called its relative frequency.

The relative frequency of a data value is the proportion of the data

that have that value.

The relative frequencies can be represented graphically by a relative

frequency line or bar graph or by a relative frequency polygon.

Starting salary 47 48 49 50 51 52 53 54 56 57 60
4 1 3 5 8 10 5 2 3 1
Frequency 42 42 42 42 42 42 0 42 42 42 42

Table: Relative frequency table for starting yearly salaries (thousands

USD) of 42 graduated students.
A pie chart

This is often used to indicate relative frequencies when the data

are not numerical in nature.
A pie chart

This is often used to indicate relative frequencies when the data

are not numerical in nature.

A circle is constructed and then sliced into different sectors; one

for each distinct type of data value.
A pie chart

This is often used to indicate relative frequencies when the data

are not numerical in nature.

A circle is constructed and then sliced into different sectors; one

for each distinct type of data value.

The relative frequency of a data value is indicated by the area of

its sector, this area being equal to the total area of the circle
multiplied by the relative frequency of the data value.
A pie chart

This is often used to indicate relative frequencies when the data

are not numerical in nature.

A circle is constructed and then sliced into different sectors; one

for each distinct type of data value.

The relative frequency of a data value is indicated by the area of

its sector, this area being equal to the total area of the circle
multiplied by the relative frequency of the data value.

Example: The following data relate to the different types of

cancers affecting the 200 most recent patients to enrolled at a
clinic specializing in cancer. These data are represented in the pie
chart presented as follows:
Type of Cancer Number of New Cases Relative Frequencies
Lung 42 0.21
Breast 50 0.25
Colon 32 0.16
Prostate 55 0.275
Melanoma 9 0.045
Bladder 12 0.06
Figure: Pie chart
Grouped data
When the number of distinct values in the data set is too large, it
is useful to divide the values into groupings or class intervals, and
then present the number of data values in each class interval.
Grouped data
When the number of distinct values in the data set is too large, it
is useful to divide the values into groupings or class intervals, and
then present the number of data values in each class interval.

The the appropriate number is a subjective choice, though 5 to 10

are typical depending on the number of values in the data set.
Grouped data
When the number of distinct values in the data set is too large, it
is useful to divide the values into groupings or class intervals, and
then present the number of data values in each class interval.

The the appropriate number is a subjective choice, though 5 to 10

are typical depending on the number of values in the data set.

It is common, although not essential, to choose class intervals of

equal length.
Grouped data
When the number of distinct values in the data set is too large, it
is useful to divide the values into groupings or class intervals, and
then present the number of data values in each class interval.

The the appropriate number is a subjective choice, though 5 to 10

are typical depending on the number of values in the data set.

It is common, although not essential, to choose class intervals of

equal length.

The endpoints of a class interval are called the class boundaries.

We will adopt the left-end inclusion convention, which stipulates

that a class interval contains its left-end but not its right-end
boundary point.
Grouped data
When the number of distinct values in the data set is too large, it
is useful to divide the values into groupings or class intervals, and
then present the number of data values in each class interval.

The the appropriate number is a subjective choice, though 5 to 10

are typical depending on the number of values in the data set.

It is common, although not essential, to choose class intervals of

equal length.

The endpoints of a class interval are called the class boundaries.

We will adopt the left-end inclusion convention, which stipulates

that a class interval contains its left-end but not its right-end
boundary point.

Thus, for instance, the class interval 20-30 contains all values that
are both greater than or equal to 20 and less than 30.
Consider the following set of data for life in hours of 200 lamps;
A grouped frequency table for life in hours of 200 lamps is given by

The class intervals are of length 100, with the first one starting at
500.
The class intervals are of length 100, with the first one starting at
500.
A histogram for the grouped frequency is presented as

Figure: Histogram for the grouped frequency table of 200 lamps

Cumulative frequency Graph

Cumulative frequency is the total of frequencies distributed over

different class intervals (a class interval and all class intervals
below it).
Cumulative frequency Graph

Cumulative frequency is the total of frequencies distributed over

different class intervals (a class interval and all class intervals
below it).

Cumulative frequency graph represents upper class boundaries of

class intervals (horizontal axis) and cumulative frequency (vertical
axis).
Cumulative frequency Graph

Cumulative frequency is the total of frequencies distributed over

different class intervals (a class interval and all class intervals
below it).

Cumulative frequency graph represents upper class boundaries of

class intervals (horizontal axis) and cumulative frequency (vertical
axis).

The relative cumulative frequency graph (ogive) for Lifetime of 200

Lamps is given by
Figure: Relative cumulative frequency graph for lifetime in hours of 200
Lamps

Tybca 5 Slips
100% (1)
Tybca 5 Slips
108 pages
Statistics For The Social Sciences 1729780459. Print
No ratings yet
Statistics For The Social Sciences 1729780459. Print
1,113 pages
Jim Duggan - Exploring Operations Research With R-CRC Pressr (2024)
No ratings yet
Jim Duggan - Exploring Operations Research With R-CRC Pressr (2024)
396 pages
Chapter - 2 Regression - Business Analytics
No ratings yet
Chapter - 2 Regression - Business Analytics
99 pages
PPT 08 - Quantitative Data Analysis
No ratings yet
PPT 08 - Quantitative Data Analysis
51 pages
Modified Ps Final 2023
No ratings yet
Modified Ps Final 2023
124 pages
Introduction To Statistics DaDU
100% (1)
Introduction To Statistics DaDU
181 pages
ST Topic 1
No ratings yet
ST Topic 1
164 pages
(Ebook PDF) Essentials of Statistics For Business and Economics 7th Editioninstant Download
100% (3)
(Ebook PDF) Essentials of Statistics For Business and Economics 7th Editioninstant Download
55 pages
Lecture2 Vertical Stress
No ratings yet
Lecture2 Vertical Stress
122 pages
Social Work Statistics Lecture and Activity 12 (Liwagon)
No ratings yet
Social Work Statistics Lecture and Activity 12 (Liwagon)
31 pages
Mostly Harmless Statistics
No ratings yet
Mostly Harmless Statistics
506 pages
PTS1 Reader
No ratings yet
PTS1 Reader
130 pages
Lecture 5 - Design of Bridge Foundation
100% (1)
Lecture 5 - Design of Bridge Foundation
47 pages
Basic Statistics Notes 2
No ratings yet
Basic Statistics Notes 2
118 pages
(Ebook PDF) Modern Business Statistics, With Microsoft Office Excel 4th Edition Download
100% (7)
(Ebook PDF) Modern Business Statistics, With Microsoft Office Excel 4th Edition Download
56 pages
Statistics For Engineers and Scientists, 6th Edition William Navidi - Ebook PDF Download PDF
100% (1)
Statistics For Engineers and Scientists, 6th Edition William Navidi - Ebook PDF Download PDF
52 pages
Lecture 1 - Introduction
No ratings yet
Lecture 1 - Introduction
57 pages
Basic Statistics Ch1 - 4
No ratings yet
Basic Statistics Ch1 - 4
69 pages
Statistics
No ratings yet
Statistics
23 pages
Lecture3 - Shallow Foundation I - Bearing Capacity
No ratings yet
Lecture3 - Shallow Foundation I - Bearing Capacity
92 pages
Lesson 1
No ratings yet
Lesson 1
63 pages
Lecture 4b
No ratings yet
Lecture 4b
43 pages
Project Finance
No ratings yet
Project Finance
34 pages
Introduction To Statistics Walpole, Ronald E 1974 New York, Macmillan
No ratings yet
Introduction To Statistics Walpole, Ronald E 1974 New York, Macmillan
368 pages
Bsem 26 - Chapter 1 1 10
No ratings yet
Bsem 26 - Chapter 1 1 10
10 pages
Cybersecurity Internship Report
No ratings yet
Cybersecurity Internship Report
55 pages
Lecture 3 - Bridge Loads
No ratings yet
Lecture 3 - Bridge Loads
30 pages
Selvanathan 6e - 19 - PPT
No ratings yet
Selvanathan 6e - 19 - PPT
72 pages
Introduction To Probability
No ratings yet
Introduction To Probability
66 pages
AEM Lecture 1
No ratings yet
AEM Lecture 1
70 pages
Statistical Data Analysis Book Dang Quang The Hong
100% (1)
Statistical Data Analysis Book Dang Quang The Hong
256 pages
Module 1 - The Nature of Statistics
No ratings yet
Module 1 - The Nature of Statistics
13 pages
Public-Private Partnerships in Financing of Infrastructure
No ratings yet
Public-Private Partnerships in Financing of Infrastructure
26 pages
Introduction I
No ratings yet
Introduction I
21 pages
Chapter 6
0% (1)
Chapter 6
50 pages
Probablity and Statistics CLO PLO
No ratings yet
Probablity and Statistics CLO PLO
3 pages
Basics of Data - OpenStax
No ratings yet
Basics of Data - OpenStax
39 pages
Chapter 2
No ratings yet
Chapter 2
23 pages
Simple Correlation and Regression Analysis
No ratings yet
Simple Correlation and Regression Analysis
14 pages
Expect The Unexpected A First Course in Biostatistics - 2nd Edition Research PDF Download
No ratings yet
Expect The Unexpected A First Course in Biostatistics - 2nd Edition Research PDF Download
17 pages
Lecture 3b
No ratings yet
Lecture 3b
12 pages
Midterm: (15 Points) : Indian Institute of Management Bangalore Decision Science II Old Exams
0% (1)
Midterm: (15 Points) : Indian Institute of Management Bangalore Decision Science II Old Exams
72 pages
Math 140 Final Review Notes
No ratings yet
Math 140 Final Review Notes
20 pages
Session 18 Regression
No ratings yet
Session 18 Regression
16 pages
Lecture - 3 Probability Theory
No ratings yet
Lecture - 3 Probability Theory
25 pages
Take Home Assignment 2
No ratings yet
Take Home Assignment 2
7 pages
BIOSTATISTICS
50% (2)
BIOSTATISTICS
151 pages
Statistics
No ratings yet
Statistics
50 pages
Multiple Regression Analysis: Estimation
No ratings yet
Multiple Regression Analysis: Estimation
50 pages
Lecture 01 Math4453 (CE)
No ratings yet
Lecture 01 Math4453 (CE)
19 pages
Lecture6 Compression Members
No ratings yet
Lecture6 Compression Members
38 pages
Group 2 - Chapter 3 - Multiple Regression Analysis Estimation
No ratings yet
Group 2 - Chapter 3 - Multiple Regression Analysis Estimation
13 pages
Sta 111 Nursing Notes
No ratings yet
Sta 111 Nursing Notes
36 pages
Notes Data Analytics
No ratings yet
Notes Data Analytics
19 pages
LR 1 Intro
No ratings yet
LR 1 Intro
24 pages
Asynchronus Learning Module - Sesi 8
No ratings yet
Asynchronus Learning Module - Sesi 8
9 pages
Book Statistics
100% (1)
Book Statistics
197 pages
Duleba1996 - Regression Analysis and Multivariate Analysis
No ratings yet
Duleba1996 - Regression Analysis and Multivariate Analysis
15 pages
Worksheet 3
No ratings yet
Worksheet 3
10 pages
Introduction To Biostatistics
No ratings yet
Introduction To Biostatistics
8 pages
Statistical Methods
No ratings yet
Statistical Methods
15 pages
Statistics Syllabus
No ratings yet
Statistics Syllabus
18 pages
MI 429 Lecture 10A
No ratings yet
MI 429 Lecture 10A
8 pages
Grade: Midterm II (Quantitative Methods I)
No ratings yet
Grade: Midterm II (Quantitative Methods I)
3 pages
Probability and Statistics Course Outline
No ratings yet
Probability and Statistics Course Outline
2 pages
Sampling Theory PYQ
No ratings yet
Sampling Theory PYQ
2 pages
Misusesof Statisticsin Research
No ratings yet
Misusesof Statisticsin Research
8 pages
Statistical Methods
No ratings yet
Statistical Methods
16 pages
Dps 502 Inventory Management Feb 23 2012
No ratings yet
Dps 502 Inventory Management Feb 23 2012
153 pages
TLP - MATH1310 Statistical Concepts
No ratings yet
TLP - MATH1310 Statistical Concepts
10 pages
Chapter 2 Econometrics
No ratings yet
Chapter 2 Econometrics
5 pages
Linear Regression-2: Prof. Asim Tewari IIT Bombay
No ratings yet
Linear Regression-2: Prof. Asim Tewari IIT Bombay
19 pages
Syllabus: (Course Outline) in Statistics and Probability
No ratings yet
Syllabus: (Course Outline) in Statistics and Probability
7 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
95 pages
MI 429 Lecture 10B
No ratings yet
MI 429 Lecture 10B
13 pages
Statistics and Probability - Midterm Reviewer
No ratings yet
Statistics and Probability - Midterm Reviewer
12 pages
Articol Benedek 2016
No ratings yet
Articol Benedek 2016
10 pages
Introduction of Statistics (Reading)
No ratings yet
Introduction of Statistics (Reading)
24 pages
RcmdrPlugin HH
No ratings yet
RcmdrPlugin HH
20 pages
Least Median of Squares Regression. Peter J. Rousseeuw, 1984
No ratings yet
Least Median of Squares Regression. Peter J. Rousseeuw, 1984
10 pages
Sta 111
No ratings yet
Sta 111
94 pages
CH 12
No ratings yet
CH 12
30 pages
Statistics Is About Data (Observations) - in Statistics We Organize and Analyze Data. We Come Up
No ratings yet
Statistics Is About Data (Observations) - in Statistics We Organize and Analyze Data. We Come Up
2 pages
Syllabus For STA 2023 - Introduction To Statistics: Spring 2017 - ONLINE Instructor Information
No ratings yet
Syllabus For STA 2023 - Introduction To Statistics: Spring 2017 - ONLINE Instructor Information
15 pages
Statistics (Disambiguation)
No ratings yet
Statistics (Disambiguation)
12 pages
Chapter 1
No ratings yet
Chapter 1
41 pages
Module in Statistics and Probability: Philippine Christian University Dasmariñas City S.Y. 2018-2019 Senior High School
No ratings yet
Module in Statistics and Probability: Philippine Christian University Dasmariñas City S.Y. 2018-2019 Senior High School
9 pages
1probability and Statistics For Pre-Engineers Course Outline
No ratings yet
1probability and Statistics For Pre-Engineers Course Outline
3 pages
Statistics
100% (12)
Statistics
256 pages
What Is Statistics Intro
No ratings yet
What Is Statistics Intro
16 pages
Bus 172
No ratings yet
Bus 172
17 pages
Elementary Statistics
From Everand
Elementary Statistics
jay prakash Maheshwari
5/5 (1)
Statistics Super Review, 2nd Ed.
From Everand
Statistics Super Review, 2nd Ed.
The Editors of REA
5/5 (3)
Business Statistics I Essentials
From Everand
Business Statistics I Essentials
Louise Clark
5/5 (5)
Multivariate Analysis for the Biobehavioral and Social Sciences: A Graphical Approach
From Everand
Multivariate Analysis for the Biobehavioral and Social Sciences: A Graphical Approach
Bruce L. Brown
No ratings yet
Concise Biostatistical Principles & Concepts: Guidelines for Clinical and Biomedical Researchers
From Everand
Concise Biostatistical Principles & Concepts: Guidelines for Clinical and Biomedical Researchers
Franklin Opara
No ratings yet
Surviving Statistics: A Professor's Guide to Getting Through
From Everand
Surviving Statistics: A Professor's Guide to Getting Through
Luther Maddy
No ratings yet
Bayesian Methodology: an Overview With The Help Of R Software
From Everand
Bayesian Methodology: an Overview With The Help Of R Software
Editor IJSMI
No ratings yet
SPSS for you
From Everand
SPSS for you
A Rajathi
4.5/5 (4)
Overview Of Bayesian Approach To Statistical Methods: Software
From Everand
Overview Of Bayesian Approach To Statistical Methods: Software
Vinaitheerthan Renganathan
No ratings yet
Glossary of Research Methodology
From Everand
Glossary of Research Methodology
Dr. Awadhesh Kishore
No ratings yet
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet
Social Research Methods. A Complete Guide
From Everand
Social Research Methods. A Complete Guide
Mutea Rukwaru
5/5 (1)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lecture - 1 Introduction To Statistics

Uploaded by

Lecture - 1 Introduction To Statistics

Uploaded by

IS 141: Statistics and Probability

March 27, 2024

Basic Concepts in Statistics

Upon completion of this course students will be able to:

I Basic Concepts in Statistics: Introduction to statistics,

I Basic Concepts in Statistics: Introduction to statistics,

Statistics is the art of learning from data.

Statistics is the art of learning from data.

It is concerned with the collection, description and analysis of data,

Statistics is the art of learning from data.

It is concerned with the collection, description and analysis of data,

Statistics can be divided into two major categories:

Statistics is the art of learning from data.

It is concerned with the collection, description and analysis of data,

Statistics can be divided into two major categories:

Statistics is the art of learning from data.

It is concerned with the collection, description and analysis of data,

Statistics can be divided into two major categories:

To draw logical conclusions from data, usually make some

To draw logical conclusions from data, usually make some

To draw logical conclusions from data, usually make some

Sometimes the nature of the data suggests the form of the

The scientist might select a group of these bottles, with the

The scientist might select a group of these bottles, with the

Provided that the bottles selected were ”randomly” chosen, it is

The scientist might select a group of these bottles, with the

Provided that the bottles selected were ”randomly” chosen, it is

Population and Samples

The scientist might select a group of these bottles, with the

Provided that the bottles selected were ”randomly” chosen, it is

Population and Samples

The subgroup of a population is called a sample.

Frequency Tables and Graphs

Frequency Tables and Graphs

Figure: Line graph

Figure: Bar graph

Another type of graph used to represent a frequency table is the

Another type of graph used to represent a frequency table is the

Consider a data set consisting of n values, if f is the frequency of a

Consider a data set consisting of n values, if f is the frequency of a

The relative frequency of a data value is the proportion of the data

Consider a data set consisting of n values, if f is the frequency of a

The relative frequency of a data value is the proportion of the data

The relative frequencies can be represented graphically by a relative

Table: Relative frequency table for starting yearly salaries (thousands

This is often used to indicate relative frequencies when the data

This is often used to indicate relative frequencies when the data

A circle is constructed and then sliced into different sectors; one

This is often used to indicate relative frequencies when the data

A circle is constructed and then sliced into different sectors; one

The relative frequency of a data value is indicated by the area of

This is often used to indicate relative frequencies when the data

A circle is constructed and then sliced into different sectors; one

The relative frequency of a data value is indicated by the area of

Example: The following data relate to the different types of

The the appropriate number is a subjective choice, though 5 to 10

The the appropriate number is a subjective choice, though 5 to 10

It is common, although not essential, to choose class intervals of

The the appropriate number is a subjective choice, though 5 to 10

It is common, although not essential, to choose class intervals of

The endpoints of a class interval are called the class boundaries.

We will adopt the left-end inclusion convention, which stipulates

The the appropriate number is a subjective choice, though 5 to 10

It is common, although not essential, to choose class intervals of

The endpoints of a class interval are called the class boundaries.

We will adopt the left-end inclusion convention, which stipulates

Figure: Histogram for the grouped frequency table of 200 lamps

Cumulative frequency is the total of frequencies distributed over

Cumulative frequency is the total of frequencies distributed over

Cumulative frequency graph represents upper class boundaries of

Cumulative frequency is the total of frequencies distributed over

Cumulative frequency graph represents upper class boundaries of

The relative cumulative frequency graph (ogive) for Lifetime of 200

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.