0% found this document useful (0 votes)
25 views52 pages

Handouts 1 ENDATA130 Introduction To Data Analysis

This document provides an overview of an engineering data analysis class. It outlines class details like scheduling, grading, and requirements. It also presents the learning outcomes which are to identify class policies, understand topic coverage and timelines, distinguish relevant terminology, and summarize introductory concepts. Finally, it introduces engineering data analysis and some key definitions like population, sample, variables, and descriptive and inferential statistics.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
25 views52 pages

Handouts 1 ENDATA130 Introduction To Data Analysis

This document provides an overview of an engineering data analysis class. It outlines class details like scheduling, grading, and requirements. It also presents the learning outcomes which are to identify class policies, understand topic coverage and timelines, distinguish relevant terminology, and summarize introductory concepts. Finally, it introduces engineering data analysis and some key definitions like population, sample, variables, and descriptive and inferential statistics.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 52

ENDATA130

HANDOUTS 1

Prepared by: Y.E. Fernandez


TOPIC COVERAGE

• Class Details, Guidelines and Requirements


• Timeline of Topic Coverage
• Introduction to Engineering Data Analysis
LEARNING OUTCOMES:
• At the end of the discussion and presentations, the student should
be able to
• Identify policies, guidelines and requirements for the class;
• Discover the topic contents and timeline of class activities;
• Distinguish the relevant terminologies for data analysis; and
• Summarize the introductory concepts in the presentation.
CLASS DETAILS AND REQUIREMENTS
• Class Schedule: Whole Semester
Thurs 1pm – 4pm
3 hours per week for the whole semester
• Grading Periods
Prelims = Phase 1 Schedule
Midterms = Phase 2 Schedule
Finals = Phase 3 Schedule
• Grading System, Textbook and References:
Refer to Syllabus
CLASS DETAILS AND
REQUIREMENTS
• Class meetings thru video conferencing:
To be done usually through MS Teams Meetings
corresponding to the class schedule.

• Notifications and Announcements:


Through Teams class account or alternatively
through Messenger gc.
CLASS DETAILS AND
REQUIREMENTS
◼ Requirements and Submission:
1.) Attend classes through video conferencing in MS
Teams.
2.) Worksheets as seatwork or assignment will be
required and channeled thru MS Teams ENDATA130 class
account.
3.) Two Long Quizzes per grading period will be
scheduled.
4.) Major departmentalized examination will be given in
the three grading periods:
Prelim Exam, Midterm Exam and Finals Exam.
NOTE: Finals Examination is Comprehensive!
SOME CLASS POLICIES

1.) Video-on during prayer time at the start of the class.


2.) No special submissions for missed worksheets, assignments
and other tasks assigned that are given several days for
submission with prior notice.
3.) Special quizzes will be given when a letter request with
reason for missing the quiz is submitted to me.
4.) Special examination will be given by requesting application
for special examination with the Dean of the College thru the
College secretary.
PROJECT PER GRADING PERIOD
• Video production showing yourself solving problem
application for topics discussed in the Grading Period.
a.) The video format should be in mp4 file.
b.) Contents of the video should be comparable to
doing class board work of solving problem.
c.) There should be two (2) problems to be solved
and presented in two (2) separate videos good up to a
maximum of 5 minutes per video.
Due Date for Submission: Friday after the Examination Week.
INTRODUCTION TO ENGINEERING
DATA ANALYSIS

• Introductory Topics for Discussion


1. Introduction to Engineering Data Analysis.
1.1 Kinds of Statistical Variables
1.2 Levels or Scales of Measurement
1.3 Population and Sample
1.4 Sample size Determination
1.5 Sampling Process
THE NEED FOR DATA
ANALYSIS:
• All Engineering profession involves experimental
tasks in areas of interest that concerns itself with
quantitative studies of the phenomena of interest,
i.e. (that is), conducting measurements.

• In addition to designing and carrying out the


experiment, an important task is the accurate
evaluation and complete exploitation of the data
obtained from the measurements conducted or
from available existing data (this is DATA
ANALYSIS).
SAMPLE SITUATIONS AND
SCENARIOS INVOLVING THE
NEED FOR DATA ANALYSIS
Sample 1: Mobile network operators used measurement to
optimize new network antennas, saving money and improving
energy efficiency in the process.
The background :
To support 3G, network operators had to install a new network
of antennas, capable of handling higher data volumes and
network transmission speeds. By moving from older fixed tilt
antennas to new electronically variable tilt models, reliability
and access to the network could be improved.
SAMPLE SITUATIONS AND
SCENARIOS INVOLVING THE
NEED FOR DATA ANALYSIS
The challenge : There were many different antennas available,
and it was difficult to evaluate the best option because
specifications were set by the manufacturers themselves,
rather than by an independent organization.

The measurement intervention: A variety of 3G clients,


including network operators and antenna makers, carried out
a series of tests at an independent measurement laboratory.
The test results provided full information about how the
antennas could be expected to behave, including their gain
and directivity.
SAMPLE SITUATIONS AND
SCENARIOS INVOLVING THE
NEED FOR DATA ANALYSIS
Mounting each antenna in a specialized test
chamber gave accurate and repeatable readings,
allowing comparison across the competing
products. Manufacturers’ measurement
uncertainties were found to be typically 5 %,
whilst the independent laboratory tests achieved
2 %.
SAMPLE SITUATIONS AND
SCENARIOS INVOLVING THE
NEED FOR DATA ANALYSIS
Sample 2: A mechanical engineer as the plant
supervisor of a processing company needs to know the
man-hour requirements, supplies needed and the total
time for repair and maintenance during offseason or
shutdown repair and maintenance.

Data from last year’s operation are available. How will


the mechanical engineer decide for the projections on
the requirements for upcoming repair and maintenance?
ENGINEERING DATA ANALYSIS

• The field of data analysis can be called engineering


analytics or applied mathematical statistics.
• Engineering Data Analysis – involves basic statistical
techniques, probability, risk analysis, and predictive
modeling, and how they impact engineering and
manufacturing activities in both analytical and forward-
looking activities (refer to projections, forecasting or
estimations).
DEFINITION OF TERMS:
1) Statistics - assumes a dual meaning
a. In Singular Form;
Statistics is a science which deals with the
collection, organization, presentation,
analysis, inferences and interpretation of data.

b. In plural Form;
Statistics refer to measures or data obtained
from studying the population or sample.
STATISTICS DEFINED:
Descriptive
Statistical Study
Statistical Study
(singular)
Inferential
Statistical Study
Statistics
(dual meaning)
Qualitative
Data
Statistical Data
(plural)
Quantitative
Data
DEFINITION OF TERMS:
2) Statistical Variables – a characteristic/attribute/
property which tend to vary from one element
to another in a population or sample and can be
measured in terms of categories, names or numerical
values.
- simply refers to characteristics or properties of
measurable quantities.
3) Population – any complete set of people, objects or
observations (events) being studied and having a
characteristic.
4) Sample – a representative portion or subset/
subgroup of a population.
DEFINITION OF TERMS:

5) Statistical Data or statistics – refers to non-


numeric (names, categories, codes) or numerical
values of a statistical variable.

6) Data set/ Data File – a collection of data values.

7) Data value or datum or element - a specific value


in a data set.
DEFINITION OF TERMS:
8) Probability - refers to the chance for an
occurrence of an event. A tool used in
Inferential Statistics.
9) Hypothesis – a tentative statement used as an
explanation of an observed event. A
conjecture about a population parameter.
This
conjecture may or may not be true and must
be tested.
10)Hypothesis-Testing – a decision-making process
for evaluating claims about a population.
BRANCHES OR KINDS OF
STATISTICS OR STATISTICAL
STUDY
A. Descriptive Statistics or Parametric Statistics -
involves describing a situation for a given
population or sample.
- consists of the collection, organization,
summarization and presentation of data.
- describes the behavior of the sample or
population.
BRANCHES OR KINDS OF
STATISTICS OR STATISTICAL
STUDY
B. Inferential Statistics or non-Parametric Statistics -
involves making inferences or generalizations or
predictions from samples to populations.
- consist of generalizing from samples to
populations, performing hypothesis-testing,
determining relationships among variables and
making predictions or decisions.
- uses principles in probability to arrive to
inferences, generalizations or predictions from
sample to population.
COMPARATIVE FEATURES OF DESCRIPTIVE
AND INFERENTIAL STATISTICS

Features Descriptive Statistics Inferential Statistics


1.) Main To describe the characteristics of Aims to establish a generalization or
a population or sample based on inference from the observed
objective collected data. behavior of a sample in relation to
Aims to describe the behavior of the overall behavior of the
the population or sample with population.
respect to a given parameter. Establishes predictions from sample
NOTE: Statistical Variable if for to population.
population is also known as
“Parameter”
2.) Tasks a.) Data sampling and a.) Determining relationships
collection. between sample to population.
Involved b.) Data organization and b.) Conducting hypothesis-testing.
presentation. c.) making predictions, inference
c.) Data description and generalizations from sample to
population usually using principles
of probability.
CLASSIFICATIONS OF
STATISTICAL VARIABLES (REFERS
TO CHARACTERISTICS BEING
STUDIED/MEASURED)

I. According to the data obtained when measured


1.) Qualitative Variables - assumes distinct categories, according to
some characteristics or attribute.
- variables which assume non-numeric data
Ex: Religious Affiliation, gender, educational attainment, attitude
scale or Likert scale.
Classifications of Statistical Variables (refers
to characteristics being studied/measured)
2.) Quantitative Variables – assume numerical values and can be
ordered or ranked.
• May be further classified as

• According to their nature:


a.) Discrete Quantitative Variables – assume values that are
counted. Refers to variables that are countable and cannot assume
all values between any two specific values.
Ex: No. of Children in the family, No. of calls per day, Population
in a locality.
b.) Continuous Quantitative Variables - assume all values
between any two specific values. They are obtained by
measurement.
Ex: Temperature, gasoline mileage, family income
DISCREET VS DISCRETE

• Which should you use?


Remember :
DISCRETE, the t separates the e's, so discrete means
"separate OR non-continuous.“

DISCREET , means "unobtrusive OR unnoticeable“.


Classifications of Statistical Variables (refers
to characteristics being studied/measured)

II. According to the measurement used to obtain its data:

Levels or Scales of Measurements - refer to how the data of a


variable is categorized, counted or measured.
LEVELS OR SCALES OF
MEASUREMENT OF STATISTICAL
VARIABLES OR DATA:
• 1.) Nominal Scale/level - classifies data into
names or categories that are mutually exclusive,
exhaustive but NON-RANKABLE.
Ex: gender, nationality, plate number, civil status

• 2.) Ordinal level – classifies data into names or


categories that are rankable although precise
difference between ranks may not exist.
Ex: educational attainment, attitude scale(Likert’s
Scale) , ranking system
LEVELS OR SCALES OF
MEASUREMENT OF STATISTICAL
VARIABLES OR DATA:

• 3.) Interval Level – classifies data into numerical


values and precise units of measure exist but does
not assume a value of zero (zero has no meaning for
the variable measured)
Ex: Grade point average, temperature, IQ
LEVELS OR SCALES OF
MEASUREMENT OF STATISTICAL
VARIABLES OR DATA:

• 4.) Ratio Interval Level or Ratio Level – classifies


data into numerical values with same characteristics
as interval level except that a true zero exists or has
meaning for the given variable.
Ex: no. of children, monthly income, gasoline
consumption.
CLASSIFICATIONS OF STATISTICAL VARIABLES
(REFERS TO CHARACTERISTICS BEING
STUDIED/MEASURED)

Qualitative Variable
According to the data
obtained when measured

Quantitative Variable

Statistical Nominal Scale


Variables
Classification
Ordinal Scale

According to the measurement


used to obtain its data Interval Scale
(Levels or Scales of
Measurement)
Ratio Interval or Ratio
Scale
CLASSIFICATION OF STATISTICAL
DATA OBTAINED FOR A GIVEN
STATISTICAL VARIABLE
Nominal Scale
Non-Numeric Data
(names, categories or codes)
Ordinal Scale

Statistical Data
Classification
Interval Scale

Numeric Data

Ratio Interval
or Ratio Scale
SPECIFIC TO QUALITATIVE
VARIABLES

Nominal Scale

Non-numeric Data
Qualitative
Variable (Names, Categories,
Codes)
Ordinal Scale
SPECIFIC TO QUANTITATIVE
VARIABLES
Numeric Data Discrete Data

Continuous
Data
Quantitative
Variable
Interval Scale

Numeric Data
Ratio interval
or Ratio Scale
EXERCISE 1:
Determine the type of variable , answer QL if it is
Qualitative or QN if it is Quantitative:
1.) Engineering course majoring
2.) Daily internet data expense
3.) Student’s ID number
4.) Educational Attainment
5.) Weight
6.) Total Pay Units enrolled for 2nd Sem, AY 2020-2021
7.) Number of bags of cement needed for the foundation of a
12-storey building with a floor area of 100m2
8.) Number of service hours of a solar-powered electric fan
EXERCISE 1:
Determine if the variable is Qualitative or Quantitative:
1.) Engineering course majoring = QL
2.) Daily internet data expense = QN
3.) Student’s ID number = QL (not meant amount but only as
code)
4.) Educational Attainment = QL
5.) Weight = QN
6.) Total Pay Units enrolled for 2nd Sem, AY 2020-2021 = QN
7.) Number of bags of cement needed for the foundation of a
12-storey building with a floor area of 100m2 = QN
8.) Number of service hours of a solar-powered electric fan
QN
EXERCISE 2:
Determine the Scale or Level of Measurement used for obtaining
the data of the following variables. Answer N = if nominal, O = if
Ordinal. I = if interval, R = if ratio :
1.) Engineering course majoring
2.) Daily internet data expense
3.) Student’s ID number
4.) Educational Attainment
5.) Weight
6.) Total Pay Units enrolled for 2nd Sem, AY 2020-2021
7.) Number of bags of cement needed for the foundation of a 12-
storey building with a floor area of 100m2
8.) Number of service hours of a solar-powered electric fan
EXERCISE 2:
Determine the Scale or Level of Measurement used for obtaining
the data of the following variables. Answer N = if nominal, O = if
Ordinal. I = if interval, R = if ratio :
1.) Engineering course majoring = N
2.) Daily internet data expense = R
3.) Student’s ID number = N
4.) Educational Attainment = O
5.) Weight = I
6.) Total Pay Units enrolled for 2nd Sem, AY 2020-2021 = R
7.) Number of bags of cement needed for the foundation of a 12-
storey building with a floor area of 100m2 = R
8.) Number of service hours of a solar-powered electric fan = R
EXERCISE 3:
Determine if the numerical data obtained is discrete
or continuous:
1.) Daily internet data expense
2.) Weight
3.) Total Pay Units enrolled for 2nd Sem, AY 2020-2021
4.) Number of bags of cement purchased for the
foundation of a 12-storey building with a floor area of
100m2
5.) Number of children in the family
EXERCISE 3:
Determine if the numerical data obtained is discrete
or continuous:
1.) Daily internet data expense = C
2.) Weight = C
3.) Total Pay Units enrolled for 2nd Sem, AY 2020-2021 = D
4.) Number of bags of cement purchased for the
foundation of a 12-storey building with a floor area of
100m2 = D
5.) Number of children in the family = D
WHY IS LEVEL OF MEASUREMENT
IMPORTANT?

• 1.) The level of measurement helps one decide how to


interpret the data for a given variable. When you know
that a measure is nominal (like the one just described),
then you know that the numerical values are just short
codes for the longer names.
• 2.) Knowing the level of measurement helps one
decide what statistical analysis is appropriate on the
values that were assigned. If a measure is nominal, then
you know that you would never average the data values
or do a t-test on the data.
WHY IS LEVEL OF MEASUREMENT
IMPORTANT?
It's important to recognize that
there is a hierarchy implied in
the level of measurement idea.
At lower levels of measurement,
assumptions tend to be less
restrictive and data analyses
tend to be less sensitive. At each
level up the hierarchy, the
current level includes all of the
qualities of the one below it and
adds something new. In general,
it is desirable to have a higher
level of measurement (e.g.,
interval or ratio) rather than a
lower one (nominal or ordinal).
CONDUCTING RESEARCH OR
STATISTICAL STUDY OR SOLVING
ENGINEERING OR TECHNICAL
PROBLEMS
May be done or applied to
1) Population Study = if resources and time will allow

2) Sample Study
SAMPLE SIZE DETERMINATION
Determination of Sample Size
- Can be done using several available sample size formula.
- The simplest formula for solving sample size is the “Sloven
Formula”
The Sloven Formula:
𝑁
𝑛=
1+ 𝑁 𝑒 2

Where: N = population size


n = sample size
e = level of significance = allowable fraction of error
commonly used values:
e = 0.10 (for 10% allowable error)
e = 0.05 (for 5% allowable error)
e = 0.01 ( for 1% allowable error)
SAMPLING
PROCESS/PROCEDURE:
- is the process of selecting units (e.g., people,
organizations, objects or events) from a population of
interest so that by studying the sample we may fairly
generalize our results back to the population from
which they were chosen.

NOTE: Study is conducted on the sample and not on


the population only when the population has number
of elements that is impossible for the researcher to
study or cover.
KINDS OF SAMPLING
PROCESS/PROCEDURE
• 1.) Probability Sampling or Random Sampling -
every element in a population is given an equal
chance to be included in the sample.
- applicable when a complete master list
of all elements in the population is available.

• 2.) Non-Probability Sampling - not every element


in the population is given an equal chance to
be included in the
sample.
- applicable when no master list of all
elements in the population is available.
TYPES OF SAMPLING
PROCESS/PROCEDURE
A. PROBABILITY OR
RANDOM SAMPLING -
1) Simple Lottery or Simple Random Sampling
• without replacement
• with replacement
• Random Sampling using Table of Random
numbers
2) Systematic Sampling – each element in the
population is numbered and a sampling interval, k is
chosen. The sample is chosen every kth element in
the population beginning with a random start.
PROBABILITY OR RANDOM
SAMPLING (CONT’D)
3) Stratified Sampling – applicable when the population is
classified into different groups or strata according to
some characteristics that is important to the study. A
proportional allocation of the number of elements in
each stratum or group is included in the sample.

4) Cluster or Area Sampling – elements in the sample is


chosen from intact group or clusters that is
representative of the population.

5) Multistage Sampling - elements from various levels of


categories in the population is chosen such as
according to provinces, regions, cities.
B. NON-PROBABILITY OR
JUDGMENT SAMPLING
1) Accidental Sampling
2) Quota Sampling
3) Purposive Sampling – choosing the sample
according to criteria determined by the research
objectives.
4) Panel - choice of 24 respondents
EXERCISE 4:
Proposed or recommend a sampling method used in the
following: Answer P = if probability sampling and N =
non-probability sampling
1.) Study of the course majoring preference of SHS of
Bacolod City
2.) Selecting 20 concrete hollow blocks for testing of
mechanical properties from a supplier
3.) Study on the hours spent by Senior citizens in reading
a book.
4.) Combined Monthly income of a Filipino family
EXERCISE 4:
Proposed or recommend a sampling method used in the
following: Answer P = if probability sampling and N =
non-probability sampling
1.) Study of the course majoring preference of SHS of
Bacolod City = P
2.) Selecting 20 concrete hollow blocks for testing of
mechanical properties from a supplier = N
3.) Study on the hours spent by Senior citizens in reading
a book. = P
4.) Combined Monthly income of a Filipino family = P or
N

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy