0% found this document useful (0 votes)

11 views34 pages

Types of Data

Uploaded by

akshatsharma1278

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views34 pages

Types of Data

Uploaded by

akshatsharma1278

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 34

Data Processing: Introduction to Concepts,

Variables, Attributes, Types of Data

Vocabulary of Statistics
 Those new to statistics sometimes find its
terminology difficult, so at this stage it is
important to understand some new words and
concepts.

 This session is to provide introduction to some of

the basic terms which are essential in carrying out
statistical data analysis.
Some of the Statistical Expressions

Data: Data refers to any group of measurements

that helps in providing information.
Quantitative Data: Data that possess numerical
properties are known as quantitative data.
Qualitative Data: Qualitative data reflects non-
numeric features or qualities of experimental units.
Ex: colour, gender, good, high, low.
Statistics: Statistics is the use of data to help the
decision-makers to reach better decisions.
Variable: A variable is a characteristic that may
take on different values at different times, places
or situations. Ex: income, wages, population, no. of
SHGs, political parties, voters.
Basic concepts:

Constants, Variables, Cases, Values

Discrete And Continuous Variables
Values
Categorizing And Coding The Variables
Nominal Scale
Ordinal Scale
Interval Scale
Grouped And Ungrouped Data
Constants
In math and statistics, a constant is a number that is
fixed and known, unlike a variable which changes with
the context.

A symbol which has a fixed numerical value is called

a constant. For example: 2, 5, 0, -3, -7 etc., are
constants.

Constant is a specific number or a symbol that is

assigned a fixed value. For example, in the equation
below, "y" and "x" are variables, while the numbers 2
and 3 are constants.
y = 2x – 3
A few more constant examples are : The number of days
in a week represents a constant. In the expression 5x +
10, the constant term is 10.
Variables, Cases, Values:

In any study the researcher is concerned with a

particular population or universe.

Population refers to a specific group of people or

institutions or occurrences or observations about
which the researcher wishes to make descriptive or
analytical statements.

When resources are limited researcher will draw

sample observations either randomly or according to
some agreed strategy as the basis for investigation.
Cases-Units of observations

Within a population or sample each individual unit

is called a case or observation.
A case is the basic unit of analysis, it could be an
individual or an organization or an occurrence of
some event.
The population or sample then consists of all the
available cases
with which the study is concerned.
The unit of observation is the unit for which data is
collected. Common examples include individual,
household, community, or school.

Clearly identifying the unit of observation is

important for a logical survey design, organized
data collection, a sound data folder set-up.
Variables

We must define characteristics of population or

sample units to understand the sample or universe
in a better way. Each characteristic of a population
is termed as variable because these are attributes
which vary between cases.

By definition a variable is any characteristic,

number, or quantity that can be measured or
counted.
Variable is a characteristic that may take on
different values for different cases at different
times, places or situations. Ex: incomes of different
individuals, votes obtained by different political
parties, population of different small towns, no. of
SHGs in each district etc.

A variable may also be called a data item. Some

more examples of variables are Age, sex, country of
birth, class grades, eye colour etc.
Example:

Cases: Individuals such as X, Y, Z

Variables: Their respective gender is variable since it
is varying among individuals, Male= 1, Female = 2
For some variables we can have more number of
categories.
Example: Number of children in a family

Cases: Family X, Y, Z
Variables: Their respective number of children is the
variable. Number of children may vary from ‘Zero to
one child, two children and so on.
Discrete and Continuous Variables

A variable may be either continuous or discrete. A

continuous variable is, capable of manifesting every
conceivable fractional value within the range of
possibilities, such as height or weight of persons (Ex.
55.6, 60.4, 72.8 K.G).

On the other hand, a discrete variable is that which

can vary only by ‘finite’ jumps and cannot manifest
every conceivable fractional value.

In some categories the values cannot logically be

subdivided. For example the number of children in a
family can only take certain values such as 1, 2 or 3,
size of the family etc.
Values
These are the possible outcomes for a single Variable.
They are
different for the different cases. Values can be
numbers or
named categories. For example the variable Gender
has two
values, "male" and "female". Some people (cases)
are men, and
some are women.
Example:
Cases: Individuals such as X, Y, Z
Variables: Their respective gender is variable since it
is varying
among individuals, Male= 1, Female = 2.
Values: values for the variable gender are 1 & 2 which
Types of variables
Categorizing and coding the Variables

An important stage of the research process is the

allocation
of a numerical values to each variable.
This is called coding, for example non-literates= 0
and literates= 1. The very process of coding
facilitates the researcher to categorize the population
or sample observations.

Categorical variables:
Categorical variables have values that describe a
'quality' or 'characteristic' of a data unit, like 'what
type' or 'which category'.

There are four levels of measurement or scales to

measure the variables.
a) Nominal scale: Variables measured on the nominal scale
are essentially qualitative rather than quantitative in form.
The values of variables are categories not mere numbers
and cannot be ordered in any mathematically meaningful
way.
A nominal variable with only two possible values is referred
to as a dichotomous variable. An example might be if we
asked a person if they owned a mobile phone. Here, we
may categorize mobile phone ownership as either "Yes" or
"No".

We can as well have more categories for a variable, such as

religious belief. These are called polytomous variables.

Hindu 1
Muslim 2
Christian 3
Buddhist 4
Here in nominal scale each value of the variable
represents a category, they imply no particular
order or relationship between the values.
b) Ordinal scale:
Nominal scale of measurement permits only
classification of the observations into different
categories, whereas ordinal scale of measurement
permits the ordering of those categories into ranks
or scale.
We can distinguish between the values in terms of
degree but cannot measure the degree of difference
between them.

Example: A group of workers opinions about the

work environment.
Very poor 0
Poor 1
Satisfactory 2
Good 3
c) Interval scale: Interval scale implies both an ordering
of categories and a measure of the distance between
them. The differences between points on the scale are
measurable and exactly equal.

Example: Number of absents each employee had in an

organization in a month.

No absences 0
One day 1
Two days 2
Three days 3 and so on.

The number of days here are categories which are ordered

and allows us to measure exactly in a standard unit that
three days is more than one day but less than six days.
Four days absence is twice as many as two days and so on.
d) Ratio scale:
A ratio scale is a quantitative scale where there is a
true zero and equal intervals between neighboring
points.
Unlike on an interval scale, a zero on a ratio scale
means there is a total absence of the variable you
are measuring.

Ex: Length, area, and population are examples of

ratio scales.
Ratio scales are one of the most common ways to
depict scale on maps. It tells the map reader that
one unit on the map is equal to a certain number of
units in the real world. Example: 1:2500. For
example, 1:2500 means that 1 cm = 2500 cm
Age is typically considered to be measured on a ratio
scale. This is because age has a true zero point,
which means that a value of zero represents the
absence of age.

In addition, it is possible to perform mathematical

operations such as addition, subtraction,
multiplication, and division on age values.

The most common examples of ratio scale are

height, money, age, weight etc.
Attributes:
An attribute refers to the quality of a characteristic.
The theory of attributes deals with qualitative types
of characteristics that are calculated by using
quantitative measurements.
Therefore, the attribute needs slightly different kinds
of statistical treatments, which the variables do not
get.
For example, eye color is an attribute of a person.
Attributes refer to the characteristics of the item
under study, like the habit of smoking, or drinking. So
‘smoking’ and ‘drinking’ both refer to the example of
an attribute.
In statistics classifying data based on attributes or
characteristic is known as qualitative classification of
data. Example of attributes are region, caste etc.
Grouped and Ungrouped Data:

Ungrouped Data: The data obtained in original form are

called raw data or ungrouped data.
Example: The ranks obtained by 2500 students in a
certain examination are given below;

25, 8, 37, 16, 45, 40, 29, 12, 42, 25, 14, 16, 16, 20, 10,
36,
33, 24, 25, 35, 11, 30, 45, 48….

This is ungrouped data which is in original form without

any
ordering or grouping.
Grouped Data:

To put the data in a more condensed form, we make

groups of suitable size, and mention the frequency of
each group. Such a table is called a grouped
frequency distribution table. Here we aggregate or
group the data into ordered categories.

Employees age No. of cases

16-20 years 470
21-30 years 950
31-40 years 670
41-50 years 710
Inductive Statistics:

The branch of statistics dealing with generalizations,

predictions, estimations and arriving at conclusions
based on data from sample is called inductive statistics.

When we do this we are inducing or inferring the

characteristics of the population from the characteristics
of the sample.

The purpose of inductive statistics is to assist the

researcher to assess how representative a sample is
from the population. Inductive statistics are also
commonly called inferential statistics.
Example: alpha=0.05
Here in inductive statistics we discuss the
Following:

Why we use sample

Various sampling procedures such as random and
non-random sampling methods
Random sampling error, bias
Estimating the population mean from the sample
mean, normal distribution, standard error, confidence
levels, testing of hypothesis etc.
Concepts of Distributions
Concepts of Cross-section, Time Series, Panel
data
Cross-sectional data:

Definition: Cross-sectional data is information that is

gathered at one point in time to reflect social conditions.

 Cross-sectional data, or a cross section of a population,

in statistics is a type of data collected by observing
individuals, firms, countries, or regions at some point of
time, or without regard to differences in time.

 Analysis of cross-sectional data usually consists of

comparing the differences among the subjects
(individuals, firms, countries, or regions).

Example: Number of habitations in a region in 1996.

For example, if we want to measure current obesity
levels in a population, we could draw a sample of 1,000
people randomly from that population. This is also
known as a cross section of that population.

If we measure their weight and height, and calculate

what percentage of that sample is categorized as obese.

This cross-sectional sample provides us with a snapshot

of that population, at that point of time.

Note that we do not know based on one cross-sectional

sample if obesity is increasing or decreasing; we can
only describe the current proportion.
Time Series data:

Time series data differs from cross-sectional data, in

which units of observations observed at various
points of time.

A time series is a collection of observations made

sequentially through time. The interval between
observations can be any time interval (hours within
days, days, weeks, months, years, etc).
Time series data differs from cross-sectional data, in which units
of observations are observed at various points of time.

A time series is a collection of observations made sequentially

through time (the interval between observations can be any time
interval hours within days, days, weeks, months, years, etc).

Some areas of applications:

Time series can occur in a wide range of fields from economics to
sociology, meteorology, geography to financial investment, etc
Some examples of time series are:
- Malaria incidence or deaths over calendar years, Covid-19
- Daily maximum temperatures
- Hourly records of babies born at a maternity hospital
Can you suggest other examples?
Air pollution etc.
Panel Data
Panel data (or time-series cross-sectional (TSCS)
data, or longitudinal data), combines both cross-
sectional and time series data ideas and looks at how
multiple subjects (units of observations such as
households, individuals, data related to small towns
etc.) change over time.

Panel data examines changes in variables over time

and differences in variables between the subjects.

Examples include estimating the effect of education

on income, with data across time and individuals.
 Panel data contain observations of multiple
phenomena obtained over time periods for the
same units of observations or individuals.
 The term longitudinal data is often used for panel
Thank you

Psy 321
No ratings yet
Psy 321
117 pages
Statistics For Economists - Lecture Notes
No ratings yet
Statistics For Economists - Lecture Notes
171 pages
Stat130 Module Notes
No ratings yet
Stat130 Module Notes
151 pages
Introduction To Statistics: Romeo D. Caturao, D.SC., Ph.D. Dean, College of Fisheries
No ratings yet
Introduction To Statistics: Romeo D. Caturao, D.SC., Ph.D. Dean, College of Fisheries
173 pages
Statistics Super Review
From Everand
Statistics Super Review
Statistics Study Guides
2/5 (1)
Sampling and Variables
No ratings yet
Sampling and Variables
10 pages
Statistics in Education
No ratings yet
Statistics in Education
76 pages
Q3 Research Ii Week 5 6 Statistics 1
No ratings yet
Q3 Research Ii Week 5 6 Statistics 1
30 pages
Basic Statistical Concepts
No ratings yet
Basic Statistical Concepts
14 pages
1 Introduction To Statistics - 241108 - 104334
No ratings yet
1 Introduction To Statistics - 241108 - 104334
11 pages
Cleaning Validation PIC-S
No ratings yet
Cleaning Validation PIC-S
29 pages
Technical Report Writing and Research Methodology MENG 2022 23
No ratings yet
Technical Report Writing and Research Methodology MENG 2022 23
60 pages
Lect 1
No ratings yet
Lect 1
47 pages
SPSS for you
From Everand
SPSS for you
A Rajathi
4.5/5 (4)
Michael S. Lewis-Beck-Data Analysis - An Introduction, Issue 103-SAGE (1995)
100% (1)
Michael S. Lewis-Beck-Data Analysis - An Introduction, Issue 103-SAGE (1995)
119 pages
Module Stat
No ratings yet
Module Stat
56 pages
Lecture1 Olive's File
No ratings yet
Lecture1 Olive's File
54 pages
Semi Detailed Lesson Plan Elementary
No ratings yet
Semi Detailed Lesson Plan Elementary
3 pages
Prob and Stat - Unit1
No ratings yet
Prob and Stat - Unit1
67 pages
Sta 103 L1 Upda2
No ratings yet
Sta 103 L1 Upda2
104 pages
Unit-2 Ids
No ratings yet
Unit-2 Ids
64 pages
Business Statistics I Essentials
From Everand
Business Statistics I Essentials
Louise Clark
5/5 (5)
Chapter 1: Introduction To Statistics Sher Muhammad CH
100% (1)
Chapter 1: Introduction To Statistics Sher Muhammad CH
4 pages
Charles Haanel - The Master Key System Cd2 Id1919810777 Size878
100% (2)
Charles Haanel - The Master Key System Cd2 Id1919810777 Size878
214 pages
STAE Lecture Notes - LU1
No ratings yet
STAE Lecture Notes - LU1
7 pages
Components of An Experimental Study Method Plan
No ratings yet
Components of An Experimental Study Method Plan
10 pages
Unit 2
No ratings yet
Unit 2
72 pages
Basic Statistics For Testing
No ratings yet
Basic Statistics For Testing
58 pages
2 Some Important Concepts
No ratings yet
2 Some Important Concepts
5 pages
B.1 Learning Modules Quarter 3 Learning Information and Course Activity
No ratings yet
B.1 Learning Modules Quarter 3 Learning Information and Course Activity
23 pages
1 Introduction To Statistics
No ratings yet
1 Introduction To Statistics
89 pages
Basics of Definitions
No ratings yet
Basics of Definitions
26 pages
Lecture Notes Quanti 1
No ratings yet
Lecture Notes Quanti 1
105 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
27 pages
Chapter 2
100% (1)
Chapter 2
35 pages
Statistics Intro
No ratings yet
Statistics Intro
7 pages
Lesson 1 - Basic Statistical Concepts
No ratings yet
Lesson 1 - Basic Statistical Concepts
4 pages
1 - Nature of Statistics - SMM105 - Elementary Statistics and Probability - 2ndsem (20250203163205)
No ratings yet
1 - Nature of Statistics - SMM105 - Elementary Statistics and Probability - 2ndsem (20250203163205)
12 pages
Module One Two One
No ratings yet
Module One Two One
32 pages
Understanding Scientific Theory
No ratings yet
Understanding Scientific Theory
4 pages
Unit One Graphing and Descriptive Statis-1
No ratings yet
Unit One Graphing and Descriptive Statis-1
12 pages
Introduction To Engineering Design and Problem Solving
100% (2)
Introduction To Engineering Design and Problem Solving
244 pages
ENGDAT1 Module1 PDF
No ratings yet
ENGDAT1 Module1 PDF
34 pages
PSY 311 Week 3
No ratings yet
PSY 311 Week 3
8 pages
Class 2
No ratings yet
Class 2
5 pages
CH 03
No ratings yet
CH 03
19 pages
Statistics and Probability Lesson 1
100% (1)
Statistics and Probability Lesson 1
6 pages
Introduction To Data Analtsis
No ratings yet
Introduction To Data Analtsis
33 pages
Educ 98a
No ratings yet
Educ 98a
38 pages
Statistics I Essentials
From Everand
Statistics I Essentials
Emil G. Milewski
No ratings yet
1.1 Intro Notes MAT 1260
No ratings yet
1.1 Intro Notes MAT 1260
4 pages
Statistics and Probability
No ratings yet
Statistics and Probability
17 pages
THESIS
No ratings yet
THESIS
13 pages
Understanding The Principles of Instrumentation
No ratings yet
Understanding The Principles of Instrumentation
11 pages
G.E. 4 Pre - Final Handoout
No ratings yet
G.E. 4 Pre - Final Handoout
11 pages
rEVIEWER PRACTICAL RESEARCH 12 1ST QUARTER
No ratings yet
rEVIEWER PRACTICAL RESEARCH 12 1ST QUARTER
8 pages
3.1 Classify Your Variables
No ratings yet
3.1 Classify Your Variables
4 pages
General Studies A 2009 AQA
No ratings yet
General Studies A 2009 AQA
28 pages
Pengaruh Latihan Circuit Training Terhadap Peningkatan VO2MAX Pemain Sepak Bola Ekacita FC PDF
No ratings yet
Pengaruh Latihan Circuit Training Terhadap Peningkatan VO2MAX Pemain Sepak Bola Ekacita FC PDF
4 pages
1 Introduction
No ratings yet
1 Introduction
6 pages
Research Methodology
No ratings yet
Research Methodology
16 pages
Cambridge International AS & A Level: Mathematics 9709/63
No ratings yet
Cambridge International AS & A Level: Mathematics 9709/63
16 pages
How To Conduct An Autoethnography
100% (2)
How To Conduct An Autoethnography
12 pages
STATAPP1
No ratings yet
STATAPP1
11 pages
Report Stat
No ratings yet
Report Stat
21 pages
Supplementary Material in Elementary Statistics - Chapter 1 - SY 2024 2025
No ratings yet
Supplementary Material in Elementary Statistics - Chapter 1 - SY 2024 2025
6 pages
Qualitative Interviewing
No ratings yet
Qualitative Interviewing
25 pages
MPCE 36 Project
No ratings yet
MPCE 36 Project
39 pages
Chap 1
No ratings yet
Chap 1
5 pages
Chapter 1 Data Analysis
No ratings yet
Chapter 1 Data Analysis
18 pages
Statistics: Basic Concepts
No ratings yet
Statistics: Basic Concepts
5 pages
Rasmuson A Andersson B Olsson L Andersson R Mathematical PDF
No ratings yet
Rasmuson A Andersson B Olsson L Andersson R Mathematical PDF
196 pages
Quantitative Techniques in Business
No ratings yet
Quantitative Techniques in Business
3 pages
Spectophotometric Analysis of DNA
No ratings yet
Spectophotometric Analysis of DNA
4 pages
2009 10 15 LIME Appendix B Erlang B Table
No ratings yet
2009 10 15 LIME Appendix B Erlang B Table
4 pages
Chapter 1 Introduction To Statistcs
No ratings yet
Chapter 1 Introduction To Statistcs
9 pages
What Is Statistics
No ratings yet
What Is Statistics
6 pages
Statistics and Probability Handouts - Basic Terms in Statistics
No ratings yet
Statistics and Probability Handouts - Basic Terms in Statistics
4 pages
CHP1 Mat161
No ratings yet
CHP1 Mat161
4 pages
13C NMR Spectros
100% (4)
13C NMR Spectros
16 pages
Notable Figures:: Mental Shortcut': Intuitive Judgment, Stereotyping
No ratings yet
Notable Figures:: Mental Shortcut': Intuitive Judgment, Stereotyping
3 pages
Chapter 1. Introductory Notions Meaning of Statistics
No ratings yet
Chapter 1. Introductory Notions Meaning of Statistics
4 pages
FORMAT Project Report
No ratings yet
FORMAT Project Report
6 pages
PRISMA-ScR Fillable Checklist
No ratings yet
PRISMA-ScR Fillable Checklist
2 pages
What Is The Importance of Statistics?: Qualitative Data
No ratings yet
What Is The Importance of Statistics?: Qualitative Data
2 pages
Lesson 1:: Basic Terminologies in Statistics
No ratings yet
Lesson 1:: Basic Terminologies in Statistics
3 pages
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet
Uncertainty Analysis - Monte Carlo Simulation User Guide PDF
No ratings yet
Uncertainty Analysis - Monte Carlo Simulation User Guide PDF
12 pages
PALM
100% (1)
PALM
21 pages
Module 1 - Science 7
No ratings yet
Module 1 - Science 7
11 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Types of Data

Uploaded by

Types of Data

Uploaded by

Data Processing: Introduction to Concepts,

Variables, Attributes, Types of Data

 This session is to provide introduction to some of

Data: Data refers to any group of measurements

Constants, Variables, Cases, Values

A symbol which has a fixed numerical value is called

Constant is a specific number or a symbol that is

In any study the researcher is concerned with a

Population refers to a specific group of people or

When resources are limited researcher will draw

Within a population or sample each individual unit

Clearly identifying the unit of observation is

We must define characteristics of population or

By definition a variable is any characteristic,

A variable may also be called a data item. Some

Cases: Individuals such as X, Y, Z

A variable may be either continuous or discrete. A

On the other hand, a discrete variable is that which

In some categories the values cannot logically be

An important stage of the research process is the

There are four levels of measurement or scales to

We can as well have more categories for a variable, such as

Example: A group of workers opinions about the

Example: Number of absents each employee had in an

The number of days here are categories which are ordered

Ex: Length, area, and population are examples of

In addition, it is possible to perform mathematical

The most common examples of ratio scale are

Ungrouped Data: The data obtained in original form are

This is ungrouped data which is in original form without

To put the data in a more condensed form, we make

Employees age No. of cases

The branch of statistics dealing with generalizations,

When we do this we are inducing or inferring the

The purpose of inductive statistics is to assist the

Why we use sample

Definition: Cross-sectional data is information that is

 Cross-sectional data, or a cross section of a population,

 Analysis of cross-sectional data usually consists of

Example: Number of habitations in a region in 1996.

If we measure their weight and height, and calculate

This cross-sectional sample provides us with a snapshot

Note that we do not know based on one cross-sectional

Time series data differs from cross-sectional data, in

A time series is a collection of observations made

A time series is a collection of observations made sequentially

Some areas of applications:

Panel data examines changes in variables over time

Examples include estimating the effect of education

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.