Butohan, Wilfredo Jr. - Reflection
Butohan, Wilfredo Jr. - Reflection
GRADUATE SCHOOL
Pioneer Avenue, General Santos City, Philippines
Website: www.rmmcmain.edu.ph
REFLECTION PAPER
(MIDTERM)
As an innovative institution for the Graduate School, the RMMC vision is to produce leaders which
focus on culture excellence. Its goal is to increase leaders which provide fairness to the culture of
societies.
It anchors to their mission that committed to innovations and services providing quality education.
Their goals and objectives are to provide quality instructions, foster a sustainable research tradition,
produce proactive and committed educational leaders, cultivate professionalism and academic
excellence, collaborate with various sectors and stakeholders, sustain relevant community extension
These desires and targets are very relevant to create quality outputs, to have capacitated and high
2. Introduction to Statistics:
Statistics is a branch of mathematics that deals with the collection, tabulation, and interpretation of
numerical data. Statistics is common for us because we are doing basic statistics most of the day. It
As a teacher, it helps us maintain attendance, grade, and other required records for students. We
also use data to make our materials, such as syllabi, homework assignments, and handouts, that we
must prepare. Also, students' classwork, assignments, and papers should be evaluated and graded.
So, it is very vital for us as a teacher to gain knowledge on statistics to help us improve in our data
gathering as duties and for advancement of our research and teaching profession.
analyzing data in such a way that meaningful conclusions can be drawn from it. In general,
its investigations and analyses are divided into two broad categories: the descriptive
deals with the processing of data without making any inferences from it. And the
information is presented in the form of tables and graphs. The data characteristics are
described in simple terms. Also, in our everyday occurrences such as accidents, product
prices, business, incomes, epidemics, sports data, and population data are all dealt with it.
While the inferential statistics is a scientific discipline that employs mathematical tools to
make forecasts and projections based on data analysis. And this is useful for people
working in engineering, economics, biology, the social sciences, business, agriculture, and
communications.
Nominal, which can be name without order or ranking imposed on the data. We can use
words, letters, and even numbers on this level. Ordinal, where the data can be classified
and ranked. We can describe ranking or order on this level for example: champion, 1 st
runner-up and 2nd runner up. Interval, this indicates an actual amount(numerical). The
order and the difference between the variables can be known. The example for this is the
temperature measurements, but it has limitation as you cannot see “true zero”. And the
Ratio, of which the data can be classified, rank, evenly spaced and has natural true zero.
Kilogram is the right example for this level. It illustrates that Nominal and Ordinal are
researcher and Secondary data which is collected material from other publish data. And there are
two types of data, the qualitative, that the data is not given numerically and quantitative, the data
is numerical.
and the frequency with which they occur. It is a graphical or tabular representation of the
number of observations within a given interval. The size of the interval is determined by the
data being analyzed and the analyst's objectives. The intervals must be mutually exclusive and
exhaustive. It allows for the rapid analysis of large amounts of data at once in Graphical, which
can aid in making predictions and informed decisions. It is essential to understand histograms,
It can assist in emphasizing some important points in data in textual presentations. It also
allows the researcher to contextualize the data so that the reader can derive meaning from it.
To sum up, it focuses how to present the gathered data and to enables the researcher to
quickly scan the entire dataset. It indicates whether the number of observations is high or low,
as well as whether they are concentrated in one area or spread across the entire scale.
the central (center) position within that set of data. As a result, measures of central tendency are
sometimes referred to as measures of central location. They are also classified as summary
statistics. The mean (also known as the average) is the most common measure of central
tendency, but there are others, such as the median and the mode.
On this topic, I learned the ways to get the mean, median and mode for ungrouped data set and
grouped data set. These ways of data gathering are very useful to get scores of our students
3.1 Mean
Mean is the average of the data, it is represented with the symbol x-bar ( x̅ ) for the sample
data and mu ( μ ) for population parameters. There are two ways to get the mean from the
3.2 Median
The median is a statistical measure that determines the middle value of a list of ascending
values in a dataset from smallest to largest value. The measure divides the dataset into two
halves, the lower and upper. Median which has two formula the {(n + 1) ÷ 2}th for the ungrouped
data set. And Median = Lb + [(n/2−cfb)/f]cw for the grouped data set.
3.3 Mode
Mode is referring to the most frequently occurring number in a dataset. It is the value that has
a higher frequency in a set of data, or the number appears more often. And there are three
4. Variability
In this topic, Measures of variability is a summary statistic that represents the amount of
dispersion in a data set. It tells us how spread is the value are. It defines how far away that the
data tend to fall from the center. It talks about variability in the context of a distribution of
values. If the measure of dispersion is low dispersion it indicates that the data points tend to be
clustered tightly around the center. Low variability or low dispersion is ideal because it because
it means that you can predict better input location about the population based on the given
sample data. While in a high dispersion or high variability it signifies that they tend to fall
further away. High Variability means that the values are less consistent it is harder for us to
make predictions.
5. Range
Range is a rough estimation or dispersion. It is simply defined as the difference between the
6. Interquartile Range
In this topic, Interquartile range gives us the spread of the middle of the distribution. For any
distribution order from low to high or ascending order the interquartile range contains half of
the value while we all know the first quartile contains the first 25% of the values, the 4 th quartile
contains the last 25% of the values. To get the value of the interquartile range as we will use
7. Semi-Interquartile Deviation
Semi-quartile deviation is defined as the half of quartile range. The semi-interquartile range
is one-half the difference between the first and third quartiles. It is half the distance needed to
cover half the scores. The semi-interquartile range is affected very little by extreme scores.
And this makes it a good measure of spread for skewed distributions. We have the formula for
this as SIQR = Q3 – Q1 / 2 .
The mean absolute deviation of a dataset is the average distance between each data point
and the mean. It gives us an idea about the variability in a dataset. Mean absolute deviation is
a way to describe variation in a data set. Mean absolute deviation helps us get a sense of how
"spread out" the values in a data set are. The formula for this is MAD = ∑∣xi−xˉ∣ / n .
9. Variance
On this topic, variance is defined as the mean of the square of the deviations from the mean of
a frequency distribution. For large quantities, the variance is computed using frequency
distribution with columns for the midpoint value, the product of the frequency and midpoint
value for each interval, the deviation and its square, and the product of the frequency and the
squared deviation. Variance are symbolized with small sigma squared ( σ2) for population
variance and small s squared (s2) for sample variance. The formula for ungrouped data for
population is σ2 = ∑(x- μ)2 / n and s2 = ∑(x- x̅)2 / n-1 for the sample variance. For the
In this lesson, I learned that Standard Deviation is considered the best indicator of the degree
the distribution. Given the set of data, the smaller the range, the smaller the standard
deviation, the less spread is the distribution. The simplified formula for this is s = √ s 2 .
The coefficient of variation (CV) is the ratio of the standard deviation to the mean. The higher
the coefficient of variation, the greater the level of dispersion around the mean. It is generally
expressed as a percentage.
This I learned that coefficient of variation is an important concept that allows us to predict
variables within and outside data sets. While it has its roots in mathematics and statistics,
coefficient of variation can be applied in different contexts including population studies and
investments in the stock market. The formula for this is Coefficient of Variation (CV) =
12. Skewness
its left and right side are not mirror images. A distribution can have right (or positive), left (or
This I leaned that right-skewed distribution is longer on the right side of its peak, and a left-
skewed distribution is longer on the left side of its peak . The zero skew is symmetrical, its left
and right sides are mirror images where the mean and median are equal. The formula for this
13. Kurtosis
Kurtosis is a measure of the combined weight of a distribution's tails relative to the center of
the distribution. When a set of approximately normal data is graphed via a histogram, it
shows a bell peak and most data within three standard deviations (plus or minus) of the
mean.
I learned that Kurtosis is usually used as a measure to define the risk for an investment.
The nature of the investment to generate higher returns can also be predicted from the
value of the calculated kurtosis. The greater the excess for any investment data set, the
greater will be its deviation from the mean. There are three types of Kurtosis, the
Mesokurtic, the Leptokurtic and the platykurtic. Where the mesokurtic falls close to zero or
equal to zero. And The Leptokurtic is positive on in the other terms, more than zero. While
Platykurtic is less than zero. The formula for this is k = ∑(x- x̅)4 / ns4 for ungrouped