Take Home Exam Part 1
Take Home Exam Part 1
ANDA
SE 413 Advanced Educational Statistics
Take Home Exam
I. Conceptual
1) Why is it very important to know the nature of the data the researcher
collected?
Data is any set of characters that is gathered and translated for some purpose,
usually analysis. In research, it is any information that has been collected, observed,
verification, validation, and assessment of models for predicting the long-term structural
different. It might be in the form of a nominal, ordinal, interval, or ratio. Each level has its
own set of features that influence the sort of analysis that may be done. Furthermore, if
we understand the nature of the data, we may obtain reliable results. Maintaining the
integrity of research requires accurate data. If the data is obtained incorrectly, it might
lead to researchers following futile paths of exploration and distorted findings resulting
in wasted resources.
Data is used as the raw material in research to reach conclusion about a topic.
crucial for making constructive decisions. Similarly, researchers have depended mostly
on data to help them make decision about topic. Today, collecting data has become a
most priority for researchers in order to better understand their research papers.
2) For what purpose the assumptions are made or imposed for some statistical
tests such as the parametric?
All parametric tests in statistical analysis presume that the data has some
tests we perform. The outcome of the study and the interpretation of the findings are
altered if certain assumptions are violated. Those parametric assumptions are important
for a variety of reasons. They place constraints on how to interpret the results of the
data. For instance, If we have normality and homoscedasticity and get a significant
result, the only logical interpretation of a null hypothesis rejection is that the population
means vary. In addition, the assumptions include that we will make conclusions from the
know a lot about our sampled populations since we assume normalcy and
analysis helps us to see if we can accurately draw inferences from our findings.
before we can proceed with our investigation. Testing assumptions for the application of
parametric tests may appear time-consuming, yet it is a necessary step in data analysis.
As a result, any study, whether for a journal article, thesis, or dissertation, must adhere
because they are not reliant on a specific sort of population distribution, such as a
normal distribution. The nonparametric statistics are widely considered to involve fewer
or less severe assumptions about the nature of the population distribution being
analyzed. It is commonly used for researching populations that take on a ranked order,
and it requires due diligence on its limitations, strengths, and potential pitfalls.
alternate data analysis techniques in many situations when the parametric statistics are
the data distribution is not normal, nonparametric statistics are used. Its usage is also
appropriate when we do not know or have trouble determining the distribution of the
applied to a wide range of circumstances, the ability to work with lower sample sizes,
and the ability to work with different types of data. Nonparametric statistics have been
developed as an alternative to parametric tests such as the T-test or ANOVA, which can
only be used provided the underlying data meets particular requirements and
assumptions. Nonparametric statistics are popular because they are simple to apply.
The data may be used to a wider range of tests since the parameters are not required,
4) What are the effects of sample sizes on the analysis of the data?
consider when conducting research. For a variety of reasons, obtaining a sample size
that is acceptable in both respects is crucial. A high sample size is also more typical of
the population, reducing the impact of outliers or extreme data. In order to get findings
among variables that are significantly different, a sufficiently large sample size is also
insufficient sample size may generate inconclusive results and may also be unethical,
because exposing human subjects to the possible risks of research is only justified if
there is a reasonable possibility that the study will yield meaningful information.
Similarly, a study with an excessively large sample size will waste resources and
expose more people than necessary to any associated risk. As a result, determining the
right sample size for a study is an important part of the research design process.
Sample sizes should not be excessively large or small, as both have drawbacks that
could compromise the conclusions of the study. A limited sample size may hinder
extrapolation of findings, whereas a large sample size may increase the detection of
differences, emphasizing statistical differences that are not clinically relevant. Our
estimate is more precise since we have more data and consequently more information.
As our sample size increases, we gain more confidence in our estimate, our uncertainty
normal distribution accepts all real numbers and the binomial distribution only accepts
certain value with a normal distribution. As a result, for small sample sizes in some
nonparametric statistics, the continuity correction factor provides a better answer. The
chi square test continuity correction, often known as Yates' correction for continuity, is
one of the continuity corrections used in nonparametric statistics. Yate’s Correction was
table. It has the effect of preventing overestimation of statistical significance for small
data sets. Generally, we use continuity correction while estimating the binomial using
the normal distribution. While the original distribution was discrete, the normal
nonparametric statistics.
6) Compare and contrast parametric and nonparametric statistics.
about the population distribution from which the sample was taken, but nonparametric
statistics are not. This means that data can be acquired from a sample that does not
methods that deal with data with a known probability distribution or that have the
statistical method in which the data is not expected to come from predetermined models
information about a population distribution is uncertain and the parameters are not
defined. We should examine various criteria concerning the sample data and
and carefully analyze the validity of those assumptions. Parametric tests rely on
makes no assumptions and uses the median value to measure the central tendency. In
conclusion, parametric and nonparametric tests are both important components of data