0% found this document useful (0 votes)

159 views4 pages

Factor in R PDF

This document provides information on conducting factor analysis in R. It discusses the purpose of factor analysis, basic usage including data input/output and rotations, determining the appropriate number of factors, and checking the adequacy of the analysis. Functions for factor analysis in R are factanal, which uses maximum likelihood estimation, paf which uses principal axis method, and fa from the psych package, which allows specifying the estimation method and rotation type.

Uploaded by

rspecu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

159 views4 pages

Factor in R PDF

Uploaded by

rspecu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

R practice: Factor analysis

Minato Nakazawa (minato-nakazawa@umin.net)

27 June 2011

This material is just a rough draft to be read carefully with attention. Any suggestions and comments are welcome.

1 References
http://www.psy.ed.ac.uk/people/tbates/lectures/methodology/, which is provided by Prof. Timothy Bates,
psychologist in the University of Edinburgh, is very helpful.
http://aoki2.si.gunma-u.ac.jp/lecture/PFA/pfa6.html
*
1
is provided by Prof. Shigenobu Aoki, which are
very informative but unfortunately written in Japanese.
2 The purpose of factor analysis
Ocial explanation Find the hidden factors behind observed variables: The hidden factors cannot measured directly,
but should be natural groupings
*
2
of observed variables.
Practical explanation Reduction of the number of variables intercorrelated: In this meaning, it resembles the principal
component analysis.
3 Basic usage of factor analysis
Input Large numeric matrices, usually more than 300 cases with many variables. Subjects/Variables ratio
should usually range between 2:1 to 10:1. Variables should obey normal distribution. Outliers should be
omitted. Any variables uncorrelated with any other variables should be omitted. No variables correlated 1.0
with each other can be included: Remove one of each pair or take sum of them if appropriate.
Output (1) Factor loadings, which mean the correlation of each variable with the underlying factor (for that purpose,
various rotations will be applied
*
3
), (2) Factor scores, the summation of subjects responses factor loadings,
which mean the extent of each subject being explained by that factor.
Rotation There are two kinds of rotations: Orthogonal rotations keep independence among factors, but oblique
rotations allow correlations among factors. If the factors may theoretically allow interdependence, the latter
should be considered. The former includes the varimax rotation, which is most common and simple: to maximize
squared column variance). The latter includes promax and oblimin rotations.
Tools Screeplot, Bartletts sphericity test, Kaise-Meyer-Olkins sampling adequacy criteria, and Parallel analysis are
useful. After the successful factor extraction, Cronbachs can be calculated to check whether the variables
in each factor consist a uni-directional additive score or not (usually Cronbachs must be more than 0.7 to
consist a reliable scale).
When we interpret the extracted factors, adequate names (meaning) of factors are necessary. Well-dened factor
should have at least three high-loading variables (if only one or two high loading(s), factors may be overextracted or
multicolinearity may exist).
*
1
http://aoki2.si.gunma-u.ac.jp/R/kmo.html and http://aoki2.si.gunma-u.ac.jp/R/Bartlett.sphericity.test.html provide
function denitions for KMO, MSA and Bartletts sphericity test.
*
2
Subsets of variables that correlate strongly with each other and weakly with other variables in the dataset. Found factors should
correspond to underlying dimensions, which can be theoretically interpreted.
*
3
The initial factor loadings were calculated to maximize the loadings on the rst factor, so that most items have large loadings on
more than one factor, where the interpretation of the factor is dicult. Adequate rotation may solve this problem.
1
4 Basic model
Lets consider 10 variables X
1
, X
2
, ..., X
10
for 300 individuals. If we can assume 2 latent (hidden) factors F
1
and F
2
behind these 10 variables, each variable can be explained by these factors as follows.
X
1
=
11
F
1
+
21
F
2
+
1
X
2
=
12
F
1
+
22
F
2
+
2
.
.
.
X
10
=
110
F
1
+
210
F
2
+
10
Here means the correlation of each variable with the underlying factor: we call it Factor loadings. The
means error variance, which is, in other word, the uniqueness, which cannot be explained by extracted factors.
However, F
1
and F
2
are not measured. So we must estimate them by various method (principal axis method, minimum
residual method, maximum likelihood method, and so on.) with iteration
*
4
.
Before rotation, F
1
and F
2
are assumed to be independent. If we denote n
th
(n in [1,300]) individuals data of i
th
(i in [1,10]) variable as X
i
(n), the Factor scores (here FS
1
(n) and FS
2
(n)) can be obtained as follows (this is the
simplest method. There are some other methods to estimate factor scores). Used variables for calculation are limited
to have the absolute of being large enough (usually more than 0.3, 0.4 or 0.5).
FS
1
(n) =
10

i=1

1i
X
i
(n)
FS
2
(n) =
10

i=1

2i
X
i
(n)
5 How many number of factors should be extracted?
There are some criteria, but no 100% foolproof statistical test exists.
Drawing screeplot: Connect eigenvalues (as representing variances explained by each factor, so that sometimes
sums of squared factor loadings are used instead) for many possible factors from maximum to minimum. The
adequate number of factors is before the sudden downward inextion of the plot.
Parallel analysis: Compare actual screeplot with the possible screeplot based on randomly resampled data. The
adequate number of factors is at the crossing point of the two plots.
Eigenvalues > 1: Eigenvalues sum to the number of items, so an eigenvalue more than 1 is more informative
than a single average item.
6 Checking adequacy of factor analysis
There are some method to check the adequacy of the factor analysis.
Criteria of sample size adequacy: sample size 50 is very poor, 100 poor, 200 fair, 300 good, 500 very good, and
more than 1,000 excellent (Comfrey and Lee, 1992, p.217).
*
4
In principal component analysis, each components can be formulated as the linear function of measured variables, so that it doesnt
need iterative estimation.
2
Kaiser-Meyer-Olkins sampling adequacy criteria (usually abbreviated as KMO) with MSA (individual measures
of sampling adequacy for each item): Tests whether there are a signicant number of factors in the dataset:
Technically, tests the ratio of item-correlations to partial item correlations. If the partials are similar to the raw
correlations, it means the item doesnt share much variance with other items. The range of KMO is from 0.0
to 1.0 and desired values are > 0.5
*
5
. Variables with MSA being below 0.5 indicate that item does not belong
to a group and may be removed form the factor analysis.

Prof. Shigenobu Aoki provides the following function to calculate KMO and MSA at his web page:
kmo <- function(x)
{
x <- subset(x, complete.cases(x)) # Omit missing values
r <- cor(x) # Correlation matrix
r2 <- r^2 # Squared correlation coefficients
i <- solve(r) # Inverse matrix of correlation matrix
d <- diag(i) # Diagonal elements of inverse matrix
p2 <- (-i/sqrt(outer(d, d)))^2 # Squared partial correlation coefficients
diag(r2) <- diag(p2) <- 0 # Delete diagonal elements
KMO <- sum(r2)/(sum(r2)+sum(p2))
MSA <- colSums(r2)/(colSums(r2)+colSums(p2))
return(list(KMO=KMO, MSA=MSA))
}

Bartletts sphericity test: Tests the hypothesis that correlations between variables are greater than would be
expected by chance: Technically, tests if the matrix is an identity matrix. The p-value should be signicant:
i.e., the null hypothesis that all o-diagonal correlations are zero is falsied.

Prof. Shigenobu Aoki provides the following function to conduct Bartletts sphericity test at his web page:
Bartlett.sphericity.test <- function(x)
{
method <- "Bartletts test of sphericity"
data.name <- deparse(substitute(x))
x <- subset(x, complete.cases(x)) # Omit missing values
n <- nrow(x)
p <- ncol(x)
chisq <- (1-n+(2*p+5)/6)*log(det(cor(x)))
df <- p*(p-1)/2
p.value <- pchisq(chisq, df, lower.tail=FALSE)
names(chisq) <- "X-squared"
names(df) <- "df"
return(structure(list(statistic=chisq, parameter=df, p.value=p.value,
method=method, data.name=data.name), class="htest"))
}

7 Functions to conduct factor analysis in R
factanal This function is included in standard installation. It uses maximum likelihood estimation (mle) to nd
the factor loadings. The number of factors to be extracted must be explicitly specied. Varimax and promax
rotations are possible. Input data may be a matrix or a dataframe.
paf This function is included in rela package. It uses principal axis method to nd the factor loadings. The
*
5
According to the criteria suggested by Kaiser (1974), less than 0.5 is unacceptable, [0.5, 0.6) is miserable, [0.6, 0.7) is mediocre, [0.7,
0.8) is middling, [0.8, 0.9) is meritorious, [0.9, 1.0) is marvelous.
3
adequate number of factors will be automatically determined by the criteria of eigenvalues (you can specify its
criterion by eigencrit= option: default is 1). KMO and MSA are automatically calculated. Rotation is not
provided. Input data must be a matrix.
fa This function is included in psych package. The fm= option can specify the method of estimation ("minres"
for minimum residual, "ml" for maximum likelihood estimate, and "pa" for principal axis method). The
number of extracted factors must be specied by nfactors= option. Various rotation methods can be specied
by rotate= option ("none", "varimax", "quartimax", "bentlerT", "geominT", "oblimin", "simplimax",
"bentlerQ", "geominQ", and "cluster" will be possible).
alpha This function is included in psych package. This calculates Cronbachs .
cortest.bartlett This function is included in psych package. This conducts Bartletts sphericity test.
fa.parallel This function is included in psych package. Return the adequate number of extracted factors as $nfact.
8 Example 1
Lets analyze the variable p1-p40 in the factorexdata05.txt, which is converted from Prof. Timothy Bates SPSS
data
*
6
. Prof. Bates provides the pdf documents for undergraduate students
*
7
.
The easiest way is the following. Number of factors can be automatically determined. Factor loadings are saved as
res$Factor.Loadings.

library(foreign)
y <- read.spss("http://www.subjectpool.com/ed_teach/y3method/factorexdata05.sav")
x <- as.data.frame(y)
for (i in 1:length(x)) { x[,i] <- ifelse(x[,i]==999,NA,x[,i]) }
# The data \verb!x! consists of 538 cases with 102 variables.
# it can be saved as "factorexdata05.txt" by the following line
# write.table(x,"factorexdata05.txt",quote=FALSE,sep="\t",row.names=FALSE)
# if so, the data can be read by:
# x <- read.delim("factorexdata05.txt")
Ps <- x[,4:43] # Extract variables p1-p40
Ps <- subset(Ps, complete.cases(Ps)) # Omit missings (511 cases remain)
library(rela)
res <- paf(as.matrix(Ps))
summary(res) # Automatically calculate KMO with MSA, determine the number of factors,
# calculate chi-square of Bartletts sphericity test, communalities and
# factor loadings. Communalities are 1 minus uniquenesses.
barplot(res$Eigenvalues[,1]) # First column of eigenvalues.
resv <- varimax(res$Factor.Loadings) # Varimax rotation is possible later.
print(resv)
barplot(sort(colSums(loadings(resv)^2),decreasing=TRUE)) # screeplot using rotated SS loadings.
scores <- as.matrix(Ps) %*% as.matrix(resv$loadings) # Get factor scores in a simple manner.
library(psych)
cortest.bartlett(Ps) # Bartletts sphericity test.
res2 <- fa.parallel(Ps)
res3 <- fa(Ps, fm="minres", nfactors=8, rotate="oblimin")
print(res3) # Factor loadings as $loadings

*
6
http://www.subjectpool.com/ed_teach/y3method/factorexdata05.sav
*
7
http://www.subjectpool.com/ed_teach/y3method/factorex05.pdf and http://www.subjectpool.com/ed_teach/y3method/fa.pdf
4

LK Valves Product Catalogue Rev. 6
No ratings yet
LK Valves Product Catalogue Rev. 6
211 pages
Price-Rexroth Hydraulics Division
78% (9)
Price-Rexroth Hydraulics Division
512 pages
Sessions 21-24 Factor Analysis - Ppt-Rev
No ratings yet
Sessions 21-24 Factor Analysis - Ppt-Rev
61 pages
Slide Share Session 15 To 18 BRM
No ratings yet
Slide Share Session 15 To 18 BRM
105 pages
Factor Analysis
No ratings yet
Factor Analysis
3 pages
Analise Multi - Moodle
No ratings yet
Analise Multi - Moodle
127 pages
NJ Cse4261-4
No ratings yet
NJ Cse4261-4
43 pages
Factor Analysis
No ratings yet
Factor Analysis
49 pages
wk2 Factor-Analysis
No ratings yet
wk2 Factor-Analysis
35 pages
SSC Cpo
No ratings yet
SSC Cpo
1 page
Factor Analysis
No ratings yet
Factor Analysis
44 pages
On Job Annual Training Plan 2023
No ratings yet
On Job Annual Training Plan 2023
3 pages
2.6 Factor Analysis
No ratings yet
2.6 Factor Analysis
35 pages
Business Research Method: Factor Analysis
100% (1)
Business Research Method: Factor Analysis
52 pages
Factor Analysis: Nazia Qayyum SAP ID 48541
100% (1)
Factor Analysis: Nazia Qayyum SAP ID 48541
34 pages
Assignment No 3
No ratings yet
Assignment No 3
23 pages
Factor Analysis & Rotation
No ratings yet
Factor Analysis & Rotation
9 pages
Steps in Factor Analysis
No ratings yet
Steps in Factor Analysis
3 pages
Factor Analysis
No ratings yet
Factor Analysis
24 pages
Cambridge Advanced Practice Tests 2015
0% (1)
Cambridge Advanced Practice Tests 2015
17 pages
Factor Analysis
No ratings yet
Factor Analysis
31 pages
Session 1.4 Factor Analysis Notes
No ratings yet
Session 1.4 Factor Analysis Notes
23 pages
Week 5 Module Grade 9
No ratings yet
Week 5 Module Grade 9
7 pages
Hyperlipidemia 1
No ratings yet
Hyperlipidemia 1
54 pages
Factor Analysis
No ratings yet
Factor Analysis
27 pages
MTS3101 Appendices v1
No ratings yet
MTS3101 Appendices v1
35 pages
JML Regression
No ratings yet
JML Regression
36 pages
Session 13 - Factor Analysis
No ratings yet
Session 13 - Factor Analysis
22 pages
2b Factor Anaysis
No ratings yet
2b Factor Anaysis
24 pages
Exploratory Factor Analysis
No ratings yet
Exploratory Factor Analysis
19 pages
Motor, Filter, Kühlsystem Und Auspuff
No ratings yet
Motor, Filter, Kühlsystem Und Auspuff
18 pages
Semi Detailed LP 2
No ratings yet
Semi Detailed LP 2
3 pages
Lecture 8 - Transport Layer
No ratings yet
Lecture 8 - Transport Layer
50 pages
Factor Analysis
No ratings yet
Factor Analysis
20 pages
Exploratory Factor Analysis With SPSS Oct 2019
No ratings yet
Exploratory Factor Analysis With SPSS Oct 2019
26 pages
Cronbach's α (Reliability of data) and Factor Analysis (Construct Validity)
No ratings yet
Cronbach's α (Reliability of data) and Factor Analysis (Construct Validity)
55 pages
Factor Analysis
No ratings yet
Factor Analysis
18 pages
Lecture 11 Factor Analysis
No ratings yet
Lecture 11 Factor Analysis
21 pages
Ground Improvement Methods
No ratings yet
Ground Improvement Methods
32 pages
Factor Analysis Final
No ratings yet
Factor Analysis Final
13 pages
Đề thi minh họa số 16
No ratings yet
Đề thi minh họa số 16
6 pages
Sprockets
No ratings yet
Sprockets
16 pages
Keberhasilan Media Promosi Judi Online Dalam Menarik Minat Masyarakat-1
No ratings yet
Keberhasilan Media Promosi Judi Online Dalam Menarik Minat Masyarakat-1
10 pages
Factor Analysis
No ratings yet
Factor Analysis
42 pages
Chapter 19: Factor Analysis: Advance Marketing Research
No ratings yet
Chapter 19: Factor Analysis: Advance Marketing Research
37 pages
Tendernotice - 1 (5) - 1
No ratings yet
Tendernotice - 1 (5) - 1
4 pages
Factor Analysis Notes
No ratings yet
Factor Analysis Notes
11 pages
Factor Analysis Explained
No ratings yet
Factor Analysis Explained
4 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
10 pages
Sampling Procedure APEDA 1721269949
No ratings yet
Sampling Procedure APEDA 1721269949
5 pages
Dsur I Chapter 17 Efa
No ratings yet
Dsur I Chapter 17 Efa
47 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
48 pages
Factor Analysis
No ratings yet
Factor Analysis
11 pages
Mounting Procedure: Reference: C3131320010 A1
No ratings yet
Mounting Procedure: Reference: C3131320010 A1
16 pages
Exploratory Factor Analysis
100% (1)
Exploratory Factor Analysis
52 pages
Factor Analysis (FA)
No ratings yet
Factor Analysis (FA)
61 pages
Factor Analysis
No ratings yet
Factor Analysis
23 pages
Unit 5 CS
No ratings yet
Unit 5 CS
3 pages
Jithin Original
No ratings yet
Jithin Original
2 pages
GVI Seychelles Marine Report Jan 2017 - Dec 2017 (Cap Ternay)
No ratings yet
GVI Seychelles Marine Report Jan 2017 - Dec 2017 (Cap Ternay)
82 pages
Factor Handout
No ratings yet
Factor Handout
20 pages
Tense Changes in Reported Speech Rules, Examples, and Usage
No ratings yet
Tense Changes in Reported Speech Rules, Examples, and Usage
1 page
Borang
No ratings yet
Borang
1 page
Exploratory Factor Analysis: Prepared By: DR Gurjeet Kaur IIM, Amritsar
No ratings yet
Exploratory Factor Analysis: Prepared By: DR Gurjeet Kaur IIM, Amritsar
18 pages
Dr. Chinmoy Jana Iiswbm: Management House, Kolkata
No ratings yet
Dr. Chinmoy Jana Iiswbm: Management House, Kolkata
22 pages
Waiting For Santa - Barney Wiki - Fandom 44 58
No ratings yet
Waiting For Santa - Barney Wiki - Fandom 44 58
7 pages
KT Remote G PowerRemote en
No ratings yet
KT Remote G PowerRemote en
2 pages
Exploratory Factor Analysis
No ratings yet
Exploratory Factor Analysis
35 pages
Activity 1 BRS NSC Mar 2017 Cheques Out)
No ratings yet
Activity 1 BRS NSC Mar 2017 Cheques Out)
1 page
Exploratory Factor Analysis (EFA) : Welcome & Agenda
No ratings yet
Exploratory Factor Analysis (EFA) : Welcome & Agenda
45 pages
Session 7 Factor Analysis
No ratings yet
Session 7 Factor Analysis
24 pages
20 Questions 35 Minutes
No ratings yet
20 Questions 35 Minutes
7 pages
10 FactorAnalysis
No ratings yet
10 FactorAnalysis
15 pages
Quiz Ecology
No ratings yet
Quiz Ecology
9 pages
Partnership - Case Digests (Thyrz)
No ratings yet
Partnership - Case Digests (Thyrz)
15 pages
LCD TV: Service Manual
No ratings yet
LCD TV: Service Manual
51 pages
Annotated SPSS Output Factor Analysis
No ratings yet
Annotated SPSS Output Factor Analysis
20 pages
Introduction To Factor Analysis (Compatibility Mode) PDF
No ratings yet
Introduction To Factor Analysis (Compatibility Mode) PDF
20 pages
DSO Organizational Chart - by Michael W. Davis, DDS
No ratings yet
DSO Organizational Chart - by Michael W. Davis, DDS
1 page
Factor Analysis: KMO and Bartlett's Test
No ratings yet
Factor Analysis: KMO and Bartlett's Test
7 pages
Factor Analysis by Diagrams PDF
No ratings yet
Factor Analysis by Diagrams PDF
6 pages
HO Factor Analysis 5 Pages
No ratings yet
HO Factor Analysis 5 Pages
5 pages
Factor Analysis (DR See) : I I I Ik K I
No ratings yet
Factor Analysis (DR See) : I I I Ik K I
6 pages
Factor Analysis
No ratings yet
Factor Analysis
11 pages
Factor Analysis - Stata
No ratings yet
Factor Analysis - Stata
4 pages
Lecture 4 - Notes On Principal Components Analysis and Factor Analysis1
No ratings yet
Lecture 4 - Notes On Principal Components Analysis and Factor Analysis1
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Factor in R PDF

Uploaded by

Factor in R PDF

Uploaded by

R practice: Factor analysis

Minato Nakazawa (minato-nakazawa@umin.net)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.