American Statistical Association American Society For Quality
American Statistical Association American Society For Quality
Split-Plot Designs and Estimation Methods for Mixture Experiments with Process Variables
Author(s): Scott M. Kowalski, John A. Cornell and G. Geoffrey Vining
Source: Technometrics, Vol. 44, No. 1 (Feb., 2002), pp. 72-79
Published by: American Statistical Association and American Society for Quality
Stable URL: http://www.jstor.org/stable/1270686 .
Accessed: 20/06/2014 13:13
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at .
http://www.jstor.org/page/info/about/policies/terms.jsp
.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.
American Statistical Association and American Society for Quality are collaborating with JSTOR to digitize,
preserve and extend access to Technometrics.
http://www.jstor.org
Virginia
PolytechnicInstitute
and StateUniversity
VA24061-0439
Blacksburg,
Am/ I r A^\m m
Table1. Breakdown
ofDegreesofFreedomforSourcesofVariation
in /n \0 I / I /
theANOVATableforFourDesignsWith
q = 3 and m= 4 [a, O]
Degreesoffreedom
n=2 n=3
Source r =2 r =3 r =2 r= 3
0,-a]
Reps 0 0 0 0
Wholeplottreatments 4 4 8 8
Error(whole) 1 2 1 2
Subplottreatments 6 6 6 6 Figure3. ProposedDesign forSplit-Plot Structure
to SupportFit-
Wholex sub treatments 6 6 18 18 ting the CombinedSecond-OrderMPV Model. The centerpoint
Error(sub) 6 9 6 9 [z1,z2] = [0,0] is replicatedr times,and each timethe centroid
Total 23 27 39 43 (x1,x2,X3)= (1/3,1/3,1/3)is replicatedm times.Also,the centroid
(x1,x2,X3) = (1/3,1/3,1/3) is replicatedm timesat each axial setting.
FEBRUARY
TECHNOMETRICS, 2002,VOL.44, NO. 1
thedeterminants was computedand its value was compared d is close to zero,thevariance-covariance matrixV is close
to thedeterminant of thetrueVar(/]). to r'2IN, which is what is assumed with OLS.
To comparethethreemethodsof estimation, a measureof Table 2 shows thatthe pure errorapproachand REML
relativeefficiency can be defined.For OLS, themeasureis are quitecomparable.Withoutthepurequadraticterms,the
pureerrorand REML methodsperformed equallywell when
Rel. Eff.= j(X'X)-X'VX(X'X)- ' there are three, four,or five of
replicates the centerof the
I(XV-lX)- 'I processvariables.Whenonlytworeplicatesare used,REML
wherein thedenominator thematrix(X'V-'X)-1 is thetrue performed slightlybetterthanthepureerrormethoddid when
d is large.Withthepurequadratictermsin themodel,REML
Var(fl)becauseV is thetruevariance-covariance matrix.For
the methodwhenonlytworeplicates
REML andthepureerrormethod, is defined outperformed pureerror
relativeefficiency
areusedbutbothmethods performed equallywellwhenthree,
as
four,or fivereplicates areused.Therefore, iftheexperimenter
avg. I (X'V 'X)-'X'V 'VVc-'X(X'Vc'X)-' has no priorknowledge ofthevaluesofthetwoerrorvariances
Rel. Eff. =
I(X'V- X)-I I and can affordtheextracenterruns,the pureerrormethod
is a simpleand effective way to estimatetheerrorvariances
wheretheaverageis overthe 10,000simulateddeterminants.
independent of themodel.To getestimates of thecoefficients
Listed in Table 2 are the relativeefficiencies when q = 3 of thetermsin the
model,Vc is calculated,afterwhichit can
and n = 2 forcombinedMPV modelswithand without pure be inputted as theweightmatrixand weightedleast squares
quadratictermsin theprocessvariables. performed. An addedbenefit ofincludingthereplicated
FromTable2, itis seenthatOLS performs points
poorlyrelativeto andusingthe
pureerrorapproachis theabilityto testforlack
REML and thepureerrorapproach,exceptwhend < 1. This of fitof themodel.
is expectedbecausewhend is smallerthanone,thevariance The resultsof thesimulation studyby Kowalski(1999) for
of f8is dominated bythesubploterror, &-. Furthermore,when fivedifferent scenariosinvolvingmixtureexperiments with
processvariablesin both the simplexregionand the con-
strainedregioncan be obtainedfromthe firstauthorupon
Table 2. RelativeEfficienciesforComparingEstimationMethods
request.
Withq = 3 and n = 2 forthe Combined MPV Model
Without
Pure Quadratic Termsin Process Variables
5. EXAMPLE
r=2 r=3 An exampletakenfromCornell(1990, pp. 377-383) is
d OLS REML PE OLS REML PE modifiedhereto illustrate the designand estimation of the
model in Equation(4). The experiment involvesproducing
.11 1.03 1.07 1.08 1.03 1.06 1.08
.43 1.27 1.13 1.10 1.27 1.12 1.07
vinylforautomobileseatcovers.The mixture components are
1.0 1.78 1.26 1.20 1.79 1.21 1.11 threeplasticizerswhoseproportions arerepresented byx,, x2,
2.3 3.03 1.38 1.40 3.07 1.24 1.18 and x3. Also to be studiedare twoprocessvariables,rateof
4.0 4.68 1.49 1.62 4.76 1.33 1.24 extrusion (z,) and temperatureof drying (z2), each at twolev-
r=4 r=5 els. The measuredresponseis vinylthickness scaledto integer
values.The experiment was carriedoutas a split-plot experi-
d OLS REML PE OLS REML PE mentwiththeprocessvariablesas thewhole-plot factors.
.11 1.03 1.07 1.08 1.03 1.06 1.08
The originaldesignwas a six-point {3, 2} simplexlattice(a
.43 1.27 1.11 1.06 1.27 1.09 1.05 simplex-centroid designwithoutthecentroid)at each of the
1.0 1.79 1.14 1.08 1.79 22 factorial
2.3 3.07 1.18 1.12
1.13 1.06 pointsin theprocessvariables.The 24 treatment
3.05 1.11 1.08
4.0 4.76 1.15 1.13 4.73 1.15 1.08
combinations werereplicatedto produce48 observations. To
use thisexamplewiththedesignpresented earlierin thepaper,
WithPure Quadratic Termsin Process Variables somemodifications arenecessary.
r=2
First,onlytheresponseval-
r=3 ues at theverticesand midpoints of thetriangle fromthefirst
d OLS REML PE OLS REML PE replicateare used. Then,theresponsesforthe mixturecen-
troidsat the 22 factorialpointsin theprocessvariablesand
.11 1.02 1.03 1.04 1.02 1.03 1.03
.43 1.22 1.07
all the replicatesat the centerof the processvariablesare
1.09 1.24 1.07 1.05
1.0 1.81 1.09 1.20 1.83 1.08 1.10 simulatedas describedearlierusinga d valueof 3 (based on
2.3 3.15 1.10 1.38 3.12 1.06 1.15 theoriginalanalysisin Cornell(1990) wheretheratioof the
4.0 4.83 1.08 1.41 4.85 1.05 1.17
whole-plot errorto thesubploterrorwas about3). The data,
r=4 r=5 roundedto integers, are shownin Table 3.
The combinedMPV modelis estimated usingOLS, REML,
d OLS REML PE OLS REML PE andthepureerrorapproach.The estimates andstandard errors
.11 1.02 1.02 1.01 1.02 1.03
for REML were found using PROC MIXED in SAS. For OLS
1.03
.43 1.21 1.06 1.03 1.21 1.04 1.03 and thepureerrormethod,theestimatesand standarderrors
1.0 1.79 1.07 1.05 1.78 1.04 1.04 werecalculateddirectly
2.3 3.12 1.04 1.08 3.14 1.03 1.04
usingmatricesin PROCIMLin SAS.
4.0 4.82 1.03 1.08 4.82 1.02 1.04
The coefficient
estimatesand theirestimatedstandarderrors
alongwitht valuesare shownin Table4.
TECHNOMETRICS, FEBRUARY2002, VOL. 44, NO. 1
Table3. Vinyl
Thickness
ValuesfortheExample The pureerrorapproachused the 12 totalobservationsat
thecenterof theprocessvariablesto estimatethetwocovari-
Plasticizer
proportion Processvariablesetting Thickness
values ance S2p= 2.44 and S2 = .97. REML estimated
parameters,
X, x2 X3 Z, Z2 y
these two parametersto be S2 = 2.58 and S2p= 1.90. Even
withthisdifference in theestimatesof thecovarianceparam-
1 0 0 -1 1 10
0 1 0 eters,thevaluesof theestimates of themodelcoefficients
and
8
0 0 1 3 theirstandarderrorsare verysimilarwiththetwo methods.
.33 .33 .33 8 The valuesof theOLS estimates of themodelcoefficients
are
1 0 0 1 -1 10 close to thevaluesforREML and thepureerrormethodbut
0 1 0 5 the estimatedOLS standarderrorsare underestimated.
0 0 1
The
9
.33 .33 .33 9 resultsin thisexampleare consistent withthosefromearlier
.5 .5 0 1 1 5 simulation workby Kowalski(1999).
.5 0 .5 4 Testsof hypotheses of thecoefficients
producedthet val-
0 .5 .5 7 ues givenin Table 4. Testsforthe linearblendingtermsin
.33 .33 .33 10
.5
themixture modelare generally notperformed sincetheyare
.5 0 -1 -1 7
.5 0 .5 8 not directmeasuresof the componenteffects.WithREML
0 .5 .5 4 and OLS, thereare fivedf for the whole-ploterrormean
.33 .33 .33 7
square(two pureand threepooled) and ninedf forthesub-
.33 .33 .33 0 0 8
.33 .33 .33 7 plot errormean square.For each of thesetwo methods,the
.33 .33 .33 7 coefficientestimatesthoughtto be significantly
different
from
.33 .33 .33 8 zero, at the a = 0.05 level and attainingItl > t5 025 = 2.571
.33 .33 .33 0 0 7 or Itl > 9,.025 = 2.262, are indicatedby an asteriskin Table 4.
.33 .33 .33 8
.33 .33 .33
OLS is shownonlyforcompleteness, and becauseit underes-
9
.33 .33 .33 9 timatesthestandarderrors,OLS indicatesmostof theterms
.33 .33 .33 0 0 12 are significant.
For thepure errormethod,thereare two df
.33 .33 .33 10 forthe estimateof the whole-ploterrorvarianceand nine
.33 .33 .33 9
.33 .33 .33
df fortheestimateof thesubploterrorvariance.REML and
11
thepureerrormethodlead to thesameconclusionsregarding
LeastSquares
Ordinary REML
Term Estimate Std.error t Significant Estimate Std.error t Significant
Z Z2 -1.33 .31 -4.27 * -1.13 .92 -1.23
x1 8.49 .73 11.64 NA 8.99 1.27 7.08 NA
X2 4.99 .73 6.84 NA 5.49 1.27 4.32 NA
x3 4.49 .73 6.15 NA 4.99 1.27 3.93 NA
XlX2 5.14 3.09 1.66 3.10 4.69 .66
X,X3 6.14 3.09 1.99 4.10 4.69 .87
X2X3 11.14 3.09 3.61 * 9.10 4.69 1.94
x,Z1 -2.00 .69 -2.89 * -2.00 1.25 -1.60
X2Z1 .75 .69 1.08 .75 1.25 .60
X3Z1 2.00 .69 2.89 * 2.00 1.25 1.60
xl Z2 -2.00 .69 -2.89 * -2.00 1.25 -1.60
x2z2 2.25 .69 3.26 * 2.25 1.25 1.80
x3z2 -1.00 .69 -1.45 -1.00 1.25 -0.80
PureError
Method
Term Estimate Std.Error t Significant
Z Z2 -1.11 .85 -1.31
x, 9.06 1.00 9.03 NA
X2 5.56 1.00 5.54 NA
X3 5.06 1.00 5.04 NA
x1X2 2.82 3.39 .67
x1x3 3.82 3.39 1.13
X2X3 8.82 3.39 2.59
X1Z -2.00 1.04 -1.93
X2Z, .75 1.04 .72
X3Z1 2.00 1.04 1.93
X Z2 -2.00 1.04 -1.93
x2z2 2.25 1.04 2.17
X3Z2 -1.00 1.04 -.96
*Indicatesstatisticalsignificance(at the a = .05 level).
testsof significance
on theinteractions
betweenthe mixture ACKNOWLEDGMENTS
componentsand the process variablesas well as the two- The authorsthankthe editor,associate editor,and the two
factorinteractionamongtheprocessvariables.The possible reviewers for their suggestions and comments,all of which
underestimateofthesubploterrorvarianceforcesus to declare
improvedthis paper.
significantly
the nonlinearblendingtermbetweenmixturecomponents 2
and 3 as significant. [ReceivedJune2000. RevisedApril2001.]
REFERENCES
Box, G. P.,andDraper,N. R. (1987),EmpiricalModel-Building andResponse
6. SUMMARY Surfaces,New York:Wiley.
Chipman,H., Hamada,H., and Wu, C. F. J. (1997), "A BayesianVariable-
Cost and/or timeconstraints can makemixture experiments SelectionApproachforAnalyzingDesignedExperiments WithComplex
withprocessvariablesdifficult to runcompletely at random. Aliasing,"Technometrics, 39, 372-381.
In thiscase,restrictingtherandomization schemebyfixing the Cornell, J.A. (1988), "AnalyzingData FromMixtureExperiments Containing
ProcessVariables:A Split-Plot Approach," Journalof QualityTechnology,
levelsof some factorsand thenrunning combinations of the 20, 2-23.
otherfactorsleads to an experiment witha split-plot
structure. .(1990), Experiments WithMixtures:Designs,Models,and theAnal-
We proposetwo designsthatconsiderthe processvariables ysisofMixtureData (2nd ed.), New York:Wiley.
Cornell,J.A., and Gorman,J.W. (1984), "FractionalDesignPlans forPro-
as thewholeplotfactorsand themixture components as the cess Variablesin MixtureExperiments," JournalofQualityTechnology, 16,
subplotfactors. 20-38.
The split-plot V. "Mixture
structureof theexperiment requirestheesti- Czitrom, (1988), Experiments WithProcessVariables:D-Optimal
mationof the whole-plotand subploterrorvariancecompo- and Orthogonal ExperimentalDesigns,"Communications in Statistics-Theory
Methods,17, 105-121.
nentsdeclaringOLS to be no longervalid.As a result,two (1989), "Experimental DesignsforFourMixtureComponents With
alternative estimationmethodsare presented:REML and a ProcessVariables," Communications inStatistics-Theoryand Methods,18,
4561-4581.
pure errorapproach.A simulationstudywas conductedin Draper,N. R., Prescott,P., Lewis,S. M., Dean, A. M., John,P. W. M., and
orderto estimatethevariancecomponents. The twomethods Tuck,M. G. (1993), "MixtureDesignsforFourComponents in Orthogonal
along with OLS were the
comparedby using determinant of Blocks,"Technometrics, 35, 268-276.
the varianceof if and forming Ganju,J.,and Lucas,J.M. (1997), "Bias in TestStatistics WhenRestrictions
a relativeefficiencyvalue in in Randomization AreCaused by Factors,"Communications in Statistics-
termsof the truevalue. The relativeefficiencies providea Theoryand Methods26, 47-63.
measureof theincreasein thesize of theconfidence ellipsoid Khuri,
A. I., and Cornell,J.A. (1996), ResponseSurfaces:Designand Anal-
the yses(2nd ed.), New York:MarcelDekker.
surrounding parameter vector ft. Kowalski, S. M. (1999), "The Design and Analysis of Split-Plot
CombinedMPV modelswithandwithout thepurequadratic Experimentsin Industry,"Ph.D. dissertation,Universityof Florida,
termsin theprocessvariableswereconsidered. The pureerror Gainesville,FL.
Kowalski,S. M., Cornell,J. A., and Vining,G. G. (2000), "A New
approach and REML are quitecomparable whenthreeor more Model and Class of Designs for Mixture ExperimentsWith Pro-
replicatesof the centerof the design in the process vari- cess Variables,"Communications in Statistics-Theoryand Methods,29,
ables areperformed. Whenonlytworeplicatesareperformed, 2255-2280.
R. H., andMontgomery, D. C. (1997),ResponseSurfaceMethodology:
REML tendsto perform betterthanthepureerrorapproach. Myers,Processand ProductOptimization UsingDesignedExperiments, New York:
Therefore, if the experimenter can afforda few extraruns Wiley.
(using r > 3), thenwe suggestusingthe simplerpureerror Patterson andThompson(1971). "Recoveryof Inter-block Information When
Block Sizes Are Unequal,"Biometrika, 58, 545-554.
approach which does not requirecorrectlyspecifyingtheform SAS Institute(1989), SAS/QCSoftware:
ReferenceGuide, Ver 6 (1st ed.),
of themodel. Cary,NC: SAS Institute.
FEBRUARY
TECHNOMETRICS, 2002,VOL.44, NO. 1