2010 RAMS Doe and Data Analysis
2010 RAMS Doe and Data Analysis
Table of Contents
1. Introduction ..........................................................................................................................................................................1
2. Statistical Background ..........................................................................................................................................................2
3. Two Level Factorial Design .................................................................................................................................................3
4. Response Surface Methods (RSM) ......................................................................................................................................6
5. DOE for Life Testing ...........................................................................................................................................................9
6. Conclusions ........................................................................................................................................................................10
7. References ..........................................................................................................................................................................11
8. Tutorial Visuals…………………………………………………………………………………….. .................................12
This design is orthogonal. This is because the sum of the Table 6 – Design Matrix for a 2 3 Design
product of A and B is zero, which is
(−1 × −1) + (1 × −1) + (−1 × 1) + (1 × 1) = 0 . An orthogonal If the effect of ABC can be ignored, the following 4
design will reduce the estimation uncertainty of the model treatments can be used in the experiment.
coefficients.
The following linear regression model is used for the Order A B AB C AC BC ABC
analysis: 2 1 -1 -1 -1 -1 1 1
Y = β 0 + β1 X 1 + β 2 X 2 + β12 X 1 X 2 + ε (12) 3 -1 1 -1 -1 1 -1 1
Where: X 1 is for factor A; X 2 is for factor B; and their 5 -1 -1 1 1 -1 -1 1
interaction is represented by X 1 X 2 . The effects of A and B 8 1 1 1 1 1 1 1
are called main effects. The effects of their interaction are
called two-way interaction effects. These three effects are the Table 7 –Fraction of the Design Matrix for a 2 3 Design
three sources for the variation of Y. Since equation (12) is a In Table 7, the effect of ABC cannot be estimated from the
linear regression model, the ANOVA method and the t-test experiment because it is always at the same level of 1. Since
given in Section 2 can be used to test whether or not one effect Table 7 uses only half of the treatments from the full factorial
is significant. design in Table 6, it is represented by 2 3−1 and read as “2 to
For a balanced design, a simple way to calculate the effect the power of 3 minus 1 design” or “2 to the 3 minus 1 design.”
of a factor is to calculate the difference of the mean values of From Table 7, you will also notice that some columns
the response at its high and low setting. For example, the have the same values. For example, column AB and C are the
effect of A can be calculated by: same. Using equation (13) to calculate the effect of AB and C,
we will end up with the same procedure and result. Therefore,
from this experiment, the effect of AB and C cannot be
Table 11 –ANOVA Table with All Effects Table 12 –ANOVA Table with Significant Effects
There are no F ratio and P values in the above table. This is From Table 12, we can see that effects A, B, C and AB are
because there are no replicates in this experiment when all the indeed significant because their P values are close to 0.
effects are considered. Therefore, there is no way to estimate Once the important factors have been identified, follow-
the error term in the regression model. This is why the SSE up experiments can be conducted to optimize the process.
(Sum of Squares of Error), labeled as Residual in Table 11, is Response Surface Methods are developed for this purpose.
0. Without SSE, the estimation of the random error, how can 4. RESPONSE SURFACE METHODS (RSM)
we test whether or not an effect is significant compared to
random error? Don’t panic. Statisticians have already Response surface methods (RSM) are used to estimate the
developed methods to deal with this situation. When there is transfer functions at the optimal region. The estimated
no error in a screening experiment, Lenth’s method can be function is then used to optimize the responses. The quadratic
used to identify significant effects. Lenth’s method assumes model is the model used in RSM. Similar to the factorial
that all the effects should be normally distributed with a mean design, linear regression and ANOVA are the tools for data
of 0, given the hypothesis that they are not significant. If any analysis in RSM. Let’s use a simple example to illustrate this
effects are significantly different from 0, they should be type of design.
considered significant. So we can check the normal probability 4.1 Initial Investigation
plot for effects.
Assume the yield from a chemical process has been found
ReliaSoft DOE++ - www.ReliaSoft.com to be affected by two factors [1]:
Normal ProbabilityPlot of Effect
99.000
Effect Probability • Reaction Temperature
B:Exposure Time
Yield
Y' = Y • Reaction Time
Non-Significant Effects
A:Aperture Setting Significant Effects
Distribution Line The current operating conditions of 230 Fahrenheit and 65
minutes give a yield of about 35%. The engineers decide to
C:Develop Time
AB
50.000
Fahrenheit and [L=55, H=75] minutes to see how the
temperature and time affect the yield. The design matrix is:
10.000
Std. Point A: B:
5.000 Yield (%)
QA
Reliasoft
Order Type Temperature Reaction Time
7/10/2009
1.000
-40.000 -24.000 -8.000 8.000 24.000
3:03:18 PM
40.000
1 1 -1 -1 33.95
Effect
Alpha = 0.1; Lenth's PSE = 0.9375 2 1 1 -1 36.35
3 1 -1 1 35
Figure 1-Effect Probability Plot Using Lenth’s Method
4 1 1 1 37.25
From Figure 1, the main effect A, B, C and the 2-way 5 0 0 0 35.45
interaction AB are identified as significant at a significance 6 0 0 0 35.75
level of 0.1. Since the rest of the effects are not significant, 7 0 0 0 36.05
they can be treated as noise and used to estimate the sum of
8 0 0 0 35.3
squares of error. In DOE, it is a common practice to pool non-
9 0 0 0 35.9
significant effects into error. With only A, B, C and AB in the
model, we get the following ANOVA table. Table 13-Design Matrix for the Initial Experiment
Table 13 is the design matrix in coded values where -1 is the
Source of
DF
Sum of Mean F
P Value lower level and 1 is the higher level. For a given actual value
Variation Squares Squares Ratio
for a numerical factor, its corresponding coded value can be
4 5747.25 1436.8125 560.7073 1.25E-12
calculated by:
Model
90.548
normally distributed.
Given the above reasons, correct analysis methods for
88.955
data from life testing are needed.
87.362
QA
Reliasoft
7/11/2009
5.2 Maximum Likelihood Estimation and Likelihood Ratio
1:16:17 PM
Test [2]
342.929
345.757
348.586
351.414
354.243
357.071
150.858
156.515
162.172
167.828
173.485
179.142
A:Temperature B:Reaction Time
Maximum Likelihood Estimation (MLE) can estimate
X =351.5045 X =168.9973
model parameters to maximize the probability of the
Figure 4-Optimal Solution for the Chemical Process occurrence of an observed data set. It has been used
successfully to handle different data types, such as complete
Figure 4 shows how the response changes with each data, suspensions and interval data. Therefore, we will use
factor. The red dashed line points to the optimal value of each MLE to estimate the model parameters for life data from
factor. we can see that the optimal settings are 351.5 degrees DOE.
Fahrenheit for temperature and 169 minutes for the reaction Many distributions are used to describe lifetimes. The
time. At this setting, the predicted yield is 95.3%, which is three most commonly used are [4]:
much better than the yield at the current condition (about • Weibull distribution with probability density function
35%). (pdf):
β
5. DOE FOR LIFE TESTING β −1 t
−
t β µ
When DOE is used for life testing, the response is the life
f (t ) = e (17)
or failure time. However, because of the cost, time or other η η
constraints, you may not have observed values of life for some • Lognormal distribution with pdf:
β
test units. They are still functioning at the time when the test 1 ln(t ) − µ
−
ends. The end time of the test is called the suspension time for 1
f (t ) = e 2 σ (18)
the units that are not failed. Obviously, this time is not their σt 2π
“life.” Should the suspension time be treated as the life time in • Exponential distribution with pdf:
order to analyze the data? In this section, we will discuss the t
correct statistical method for analyzing the data for life testing. 1 −
f (t ) = e m (19)
First, let’s explain some basic concepts in life data analysis. m
Assume there is only one factor (in the language of DOE), or
5.1 Data Type for Life Test
stress (in the language of accelerated life testing) that affects
When the response is life, there are two types of data the lifetime of the product. The life distribution and factor
• Complete Data relationship can be described using the following graph.
• Censored Data
o Right Censored (Suspended)
o Interval Censored
If a test unit is failed during the test and the exact failure
time is known, the failure time is called complete data.
If a test unit is failed and you don’t know the exact failure
time -- instead, you know the failure occurs within a time
range -- this time range is called interval data.
If a unit does not fail in the test, the end time of the test of
the unit is called right censored data or suspension data.
Obviously, ignoring the censored data or treating them as
failure times will underestimate the system reliability.
However, in the use of the linear regression and ANOVA, an
exact value for each observation is required. Therefore,
engineers have to tweak the censored data in order to use Figure 5-pdf at Different Stress/Factor Levels
linear regression and ANOVA. A simple way to tweak the Figure 5 shows that life decreases when a factor is changed
Outline
Introduction
Design of Experiments
Why DOE
What DOE Can Do
Common Design Types
General Guidelines for Conducting DOE
Huairui (Harry) Guo, Ph.D. Statistic Background
Adamantios Mettas Linear Regression and Analysis of Variance (ANOVA)
Two Level Factorial Design
Response Surface Method
Reliability DOE
2010 RAMS –Tutorial DOE – Guo and Mettas 1
Introduction
Linear Regression
1.5
A linear regression
model attempts to explain
the relationship between
two or more variables
using a straight line.
Introduction
Statistical
T-test for the Coefficients Background
Two Level
T-test is an alternative to the F-test. Factorial Design
T-test is used to test whether or not a coefficient is 0.
For example,
Response
Surface Method Two Level Factorial Design
H0: β1 = 0; H1: β1 ≠ 0
Reliability DOE
The test statistic is:
βˆ1 Summary
T0 =
se( βˆ1 )
β̂1 is the estimated value for β1. se( βˆ1 ) is the standard error.
2010 RAMS –Tutorial DOE – Guo and Mettas 21
Treatment = 4 Replicate = 1
Source of Degrees of Sum of Mean P
Variation Freedom Squares Squares F Ratio Value
Design notation: 2k k is the number of factors.
Model 2 6.14E+04 3.07E+04 36.86 0.0077
X1 1 6.00E+04 6.00E+04 72.03 0.0034
2010 RAMS –Tutorial DOE – Guo and Mettas 23
X2 1 8100 8100 9.72 0.0526
Residual 3 2500 833.3333
Total 5 6.39E+04
Note: The P values from the T-test are the same as in the F-test.
Critical Valu
Significant
X1
Term
X2
QA
Reliasoft
7/28/2009
4:57:54 PM
0.000 2.353 3.600 5.400 7.200 9.000
Effect
Alpha = 0.1; Threshold = 2.3534
Temperature
Num of Units (Celsius) Humidity (%)
25 120 (+1) 95 (+1)
25 85 (-1) 85 (-1)
It is a bad design: Main effects Temperature and Humidity are
Confounded.
From the viewpoint of Linear Regression, at least 3 unique
combinations are needed — why?
Y = β 0 + β1 X 1 + β 2 X 2 + ε
If effect ABC can be ignored in the study, then we can select the
treatments with ABC=1 for a Fractional Factorial Design.
B:Exposure Time
Yield
Y' = Y
Non-Significant Effects
Significant Effects
Experiment
A:Aperture Setting
Distribution Line
AB
C:Develop Time The initial design is a modified factorial design. The Design Matrix is:
A: B:
Std. Order Point Type Temperature Reaction Time Yield (%)
Probability
50.000
1 1 -1 -1 33.95
2 1 1 -1 36.35
3 1 -1 1 35
10.000 4 1 1 1 37.25
5.000
5 0 0 0 35.45
QA
Reliasoft 6 0 0 0 35.75
7/10/2009
1.000 3:03:18 PM 7 0 0 0 36.05
-40.000 -24.000 -8.000 8.000 24.000 40.000
Effect 8 0 0 0 35.3
Alpha = 0.1; Lenth's PSE = 0.9375
9 0 0 0 35.9
Lenth’s method assumes all the effects should be normally distributed with a mean of
0, given the hypothesis that they are not significant. The initial design at the current operation condition is used to search
2010 RAMS –Tutorial DOE – Guo and Mettas 36 the direction for locating the optimal region for optimization.
2010 RAMS –Tutorial DOE – Guo and Mettas 41
Statistical
ANOVA Table
Background
Two Level
Factorial Design
Reliability DOE
RSM: Example
The yield from a chemical process has been Path of Steepest Ascent
found to be affected by two factors:
Reaction Temperature Factor Levels
Reaction Time Step Coded Actual Yield (%)
A B A B
The current operating conditions of Current
0 0 230 65 35
230 Fahrenheit and 65 minutes give a yield Operation
1 2.4 1 242 75 36.5
of about 35%. 2 4.8 2 254 85 39.35
The engineer decides to explore the current 3 7.2 3 266 95 45.65
4 9.6 4 278 105 49.55
conditions in the range (225, 235)
5 12 5 290 115 55.7
Fahrenheit and (55, 75) minutes so that 6 14.4 6 302 125 64.25
steps can be taken to achieve the maximum 7 16.8 7 314 135 72.5
yield. 8 19.2 8 326 145 80.6
9 21.6 9 338 155 91.4
10 24 10 350 165 95.45
11 26.4 11 362 175 89.3
12 28.8 12 374 185 87.65
2010 RAMS –Tutorial DOE – Guo and Mettas 40
Quadratic model
5 0 350 165 94.85
6 0 350 165 95.45
7 0 350 165 95
8 0 350 165 94.55
9 0 350 165 94.7
Y = β 0 + β1 X 1 + β 2 X 2 + β11 X 12 + β 22 X 22 + β12 X 1 X 2 + ε
At the optimal region, the curvature is significant. The simple linear model is not enough any more.
The coefficients show that the direction is (1.1626, 0.4875) or Using a 2 factor factorial design as example:
can be normalized to (2.4, 1). (-1,1) (1,1)
(0, α)
(-1,-1) (1,-1)
(0, -α)
(0, α)
(-1,1) (1,1)
(-α, 0) (α, 0)
(0, 0)
(-1,-1) (1,-1)
(0, -α)
was conducted. 11
12
-1
-1
357.07
350
165
150.86
92.75
88.4
13 -1 350 179.14 92.6
50.000
Residual
0.000
-0.140
10.000
5.000
QA -0.420
Reliasoft
7/11/2009
1:07:07 PM
1.000 QA
-2.000 -1.200 -0.400 0.400 1.200 2.000 Reliasoft
Residual -0.621 7/11/2009
Anderson-Darling =0.3202; p-value =0.4906 -0.700 1:08:33 PM
0.000 3.000 6.000 9.000 12.000 15.000
Run Order
2010 RAMS –Tutorial DOE – Guo and Mettas 53 Alpha =0.1; Upper Critical Value =0.6209; Lower Critical Value =-0.6209
How to Decide α
Several methods have been developed for deciding Optimization
α. Y = 94.91 + 0.74 X 1 + 1.53 X 2 − 1.52 X 12 − 2.08 X 22 + 0.45 X 1 X 2
ReliaSoft DOE++- www.ReliaSoft.com
Optimal Solution 1
Special values of α can make the design have special Central Composite Design
parameters, or that can better explore the optimal region. 93.733 Factor Value
Response Value
Y = 95.3264
(Maximize)
Yield
1/ 4 90.548
2 k − f (n f )
α = 88.955
ns
87.362
QA
Reliasoft
7/11/2009
1:16:17 PM
150.858
156.515
162.172
167.828
173.485
179.142
ns is the number of replicates of the runs at the axial points. A:Temperature B:Reaction Time
2k-f represents the original factorial or fractional factorial design. X =351.5045 X =168.9973
σ t 2π
Summary
Exponential distribution pdf:
t
1 − m
f (t ) = e
m
2010 RAMS –Tutorial DOE – Guo and Mettas 56
where:
Instead of considering the entire scale of the pdf, the life µ ' = ln(η ) or µ'= µ or µ ' = ln(m)
characteristic can be chosen to investigate the effect of potential
factors on life.
xj : jth factor value
The life characteristics for the three commonly used distributions Note that a logarithmic transformation is applied to the life
are: characteristics of the Weibull and exponential distributions.
Weibull: η Lognormal: µ Exponential: m This is because η and m can take only positive values.
2010 RAMS –Tutorial DOE – Guo and Mettas 63
failure time. -1
-1
1
1
-1
1
1
-1
-1
1
8~10
18~20
10~12
20+
1 -1 -1 -1 1 20+ 20+
Problem: The above assumptions and adjustments 1
1
-1
1
1
-1
1
-1
-1
-1
12~14
16~18
20+
20+
are incorrect. 1 1 1 1 1 12~14 14~16
2010 RAMS –Tutorial DOE – Guo and Mettas 69 …assuming that the interactions are absent.
2010 RAMS –Tutorial DOE – Guo and Mettas 70
β̂1 is the estimated value for β1. se( βˆ1 ) is the standard error.
B:B
Term
E:E
Note: The P values from the Z-test are slightly different from the Likelihood Ratio test. A:A
2010 RAMS –Tutorial DOE – Guo and Mettas 73 From both LR test and the Z-test, A, B, D and E are significant.
2010 RAMS –Tutorial DOE – Guo and Mettas 74
Model Diagnostic
When using the Weibull distribution for life, the residuals from the life- Model Diagnostic (cont’d)
factor relationship should follow the extreme value distribution with a
mean of zero. Residuals against run order plot
In order to improve the life, factors A and E should be set to the high
level; while factors B and D should be set to the low level.
MLE Information
Term Coefficient
A:A 0.1052
B:B -0.2256
C:C -0.0294
D:D -0.2477
E:E 0.1166
Introduction
Statistical
Summary: Topics Covered Background
Two Level
Why DOE, what DOE can do and common design Factorial Design
types Response
Surface Method
Summary
General guidelines for conducting DOE
Reliability DOE
Linear regression and ANOVA
Summary
2-level factorial and fractional factorial design
Response surface method
Reliability DOE 2010 RAMS –Tutorial DOE – Guo and Mettas 80