Linder 2014
Linder 2014
An Analysis of Industrial
SRAM Test ResultsVA
Comprehensive Study
on Effectiveness and
Classification of March
Test Algorithms
Michael Linder and Alfred Eder Klaus Oberländer
University of Applied Sciences Augsburg Infineon Technologies AG
Ulf Schlichtmann
Technische Universität München
have been tested during full industrial
production. This allows a meaningful
Editor’s notes:
statistical analysis within the range of
This paper deals with efficient test algorithm identification of embedded
SRAMs. The approach is based on silicon test results of automotive parts per million (ppm). To the best of
microcontroller devices, which tries to identify efficient tests by removing our knowledge, no such detailed and
unnecessary test patterns that cover the same subset of faults. Results of an comprehensive evaluation of test algo-
industrial case under 29 test algorithms and a large amount of chips are rithms based on large volumes of
obtained; among them, efficient ones are identified. production ICs has been published to
VCheng-Wen Wu, NTHU date. Our analysis took place during the
burn-in phase of production, i.e., after
wafer test and packaging.
h MUCH HAS BEEN published on the development In production test sets, an efficient selection of
and properties of memory test algorithms [1]–[9]. test algorithms is essential to achieve high fault
The theory of functional fault models (FFMs), coverage and low test time at the same time. Until
performance, and effectiveness of test algorithms now, however, test engineers have mostly relied
are commonly confirmed by simulation. In this more on personal experience and even gut feeling
paper, we present a study on the effectiveness of when deciding on production test sets rather than
memory test algorithms based on test results on reliable quantitative data.
gathered during production memory tests of em- In this paper, we present production SRAM test
bedded SRAMs. Hundreds of thousands of devices results and a statistical analysis of these data that
work out the effectiveness of test algorithms and
Digital Object Identifier 10.1109/MDAT.2013.2279752 efficiency of test combinations, which is based
Date of publication: 28 August 2013; date of current version: on approximately 2500 faults. The effectiveness of
22 July 2014. single test algorithms is measured by their fault
42 2168-2356/14 B 2013 IEEE Copublished by the IEEE CEDA, IEEE CASS, IEEE SSCS, and TTTC IEEE Design & Test
coverage, and the efficiency of test combinations is memory test sets [5], as well as new algorithms, such
shown by a comprehensive analysis of union and as March LR [6] and March RAW [7]. In contrast to
intersection of sets of faults detected by combina- older test algorithms, these new test algorithms have
tions of different test algorithms. These analyses principally been developed based on FFMs. The
allow us to classify the test algorithms and to group concept of FFMs has been established during the
them into sets. The findings of this classification are 1990s by Hamdioui [1] and van de Goor [5]. It is a
combined with information about correlation of formal description of any memory fault indepen-
march sequences and FFMs from literature, so that dent of memory type or technology. With the
sets of test algorithms could be allocated to a introduction of FFMs, test algorithms could be
specific set of FFMs. An efficient selection of test developed systematically and targeted for specific
algorithms for production SRAM testing can now be purposes. Additionally, three algorithms within the
based on this classification to avoid similar tests test set are not taken from literature, but are
within a test set and so to avoid redundancy and designed with respect to the specifications of our
save test time. configurable MBIST. These algorithms are Ham5R,
Ham5W, and Ham_Walk.
Test setup and data acquisition The notation of linear march algorithms is ac-
The test suite used for the analysis contains cordingly as described in [10]. An algorithm is
29 linear march algorithms that are executed by a delimited by curly brackets ‘‘{’’ and ‘‘}’’ and a march
configurable memory built-in self-test (MBIST) on element is delimited by parentheses. A +, *, or m
each of the embedded memories of microcontroller prior to the march elements denotes the addressing
devices in 130-nm technology. In total, there have direction up, down, or arbitrary, respectively. An
been 13 embedded SRAMs, sized between 1.38 and operation applied to a cell can be a ‘‘w0’’ (write ‘‘0’’),
128 kB, tested in parallel with a total memory size of an ‘‘r0’’ (read ‘‘0’’), a ‘‘w1,’’ or an ‘‘r1,’’ and all oper-
261.56 kB. Our memory design uses single-port, six- ations of one march element are applied suc-
transistor SRAM cells for all embedded memories in cessively to one cell, before they are applied to the
the study. next cell. A D in the notation of March G denotes
We chose devices of 130-nm technology because delay time between two march elements.
of the high production volumes that are necessary to
achieve enough test data for a meaningful statistical Test environment
analysis. The test volume of a more recent and The tests of this analysis were placed into the
smaller technology (e.g., 90 nm) would not have production test flow of embedded SRAMs of micro-
been sufficient for such a comprehensive analysis. controller devices during burn-in. That means that
For a full access to the high volume of devices, all the tests took place after wafer test and packaging
tests of the study took place during the production (see Figure 1). This strategy allows gathering com-
test flow. prehensive test results at different environmental test
conditions (temperature and supply voltage), and
Memory test algorithms allows extensive test time, as the devices remain up
The set of memory test algorithms contains 29 to 72 h in burn-in so that the test does not limit this
linear march algorithms that are well known and study. During the tests, the results are stored on-chip.
have previously been described in the literature or They are read out and stored in a database after the
are designed due to the properties of the MBIST we test flow has been completed.
used for the study. The set of algorithms is listed in The tests for the analysis presented in this paper
Table 1. The number of test algorithms was restricted took place at a temperature of 145 C and increased
by limited storage space for the test program and by supply voltage of 1.80 V after a high-voltage and
the fact that our MBIST only supports linear high-temperature stress phase during burn-in.
marching algorithms. Hence, unfortunately, no
complex and nonlinear algorithms such as GALPAT Data acquisition and preparation
[5] could be included. However, the test set con- After testing has been completed, the results are
tains commonly used marching algorithms, such as read out from the devices, processed, and written
SCAN, MATS, or March C-, that are often used in into a database. This database allows combining the
May/June 2014 43
An Analysis of Industrial SRAM Test Results
data in any way and filtering the test results for the
analysis.
While this approach enables us to achieve a large
number of test results for a statistical analysis, the
analysis of the data is restricted to the possibilities that
are given by the production test flow. It was not
possible to pick and collect faulty devices during the
test process as all analyses were done offline after a
Figure 1. SRAM test flow (simplified, schematic). sufficient number of test results were available. That
May/June 2014 45
An Analysis of Industrial SRAM Test Results
with the knowledge about characteristic march The following analyses of fault coverage and
sequences from theory, and the sets of algorithms efficiency of test algorithms are based on the values
are allocated to a set of FFMs. of Table 3.
May/June 2014 47
An Analysis of Industrial SRAM Test Results
Figure 3. Venn diagrams showing union and intersection of test algorithms. (a) Algorithm B, March U,
March LR, and March SR. (b) March RAW, March AB, and March BDN. (c) March RAW and March RAW1.
(d) SCAN, SCAN+, and March G. (e) SCAN, March RAW, and Ham5W.
memories, the combination of algorithms is of more space reasons, but would be similar. Only Ham_Walk
interest than the fault coverage of single test is a little outstanding as some additional faults are
algorithms Therefore, the analysis of test data is detected that are not covered by the other algo-
extended to union, intersection, and Qeff . rithms. So, in addition to the previous statement, the
sets of faults detected by Algorithm B, March U,
Analysis of effectiveness and similarity March LR, or March SR are approximately a subset of
Analyzing the results listed in Table 3 is a very those faults detected by Ham_Walk. For these
simple way to determine the efficiency and similar- relationships, we use as a symbol to denote ‘‘is
ity of test algorithms in this study. The similarity, and approximately a subset of’’
hence also the efficiency of test algorithms, is simply
given by a shading. fFAlgB ; FU ; FLR ; FSR g FHam Walk : (4)
The following results are based on the empirical
and statistical data gathered during the study. The This means, in terms of efficiency and test set
five algorithms Algorithm B, March U, March LR, development, that Ham_Walk is able to replace the
March SR, and Ham_Walk are not only outstanding other algorithms. Using more algorithms of this
concerning their fault coverage, but are also highly group in one set would mean redundant testing and
similar. Comparing these algorithms shows that the so an increase of test time that would hardly
intersection of any pair of these algorithms is increase the fault coverage. Such combinations
extremely high, and so is Qeff . This means that not should be avoided during industrial memory testing
only the cardinality of these algorithms is very as test time is expensive.
similar, but also mostly the same faults are covered Another interesting relationship of algorithms
by these algorithms: can be observed with March RAW, March RAW1,
March AB, March AB1, and March BDN. These
FAlgB FU FLR FSR FHam Walk : (3)
algorithms have been developed with a focus to
This relationship and also the proportions are detect dynamic faults. In [8] and [9], March AB
shown in Figure 3a, where March LR is taken as base respectively March BDN are described to detect the
of the comparison and all other algorithms are same set of faults as March RAW but with reduced
related to March LR. The Venn diagrams of any other test length. This statement can be confirmed by our
combination of these algorithms are omitted for test results. The fault coverage of these algorithms is
May/June 2014 49
An Analysis of Industrial SRAM Test Results
almost the same. Figure 3b illustrates combinations be expected due to the fact that the march
of these algorithms and shows that sequence of SCAN is a proper subset of SCAN+
(see Table 1). This discrepancy between theory
FRAW FAB FBDN : (5) (same structure of march elements) and practice
Moreover, the test results confirm the assumption (no proper subset) remains a subject for future
that the set of faults detected by March RAW1 analysis, as no individual devices could be picked
respectively March AB1 is a subset of March RAW from the test flow in the production test environ-
respectively March AB [7], [9]. Figure 3c shows ment used in our study.
As defined above, efficient combinations of test
FRAW1 FRAW (6) algorithms are those where the intersection is low
and the union is high at the same time. One of
and
these combinations is SCAN with Ham5W (Qeff ¼
FAB1 FAB : (7) 0:016; see Figure 3e). However, for an efficient
test selection, the whole set of algorithms that is
March RAW1 and March AB1 are assigned to applied needs to be taken into account. Although
single-cell dynamic faults whereas March RAW and the combination of SCAN and Ham5W is very
March AB additionally detect two-cell dynamic efficient, both algorithms are covered by March
faults [9]. So, the intersection of March RAW and RAW. Hence, if March RAW is used in a test set,
March RAW1, and the intersection of March AB and none of the other algorithms would improve the
March AB1 denote those faults within our experi- test results significantly. All relationships between
mental results that contain single-cell dynamic SCAN, Ham5W, and March RAW are illustrated in
faults, and the remaining faults detected by March Figure 3e.
RAW respectively March AB are then assumed to be For any combination of algorithms, the data are
two-cell coupling faults. The results permit an esti- analyzed and are usedVtogether with the theory
mation of the ratio of single-cell to two-cell dynamic about characteristic march elementsVfor the clas-
faults. In our results, 584 faults are detected by sification of test algorithms and FFMs.
March RAW but not by March RAW1, and the inter-
section of these two algorithms contains 979 faults. Grouping and Classification
The union of these algorithms contains 1604 faults The previous section described how to deter-
(see Figure 3c). So, about one third of the faults (584 mine and to describe similar performance of mem-
of 1604) is exclusively detected by March RAW. This ory test algorithms. The findings of sets, subsets, and
third contains the two-cell dynamic faults. About supersets of test algorithms are now used for the
two thirds of the faults (979 of 1604) are detected by following classification of algorithms and faults. In
both algorithms. They contain the single-cell dyna- addition to our experimental gathered results, struc-
mic faults. However, the part of static faults is not tural similarities of test algorithms are also taken into
excluded in the results. Assuming that static faults account for the grouping. In literature introducing
are uniformly distributed, the ratio of single-cell to new test algorithms, the fault coverage for specific
two-cell dynamic faults in our results is approxi- fault models of the new algorithms is compared to
mately 2 : 1. that of recent algorithms by simulation [4], [6], [8],
Two algorithms (SCAN and SCAN+) are completely [9]. We, however, combine theoretical approaches,
covered by March G (Figure 3d). So, all faults detected e.g., characteristic march sequences (Table 4) de-
by SCAN or SCAN+ are already covered by March G. scribed in [4] which are related to FFMs, and the
So, F SCAN and F SCANþ are proper subsets of F G findings of our experiments described in the Exper-
FSCAN FG (8) imental Test Results section. The biggest difference
between simulation based comparison and our ap-
and proach is that we do not know the kind of fault from
our results. Thus, the information about detectability
FSCANþ FG : (9)
of specific fault models, i.e., the properties of test
However, the set of faults detected by SCAN is algorithms, is taken from previous published work. For
not a proper subset of SCAN+, although this should the classification based on our statistical analysis, four
kinds of functional faults are taken into account. rithms for production test sets, as similar algo-
These are rithms, and so redundant testing, can be avoided.
From the sets determined and the analysis of
h simple static single-cell faults; similarity and efficiency, the following conclusions
h simple static coupling faults; can be drawn:
h linked faults;
h dynamic faults. h sets I and II are covered by set III;
h the march sequences of March LR are similar to
Based on theory, i.e., the formal description of those of algorithms in set III;
which algorithms are able to detect specific fault h march RAW covers other algorithms of set V;
models given in literature, and also based on the h Ham_Walk covers algorithms of set IV and III.
evaluation of or test results, the algorithms within
our test set are classified into five sets. Table 5 lists An efficient test set which detects many different
the groups of algorithms and allocated FFMs. Here, faults would consist of algorithms of different sets
the similarity of algorithms is necessary to determine which are irredundant. Hence, a combination of
whether algorithms should be in the same set. This March RAW and Ham_Walk would be expected to
classification is similar to that shown in [5], be very efficient. For the evaluation of our results,
grouping simple march algorithms like SCAN, this means that 2347 out of a total of 2439 faults are
MATS, and March C-, and fault modes. detected by the combination of these algorithms.
Each set of algorithms is allocated to a certain That means 96.2% of the faults can be covered with
number of FFMs which are detected predominantly an effort of only two test algorithms. However, this is
by these algorithms, but not only. For example, not yet 100% fault coverage, but our results have
algorithms of set I, II, III, and IV may also detect some shown that at least 13 test algorithms would have
dynamic faults, but the major part of dynamic faults been necessary to detect all faults. This would
is allocated to algorithms of set V. The allocation of mean too much effort for production testing as the
FFMs is based on the description of typical march MBIST needs to be reconfigured for each algorithm
sequences and test algorithms in [4]. in the test set. Hence, our approach is useful for
This grouping of algorithms and fault models is such test stages during a production test flow, where
the basis for an efficient selection of test algo- only a few test algorithms can be used as test time is
May/June 2014 51
An Analysis of Industrial SRAM Test Results
Summary
In this paper, we present a comprehensive h References
analysis of an industrial SRAM test. During produc- [1] S. Hamdioui, Testing Semiconductor Random Access
tion memory testing. hundreds of thousands of em- Memories: Defects, Fault Models and Test Patterns.
bedded memories have been tested and the test Boston, MA, USA: Kluwer, 2004.
results are analyzed with respect to efficiency and [2] S. Hamdioui, Z. Al-Ars, and A. J. van de Goor, ‘‘Testing
similarity of memory test algorithms. The tests were static and dynamic faults in random access memories,’’
implemented in high volume production testing of in Proc. IEEE VLSI Test Symp., 2002, pp. 395–400.
130-nm technology automotive microcontroller de- [3] S. Hamdioui, A. J. van de Goor, J. Delos Reyes, and
vices. The overall aim of this study is to get a basis for M. Rodgers, ‘‘Memory test experiment: Industrial
simple and quick test set generation that fulfills two results and data,’’ Comput. Digital Tech., vol. 153,
requirements: high fault coverage and low test effort no. 1, pp. 195–205, 2006.
at the same time. Test time is an important issue for [4] S. Hamdioui, A. J. van de Goor, and M. Rodgers,
production memory testing, as is high fault cover- ‘‘Linked faults in random access memories: Concept,
age. Hence, algorithms with similar properties and fault models, test algorithms and industrial results,’’
fault coverage (i.e., that address the same faults) IEEE Trans. Comput.-Aided Design Integr. Circuits
should be avoided in production test sets as this Syst., vol. 23, no. 5, pp. 737–757, May 2004.
means redundant testing and additional effort. Be- [5] A. J. van de Goor, Testing Semiconductor Memories:
sides the simple fault coverage of single test algo- Theory and Practice. Gouda, The Netherlands:
rithms, the ratio of intersection and union of faults ComTex, 1998.
detected by different test algorithms are taken into [6] A. J. van de Goor, G. N. Gaydadjiev, V. N. Yarmolik,
account as a measure of efficiency and similarity of and V. G. Mikitjuk, ‘‘March LR: A test for realistic
test algorithms in a set. A comprehensive analysis of linked faults,’’ in Proc. IEEE VLSI Test Symp., 1996,
29 linear marching algorithms was performed. The pp. 272–280.
result of this study is a classification of algorithms. [7] S. Hamdioui, Z. Al-Ars, A. J. van de Goor, and M. Rodgers,
Groups of similar algorithms are allocated to FFMs. ‘‘Dynamic faults in random-access memories: Concept,
This serves as a foundation for an efficient combi- fault models and tests,’’ J. Electron. Testing, Theory Appl.,
nation of test algorithms. The analysis of our experi- vol. 19, pp. 195–205, 2003.
mental results shows that already 96% fault coverage [8] A. Benso, A. Bosio, S. Di Carlo, G. Di Natale, and
can be achieved by combining only two out of the P. Prinetto, ‘‘March AB, March AB1: New March tests
29 considered test algorithms. The findings pre- for unlinked dynamic memory faults,’’ in Proc. IEEE Int.
sented in this paper have directly influenced the Test Conf., 2005, pp. 834–841.
wafer test stage of microcontroller devices and the [9] A. Bosio and G. Di Natale, ‘‘March BDN: A new March
development of a new memory BIST generation at test for dynamic faults,’’ in Proc. IEEE Int. Conf. Autom.
Infineon Technologies AG to force the efficiency of Quality Testing Robot., 2008, pp. 85–89.
embedded SRAM self-tests. [10] A. Offerman and A. J. van de Goor, ‘‘An open notation
for memory tests,’’ in Proc. Int. Workshop Memory
THE AUTHORS APPRECIATE the evaluation test Technol. Design Testing, 1997, pp. 71–78.
results gathered during burn-in of hundreds of
thousands of ICs. The enormous number of test Michael Linder is currently with NXP Semicon-
results gathered during this study enabled them to ductors, Hamburg, Germany, with an emphasis on
demonstrate the relationship between march test test development for security smart card devices.
algorithms and FFMs described in theory based on Linder has a Dipl-Ing (FH) in mechatronics from
statistical analysis in practice. h Hochschule Augsburg, Augsburg, Germany, an MS
May/June 2014 53