Kessler, 2009
Kessler, 2009
orthographic correspondences
Brett Kessler
Washington University in St. Louis, MO, USA
Abstract
The English writing system deviates widely from the alphabetic ideal of uniform
one-to-one correspondence between graphemes and phonemes, but its inconsis-
Correspondence: tency is greatly reduced when conditional sound–spelling rules are applied. When
Brett Kessler,
Psychology Department,
reading or writing one part of a word, children and adults evinced knowledge of
Washington University rules sensitive to the identity of other letters or phonemes, even those appearing
in St. Louis, much later in the word. Adults also showed sensitivity to the distinction between
Campus Box 1125, the basic and Romance subsystems of English (Albrow’s Systems 1 and 2).
One Brookings Drive, Children as young as 6 years applied conditional rules that they were not taught,
St. Louis, MO 63130-4899,
indicating implicit statistical learning of patterns observed in text. But learning is
The English writing system is routinely criti- one-offers like ‘hiccough’, where <gh> = /p/, and
cized for falling far short of the alphabetic ideal ‘of’, where <f> = /v/.1 It also provides a straight-
(Caravolas, 2004) of one-to-one correspondence forward explanation of how children can learn to
between phonemes and graphemes. Almost all its read and write in classrooms whose curriculum
phonemes have multiple spellings—sound-to-letter, is based on whole-word learning with no expla-
or spelling inconsistency—and almost all its let- nation of how sound–spelling correspondences
ters have multiple pronunciations—letter-to-sound, work. But there is also much evidence that
or reading inconsistency. A study by Hanna et al. whole-word memorization cannot be the whole
(1966) showed that only 73% of all phonemes picture. All other factors being equal, people are
would be spelt correctly if the writer picked the most slower and more error-prone at processing words
common spelling for each phoneme: on average, with unusual sound–spelling correspondences
a person who spells by the alphabetic principle would (Balota et al., 2004), which should not be a fac-
make a mistake on every fourth grapheme. tor if they memorized all words as arbitrary letter
But arguably the speller’s loss is the psycholin- sequences. Even more convincing is the fact that
guist’s gain. The fact that most mature spellers do people can come up with plausible spellings when
much better than 73% raises questions as to how asked to write non-words, and other people can
people read and write words, if applying one-to-one read such spellings. These feats would hardly be
correspondences is not enough. Some theories that possible if the only route to reading and writing
have been adduced are: were memorizing words as unanalysable wholes.
• Learning large-unit sound–spelling correspon-
• Whole-word memorization: people memorize dences: it has often been noted that the rimes
entire words as arbitrary sequences of letters. (the vowel plus the following, coda, consonants)
Such a theory accounts for the fact that people of riming words are often spelt alike. There may
learn all sorts of inconsistent spellings, including be many ways to spell /aɪ/ in English, but the
Writing Systems Research, Vol. 1, No. 1, 2009. © The Author 2009. Published by Oxford University Press. 19
For Permissions, please email: journals.permissions@oxfordjournals.org
doi:10.1093/wsr/wsp004
B. Kessler
spelling of the rime /aɪld/ is consistently <ild>, chess: a set of linguistically encoded instructions
at least in simple monomorphemic words like that one consciously learns from a teacher and can
‘wild’ and ‘child’. The idea that people might easily formulate and pass on to one’s student. Even
learn the spelling of whole rimes as units is spelling researchers may have a hard time explaining
attractive because there is much evidence that that /k/ is spelt <ck> at the end of word-final stressed
preliterate children are good at breaking syllables syllables when the rime of the uninflected lexeme
down into onsets (the part before the vowel) and would otherwise contain fewer than three letters
rimes—while breaking rimes down into vowels (e.g. ‘sick’ versus ‘silk’, as discussed below), and
and codas is often very challenging for them most people who follow the rule have never been
(Goswami, 1993; Treiman and Kessler, 1995). taught it overtly (Hayes et al., 2006). An alternative
Much work in literacy research has proceeded hypothesis is that complex sound–spelling corre-
from the point of view that onsets and rimes spondences are learnt the same way we learn many
are the effective units of reading and writing other patterns in life: by observing and internalizing
(Stanback, 1992; Ziegler et al., 1997), which the relative frequency with which objects and events
is not to claim that these researchers subscribe occur and co-occur. Perhaps through repeated expo-
to the theory that literacy is nothing more that sure to text, people gradually pick up both uncon-
memorizing the spellings of whole onsets and ditional and conditional correspondences, without
rimes. One problem with such a theory is that necessarily formulating any conscious accounts of
correspondences in one part of the syllable is Table 1 Sound–spelling consistencies of the parts of
improved when rules are sensitive to the contents monosyllabic words
of other parts of the syllable. Because the syllabifi-
Contexta Onset Vowel Coda
cation of intervocalic consonants is a moot issue in
English, we found it expedient to restrict this first Reading b
that occurred in Old English, and English has had 2 Spelling Vowels in
many other vowel changes that were conditioned by
specific codas. In contrast, very few sound changes
Non-words Using Conditional
in the onset were conditioned by the coda, or vice Correspondences
versa, which accounts for the non-significant con-
nections between those syllable parts. The next question is whether humans are sensitive
It bears remembering that these conditional to, learn, and use conditional patterns. Treiman
consistencies were all derived by mathematically et al. (2002) explored how people spell vowels,
analysing the vocabulary itself: the data do not say which, as Table 1 shows, have the most to gain
that people take advantage of these patterns, but from conditional reference to context. Do adults
they do prove that the patterns are there (a full list take adjacent consonants into account when they
of the individual patterns can be found at http://spell spell vowels? We selected nine vowels for which
.psychology.wustl.edu/RelSoundLetMono). The data our vocabulary analysis had revealed that a given
also give us some basis for addressing the large-unit spelling was much more likely in one context than
hypothesis raised earlier: would it make sense to another. These are listed in Table 2. For instance,
memorize the spellings of entire onsets and rimes? it turns out that stressed /i/ is spelt <ee> in 64% of
The fact that the consistencies of vowels are so much all words where it appears before word-final /d/ or
higher when the coda is taken into account, and vice /p/, but that <ee> is used only 15% of the time, on
Table 2 Increase in proportion of vowels that adults spelt with the conditioned spelling when a conditioning consonant
is present
Before coda
/ε/ <ea> /d/ head 0.40 0.06
/i/ <ee> /d/, /p/ feed 0.49 0.27
/o/ <o> without <e> /l/ roll 0.33 0.11
/ʊ/ <u> /l/, /s/, /ʃ/, /tʃ/ push 0.73 0.11
/aʊ/ <ow> /l/, /n/ clown 0.89 0.43
/aɪ/ <igh> /t/ night 0.39 0.27
After onset
/ɑ/ <a> /w/ wasp 0.75 0.72
/ɚ/ <or> /w/ word 0.52 0.28
/u/ <oo> Non-coronals food 0.57 0.58
a
Based on standard spellings at the end or beginning of English words. Proportion of words with indicated vowel and consonant that
have the indicated spelling, minus proportion of words with the vowel and another consonant that have the indicated spelling.
b
Results from experimental trials, measured the same way as the vocabulary statistics
Three of the patterns involved the influence of the /ɛ/ = <ea> before /d/ could be due to a different
onset. For example, we set up stimuli that tested mechanism whereby spellers memorize spellings for
whether people were sensitive to the pattern by which whole rimes (/ɛd/ = <ead>) and onsets. In this experi-
General American /ɑ/ (this would be /ɒ/ in some ment, that possibility is contradicted by the fact that
other accents) is spelt as <a> 86% of the time it the same spellers who drew on the identity of the
appears after /w/ (‘wand’), but only 11% of the coda to spell vowels (intra-rime effects) also took
time after other consonants (not ‘∗fand’ but ‘fond’; into account the identity of the onset. For example,
the difference 0.75 appears in the table). Thus, we they spelt /ɑ/ as <a> the great majority of the time
investigated a wide variety of patterns. after /w/, but comparatively rarely after other conso-
To test whether spellers were sensitive to a par- nants. Thus, a whole-rime spelling strategy appears
ticular pattern, we designed ten pairs of non-words to be not only an insufficient explanation, but also an
like /θεd/ and /θεk/ and asked adults to spell them unnecessary one: the same type of (implicit) aware-
as if they were English. The pairs were split up and ness of conditional patterns that is needed to explain
interleaved with similar questions for the other eight the influence of the onset can be used to explain the
patterns, so that the participants would not deduce influence of the coda.
any patterns from the stimuli in the experiment itself.
In this particular case, we tallied whether they used
<ea> significantly more often in non-words like /θεd/ 3 Conditional Spelling of
different strategies than when writing familiar mind is that we intentionally selected the critical
words. Consequently, we configured another ver- stimuli from the minority of words that go against the
sion of the experiment to use actual words (Treiman general pattern. Even if sensitivity to context leads
et al., 2002). We asked the participants to spell pairs some spellers astray on a few words like ‘wombat’,
like ‘shred’ and ‘fleck’. We then performed error that same sensitivity should help them out on the
analyses to see how often they misspelt the vowel much larger number of words in which /ɑ/ is spelt
as <ea>, testing the hypothesis that such an error <a> after <w>, such as ‘waffle’ and ‘wallow’.
would be more likely in the context (coda /d/) that
is more strongly associated with the /ε/ = <ea> corre-
spondence in English orthography. Other examples 4 Reading Vowels Using
included looking for <ee> in ‘reap’ versus ‘ream’; Conditional Rules
<o> without silent <e> in ‘shoal’ versus ‘croak’; <u>
in ‘swoosh’ versus ‘nook’; <ow> in ‘pronoun’ versus Much more research has studied how people read
‘slouch’; <igh> in ‘contrite’ versus ‘confide’; <a> than how they spell. It has often been reported that
in ‘wombat’ versus ‘possum’; <o> in ‘whir’ versus readers’ pronunciation of vowels varies depend-
‘blurt’; and <oo> in ‘scuba’ versus ‘frugal’. Because ing on the coda (e.g. Johnson and Venezky, 1976;
errors of any type would be more numerous on lon- Glushko, 1979; Ryder and Pearson, 1980; Andrews
ger or less familiar words, the stimuli were matched and Scarratt, 1998). However, the prior literature left
onset conditioning. Table 3 shows the patterns that and because the great majority of contextual effects
were tested, along with their reliability in text, on vowel pronunciation come from within the rime
measured as in Table 2. Most of the patterns were (i.e. the coda), one might expect that people would
quite strong, but not completely exceptionless (e.g. be locked in to look for patterns within rimes, or
‘flange’ is pronounced with /æ/). at least learn and apply them much more robustly.
The last column in Table 3 shows that the readers Instead, it appears that people learn statistical pat-
were sensitive to the patterns. The numbers are the terns wherever they exist.
difference between the proportion of pronunciations
in the conditioning context (e.g. <nge>) that have the
conditioned vowel (/e/) and the proportion of those in 5 Magnitudes and Computer
the non-conditioning context (<nce>) that have that Modelling
vowel. Thus, any positive proportion in this column
shows a trend towards sensitivity to the context; its A curious fact about the experiments with non-word
maximum is 1.0. In all cases, the differences between stimuli is that the participants’ rate of use of statisti-
the two contexts were statistically significant. cal patterns was lower than that found in the gen-
It is important to note that the two sets of items eral vocabulary. For example, while 40% of English
that involved an onset context had just as strong words ending in stressed /ɛd/ are spelt with <ea>, the
a response as the items that had only a coda con- participants in our experiment used <ea> for only
Table 3 Increase in proportion of vowels read with conditioned pronunciation when a conditioning consonant is present
Conditioned
Vowel Context Example Vocabulary Responses
pronunciation
Before coda
<a> /e/ <nge> range 0.95 0.54
<a> /ɔ/ <ld>, <lt> bald 0.79 0.86
<ea> /ɛ/ <d> dread 0.78 0.12
<i> /aɪ/ <nd>, <ld> find 0.76 0.33
<o> /o/ <ld>, <lt> bold 0.96 0.83
<oo> /ʊ/ <k> look 0.96 0.70
After onset
<a> /ɑ/ <u>, <w> wand 0.93 0.58
<a> /ɔ/ <u>, <w>, before <r> warm 0.96 0.16
Numbers are differences between proportions, analogous to those in Table 2.
It is always a challenge to predict how strongly do a good job simulating human reading of vowels
people will react to environmental stimuli, and I will whose pronunciation is conditional, and this is partly
not conclude this section with a bold new model true. Like our research participants, the connection-
that accounts for why the vocabulary numbers are ist models did learn the conditional patterns, and
so much greater than the participants’. However, they did tend to use the conditioned pronunciations
it was instructive to look at computer models of more in the strongly conditioning environments than
reading to see how their numbers compared with in other environments—e.g. they pronounced <a> as
those of the vocabulary and of our participants. We /e/ more before <nge> than before <nce>. They also
examined the output of twelve different simula- matched human performance quite closely on non-
tors and looked for implementation characteristics words that have environments that only weakly con-
that provided the closest match with the human dition pronunciations, e.g. stimuli that end in <nce>,
behaviour (Treiman et al., 2003). with agreement levels as high as 94–96% for some
The most prominent of the current spelling models (those of Plaut et al., 1996; Zorzi et al., 1998;
models is DRC (the dual-route, cascaded, model of Powell et al., 2001; Harm and Seidenberg, 2004).
Coltheart et al., 2001). This model pronounces non- These are all notable achievements. However, the
words by applying a series of letter-to-sound rules connectionist models did not match the magnitudes
that are almost all unconditional. Not surprisingly, of human performance very closely on the strongly
DRC scored 0.00 on all patterns: that is, it was no conditioning environments like <a> before <nge>.
vocabulary, but also on individual letter-to-sound But literacy researchers pay at least as much atten-
rules like <a> = /æ/, because many children are heav- tion to how children read and write words. Children
ily exposed to these unconditional rules in school, who have not yet mastered a writing system are the
and so may tend to over-apply them, therefore best sources of information on how literacy skills are
under-applying conditional rules. Although that is acquired. There is also an important practical benefit
an ingenious idea, it did not have a noticeable effect to understanding the development of orthographic
in bringing down the programs’ rate of application knowledge, in that educators can take into account
to human levels. what sort of information children are capable of
The models also differed in many other details, dealing with at various ages, and in what order they
any of which could account for substantial differences naturally acquire different types of knowledge. See,
among their outputs. Therefore, it is perhaps best to for example, Cook (2004) for an introduction to the
think of their outputs not as rigorous tests of any par- voluminous literature on theories of literacy acqui-
ticular theory of statistical learning, but as sources sition. With respect to the question of conditional
of ideas for future research. Norris’s (1984) way of orthographic patterns, the central concerns here are
incorporating context seems promising. Instead of how old children are when they acquire sensitivity to
learning only the highly discriminating contextual context; at what rate their application of conditional
effects that we tested on—and which most connec- rules increases over the course of their life; whether
tionist models would learn most thoroughly—his they have conditional rules properly speaking, or
the patterns we investigated. They pronounced <a> patterns children are exposed to changes as they
as /ɑ/ more often after <w> and <qu> than after other progress through school. After all, the highly
onset letters; they pronounced <o> as /o/ more often context-dependent patterns we have been investi-
before <ld> and <lt> than before other coda letters gating mostly constitute what are often considered
in the absence of final <e>; and they pronounced irregularities in the orthographic system; perhaps
<oo> as /ʊ/ more often before <k> than before textbooks intentionally hold off on irregular words
other coda letters. At higher reading levels, children until children get older. An examination of the words
were sensitive to all eight patterns. The only excep- that appear in a graded corpus (Zeno et al., 1995)
tion was that the high-school students in our sample showed that this is not the case: The proportion of
showed an atypical reluctance to read <ea> as /ɛ/ such words in books remains remarkably constant
before <d>. from first grade on. Therefore, we cannot hypoth-
As for the magnitude of the effect, first-grade esize that first-graders are less likely to read <ea>
readers already showed an appreciable difference before <d> as /ɛ/ because they are not exposed to
of about fourteen percentage points between how so-called irregular words like ‘head’ and ‘dead’.
often they used a conditioned vowel in condition- Alternatively, perhaps children change their read-
ing versus non-conditioning contexts. As reading ing strategies as they get older or more mature,
grade improved, so did this magnitude. This change gradually becoming more and more convinced
was due both to increases in children’s produc- that context is a good thing. A simpler explana-
candidate that clearly dominates across contexts. of these tasks, children as young as second grade
To put what must be a complex, implicit process were sensitive to some of these patterns, generat-
be spelt <c> but is normally spelt <k> before <e>, before front vowels in many words (‘get’, ‘gem’),
<i>, or <y>. We found that children as young as but it is perhaps surprising that our participants
second grade were more likely to spell /k/ as <k> pronounced <c> as /k/ a substantial number of times
in non-words before those letters than before <a>, when it appeared in non-words before <e> or <i>,
<o>, or <u>. a pattern virtually unknown in English. For exam-
ple, when asked to pronounce monosyllables like
<cersh>, they produced /k/ 16% of the time, but for
9 Consonant Reading <garsh>, nobody used /dʒ/. This imbalance appears
to be another instance of people’s tendency to never
One last question concerns the reading of conso- quite let go of the most common unconditional pro-
nants. Treiman et al. (2007) looked at how condi- nunciation of a letter, even when a specific context
tional rules affect the way adults read onsets. We does not support it. The same type of error has been
were interested in the fact that the pronunciations of reported in the Romance languages, even though
word-initial <c> and <g> are not only conditioned the rules for front and back pronunciations of <c>
by the following vowel, but also by the different and <g> apply in a very large number of words and
subsystems of English orthography. In the terminol- are virtually exceptionless (Content and Peereman,
ogy of Albrow (1972), in System 1, the basic system, 1992, for French; Job et al., 1998, for Italian).
word-initial prevocalic <g> is always /ɡ/, and <c> We also conducted several experiments where
rules discussed up to now have only required the place for increased attention to conditional patterns
reader or speller to consult the adjacent letter or in literacy instruction after all.
sound, this rule requires the reader to evaluate
a variety of clues to guess the appropriate spelling
system. Some of these clues involve looking ahead Funding
quite a few letters: sometimes only the end of a word This work was supported by the (US) National
tells how the initial letter should be pronounced. Institutes of Health [R01HD051610-02].
Lookahead is a challenge for models of reading such
as the DRC (Coltheart et al., 2001), which assume
that processing is strictly sequential.
References
Albrow, K. H. (1972). The English Writing System: Notes
10 Conclusion Toward a Description. London: Longman Publishing
Group.
Although English orthography appears chaotic, there Andrews, S. and Scarratt, D. R. (1998). Rule and
are many conditional rules that significantly increase analogy mechanisms in reading nonwords: hough
dou peapel rede gnew wirds? Journal of Experimental
its consistency. Such rules are somewhat compli-
Psychology: Human Perception and Performance, 24(4):
cated, but a large body of evidence shows that even 1052–86.
Cummings, D. W. (1988). American English Spelling: Norris, D. (1994). A quantitative multiple-levels model
An Informal Description. Baltimore, MD: Johns Hopkins of reading aloud. Journal of Experimental Psychology:
University Press. Human Perception and Performance, 20, 1212–32.
Flexner, S. B. (ed.) (1987). Random House Dictionary, Nusbaum, H. C., Pisoni, D. B., and Davis, C. K. (1984).
2nd edn. New York: Random House. Sizing Up the Hoosier Mental Lexicon: Measuring the
Familiarity of 20,000 Words. Bloomington: Indiana
Glushko, R. J. (1979). The organization and synthesis University Press.
of orthographic knowledge in reading aloud. Journal
of Experimental Psychology: Human Perception and Plaut, D. C. and McClelland, J. L. (1993). Generalization
Performance, 5: 674–91. with componential attractors: word and nonword reading
in an attractor network. In Proceedings of the Fifteenth
Goswami, U. (1993). Phonological skills and learning to Annual Conference of the Cognitive Science Society.
read. In Tallal, P., Galaburda, A. M., Llinas, R. R., and Hillsdale, NJ: Lawrence Erlbaum Associates, pp. 824–9.
von Euler, C. (eds), Temporal Information Processing
Plaut, D. C., McClelland, J. L., Seidenberg, M. S.,
in the Nervous System: Special Reference to Dyslexia
and Patterson, K. E. (1996). Understanding normal
and Dysphasia. New York: New York Academy of
and impaired word reading: computational principles
Sciences, pp. 296–311.
in quasi-regular domains. Psychological Review, 103:
Hanna, P. R., Hanna, J. S., Hodges, R. E., 56–115.
and Rudorf, E. H. (1966). Phoneme–grapheme Powell, D., Plaut, D. C., and Funnell, E. (2001).
Correspondences as Cues to Spelling Improvement. A Developmental Evaluation of the Plaut, McClelland,
Treiman, R., Kessler, B., and Evans, R. (2007). Antici- the International Phonetic Association (1999), and are
patory conditioning of spelling-to-sound translation. enclosed in slash marks. The Appendix gives keywords
Journal of Memory and Language, 56(2): 229–45. illustrating how I applied the symbols of the IPA to
Treiman, R., Kessler, B., Zevin, J. D., Bick, S., and American English.
Davis, M. (2006). Influence of consonantal context
on the reading of vowels: evidence from children.
Journal of Experimental Child Psychology, 93(1): 1–24. Appendix
Treiman, R., Mullennix, J., Bijeljac-Babic, R., and
Table A1 Keywords for use of International Phonetic
Richmond-Welty, E. D. (1995). The special role of
rimes in the description, use, and acquisition of English Alphabet symbols for American English
orthography. Journal of Experimental Psychology: IPA Keyword
General, 124: 107–36.
aɪ ice
Treiman, R. and Zukowki, A. (1988). Units in reading
aʊ out
and spelling. Journal of Memory and Language, 27:
æ add
466–77. ɑ odd
Venezky, R. L. (1970). The Structure of English d dig
Orthography. The Hague: Mouton. dʒ jump
ð this
Venezky, R. L. (1999). The American Way of Spelling: e ace