Intelligibility in Speaking
Intelligibility in Speaking
Intelligibility in Speaking
33
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
34 A Framework for the Teaching of Spoken Language
desirable goal, and it shows that they understand that any deviation
from that norm can mark them as being different or as being hard to
understand. Their desire assumes a dichotomous state of affairs: native
(and therefore unaccented and easy to understand) or nonnative (and
therefore accented and harder to understand). Even though it is
unlikely that they will ever reach their goal, their desire comes from
a noble motive in that they want their spoken language to facilitate
their communication. Not being language teachers, they do not realize
that reaching a good-enough pronunciation is neither a sign of weak-
ness nor of low standards, but is instead all that is really necessary (or
all that is really possible, in most cases).
Unfortunately, few teachers or students have the time, aptitude, or
the age necessary to achieve the kind of mastery needed for native-like
pronunciation. An intelligibility-based approach, in contrast, requires
prioritizing what is taught. Teaching to achieve intelligibility is chal-
lenging precisely because of this prioritized approach to errors. Such
an approach is based on the assumption that some errors have a
relatively large effect on understanding and others a relatively small
effect. Judy Gilbert (personal communication) talks about priorities in
terms of a battlefield medical image – triage. When faced with many
people who are wounded, medics must prioritize treatment. Applied to
pronunciation, the image suggests that certain errors (injuries to com-
munication, to follow the metaphor) should be treated first because
they are more likely to harm communication than are others. Other
injuries to communication are far less problematic, and neither listen-
ers nor speakers will be harmed by lack of accuracy in such cases.
What such an approach should look like in the classroom is not
clear, however, partly because proposals based on the Intelligibility
Principle conflict, and partly because the Nativeness Principle
continues to strongly influence classroom practice and teachers’ atti-
tudes. Also, the Intelligibility Principle must be context-sensitive and
connected to both speaking and listening – speakers need to be intelli-
gible to listeners, and listeners need to be able to understand speakers.
So decisions about priorities not only involve helping learners produce
speech in an accessible way for listeners, but also involve teaching
those same learners to understand the speech they hear.
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
Setting Priorities 35
English pronunciation
Rhythm Intonation
Segmentals Word Stress
Rate Loudness
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
36 A Framework for the Teaching of Spoken Language
and stress. The rhythm of the phrase level and the stress patterns of
words affect the ways sounds change in connected speech (e.g., the
phoneme /t/ as realized in city, nature, button, and “can’t you’ ” are
not the phonetic sound [t] in North American English). Pronunciation
of the individual segments, in other words, is dependent upon where
they occur in words, which are in turn affected by where they occur
within a phrase. Prominence occurs on syllables that are emphasized
through a combination of pitch, syllable duration (a rhythmic feature),
and loudness. Prominence typically (but not always) occurs on the
stressed syllable of the prominent word, connecting prominence to
word stress. Prominence results in segments that are pronounced with
particular clarity and precision. The prominent syllable in a phrase is
often marked by pitch, and is the beginning of the final pitch contour
in the phrase – that is, its intonation.
Word-Level Features
Word-level features include segmentals (vowels and consonants) and
word stress (Figure 2.2). They also include consonant clusters, a type
of segmental which may, when mispronounced, have an effect on
syllable structure. This introduction is provided to define how
segmentals and word stress are related to intelligibility and
comprehensibility.
Segmentals include approximately forty phonemes for most
varieties of English (around twenty-four consonant phonemes and
fourteen or more vowel phonemes). The phonemes mask the number
of sounds that English uses, since phonemes often have multiple well-
known allophones that are important for pronunciation teaching. For
example, English regularly employs a glottal stop ([ʔ]) before vowel
initial words (e.g., I ! [ʔaɪ]) or as an allophone for /t/ before a final
nasal (e.g., button –[bʌʔn̩ ]). Other well-known allophones include
Word-level
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
Setting Priorities 37
Discourse-Level Features
Discourse-level features (Figure 2.3) include suprasegmental features
that carry categorical (i.e., phonological) meaning differences. In rela-
tion to how listeners may (mis)understand speakers, these supraseg-
mentals are not likely to cause listeners to misunderstand individual
words (making words unintelligible) but are likely to cause listeners to
process meaning with greater difficulty (making speech more effortful
to understand). Another suprasegmental, word stress, as discussed
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
38 A Framework for the Teaching of Spoken Language
Discourse-level
Rhythm Intonation
(2.1)
JOHN has CLIMBED the TREE to GET the CAT that’s been STUCK for a TIME.
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
Setting Priorities 39
In English, rhythm and word stress are similar. The discourse level
for rhythm in many ways mirrors the word-level rhythm of lexical
stress. A major difference is that word stress typically is limited to
multi-syllabic words, whereas rhythm includes stress for single-
syllable words. In English, for example, content words (e.g., nouns,
verbs, adjectives, adverbs, negatives), including those of one syllable,
are normally stressed in discourse. Single-syllable function words (e.g.,
prepositions, auxiliary verbs, pronouns, determiners) are typically
unstressed in discourse. Many single-syllable function words are also
among the most frequent words in English, helping to contribute to
the perception of stress timing.
Intonation, the second suprasegmental, includes at least three dis-
tinct ways in which meaning is communicated: prominence, tune, and
range. The example in (2.2) illustrates these three. For context,
imagine that the sentence is spoken in the middle of a lecture.
(2.2)
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
40 A Framework for the Teaching of Spoken Language
Related Areas
Comprehensibility and intelligibility are not only associated with
pronunciation, but also with other characteristics of spoken language
(Figure 2.4) that have an indirect connection to pronunciation. These
areas include fluency (Derwing, Munro, & Thomson, 2007), speech
rate (Kang, 2010), loudness (not typically addressed for L2 pronunci-
ation research), and voice quality (Esling & Wong, 1983; Ladd, Silver-
man, Tolkmitt, Bergmann, & Scherer, 1985; Munro, Derwing, &
Burgess, 2010). Generally, this book will not address these character-
istics in detail, because other than fluency and speech rate (which is a
component of fluency), these things are more idiosyncratic than the
other features.
In particular, research on fluency and speech rate seems to have
significant effects on judgments of comprehensibility and accented-
ness. Pronunciation research on voice quality and its inclusion in
teaching materials has never been common, despite its seeming prom-
ise in pedagogy (Jones & Evans, 1995; Pennington, 1989). Loudness is
an especially important issue in regard to hearing loss, hearing in
noise, and the intelligibility of speech for those with cochlear implants.
Anecdotally, some L2 learners can become more intelligible simply by
speaking at a volume more appropriate to the context (e.g., in a large
classroom), but this has not been typically considered important for an
L2 pronunciation syllabus.
Related areas
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
Setting Priorities 41
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
42 A Framework for the Teaching of Spoken Language
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
Setting Priorities 43
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
44 A Framework for the Teaching of Spoken Language
contention can be seen from the work of Derwing, Munro, and Wiebe
(1998), who found that listeners rated L2 learners’ comprehensibility
higher after a pronunciation course focusing on global skills and
suprasegmentals than after a course focusing on segmental improve-
ment. Similar results for other groups of learners can be found in the
work of Gordon and Darcy (2016).
McNerney and Mendelsohn (1992) argue for suprasegmentals to be
given top priority in pronunciation instruction. Suprasegmentals con-
trol how information is related, and they also are said to have a special
role in conveying attitudinal meaning (cf. Pike, 1945). McNerney and
Mendelsohn say that “a short-term pronunciation course should focus
first and foremost on suprasegmentals as they have the greatest impact
on the comprehensibility of learners’ English” and because through
suprasegmentals “greater change can be effected in a short time”
(1992, p. 186), although they provide no empirical evidence for their
confident assertion. Suprasegmental features include:
• word stress and rhythm;
• major sentence stress (or focus);
• intonation;
• linking and pausing;
• palatalization in rhythmically related contexts such as “can’t you”
and “did you.”
In another attempt to specify priorities for a short pronunciation
course, Henderson (2008) identified pacing (stressed words per
minute), speech rate (syllables per second), and word stress as import-
ant in promoting more comprehensible speech in spontaneous and
prepared speaking tasks. The author argued that these three areas
were most amenable to changes in the short term, and that learners
of English in this university-level planned speaking course were most
likely to be successful by learning to vary their pacing and speech rate.
Word stress, which was asserted to be important in promoting under-
standing, was also presented as a feature that may not be easy to
change in the short run. It may be significant that both Isaacs and
Trofimovich (2012) and Henderson (2008) studied French learners of
English, a group for whom word stress may be particularly important.
Gilbert (2001) suggests priorities for beginning learners based on
her experience and repeated attempts to distill that experience into
pronunciation features that are likely to be learnable and to make a
difference in comprehensibility. She lists the following as essential:
sound–symbol correspondences for the spelling of key vowel sounds,
consonant sounds (mostly final) that serve as signals of grammatical
meaning, linking between words, epenthesis and deletion of syllables,
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
Setting Priorities 45
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
46 A Framework for the Teaching of Spoken Language
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
Setting Priorities 47
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
48 A Framework for the Teaching of Spoken Language
notorious stickler for teaching only those things that can be learned,
provides as priorities for beginning learners: linking, word stress, and
distinguishing strong and weak syllables.
What does it mean for a feature to be teachable? Walker (2010) says
this about nuclear stress placement:
[It] is teachable in the sense that the rules are simple enough for learners to
master in the classroom, although for some learners there may be a noticeable
gap between receptive and productive competence. As a result, our primary
aim in the classroom will be to make learners aware of the existence and
importance of nuclear stress. This should make them more sensitive to its use
by other speakers, and consequently more likely to acquire competence in
its use. (Walker, 2010, p. 64)
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
Setting Priorities 49
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
Table 2.1 Pronunciation priority recommendations from various authors
50
Study Recommended targets Recommended for exclusion Source of evidence
Studies related to ESL/EFL contexts
Kenworthy Sound substitutions, deletions, and Reasoning based on experience
(1987) additions;
linking; word stress; rhythm; intonation
Jenner (1989) Vowel length; most consonants; syllable Vowel quality; [ɫ] Reasoning based on features shared
structure; stress-based rhythm; by most NS varieties
prominence; movements of pitch
Brown (1991) High functional load contrasts, e.g., /p, b/, Low functional load Functional load calculations based
/p, f/, /l, r/, /l, n/, /æ, ɛ/ contrasts, e.g., /f, θ/, /ð, d/, on minimal pair frequency
/ʤ, j/, /u, ʊ/ modified by other criteria
Firth (1992) In descending order: None specified Based on a “Zoom Principle,”
general speaking abilities; intonation; a pedagogical approach that
stress/rhythm; consonants and vowels prioritizes general speaking
habits over phonetic details
Isaacs and Word stress; lexical richness; grammatical Pitch range Based on correlations between
Trofimovich control; use of discourse features; scalar comprehensibility ratings
(2012) fluency and careful quantitative analysis,
informed by teacher’s verbal
protocols (one L1 only)
McNerney and Word-level stress/unstress; sentence-level None specified Reasoning based on the belief that
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
Gilbert (2001) Key vowel sound/spelling Priorities for beginning learners
correspondences; final consonants based on experience as a teacher
signaling grammatical meaning; linking; and textbook writer
word stress; strong and weak syllables;
emphasis (prominence)
Morley (1991) An expanded domain for pronunciation, None specified Reasoning based upon the asserted
including (in no particular order): need for pronunciation to take on
communication skills; suprasegmental; expanded roles in the language
segmentals; voice quality; body classroom
language; greater learner and teacher
involvement in developing self-
monitoring skills; contextualization;
linking; greater attention to sound–
spelling relationships; attention to
individual differences among
learners
Henderson Pacing of speech; rate of speech; word A review of principles put forth by
(2008) stress other writers. The choice of
features for the short course are
not clearly justified
Studies related to English as an international language (EIL)/ELF contexts
Jenkins (2000) Most consonant sounds; some consonant Interdental fricatives; [ɫ]; Forty errors in NNS–NNS
cluster simplifications involving consonant cluster interaction, twenty-seven of
51
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
Table 2.1 (cont.)
52
Study Recommended targets Recommended for exclusion Source of evidence
Walker (2010) Same as Jenkins (2000), including rhotic Same as Jenkins (2000) with Jenkins’ (2000) findings modified by
[ɹ] in all positions; intervocalic [t] rather some modifications for trying to implement the LFC.
than flap in city, beauty; word stress as a specific language groups Other research findings also
basis for nuclear stress; weak forms and consulted
vowel reduction for receptive
competence
Deterding and No priorities given A descriptive study of the features
Kirkpatrick that may be part of an emerging
(2006) South-East Asian variety of
English
Kirkpatrick No priorities given A study of the intelligibility of Hong
et al. (2008) Kong English to listeners in
Singapore and Australia. Hong
Kong English speakers were
generally highly intelligible, but
high intelligibility did not
guarantee perceptions of
likeability or intelligence
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
54 A Framework for the Teaching of Spoken Language
other sounds not only based on her evidence but also on her view of
English’s role in the world. This indicates that decisions about prior-
ities must be made not only on explicit evidence, but on how implicit
evidence is interpreted regarding pronunciation’s role in communica-
tive success or failure in particular communicative contexts.
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
Setting Priorities 55
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
56 A Framework for the Teaching of Spoken Language
Overall, their markers were more restricted in range than for NTAs
and included innovations that were not likely to be understood easily
by NS interlocutors. At the level of grammatical competence, lexico-
grammatical features may hamper ITAs’ ability to communicate infor-
mation clearly. Tyler, Jefferies, and Davies (1988) found that NTAs
often used strategies to focus listener attention on information to be
foregrounded and backgrounded, but ITAs did not.
In addition, even speech that is completely intelligible may be heard
as heavily accented or take more effort to understand (Munro &
Derwing, 1995). Expectations and implicit stereotypes may affect
how well listeners understand a speaker, despite the speaker being
intelligible. Rubin (1992) played a short lecture given by a female
speaker of General American English to undergraduate students under
two guises: a Caucasian guise and an Asian guise. In the study, some
listeners heard the lecture while looking at a picture of a blond Cauca-
sian woman, while other listeners heard the same lecture (spoken by
the same voice) while looking at a picture of an equally attractive Asian
woman. When asked to demonstrate their understanding of the lecture,
the listeners who heard the lecture in the Asian guise understood
significantly less well than those who heard the lecture in the Caucasian
guise. Comprehension was measured via a cloze of the passage with
every seventh word deleted. In addition, listeners completed a semantic
differential instrument with scales measuring their attitudes, issues
related to background, values, and appearance, as well as items related
to accent, ethnicity, and teaching qualifications. While the Munro and
Derwing (1995) study demonstrated that listeners can decode speech
with 100 percent intelligibility yet find it heavily accented, this study
suggests that lack of understanding can be affected by seemingly unre-
lated nonlanguage factors, in this case the unconscious biases that
listeners bring with them to interactions. More recent research suggests
that this bias may also be connected to congruence between the visual
and the aural. McGowan (2015) used transcription accuracy in noise
to examine whether listeners would be more accurate in transcribing
Chinese-accented speech when presented with a Chinese face, a Cauca-
sian face, or an unspecified silhouette. Listeners transcribed more suc-
cessfully when presented with a congruent face (a Chinese face), a
finding that was consistent despite differences in experience with
listening to Chinese-accented English.
Conclusion
Many recommendations about priorities for pronunciation teaching
are painted in broad brushstrokes that probably mask distinctions
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
Setting Priorities 57
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005
Downloaded from https://www.cambridge.org/core. University of Western Ontario, on 24 Sep 2018 at 02:51:23, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/9781108241564.005