thresholds may reflect directly on the contribution of audi- bility deficits to general ..... Figure 4 shows that when 70%-80% of the stimulus is calculated to be ...
Audibility
and recognition of stop consonants in normal
and hearing-impaired subjects ChristopherW. Turner and Michael P. Robb CommunicationSciencesand Disorders,SyracuseUniversity,80.5South CrouseAvenue,Syracuse, New York 13244-2280
(Received 13 November 1986;acceptedfor publication27 January 1987)
The purposeof this studywasto examinethe effectof spectral-cueaudibility on the recognitionof stopconsonants in normal-hearingand hearing-impairedadults.Subjects identifiedsix syntheticCV speechtokensin a closed-setresponsetask. Each syllablediffered only in the initial 40-ms consonantportion of the stimulus.In order to relate performanceto spectral-cueaudibility,the initial 40 ms of eachCV were analyzedvia FFT and the resulting spectralarray waspassedthrougha sliding-filtermodelof the humanauditorysystemto accountfor logarithmicrepresentation of frequencyand the summationof stimulusenergy within critical bands.This allowedthe spectraldata to be displayedin comparisonto a subject'ssensitivitythresholds.For normal-hearingsubjects,an orderly functionrelatingthe percentageof audiblestimulusto recognitionperformancewasfound,with perfect discriminationperformanceoccurringwhenthe bulk of the stimulusspectrumwaspresented at suprathreshold levels.For the hearing-impairedsubjects,however,it wasfoundin many instancesthat suprathresholdpresentationof stop-consonant spectralcuesdid not yield recognitionequivalentto that foundfor the normal-hearingsubjects.Theseresults demonstratethat while the audibilityof individualstopconsonants is an importantfactor influencingrecognitionperformancein hearing-impairedsubjects,it is not alwayssufficientto explainthe effectsof sensorineural hearingloss. PACS numbers: 43.66.Sr, 43.70.Jt, 43.71.Es, 43.71.Gv
INTRODUCTION
Although someresearchershave shownthat sensorineuralhearing-loss subjectsexhibitpoorerthan normalrecognitionof stopconsonants (e.g.,Owensetal., 1972),others have not found such differences(e.g., Van Tasell et al., 1982). Instancesin whichstop-consonant recognitionscores for hearing-losssubjectsare comparableto thoseof normalhearingsubjectsgenerallyoccurwhenthe subjects'hearing lossesare mild and the presentationlevelfor the speechmaterials is high, which suggeststhat the audibility of the speechstimuliis an importantfactor determiningsubjects' performance.Consider,for example,speechmaterialsin which stop consonantsare embeddedin a consonantplus vowel (CV) context.The presentationlevelsof the individual stopconsonantsare typically as much as 20-30 dB less than thoseof accompanyingvowels (Fletcher, 1953). Becausespeechpresentationlevelsare typically expressedin terms of the levelsof the intensitypeaks (usually the vowel sounds),it is likely that the lack of audibility of individual stop consonantsfor hearing-impairedsubjectsmay be an overlookedfactor in explainingtheir recognitionperformance.The presentstudyexaminesthe effectsof audibility of individual stop consonantson the recognitionperformanceof normal and hearing-impairedsubjects. Considerableresearcheffortshave beendirectedat sepPortionsofthispaperwerepresented at the 111th MeetingoftheAcoustical Societyof Americain May 1986 [J. Acoust.Soc.Am. Suppl.1 79, S24 (1986)].
1566
J. Acoust. Soc.Am.81 (5),May1987
aratingaudibilityfactorsin speech recognition frompossible effectsof psychoacoustic deficitsthat mayaccompany cochlear hearingloss,suchas abnormalfrequencyresolution, frequencydiscrimination, or temporalacuity.Oneapproach hasbeento comparethe performanceof hearing-losssubjectsagainsttheperformance of normal-hearing subjects listeningto speechmaterialsthat havebeendegraded(filtered or masked) to simulatea sensitivityloss. Severalexperimentsin whichexceptional carewastakento matchthe sensitivitythresholdsof the simulatedhearinglossesin normalhearing subjectsto the individual hearing-losssubjects' thresholdsmay reflectdirectly on the contributionof audibility deficitsto generalspeechrecognition.Milner's (1982) findingssuggestedthat usingmaskingnoiseto simulate hearinglossdid not accuratelymodelcochlearhearingloss; the maskednormal-hearingperformance(in termsof percent correct) was better than that of the hearing-lossears. FabryandVanTasell(1986) simulatedhearinglosses using filteringand maskingnoiseand alsomeasuredperformance in termsof both percentcorrectand error patterns.For onehalf of theirsubjects, bothmaskedandfilteredsimulationsof hearinglosssuccessfully approximatedthe effectsof the cochlearloss.For the remainingsubjects,eitheroneor both of the simulationsyieldedperformancethat wasbetterthan (percentcorrect) or differentfrom (in error patterns)the actual sensorineurallosses.In contrast,Zurek and Delhorne
(1986) showedthat usingmaskingnoisein normal-hearing subjectsto simulatehearinglosssuccessfully approximated the percent-correct performanceof many hearing-loss sub-
0001-4966/87/051566-08500.80 @ 1987Acoustical Society ofAmerica
1566
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 132.181.141.150 On: Mon, 30 Mar 2015 21:39:42
jectsacrossa wide varietyof speech-in-noise conditions.Resuits from these studies do not lead to firm conclusions re-
gardingthe relative contributionof audibility deficitsto speechrecognitionby the hearingimpaired.They do, however,provideevidencethat underconditionsof limited audibility for speechmaterials(resultingeitherfrom an actualor a simulatedhearing loss), the sensitivityloss alone can sometimesaccount for the poorer performanceof some hearing-impairedsubjects. An alternateapproachto separatingthe effectsof sensitivitylossfromotherpsychoacoustic factorsin speech recognitiontaskshasbeento computea measureof audibilityfor the speechmaterialsusedin eachpresentationcondition. Thisaudibilitymeasure isthenusedto comparehearing-loss subjects' performanceto that of normalsat equivalentaudibility levels.Dugal et al. (1980), usingthe articulationindex (AI) (Kryter, 1962), analyzedthe speechrecognitionresuitsof Skinner(1980) in sucha manner.They foundthat, althoughthe AI provideda well-orderedpredictionof hearing-loss results,theperformance of thehearing-loss subjects waspoorerthan that of normalsat equivalentlevelsof audibility. Pavlovic (1984) and Kamm et al. (1985) reached similarconclusions, in that manyhearing-loss subjects performedmorepoorlythan predictedby the normal-hearing assumptions of the AI. In thestudiescomparingdegraded normalperformance
(viamasking orfiltering)tothatofhearing-loss subjects, the comparisons are generallymadeat low levelsof audibility for bothsubjectgroups.Thesestudies,therefore,do not ad-
dressthequestion ofwhether' hearing-loss subjects canaccuratelyrecognize individualstopconsonants at highpresentation levels when the entire consonant is made audible. While
the useof the AI is well suitedto investigatethe relation betweendegreeof audibilityandrecognition performance of many speechmaterials,it wasnot designedto predictthe audibilityof individualstopconsonants. Recallthat the AI calculations are baseduponthe long-termaveragesound levelsof speech,whichare primarilyrepresentative of the moreintensevowelsounds. The conceptof the audibilityof individualstopconsonants hasbeenlargelyignoredin the
nants.In particular,we wereinterested in determining if cochlearhearing-loss subjects with substantial sensitivity losses couldachievehighrecognition performance whenthe stop-consonant stimuliwerepresented at levelsyieldinga highdegreeof audibility.The presentexperiment deriveda measureof the audibilityof short-duration stop-consonant sounds to examinestop-consonant recognition performance as a functionof the audibilityof the individualspeech sounds.
I. METHOD
A. Subjects
Fournormal-hearing andfivesensorineural hearingimpaired adultsubjects participated in theexperiment. The normal-hearing subjects hadpure-tonethresholds of better
than20dBHL (ANSI, 1969)at audiometric testfrequenciesfrom250to 4000Hz. Thehearing-impaired subjects' losses rangedfrommoderate to severe. Sensitivity thresholdsofthefivehearing-impaired subjects areshownin Table I. The thresholds weremeasured usinga four-alternative forced-choice adaptiveprocedurethat converged uponthe 71%-correct level of detection (Levitt, 1971). Several
thresholddeterminations wereaveraged to yieldfinalvalues. All of the hearing-impaired listenerswereregularusersof hearingaids.Only oneearof eachsubjectwastestedand,in the hearing-impairedsubjects,the testear waschosenasthe ear with the most sensitive thresholds. B. Stimuli
Consonant plusvowel(CV) speech tokensweregeneratedvia digitalsimulationof a five-formant terminal-analog speechsynthesizer(Klatt, 1980). The CV syllableswere constructed by pairingthesixstopconsonants/b,d,g,p,t,k/
with the vowel/a/to producetokensof 305 msin duration. We designed the C¾ tokensto differonlyin the initial 40-ms consonant portionofthestimuli.For thelabialstops/p/and /b/, formant-transition startingfrequencies were200, 900, and2000 Hz for F1, F2, andF3, respectively. For the alveolar stops/t/and/d/, the startingfrequencies were400, previous literature, although Kentetal. (1979)didreport 1700,and2800Hz; andfor thevelarstops/k/and/g/, the that in normal-hearing subjects, the recognition of individ- starting frequencieswere 300, 1610, and 1750 Hz. For the ualconsonants in CV contextappeared to showconsiderable unvoicedstops,voicinginitiated40 msfollowingthestimudependence upontheintensityof that particularconsonant. lusonset, whileforthevoicedstops, voicing beganfollowing The presentexperimentexaminedthe relationbetween a 5-ms burst at stimulus onset. Bursts were wideband in stop-consonant recognition by normalandhearing-loss sub- spectrumfor/p/and/b/, centeredat 3850 Hz for/t/and jectsandthe degreeof audibilityof individualstopconso- /d/, andcenteredat 1750Hz for/k/and/g/. Eachstimulus
TABLE I. Sensitivity thresholds (indBSPL) forthefivehearing-impaired subjects. Thesmall letter following each subject's intitials refers totheappropriate
panelin Fig. 7 for that subject's data.
Thresholds (dB SPL)
Subject
Sex
HB (a) MK (b) PD (c) LS (d) BE (e)
M F F M F
1567
0.25
0.50
1.0
1.5
2.0
3.0
4.0(kHz)
31 27 38 62 98
10 29 20 50 98
18 53 36 52 96
33 54 55 60 85
61 58 72 59 67
71 63 73 66 45
74 72 73 74 35
J.A½oust. So½. Am.,Vol.81,No.5,May1987
C.W.Turner andM.P.Robb: Audibility ofstopconsonants 1567
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 132.181.141.150 On: Mon, 30 Mar 2015 21:39:42
ended with a 265-ms vowel (F 1at 720 Hz, F 2 at 1240 Hz, and F 3 at 2500 Hz). Here F 4 and F 5 were setat a constant center
ted. This resultedin a scoringsystemin which a scoreof 0%
frequency(3600 and3850Hz, respectively)throughoutthe
the phonemeand a 100% scorerepresented perfectidentifi-
duration of the stimuli. All speechtokenswere stored on computerdisk, and for presentationwere directedat a 10kHz samplingrate into a low-distortion12-bit digital-to-
cation.
information
transmitted
reflected chance identification
of
In order to predict the audibility of the consonantportion of eachsyllable,an acousticanalysisof the initial portion of each CV stimuluswas obtained and subjectedto a analogconverter(Data Translation3366) followedby a transformationdesignedto simulatehuman auditoryproplow-passfilter (cutofffrequencyof 4800 Hz). erties.Thesepropertiesincluded:( 1) logarithmicfrequency representationand (2) summationof energy within each C. Procedures auditoryfilter or criticalband.This transformationincluded All subjectsweretestedindividuallyin a sound-isolated the followingsteps. booth. Prior to the actual data collection,all subjectsre( 1) A fastFourier transform(FFT) of the acousticsigceiveda minimum of 2 h of practicewith the test stimuli, nal developed in a 6-cm3 couplerfor eachstimulus wasobpresentedat comfortablelisteninglevels.A singleexperitainedusinga model3561 Hewlett-Packard spectrumanamental run consistedof the six CV speechitems presented lyzer. An exponential-decay weightingfunctionwith a time ten times each in randomizedorder. Subjectsidentifiedthe constantof 40 ms, beginningat stimulusonset,was usedto speechitems in a closed-setformat by pressinga response analyzethe initial stop-consonant portion of the CV syllabutton correspondingto that syllable.A 200-ms warning bles.This time window deemphasizedthe later portionsof light precededthe onsetof each stimulusand a response the stimulus(cf. Stevensand Blumstein,1978). Figure 1 is intervalof at least3 s followedeachresponse,servingasthe an exampleof the FFT for the stimulus/da/. The resulting interstimulusinterval. The subjectswereinstructedto guess FFT array isa constantbandwidth(12.5-Hz) analysisof the at the correct answer when they were unsureof their reinitial portionof eachstimulus. sponse. (2) A slidingfilter modelof the humanauditorysystem Data were obtainedfrom subjectsin 5- or 10-dB steps was applied to the FFT of each stimulusitem in order to acrossa wide rangeof presentationlevels,rangingfrom the accountfor the approximatelylogarithmicrepresentation of level at which the subjectobtainedno better than chance frequencyin theauditorysystemandthe summationof stimperformanceon the closed-settask (16.7% correct), to a ulusenergywithin a "critical band" (Green, 1958;Gfissler, maximumof 100dB SPL. The 100-dB-SPLpresentationlev1954). This was done in order to compare this "transel wasthe highestthat mostsubjectsfoundtolerablein this formed" versionof the stimulusto a subject'ssensitivity experiment.This nominal presentationlevel was expressed thresholds.The roex (p,r) exponentialauditoryfilter shape asthermsaverage ofacontinuous vowel,withspectral char- as describedby Pattersonet al. (1982) was usedas an apacteristicsidenticalto the vowelcomprisingthe final 265 ms proximationof the appropriateauditory filter shape.This of each CV stimulus. A subject'sdata were considered filter is describedby a weightingfunction W(g) that operasymptoticand final in a givenconditionwhen the subject atesuponthe stimuluspower;the total powerpassedby the showedno further improvementsin performancewith refilter is describedby: peatedruns,and whensuccessive three-runpercent-correct IV(g) = (l-- r)(l +pg)e -p• + r, averagesdifferedfrom oneanotherby lessthan 10%. D. Data analysis
We wereinterested in contrasting theaudibilityof individualstopconsonants for eachsubjectwith that subject's recognition performance for the samephoneme. Unfortunately,a simplemeasure ofpercent-correct identification for eachofthesixphonemes couldbecontaminated bysubjects' response bias.For example,if a subjectadopteda response strategyof simplyresponding/ba/following all presentations,thepercent-correct scorefor that phoneme wouldbe artificiallyhigh.For this reason,eachsubject's finaldata, based uponanaverage ofthefinal180trialsateachpresenta-
wheregisthenormalizedfrequencydistancefromthecenter of the filterto a signalcomponent, p is a parameterdescribing the in-bandwidthof the filterand r describes the filter "tail." Valuesofp = 25 and r = 0.0001 wereemployedin
1o
dB /OlV
tionlevel,werecastintosixindividual2 X 2 confusionmatri-
ces,contrasting the correctidentification of an individual phoneme versusincorrectidentification of that phoneme (e.g.,/ba/versus not/ba/). These2X 2 matrices werethen corrected for theresultingunequalcellfrequencies andsub- START• 100 Hz Bllh ] 2.5 Hz STOP, 5 100 Hz jectedto a bit analysis asdescribed by Miller and Nicely (1955). For the2 X 2 matrices,themaximumpossible inforFIG. 1.Example ofanFFT performed ontheacoustic stimulus/da/.A 40mationtransmittedwas 1.00bits;we thenexpressed the re-
msexponential timewindowwasusedto reflecttheinitialconsonant por-
suits in terms of relative identification information transmit-
tion of the stimulus.
1568
J. Acoust. Soc. Am., Vol. 81, No. 5, May 1987
C. W. Turner and M.P. Robb:Audibilityof stop consonants
1568
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 132.181.141.150 On: Mon, 30 Mar 2015 21:39:42
II. RESULTS 11o
A. Normal-hearing subjects
1oo 9o
m
7o
•
6o
•
5o 4o
•o 2o 1o i
i
i
i
.5
I
2
•
i
45
FREQUENCY (kHz)
FIG. 2. Sliding-filtertransformation(seetext) of the FFT of the syllable /da/, resultingin an "audibilityspectrum"of the stopconsonant/d/. The presentationlevel was 110 dB SPL.
the presentcalculation,followingPatterson'sdescriptionof thosevaluesasbeingrepresentative for normal-hearingsubjects.The resultingfilterhasanequivalentrectangularbandwidth of 0.16 timesthe filter centerfrequency.The outputs of thisfilterwerecalculatedfor 200 pointsspacedlogarithmicallyin frequencyrangingfrom 100 to 5250 Hz. This resultedin what we havetermedan "audibility spectrum"of an individualstopconsonant.Sucha transformationis similar in principle to that describedby Searle et al. (1979), Carlsonet al. (1979), and Kewley-Port (1983), with the additionalfeaturethat absolutesoundpressurelevelswere determinedfor eachpoint.Figure2 showsan exampleof the audibilityspectrumfor the stopconsonant/d/at a presentation levelof 110dB SPL. It is interestingto notethat for this 110-dB-SPLstimuluspresentation level,the audibilityspectrum of the initial stop-consonant portion lies between70 and 90 dB SPL.
Figure 3 showsthe meanaudiometricthresholdsfor the four normal-hearingsubjects plottedalongwith severalrepresentations of the audibilityspectrumof the stop-consonant /g/at variousnominal presentationlevels.All valuesof the audibility spectrum and the subjects'thresholdswere expressedin termsof levelsdevelopedin a 6-½m 3 acoustic coupler.The percentageof informationtransmittedfor the phoneme,as describedpreviously,is noted to the right of each audibility spectrum.An orderly relation betweenthe degreeof stop-consonant audibilityand recognitionperformance is observed.Essentiallyperfect recognitionperformance occurred when the audibility spectrumfrom 2504000 Hz was at suprathresholdlevels. Note that performancewasessentiallyequalto chance( 1% ) whenthe entire audibility spectrumwasbelowthe subjects'thresholds.Similar resultswereobtainedfrom the normal-hearingsubjects on the other five stop consonants.On the basisof theseresuitsfrom the normal-hearingsubjects,we concludedthat our calculationsof an audibilityspectrumfor eachstopconsonantprovided a meaningfuland orderly expressionof stop-consonantaudibility. In order to condensethe resultsinto a more manageable form, a "single-number"measureof audibilitywascomputed for each audibility spectrumin relation to a subject's thresholds.The percentageof logarithmicallyspacedpoints on the audibility spectrumbetween250 and 4000 Hz that were abovethe subject'sthresholdcurvewas usedas a measure of overall audibility for that stimulus.Portions of the audibility spectrumthat were at low sensationlevelswere assigneda reducedimportance.Each point lying 10 dB or more above the thresholdcurve was given full weighting (i.e., 1.0), while thosepointsbetween0 and 10 dB abovethe thresholdcurve were given interpolatedweightsbetween0
11o lOO
19o/
9o BO
FIG. 3. Audibilityspectrafor /g/
7O
presentedat various levels. The con_J 60
nectedstarsrepresentthe mean au-
diometric thresholds forthegroupof m 5O J w
normal-hearingsubjects.The average percentage information trans-
40
w
J
99%
@ 70dB
97%
@ 50dB
72%
@ 40dB
mittedandthepresentation level(in dB SPL) for thenormalsubjects are shownto therightof eachaudibility
30
spectrum.
2o 10
_
7%
@ 30dB
1%
@ 2OdB
-10 I
.25
.5
1
2
3
4
5
FREDUENCY(kHz)
1569
J. Acoust.Soc.Am.,Vol. 81, No. 5, May 1987
C.W. Turnerand M.P. Robb:Audibilityof stopconsonants
1569
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 132.181.141.150 On: Mon, 30 Mar 2015 21:39:42
•
loo
•
õo
•
7o
•
functionof the meanpercentage of audibilityfor eachof the sixstopconsonants. The functionsaremonotonicandorderly in eachcase.Figure 4 showsthat when70%-80% of the stimulusis calculatedto be audiblefor the normal-hearing subjects,recognitionperformanceis essentially perfect. We should mention that the data were also analyzed usingseveralalternativeschemesfor calculatingstimulus audibility. For example, the present 250-4000-Hz bandwidth wasreducedto 250-3000 Hz, and alternativeweighting schemes rangingfrom a 0-20-dB sensation levelfor the audibilityof a singlespectrumpointwerecalculated.Modificationsof the weightingschemeresultedin shiftsof the functionsin Fig. 4 to the left or the right, but causedno changein the relativeorderingor monotonicityof the functions.Minor variationsin the analysisbandwidthresultedin little changein the form of the PA functions.
eo
•
50
•
•0
•
•o
•
•o
•
o i
o
i
io
i
20
30
PERCENT
40
i
50
60
^UDIBILITY
i
70
80
90
iO0
( % )
FiG. 4. Percentageinformationtransmittedfor the six stop consonants plotted asa functionof phonemeaudibility (seetext) for the normal-hearing subjects.The consonantsdepictedare: (ß)--/b/, ([])--/d/, ( ß )• /g/, (A)•/p/, (ß)•/t/, and (¸)•/k/.
and 1.0. This partial weighting of the low-sensation-level points on the audibility spectrumwas usedin order to accountfor the poorerpsychoacoustical abilitiesthat subjects generallyshowfor low-sensation-level stimuli, and also to reflectthe generalphilosophyof deemphasizing low-sensation-levelportionsof the speechspectrumthat has proven successfulin Articulation Index calculations (Kryter, 1962). The resulting percentageof audible spectra was termed the "percent audibility" of the stop consonantfor eachconditionand subject.Figure 4 displayssix "performance-audibility (PA) functions,"plottingthemeanrecognition performanceof the four normal-hearingsubjectsasa
I
I
I
I
B. Hearing-impaired subjects
Figures5 and 6 are examplesof audibilityspectraplotted in relationto the sensitivitythresholdsfor two hearinglosssubjects.In Fig. 5, the subjecthas a moderatehearing lossand it is evident that when the stop consonant/d/is fully audible, the recognitionperformanceis accurate.In this particularcase,overcomingthe attenuativeeffectof the hearinglossthrough simpleincreasesin presentationlevel restorednormal performance. In Fig. 6, such is not the case,as this subjectfails to achievecompletelyaccuraterecognitionfor the stopconsonant/p/, even when the calculatedaudibility of the stop consonantis complete.We observedboth typesof eventsin our analysisof the fivehearing-loss subjects'data.That is, in somesubjectsfor somephonemes,accuraterecognitionoccurred when the stimuli were made audible, while in other
casesaudibilitywas not sufficientfor accuraterecognition.
I
I
I
11o
/do/ lOO 9o
96%
8o
0100d8-
7o
100% •
co 5o •
i) 85dB
6o
70%
o 70dB-
lO%
i) 55dB
40
FIG. 5. Audibility spectraof/d/at several presentationlevels plotted with the sensitivitythresholdsfor moderatehearing-losssubjectMK. Percentageinformationtransmitted and presentation level (in dB SPL) for each condition are shown to the
right of the curve. 2o lO
-lO i
i
i
i
i
i
.25
.5
1
2
3
4
FREQUENCY(kHz>
1570
J. Acoust. Soc. Am., Vol. 81, No. 5, May 1987
C. W. Turner and M.P. Robb: Audibilityof stop consonants
1570
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 132.181.141.150 On: Mon, 30 Mar 2015 21:39:42
I
I
I
I
I
I
I
11o
/pal lOO
9o
m
8o
7o -
71Z
0100d8
7DY.
0
9DdB
46Z
0
70dB
60
m 50 -•
40
'
•0
--
FIG. 6. Audibility spectraof/p/at several presentationlevels plotted with the sensitivitythresholdsfor moderatehearing-losssubjectPD. Percentageinformationtransmitted and presentationlevel (in dB SPL) for each condition are shown to the
927.
2O
0
right of the curve.
50dB --
10
-10 I
I
I
I
I
I
.25
.5
1
2
•
4
FREOUENCY(kHz)
In order to summarizethe data from the five hearinglosssubjects,PA functionsfor each subjectand phoneme conditionwerecalculated.Thesefunctions,relatingaudibil-
theaudibilityof stopconsonants to recognitionperformance for the normal-hearingsubjects,asillustratedin Figs. 3 and 4. Usingthis samemethodof expressing audibilityfor the ity of thestopconsonant to therecognition performance,hearing-impaired subjects allowedfor comparisons with the were comparedwith the correspondingdata from the nornormalsubjects.We foundthat, in mostcases,comparisons mal-hearingsubjects.Figure 7 displaysthe six PA functions at equivalentlevelsof calculatedaudibilitybetweenthe two from eachhearing-losssubjectin comparisonto the normalgroupsrevealedpoorer performancefor the hearing-imhearing subjects'functions.In general,the PA functions pairedsubjects thanfor thenormals.Thus,in mostcases,the from the hearing-impairedsubjectstendedto lie below the hearing-impaired subjectswereunableto utilizethe audible normal functions,indicatingthat, in mostcases,equivalent portionsof stopconsonants asefficientlyasdid the normals. levelsof stimulusaudibilityfor the two groupsdid not result This evidencesupportsthe existence,and importanceto in equivalentlevelsof recognition.For the four subjectswith speechrecognition,of other psychoacoustical deficitsbesensitivitylossesat high frequencies[ Fig. 7 (a), (b), (c), yond the simplesensitivitylossof a sensorineural hearing and (d) ], only four instanceswere found where their PA impairment.Our results,therefore,supportthe conclusions functions reached the same levels as the normals (/b/for of Dugal et al. (1980), Pavlovic (1984), Kamm subjectHB,/d/for subjectMK, and/k/and/d/for subject et al. (1985), and Fabry and Van Tasell (1986), who found PD). The remaining20 comparisonsrevealedthat equivathat the lack of audibility of speechmaterialsfor hearinglent levelsof audibility for thosesubjectsdid not yield norimpairedsubjects is only partiallysuccessful in accountin. g mal levelsof performance.In contrast,for the low-frequency for their reducedperformance. hearing-loss subjectBE in Figure7 (e), the functionswerein On the other hand, despitehigh nominalpresentation many casescloseto thoseof the normals,althoughthe severe levels,mostof our impairedsubjectsdid not receivefull auhearinglossof this subjectlimited the availableaudibility of dibility of the availablestop-consonant spectralcuesat the the spectralcuesevenat the highestlevels. highesttolerablepresentationlevels.We hypothesizethat It isinstructiveto keepin mindthat the right-mostpoint thisoccurredbecausethe moreintensevowelportionof each on eachhearing-impaired subject'sfunctionin Fig. 7 repre- stimulusservedto limit the acceptablepresentationlevelof sentsthe 100-dB-$PLpresentationlevel.In nearlyeveryinthe CV stimuli for the subjects.This lendssupportto the stance,the stopconsonantdoesnot reach 100% audibility notion that incompleteaudibility of stop consonantsfor for the hearing-impairedsubject.This suggeststhat few hearing-loss subjectscan serveas a limiting factor in their hearing-loss subjects wereeverableto perceivethefull range recognition performance. In manypanelsof Fig. 7, thehearof audiblespectralcuesfor the stopconsonants(audibility ing-impairedsubjects'PA functionswerestill risingas the nearor at 100%), evenat presentation levelsof 100dB $PL. functionapproached the 100-dBlevel,leadingto the speculation that further increasesin the audibilityof the stopconiii. DISCUSSION sonantswould have resultedin better recognitionperforThemeasureof stop-consonant audibilityderivedin this mance. The presentstudyis thereforeunableto offerunqualistudyprovidedanorderlyanddescriptive methodof relating 1571
J. Acoust. Soc. Am., Vol. 81, No. 5, May 1987
C. W. Turner and M.P. Robb: Audibilityof stop consonants
1571
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 132.181.141.150 On: Mon, 30 Mar 2015 21:39:42
(a) H.B.
!b) M.K. lOO
//0
5o
•/ii-i•/, ,,, 100
_,k,
'd'
/
5(]
//,,, 50
100
PERCENT
50
50
100
AUDIBILITY
(
50
100
50
PERCENT
(C) P.D.
o w
100
% )
100
50
AUDIBILITY
(
Z
100
)
(d) L.$.
lOO
100 L'- ß---•---•' ' ','',........
't'
5O
-
z ,
vvv ,
,
,
,
,
,
,
',r ,
,
,
i
i
,
,
,
,
,
I
I
I
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,',
,
,
,
,
,
lOO
50 -•••• 100 •'• %'d' ,• '9' •
5o
50
100 PERCENT
50
100
AUDIBILITY
50
(
100
50
100
% )
50
PERCENT
AUDIBILITY
100
50 (
100
% )
(e) B.E. ,
o
m
z
,
100
50
z
,.,
100
,•-•' ß •'' ] '9' 50
100 PERCENT
50 AUDIBILITY
100
50
100
( % )
FIG. 7. Performance-audibility (PA) functionsfor all fivehearing-impaired subjects.Eachbrokenline represents a hearing-impaired subject'sperformance for a particularstopconsonant,while the solidlinesrepresentthe meanresultsfrom the normal-hearingsubjects'group.The small letter identifyingeach panel [ (a)-(e) ] refersto that subject'ssensitivitythresholdsaslistedin Table I.
fledsupportforor against thehypothesis thataudibilitydeficitsalonecanaccountfor thepoorerrecognitionof stopconsonantsin sensorineuralhearing-losssubjects.Instead,we concludethat both audibilityand otherpsychoacoustic fac1572
J. Acoust. Soc. Am., Vol. 81, No. 5, May 1987
tors are mostlikely responsible. The methodsof the present study might offer a meansof separatingthesefactorsfor individual stop-consonant sounds. There are, however,somecriticismsthat may be directC.W. Turner and M.P. Robb: Audibilityof stop consonants
1572
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 132.181.141.150 On: Mon, 30 Mar 2015 21:39:42
edat theuseof thepresentmethodof analyzingthesedata. There is evidencethat cochlearhearing-loss subjectsoften have critical bandwidths that are wider than normal (e.g.,
Margolisand Goldberg,1980;Tyler et al., 1982).These wider bandwidths(or filters) in the hearing-impairedsub-
jectswerenotincorporated intothepresent model,although if theywere,thecalculated audibilityof thestimulifor the hearing-impaired subjects wouldhavebeengreaterthanthat reported here.In thiscase,ourconclusion thatmanyhearing-impaired subjects do not recognize stopconsonants as wellasnormals,whencompared at equivalent levelsof stimulusaudibility,wouldremainunchanged. It might,however,tendto argueagainsttheconclusion that stopconsonants tend to be inaudible for many hearing-impaired
subjects, evenat highnominalpresentation levels. Another considerationin the data analysisis that the
frequency rangeusedto compute audibilityof thestimuli (250-4000Hz) mayhavebeeninappropriate andshouldbe modified.Dubnoet al. (1986) havepresented evidencethat therecognition of individualstopconsonants is dependent upondifferent frequency regions for particularconsonants. We notedearlier,however,thatsimplychanging thegeneral bandwidthof the analysis did notaffectthetrendsobserved in thedatatoanygreatextent.A relatedapproach wouldbe to calculate an importance function,or information-by-frequencyweighting,for the presentstimulusset.Theseapproaches mayperhaps yielddifferentconclusions for some of the questions of the presentstudyand deservefurther investigation. Asforchoosing different frequency regions or frequency-importance weightings for theanalysis of differentindividualstopconsonants, wesuggest thatsuchmodificationsof the presentschememaybe helpfulin futureexperiments that seek to identify specific acoustic characteristics or frequency regionsthat areresponsible for the identificationof specificphonemes. ACKNOWLEDGMENTS
Thisresearchwassupportedin partby a grantfrom the Deafness Research Foundation. Thanks are due to James
Hillenbrand for assistancein constructingthe synthetic
speech stimuliandto DianneVanTasellfor severalhelpful discussions regardingthe scoringof speechrecognitionresultsthat werehelpfulin our dataanalysis.EdwardConture
providedhelpfulcommentson an earlierversionof this manuscript. We alsothankLenoreHolteandSwatiLotlikar for assistance in collectingthe data.
relative perceptual saliency of selectedacoustic manipulations,"J. Acoust. Soc.Am. Suppl. 1 65, S6. Dubno,J. R., Schaefer,A. E., and Dirks, D. D. (1986). "Critical frequency regionsfor consonantplaceperceptionby normal-hearingand hearingimpaired listeners,"presentedat the Ninth Midwinter Meeting of the Associationfor Researchin Otolaryngology,ClearwaterBeach,FL. Dugal, R. L., Braida, L. D., and Durlach, N. I. (1980). "Implicationsof previousresearchfor the selectionof frequency-gain characteristics," in AcousticalFactorsAffectingHearing Aid Performance,edited by G. A. Studebakerand I. Hochberg (University Park, Baltimore, MD). Fabry, D. A., and Van Tasell, D. J. (1986). "Masked and filtered simulation of hearingloss:Effectson consonantrecognition,"J. SpeechHear. Res. 29, 170-178.
Fletcher, H. (1953). Speechand Hearing in Communication(Van Nostrand, New York).
G/issler, G. (1954).0ber dieH/3rschwelle f/ir Schallereignisse mit Verscheidenbreitem Frequenzspektrum,"Acustica 4, 408-414. Green, D. M. (1958). "Detection of multiple componentsignalsin noise," J. Acoust. Soc. Am. 30, 904-911.
Kamm, C. A., Dirks, D. D., and Bell, T. S. (1985). "Speechrecognitionand the Articulation Index for normal and hearing-impairedlisteners,"J. Acoust. Soc. Am. 77, 281-288.
Kent, R. D., Wiley, T. L., and Strennan, M. L. (1979). "Consonant discriminationasa functionof presentationlevel," Audiology18, 212-224. Kewley-Port,D. (1983). "Time-varyingfeaturesascorrelatesof placeand articulationin stopconsonants,"J. Acoust. Soc.Am. 73, 322-335. Klatt, D. H. (1980). "Softwarefor a cascade/parallelformant synthesizer," J. Acoust. Soc. Am. 67, 971-995.
Kryter, K. D. (1962). "Methods for the calculationand useof the Articulation Index," J. Acoust. Soc. Am. 34, 1689-1697. Levitt, H. (1971). "Transformedup-down methodsin psychoacoustics," J. Acoust. Soc. Am. 49, 467-477.
Margolis,R. H., and Goldberg,S. M. (1980). "Auditory frequencyselectivity in normaland presbycusic subjects,"J. SpeechHear. Res.23, 603613.
Miller, G. A., and Nicely, P. E. (1955). "An analysisof perceptualconfusionsamong someEnglish consonants,"J. Acoust. Soc. Am. 27, 338352.
Milner, P. (1982). "Perceptionof filteredspeechby hearing-impairedlistenersand by normally hearinglistenerswith simulatedhearingloss," unpublisheddoctoraldissertation,City Universityof New York. Owens,E., Benedict,M., and Schubert,E. (1972). "Consonantphonemic errorsassociated with pure-toneconfigurationsand certainkindsof hearing impairment," J. SpeechHearing Res. 15, 308-322. Patterson,R. D., Nimmo-Smith, I., Weber, D. L.; and Milroy, R. (1982). "The deteriorationof hearingwith age:Frequencyselectivity,the critical ration, the audiogram,and speechthreshold,"J. Acoust. Soc.Am. 72, 1788-1803.
Pavlovic,C. V. (1984). "Use of the Articulation Index for assessing residual auditoryfunctionin listenerswith sensorineural heatingimpairment," J. Acoust. Soc. Am. 75, 1253-1258.
Searle,C. L., Jacobsen,J. F., and Rayment, S. G. (1979). "Stop-consonant discriminationbasedupon human audition," J. Acoust. Soc. Am. 65, 799-809.
Skinner,M. W. (1980). "Speechintelligibilityin noise-inducedhearingloss:Effectsof high-frequencycompensation,"J. Acoust. Soc.Am. 67, 306-317.
Stevens,K. N., and Blumstein,S. E. (1978). "Invariant cuesfor place of articulationin stopconsonants,"J. Acoust.Soc.Am. 64, 1358-1368. Tyler, R., Fernandes,M. A., and Wood, E. J. (1982). "Masking of pure tones by broadbandnoise in cochlear-impairedlisteners,"J. Speech Hear. Res. 25, 117-123.
ANSI (1969). ANSI S3.6-1969(R1973), "American National Standard
Specifications for Audiometers" (Am. Natl. Stand.Inst.,New York). Carlson,R., Granstrom,B., andKlatt, D. (1979). "Vowelperception:The
1573
J. Acoust.Soc.Am.,Vol.81, No. 5, May 1987
Van Tasell, D. J., Hagen, L., Koblas, L., and Penner,S. (1982). "Perception of short-termspectralcuesfor stop-consonant placeby normaland hearing-impaired subjects,"J. Acoust.Soc.Am. 72, 1771-1780. Zurek, P.M., and Delhorne, L. A. (1986). "Speechreceptionin noiseby hearing-impairedlisteners,"J. Acoust.Soc.Am. Suppl. 1 79, S23.
C.W. Turnerand M.P. Robb:Audibility of stopconsonants
1573
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 132.181.141.150 On: Mon, 30 Mar 2015 21:39:42