Hca PDF
Hca PDF
Sign Up
Or visit LWW.com
HANDBOOK OF
CLINICAL
AUDIOLOGY
SEVENTH EDITION
XXXDBNCPEJBNFECMPHTQPUDPN]#FTU.FEJDBM#PPLT]$IZ:POH]$SFEJU4503.
HANDBOOK OF
CLINICAL
AUDIOLOGY
SEVENTH EDITION
EDITOR-IN-CHIEF EDITORS
7th edition
Copyright © 2009, 2001 Lippincott Williams & Wilkins. Copyright © 1993, 1985, 1978, 1972 Williams & Wilkins.
Two Commerce Square
2001 Market Street
Philadelphia, PA 19103 USA
LWW.com
Not authorised for sale in United States, Canada, Australia, New Zealand, Puerto Rico, or U.S. Virgin Islands.
All rights reserved. This book is protected by copyright. No part of this book may be reproduced or transmit-
ted in any form or by any means, including as photocopies or scanned-in or other electronic copies, or utilized
by any information storage and retrieval system without written permission from the copyright owner, except
for brief quotations embodied in critical articles and reviews. Materials appearing in this book prepared by
individuals as part of their official duties as U.S. government employees
are not covered by the above-mentioned copyright. To request permission, please contact Wolters
Kluwer Health at Two Commerce Square, 2001 Market Street, Philadelphia, PA 19103, via email at
[email protected], or via our website at lww.com (products and services).
987654321
Printed in China
Handbook of clinical audiology / editor-in-chief, Jack Katz ; editors, Marshall Chasin, Kristina English,
Linda J. Hood, Kim L. Tillery. – Seventh edition.
p. ; cm.
Includes bibliographical references.
ISBN 978-1-4511-9163-9
I. Katz, Jack, editor. II. Chasin, Marshall, editor. III. English, Kristina M., 1951- editor. IV. Hood, Linda J.,
editor. V. Tillery, Kim L., editor.
[DNLM: 1. Hearing Disorders. 2. Hearing–physiology. WV 270]
RF291
617.8–dc23
2014014240
Care has been taken to confirm the accuracy of the information presented and to describe generally accepted
practices. However, the authors, editors, and publisher are not responsible for errors or omissions or for any
consequences from application of the information in this book and make no warranty, expressed or implied,
with respect to the currency, completeness, or accuracy of the contents of the publication. Application of this
information in a particular situation remains the professional responsibility of the practitioner; the clinical
treatments described and recommended may not be considered absolute and universal recommendations.
The authors, editors, and publisher have exerted every effort to ensure that drug selection and dosage set
forth in this text are in accordance with the current recommendations and practice at the time of publication.
However, in view of ongoing research, changes in government regulations, and the constant flow of informa-
tion relating to drug therapy and drug reactions, the reader is urged to check the package insert for each drug
for any change in indications and dosage and for added warnings and precautions. This is particularly impor-
tant when the recommended agent is a new or infrequently employed drug.
Some drugs and medical devices presented in this publication have Food and Drug Administration (FDA)
clearance for limited use in restricted research settings. It is the responsibility of the health care provider to
ascertain the FDA status of each drug or device planned for use in his or her clinical practice.
LWW.com
CO N T R I B U TO R S
v
vi Contributors
The Seventh Edition of the Handbook of Clinical Audiol- tists (both of whom he considered as audiologists) to learn
ogy is Dedicated to Raymond Carhart (192?–1975), who is about the hearing process and develop ways to help persons
recognized as the “Father of Audiology.” He talked about the living with hearing loss.
shared responsibility of the clinician and the hearing scien-
ix
FO R EWO R D
It is fitting that this book is dedicated to Dr. Raymond Four of those dimensions were (1) sensitivity (how faintly
Carhart. He stated in a 1976 interview that he conceived of can one hear); (2) clarity of sound (speech or otherwise)
an audiologist “as someone who has a prime commitment in quiet; (3) clarity in noise; and (4) tolerance (how loud
to learning about hearing and its processes as well as a com- can sound be without becoming a problem). Two of these
mitment to understanding and coping with its problems.” dimensions (sensitivity and clarity), which were detailed by
He talked about the shared responsibility of the clinician Carhart in a 1951 paper, became the basis of Plomp’s (1978)
and the hearing scientists (both of whom he considered as two-component model of hearing loss involving audibility
audiologists) to learn about the hearing process and ways and distortion. Carhart said there were many more dimen-
to help the persons with hearing impairment. The seventh sions to hearing, but those four should always be measured
edition of Handbook of Clinical Audiology book strives to do when fitting hearing aids and working with patients with
that, as have the previous editions. hearing loss.
Carhart has been referred to as the “Father of Audiol- Although Carhart worked as a speech scientist, a clini-
ogy”—or sometimes the “Grandfather of Audiology.” Per- cian (in speech and in hearing), a researcher in speech and
haps it would be most appropriate to call him the “Grand especially in hearing, his primary contribution is probably as
Father of Audiology.” a teacher and educator. An educator can be described as one
Although he came to the field somewhat indirectly, his who conveys learning in which the knowledge, skills, and
contributions were enormous. habits of a group of people are transferred from one genera-
Dr. Carhart was born in Mexico City. He received his tion to the next through teaching, training, or research, and
Bachelor’s degree from Dakota Wesleyan University in 1932 that certainly describes Dr. Carhart.
in speech pathology and psychology; his Master’s and Ph.D. In his capacity as an educator, Carhart directed some 35
degrees from Northwestern in 1934 and 1936, respectively, dissertations, beginning in 1946 with a study by John Keys
in Speech Pathology, Experimental Phonetics, and Psychol- entitled “Comparative Threshold Acuity of Monaural and
ogy. He was an instructor in speech reeducation at North- Binaural Hearing for Pure Tone and Speech as Exhibited by
western from 1936 to 1940 and then an assistant, and associ- Normal and Hard of Hearing.” Although his primary inter-
ate professor in 1943 in speech science. est was in speech and speech understanding, the disserta-
Although Carhart initially worked in speech science, he tions he directed covered a range of hearing problems from
was asked to replace C.C. Bunch following Bunch’s untimely difference limens, to effects of surgery and specific diseases
death in June, 1942. Carhart then began to teach Bunch’s on hearing, to auditory fatigue, loudness and many more
courses in hearing and became so interested in the problems topic areas. In addition, as an educator he taught some of
that, as he said, “I’ve been working with them ever since.” the leaders in the field of audiology like James Jerger, Don-
In 1943, Carhart joined the Medical Administrative ald Dirks, Cornelius Goetzinger, Jack Willeford, and many
Corps, U S Army, as a captain, he was assigned to DeShon more. Many of those went on to teach, and to educate other
Hospital in Butler, Pennsylvania as Director of the Acous- students in audiology programs at our most prestigious
tic Clinic and as Acoustic Physicist where he was asked to universities.
develop a program for veterans who had lost their hearing In 1949, he directed the dissertation of Miriam Pauls
during the war. In that capacity he contacted the scien- Hardy, who may have been the first female to graduate with
tists at the Psycho-Acoustic Laboratory (PAL) at Harvard, a Ph.D. in audiology. Unlike some of the professors of the
who, among other things, had come up with word lists that time, Dr. Carhart did not discriminate on the basis of gen-
might be used in evaluating a person’s ability to understand der. He believed that it was the mind—not the gender—that
speech. He also developed a fairly comprehensive rehabilita- was important. He did, however, believe that one should do
tion program that involved selecting and fitting hearing aids the work, not just talk the talk. He set an example in that one
(which were not so complex or elaborate as they are today), often found him in his office or laboratory in the evening
and teaching the soldiers and veterans how to use them. and on weekends.
When Carhart returned to Northwestern in 1946, he His early research interests at Northwestern were in con-
convinced the dean to establish an academic program in ductive hearing loss (the “Carhart notch,” which can be an
Audiology, which was the name that Dr. Norton Canfield indicator of possible otosclerosis was named for him), includ-
chose for the department at DeShon. He became the first ing a method of checking the accuracy of bone conduction
professor of Audiology at Northwestern. measurements before there was an artificial mastoid, let alone
Carhart later said (in class if not in print) that sound— an ANSI standard. He was interested in masking (forward,
and especially speech—was comprised of many dimensions. backward, and perceptual, which we now call informational
xi
xii Foreword
masking), and did much to enhance our understanding of the We dedicate this seventh edition to Dr. Carhart, because
way speech is processed by the damaged ear. like him the Handbook of Clinical Audiology has educated so
Before there were computers in our clinics and most many leaders of our field and has inspired countless audiol-
academic research laboratories, he developed a key-sort sys- ogists throughout the world. This edition of the Handbook
tem as a way of classifying audiograms so that one could will provide a broad perspective of the field of audiology
better interpret the puretone audiogram. by nearly 100 contributing experts in the field. They offer
Finally, Carhart believed that audiology was more their knowledge, wisdom, and enthusiasm to help another
than a clinical field and that the clinicians who practiced it generation of audiologists to fulfill their mission.
should continue to explore and research the ways in which
we hear and how to improve the lives of those who do not
hear normally. REFERENCES
Raymond Carhart died at his desk in October 1975, Carhart R. (1951). Basic principles of speech audiometry. Acta
leaving behind a legacy to the academic discipline of audiol- Otolaryngol. 40:62–71.
ogy, the numerous leaders in the field of audiology whom he Plomp R. (1978). Auditory handicap of hearing impairment
had educated, and the greater understanding of audiologic and the limited benefit of hearing aids. J Acoust Soc Am.
assessment and hearing aid rehabilitation for which he is 63:533–549.
known as the father—or Grand Father of Audiology. Laura Ann Wilber
P R E FAC E
xiii
xiv Preface
SECTION II:
SPECIAL POPULATIONS 435
PHYSIOLOGICAL PRINCIPLES 23 Newborn Hearing Screening . . . . . . . . . . . . . . .437
AND MEASURES 135 Karl R. White
xv
xvi Contents
30 Central Auditory Processing Disorder: 40 Hearing Aid Fitting for Children: Selection,
Therapy and Management . . . . . . . . . . . . . . . .561 Fitting, Verification, and Validation. . . . . . . . .759
Jack Katz, Jeanane Ferre, William Keith, and Angela Loucks Susan Scollie
Alexander
41 Hearing Aid Fitting for Adults: Selection,
31 Individuals with Multiple Disabilities . . . . . .583 Fitting, Verification, and Validation. . . . . . . . .777
Anne Marie Tharpe and Samantha Gustafson Michael Valente and Maureen Valente
32 Noise Exposure . . . . . . . . . . . . . . . . . . . . . . . . . . .595 42 Building and Growing an Audiologic
Brian Fligor, Marshall Chasin, and Rick Neitzel Practice . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .805
33 Nonorganic Hearing Loss. . . . . . . . . . . . . . . . . .617 Melanie Herzfeld
Frederick N. Martin and John Greer Clark 43 Implantable Hearing Devices . . . . . . . . . . . . . .817
34 Hearing Loss in the Elderly: A New Look Teresa A. Zwolan
at an Old Problem . . . . . . . . . . . . . . . . . . . . . . . .631 44 Intervention, Education, and Therapy
Barbara E. Weinstein for Children with Hearing Loss . . . . . . . . . . . .835
35 Tinnitus and Hyperacusis . . . . . . . . . . . . . . . . .647 Christine Yoshinaga-Itano and Kristin M. Uhler
Richard S. Tyler, William Noble, Claudia Coelho, Eveling Rojas 45 Audiologic Rehabilitation . . . . . . . . . . . . . . . . .849
Roncancio, and Hyung Jin Jun Joseph Montano
36 Tele-audiology . . . . . . . . . . . . . . . . . . . . . . . . . . .659 46 Infection Control . . . . . . . . . . . . . . . . . . . . . . . . .861
De Wet Swanepoel A.U. Bankaitis
3
4 SECTION I • Basic Tests and Procedures
the cochlea. The field of audiology has expanded to include Evanston, Illinois went on to contribute significantly to the
the assessment of more complex functions at all levels of the field of audiology in their own right.
peripheral and central auditory nervous system. Immittance
measurements enabled audiologists to assess mechanical
properties of the auditory system of the outer and middle
ABOUT THIS HANDBOOK
ears, as well as middle-ear muscle responses that rely on The first edition of the Handbook of Clinical Audiology was
auditory nerve and brainstem activity. Specialties such as published in 1972 and subsequent editions have served sev-
auditory processing disorders, educational audiology, ves- eral generations of audiologists in the United States and
tibular function, and interoperative monitoring have added increasingly throughout the world. It is used widely as both
to the breadth and depth of the field. a text and reference book by students and professionals in
The growing sophistication and understanding of audi- various fields.
tory functions and development can be seen in the lowering Currently, for this edition, we have five editors who
of the target ages for various services. In the mid-1950s it have diverse areas of specialization in clinical, research, and
was taught that we should wait until deaf children are per- teaching aspects for which they are responsible. To broaden
haps 10 years of age before testing them and presumably our horizons and to be as inclusive as possible, in this edi-
provide amplification after that. Given our current state of tion we have tried to include more international voices and
knowledge, in retrospect, this seems absurd and counterpro- procedures.
ductive. At that time we did not understand that develop- We have squeezed in as much information in 1,000
mental problems should be identified at the earliest possible pages as we could. The more than 90 contributors are highly
time. Otherwise, the person could miss critical periods and regarded audiologists who also have clinical, research, and
lose plasticity, as well as fall further behind with reduced teaching experience. This makes the chapters authoritative,
learning and more acquired misconceptions. Now, neona- well organized, and geared for sharing our knowledge in a
tal hearing screening is widespread and we strive to begin field that we love. We have always considered readability an
habilitation by 6 months of age. In fact, in the past, one important feature of this book and especially now that it is
audiologist was ridiculed when she advocated that audiolo- used by many people whose first language is not English.
gists fit hearing aids for children as young as 1 year of age. The 46 chapters are divided into four sections. Section I
Once we realized the critical importance of the early years deals with basic tests and procedures that are used by most
for later development, early identification and assessment audiologists for most of the people with whom they work.
procedures, as well as training procedures were targeted and This involves puretone air and bone conduction, as well as
developed. standard speech audiometry. Calibration and case history
As the field of audiology expanded so did the academic chapters are also important components for any audiologic
demands on the practitioners. Initially, a bachelor’s degree assessment. The Diagnostic Audiology chapter helps the
was required to practice and then a master’s degree was reader to combine all of the previous information into a
the entry level along with basic clinical certification. As in coherent diagnosis.
the past a Ph.D. was generally desired for university teach- Section II introduces the various physiological and
ing and research. In more recent years (in the United States) electrophysiological procedures used by audiologists at this
the Doctorate of Audiology (Au.D.) degree was introduced time. These include immittance measures that primarily
to provide even broader clinical teaching and training expe- reveal the status of the middle ear. Electrocochleography and
riences. Also, higher levels of competency and certification Otoacoustic Emissions provide detailed information about
are generally required today to practice audiology. Students the responses from the cochlea, the end organ of hearing.
interested in a career that includes independent research Five chapters in this section discuss the electrophysiological
continue to pursue a Ph.D. in audiology, hearing science, responses from the auditory nerve, brainstem, and various
or related areas. Now many of the top university programs areas of the brain. The chapter on intraoperative monitor-
in audiology have both Ph.D. and Au.D. trained professors ing describes the analysis of the auditory system during sur-
to provide the student the best of both worlds. We also see gery that informs the surgeons about the status and pos-
combined Au.D./Ph.D. programs that offer students excel- sible adverse effects of their manipulations of the auditory
lent ground for both clinical and research endeavors. system. The final three chapters in this section deal with the
We owe a debt of gratitude to those early audiologists vestibular system. They begin with the study of vestibular
who helped to form this vibrant and vital health profession. neurophysiology and end with vestibular rehabilitation.
Although we cannot mention the many important con- Section III is called Special Populations. This recog-
tributors, it is perhaps appropriate to mention Raymond nizes that certain groups often require modifications in
Carhart (1912 to 1975) who is generally recognized as “The audiometric procedures or accommodations. Evaluation of
Father of Audiology.” He was an important contributor to young children offers a special challenge to the audiologist
the developing field of audiology and an excellent teacher. because they do not have the auditory or cognitive devel-
Many of his students from Northwestern University in opment needed for some of the tests, and it is sometimes
CHAPTER 1 • A Brief Introduction to Clinical Audiology and This Handbook 5
difficult to have them perform in the expected fashion. This known syndrome. Since audiologists are often the first
chapter describes ways to obtain the desired results. Hear- professionals to suspect a genetic basis for a hearing loss,
ing, screening, and educational audiology generally involve it is important to have current information available as
the work carried out in schools with those who have nor- well as the knowledge of resources.
mal hearing as well as those with auditory impairments. e. Audiology Tele-practice follows the global trend to pro-
This section also includes chapters that deal with those who vide appropriate services at a distance from the profes-
have multiple disabilities, hereditary hearing loss, and the sional. Using a range of communication technologies
elderly. Other special groups are those with noise-induced and appropriate training of para-professionals, audiolo-
hearing loss, those who have tinnitus, and individuals who gists can treat individuals in remote places who might
have “nonorganic” hearing loss. Four of the other chapters otherwise not receive care. Tele-practice also provides
involve auditory processing disorders, which include the convenience to patients who live relatively close by, but
bases of central auditory problems, diagnostic procedures, nonetheless find it challenging to visit the clinic for rou-
and subsequent remediation. This section concludes with tine problems. By making oneself available using tele-
a chapter on tele-practice in which audiologists can work technology, the audiologist helps patients conserve their
with people at far distances via communication systems. physical energy, time, and travel expenses, while keeping
Section IV deals with the management of hearing dis- abreast of the patient’s challenges as they develop.
orders. It begins with acoustical environments and tech- f. The topic of Infection Control relates to every aspect of
nologies that are used to aid the hard-of-hearing person in audiology, because it is important not to harm the peo-
reducing the influence of noise and other factors that can ple whom we are here to help. Infection control is part
compromise communication. This is followed by four chap- of every aspect of our work and for this reason it is the
ters related to various aspects of hearing aids and hearing first of many chapters, in the future, that will be available
aid fittings. Another chapter, which deals with implantable from the Point on internet.
hearing devices, is a rapidly expanding area. The chapter
discusses cochlear implants and other devices that are sur- Other New Features in This
gically imbedded into the person with a hearing loss. Two
other chapters deal with management of those with hearing Handbook
problems in the classroom and with rehabilitation of adults. In this edition of the Handbook we have reduced the num-
There is also a chapter in this section that advises audiolo- ber of references provided in each chapter, but there are
gists on how to start an effective audiologic practice. extensive lists of references for the interested students, pro-
fessors, and researchers on the Point. In this way the reader
New Chapters in This Edition is not encumbered with reading through or skipping over
many references when trying to understand the concepts
a. Diagnostic Audiology serves an important purpose in and to remember the facts in this book. At the same time
bringing together the information from the basic evalua- there are thousands of references organized by chapters
tion procedures in this book to form an audiologic inter- online for those who are interested in research or for greater
pretation and an understanding of the patient’s needs. depth on the topics covered in this book.
This chapter will also discuss some procedures that are Another new feature is the thought questions at the
not covered in the preceding chapters, as well as men- end of each chapter. They will ask how and what you
tioning what our tests do not tell us. would do in dealing with, or solving, problems associated
b. The Dizzy Patient and Vestibular Rehabilitation chapter with the information in the chapter. This is not another
is an extension of a former Handbook chapter, apply- hoop to jump through but a valuable exercise. The stu-
ing diagnostic information to enable appropriate treat- dent must take what they have learned from the chapter
ment decisions for patients with vestibular problems. It and combine it with their other knowledge to figure out
will describe audiologic procedures designed to relieve a good solution to a problem/question. In this way they
patient’s symptoms, as well as the role of physical therapy take what was on page and internalize it, while it is fresh
and the necessity of collaboration among healthcare pro- in their minds, and put the information to a practical use.
fessionals. This will help you to internalize the information and make
c. Hearing Screening discusses newborn hearing screen- the material your own.
ings, school screenings, and other screening procedures
using universal approaches and targeted population
approaches. The specific procedures, their value, and
Terminology
outcomes of screening programs will be discussed. Most of the terms used in this edition are standard in
d. Hereditary Hearing Loss describes much-needed infor- the field at this time. However, when a change is made it
mation for audiologists related to genetic aspects of should be for a worthwhile purpose and not one that creates
hearing loss that may be nonsyndromic or part of a important problems. For example, this writer was pleased to
6 SECTION I • Basic Tests and Procedures
see a recent change back to a previous term. What was once For this edition we will combine both the Jacobson and
called Central Auditory Processing was changed to Auditory Northern and the Martin and Clark approached as this
Processing and recently was changed back to the clearer and seems to be better than sensory-neural and avoids the prob-
more specific Central Auditory Processing again (American lems that ‘sensorineural’ has caused.
Academy of Audiology, 2010).
PURETONE
SENSORY/NEURAL The reader might infer that the writer does not like com-
A conductive loss is a mechanical impairment of hearing, pound words (two words that are combined to form
associated with the outer and/or middle ears. For many years a composite of the two, e.g., flashlight, textbook). We
a nonconductive loss had been called a “nerve loss.” After rarely combine opposites (e.g., dogcat, daynight, or even
WWII it was changed to “sensory-neural loss” when ENT sensorineural). But when two words are frequently spo-
doctors and audiologists were then able to separate sensory ken together (e.g., base and ball) often the first step is to
(cochlear) from neural (acoustic nerve or brainstem) dis- hyphenate them (base-ball) and when people get used to
orders. For example, cochlear problems (such as Meniere’s this expression, they are often combined and made a com-
disease) were demonstrated by a rapid growth of loudness pound word (baseball).
when a sound was presented above the person’s threshold The term “pure tone” is shown one or more times on
of hearing. On the other hand with retrocochlear losses every audiogram and appears in almost every report and
(e.g., auditory nerve or brainstem) there was no accelerated is a very common type of audiometer (but in that case it
growth of loudness with sounds above the neural hearing might be hyphenated because it is followed by a noun, e.g.,
level (as with a person who had an auditory nerve tumor). pure-tone audiometer). Because (1) we have to explain this
However, after a number of years the term sensory-neural to students and often have to decide if it needs a hyphen
was changed to “sensorineural.” There was little reaction to when we are writing, and (2) it is surely time to graduate
this minor change. from pure-tone to puretone, this change seems appropriate.
I was shocked, however, to receive an angry phone call In this case there is no compelling reason for doing so (as it
from a doctor who claimed that I made a mistake which would be in the case of sensorineural) but it seems that it is
caused him to delay surgery for his patient’s auditory nerve time for “pure” and “tone” to be officially married and to be
tumor. From a review of my report it was abundantly clear a compound word forever more.
that the patient had retrocochlear characteristics that are
consistent with an “eighth nerve or brainstem involvement” ESPECIALLY FOR STUDENTS—
and not cochlear involvement. How could that have been
misinterpreted? The physician only had read up to the first
SOME SUGGESTIONS
test result, that puretone testing showed a “sensorineural As a student, it is most helpful to educate yourself broadly
loss in the right ear.” On seeing the term “sensorineural” he in your profession and related subjects. You may benefit
incorrectly concluded that it was a cochlear problem and not from speech, psychology, and many other courses as much
a very dangerous auditory nerve tumor. He did not know as from some of your audiology courses. The ability to take
that the term sensorineural could represent two importantly a broader view is certainly an advantage no matter how you
different types of hearing loss. Puretone thresholds distin- plan to practice audiology.
guish conductive from both sensory and neural disorders. When you have a choice in taking your first job, it is well
Later on similar mistakes, with the term sensorineural, to take one that covers a wider area of professional activity
were made by knowledgeable audiologists in two separate over one that is narrow. You may find that an area that pre-
publications. This convinced me that the term sensorineural viously did not seem too interesting is one that you realize
can create serious problems that should be less problematic is very interesting or gratifying. Also, if you have a broad
with the original term sensory-neural. experience you can qualify for more opportunities later on.
Since the second edition of the Handbook we have used As you get deeper into your areas of major interest you
the term sensory-neural to avoid the errors caused by senso- will necessarily reduce how broadly you can practice. But
rineural (Katz, 1978). If those who coined the term sensori- having a prior background or learning can help you in what
neural originally did not try to combine two auditory com- you are doing and perhaps provide variety in your profes-
ponents that we try hard to distinguish from one another, sional activities. Later on, if you have specialized in one
it is likely that fewer problems would have occurred. Other area then an exciting and enriching aspect is to carry out
authors have recognized the problem with the term senso- research to improve your success or simply to obtain a better
rineural. Jacobson and Northern (1991) suggest using just understanding. One way to repay your profession for train-
sensory or neural, when it is clearly one or the other. Martin ing you is to supervise students in your external practicum
and Clark (2012) avoid the confusion by using the term site. Mentoring students and sharing what you have learned
sensory/neural which is also a good way to clarify the term. can be most rewarding, but in addition you may learn some
CHAPTER 1 • A Brief Introduction to Clinical Audiology and This Handbook 7
new concepts from the students that you may have missed the main reason for choosing each of them for your
or learn from having to answer their questions. department.
It is our pleasure to provide you with this book full of 3. If you were the editor of Handbook of Clinical Audiology
knowledge that was written by dozens of audiologists who and could only add one chapter to this edition, based on
have enjoyed sharing with you their hundreds of years of what you know or imagine, which of the six new chapters
experience in this wonderful field. Finally, as professionals (see above) would you choose and why?
we should be committed to helping those we serve. We also
need to follow the rules. Of course, in addition your work
needs to provide you with the necessities of life. Despite
KEY REFERENCES
these constraints, to a great extent, your profession is pretty A full list of references for this chapter can be
much what you make of it. found at http://thePoint.lww.com. Below are the key refer-
ences for this chapter.
American Academy of Audiology. (2010) Guidelines for diagnosis,
FOOD FOR THOUGHT treatment and management with children and adults with cen-
1. What personal characteristics and experiences do you tral auditory processing disorders. Available online at: http://
www.audiology.org/resources/documentlibrary/documents/
have that you think will be helpful to you as an audiolo-
CAPDGuidelines 8–2010.pdf.
gist? Jacobson J, Northern J. (1991) Diagnostic Audiology. Austin, TX:
2. You are the Director of an Audiology Department at Pro-Ed; p 8.
a medical center. There is a need to establish guide- Katz J. (1978) Clinical audiology. In: Katz J, ed. Handbook of Clini-
lines for the audiologists to provide a degree of consis- cal Audiology. Baltimore, MD: Williams & Wilkins Co.; p 5.
tency (e.g., in reports). You have seen “sensorineural” Martin F, Clark JG. (2012) Introduction to Audiology. Boston:
spelled like that and also as sensory/neural. Consider Pearson; p 446.
C H A P T ER 2
Calibration
9
10 SECTION I • Basic Tests and Procedures
members. This is the basic procedure for development and equipment, if daily listening checks are strictly enforced,
approval of standards. For more information on the stan- transducers should be verified at least annually, unless there
dards process, the reader is referred to Melnick (1973) and is reason to suspect that the output has changed. If daily
Wilber (2004). There are three primary sources of funding listening checks are not strictly enforced more complete
for the production of standards in acoustics: financial sup- checks might be necessary. In addition to regularly sched-
port from Acoustical Society of America (ASA), fees paid by uled checks, audiometers should be tested whenever the
the voting members of an Accredited Standards Committee, clinician notices anything unusual in their performance.
and income from the sales of standards. Through your pur- Sometimes test results themselves reveal the need for
chase of standards, you are supporting the efforts of those an immediate calibration check (e.g., when the same air–
professionals who donate their time and effort to develop bone gap is obtained for two successive patients). It is always
and maintain ANSI standards. Contact information of the better to check the audiometer first rather than assume the
secretariat of ANSI S1, S2, S3, and S12 is: problem lies with the client or clinician. A quick biologic
check (described later) can always be performed. If this con-
Acoustical Society of America
firms the probability of an equipment problem, then a more
ASA Secretariat
elaborate electroacoustic check should be carried out.
35 Pinelawn Road, Suite 114E
If the audiologist discovers that the frequency or time
Melville, NY 11747-3177
components of the audiometer are out of calibration, then
E-mail: [email protected]
in most instances the manufacturer or a local representa-
tive should be contacted for immediate repair and/or proper
calibration of the instrument. However, if there is a stable
PARAMETERS OF CALIBRATION deviation in output level at a given frequency, calibration
The first step in learning how to check calibration should corrections can be made by adjusting the trim pots (potenti-
always be to read the appropriate manual(s) that accom- ometers) on the audiometer, by using the audiometer’s self-
pany the audiometric equipment that you have purchased. calibrating mechanism, or by posting a note on the front of
Additional resources include electronic parts, stores that the audiometer indicating the corrections. If paper correc-
often have basic manuals on test equipment, ASHA and tions must be used, then the adjustment in decibels (plus
ASA. A number of books have also discussed procedures or minus) that should be made at the various frequencies
for acoustic measurements and equipments that might be should be shown for each transducer. Note that if the SPL
used in such measurements (Beranek, 1988; Decker and output is too high (e.g., by 5 dB), then you must increase
Carrel, 2004; Silverman, 1999). The United States Govern- their audiometric threshold (e.g., by 5 dB HL). Most mod-
ment Printing Office is also a good source of information ern audiometers provide some sort of internal (typically
on basic test procedures. The specific parameters that must software based) calibration system for earphones, and many
be checked in an audiometer are outlined in standards pro- also provide this for bone conduction or sound field. If one
vided by the ANSI and the IEC. See Table 2.1 for a listing of plans to use bone vibrators for both mastoid and frontal
standards relevant to calibration of audiometric equipment. bone testing or two sets of earphones with the same audi-
It is beyond the scope of this chapter to discuss each area of ometer (e.g., supra-aural earphones and insert receivers), it
calibration in detail. For the readers who intend to perform is probably advisable to use “paper corrections,” rather than
their own calibration of audiometric equipment, they need trying to adjust trim pots between each transducer’s use. If
to purchase copies of the latest standards to verify the exact frequent level adjustments are required, it is probably wise
parameters to be checked and their permissible variability. to check with a qualified technician.
To better understand the procedures for checking calibra-
tion, one must first understand the parameters that need to
be checked, as well as the equipment used to perform these
INSTRUMENTATION
calibration checks. For puretone and speech audiometers, As mentioned earlier, the calibration of an audiometer
the three parameters are (1) frequency, (2) level (sound pres- requires the use of various pieces of electroacoustic and
sure level [SPL] or force level or [FL]), and (3) time. These electronic instrumentation. Most, if not all, graduate audi-
parameters apply whether one is using a portable audiom- ology programs will have the instrumentation needed to
eter, a standard diagnostic audiometer, or a computer-based at least evaluate whether the audiometer meets the refer-
audiometric system. ence equivalent threshold sound pressure level (RETSPL),
Some organizations, such as ASHA and OSHA, specify frequency, linearity, and distortion standards specified in
time intervals at which calibration checks should be made. ANSI S3.6 Specification for Audiometers. In this section, we
With current solid-state electronic circuitry, frequency, and will review the use of several basic instruments, including
time, parameters should be checked when the audiometer sound level meter (SLM), multimeter, frequency counter,
is first acquired and at yearly intervals thereafter. Older oscilloscope, and digital spectrum analyzer. More details on
equipment should be checked at least biannually. For newer acoustics and instrumentation can be found in numerous
CHAPTER 2 • Calibration 11
TA B L E 2 .1
ANSI, IEC, and ISO Standards for Audiometers and Audiometric Testing
Number Title
ANSI S3.1-1999 (R 2008) Maximum Permissible Ambient Noise for Audiometric Test Rooms
ANSI S3.2-2009 Method for Measuring the Intelligibility of Speech Over Communication Systems
ANSI S3.6-2010 Specification for Audiometers
ANSI S3.7-1995 (R 2008) Coupler Calibration of Earphones, Method for
ANSI S3.13-1987 (R 2012) Mechanical Coupler for Measurement of Bone Vibrators
ANSI S3.20-1995 (R 2008) Bioacoustical Terminology
ANSI S3.21-2004 (R 2009) Method for Manual Pure-Tone Threshold Audiometry
ANSI S3.25-2009 Occluded Ear Simulator
ANSI S3.36-2012 Specification for a Manikin for Simulated In Situ Airborne Acoustic Measurements
ANSI S3.39-1987 (R 2012) Specifications for Instruments to Measure Aural Acoustic Impedance and Admittance
(Aural Acoustic Immittance)
ANSI S1.4-1983 (R 2006) Specifications for Sound Level Meters
IEC 60318-1:2009 Electroacoustics: Simulators of Human Head and Ear. Part 1—Ear Simulator for the
Calibration of Supra-aural and Circumaural Earphones
IEC 60318-4:2010 Electroacoustics: Simulators of the Human Head and Ear. Part 4—Occluded-Ear Simula-
tor for the Measurement of Earphones Coupled to the Ear by Means of Ear Inserts
IEC 60318-6:2007 Electroacoustics: Simulators of Human Head and Ear. Part 6—Mechanical coupler for
the measurement on bone vibrators
IEC 60645-3:2007 Electroacoustics: Audiometric equipment. Part 3—Auditory Test Signals of Short Dura-
tion for Audiometric and Neuro-otological Purposes
IEC 60645-5:2004 Electroacoustics: Audiometric Equipment. Part 5—Instruments for the Measurement of
Aural Acoustic Impedance/Admittance
IEC 60645-6:2009 Electroacoustics: Audiometric Equipment. Part 6—Instruments for the Measurement of
Otoacoustic Emissions
IEC 60645-7:2009 Electroacoustics: Audiometric Equipment. Part 7: Instruments for the Measurement of
Auditory Brainstem Responses
IEC 60645-6:2009 Electroacoustics: Audiometric Equipment. Part 6: Instruments for the Measurement of
Otoacoustic Emissions
IEC 60645-7:2009 Electroacoustics: Audiometric Equipment. Part 7: Instruments for the Measurement of
Auditory Brainstem Responses
ISO 8253-1:2010 Acoustics: Audiometric Test Methods. Part 1: Basic Pure-Tone and Bone Conduction
Threshold Audiometry
ISO 389-1:l998 Acoustics: Reference Zero for the Calibration of Audiometric Equipment. Part 1: Reference
Equivalent Threshold Sound Pressure Levels for Pure Tones and Supra-aural Earphones
ISO 389-2:l994 Acoustics: Reference Zero for the Calibration of Audiometric Equipment. Part 2: Reference
Equivalent Threshold Sound Pressure Levels for Pure Tones and Insert Earphones
ISO 389-3:l994 Acoustics: Reference Zero for the Calibration of Audiometric Equipment. Part 3: Reference
Equivalent Threshold Force Levels for Pure Tones and Bone Vibrators
ISO 389-4:l994 Acoustics: Reference Zero for the Calibration of Audiometric Equipment. Part 3: Reference
Equivalent Levels for Narrow-Band Masking Noise
ISO 389-5:2006 Acoustics: Reference Zero for the Calibration of Audiometric Equipment. Part 5: Reference
Equivalent Threshold Sound Pressure Levels for Pure Tones in the Frequency Range
8 kHz to 16 kHz
ISO 389-6:2007 Acoustics: Reference Zero for the Calibration of Audiometric Equipment. Part 6: Reference
Threshold of Hearing for Test Signals of Short Duration
ISO 389-7:2005 Acoustics: Reference Zero for the Calibration of Audiometric Equipment: Part 7: Reference
Threshold of Hearing under Free-Field and Diffuse-Field Listening Conditions
ISO 389-8:2004 Acoustics: Reference Zero for the Calibration of Audiometric Equipment. Part 8: Reference
Equivalent Threshold Sound Pressure Levels for Pure Tones and Circumaural Earphones
ANSI, American National Standards Institute; ASHA, American Speech-Language-Hearing Association; IEC, International Electrotechnical
Commission; ISO, International Organization for Standardization.
NOTE: All ANSI, ISO, and IEC Standards referred to in this chapter are listed in this table.
12 SECTION I • Basic Tests and Procedures
texts (e.g., Decker and Carrell, 2004; Harris, 1998; Rosen correct this problem. If the frequency counter is an option
and Howell, 1991; Speaks, 1996). in a multimeter, there is often no adjustable trigger level,
and the signal level must be changed to correctly trigger the
counter function.
Multimeter
The term “multimeter” indicates that this device can be
used to make multiple measurements. In most cases, a
Sound Level Meter
multimeter will allow one to make measurements of volt- The SLM is actually multiple instrumentation components
age, current, and resistance. Each of these measurements provided in a single instrument. You can combine separate
is made differently, and we will limit our discussion herein instruments into a usable device when an SLM is not avail-
to making voltage measurements. To measure voltage, we able. At a minimum, for checking the calibration of RET-
must make the measurement in parallel to (across) the SPL (i.e., 0 dB HL values on the audiometer), you need an
device of interest. For example, if we are interested in atten- acoustic calibrator, an appropriate coupler (2 cc and/or 6
uator linearity, we want to place the leads of the multime- cc), a microphone, and the SLM. SLMs used for checking
ter across the earphone leads, with the earphone plugged the calibration of audiometers should be Type 1, as should
into the audiometer output. We can replace the earphone microphones used for such calibrations. The most com-
with an equivalent impedance (in most cases, a 10-, 50-, or monly used Type 1 microphone is a condenser microphone.
300-ohm resistor for ER-3 A, TDH-39, TDH-49, or TDH-50 Condenser microphones come in four standard sizes (refer-
earphones). Simply unplugging the earphones and plug- ring to their diameter): 1/8″, 1/4″, 1/2″, and 1″. For calibra-
ging in the multimeter will likely produce inaccurate tion of a supra-aural earphone, a 1″ microphone is specified
results, because this approach in most cases will change the in ANSI S3.6-2010 (because of its sensitivity—see the state-
load impedance of the audiometer output. It is important ment that follows). In general, the smaller the microphone
to purchase a true root mean square (RMS) multimeter is, the higher its upper frequency cutoff and the less its sen-
for accurate RMS voltage readings. It is important to set sitivity. Sensitivity is a measure of its efficiency transferring
the meter to AC, or alternating current (vs. DC, or direct sound pressure into voltage and is commonly reported as
current), voltage. The meter is most accurate when set to millivolts per pascal, or in dB re: 1 V/Pa. Many condenser
the lowest voltage range possible. In most cases, the volt- microphones require a DC polarization voltage of 200 V.
age range is set in powers of 10, where the listed voltage is Some condenser microphones are prepolarized and hence
the maximum voltage possible for that voltage range. When do not require an externally applied polarization voltage.
this maximum voltage is exceeded, an overload is indicated Microphones also come as pressure microphones (to be
(see multimeter manual for the overload indicator for your used in a coupler), free-field microphones (to be used in
multimeter). You adjust the multimeter range until you sound field recordings such as when measuring the ambi-
have the most sensitive range (lowest maximum voltage) ent noise in the sound booth), or random-incidence micro-
where the output is NOT overloaded. phones (for measures in, e.g., reverberant environments).
More detailed information about microphones and SLMs
can be found in Johnson et al. (1998) and Yeager and Marsh
Frequency Counter (1998). It is important that your SLM and microphone be
This might be a stand-alone device, or it might be an option compatible (i.e., provide the correct polarization voltage),
on your multimeter. In the case of a stand-alone device, a fre- or equipment damage and/or incorrect SPL measures may
quency counter will often have a trigger adjust (the voltage result.
level and direction: positive-going or negative-going) that The SLM also contains amplifiers (whose gain is
determines when an event is triggered. The frequency coun- changed when you change the SPL range), time-weighting
ter combines an event counter with an accurate clock. The circuits (for fast, slow, and possibly impulse and peak time
ratio of events (i.e., cycles) divided by the time elapsed gives weightings), various filter settings (e.g., dBA, dBC, and
you the frequency (in hertz). Thus, if 20 events are measured octave and/or third-octave band filters), as well as a display
in 10 ms (one-hundredth of a second), then the cycles per function (this could be a volume unit (VU) meter, an LED
second (or hertz) = 20 cycles/0.01 s = 2,000 cycles/s (Hz). If indicator, and/or a digital readout). The gain of an amplifier
the counter does not trigger (no events counted), you need in the SLM must be adjusted to account for the sensitivity of
to reduce the trigger level or turn up the signal (e.g., increase each microphone. For example, a 1″ microphone might have
the dB HL on the audiometer dial). If the frequency counter a sensitivity of 50 mV/Pa, whereas a 1/4″ microphone might
reads a number substantially larger than expected, then it have a sensitivity of 1 mV/Pa. If the SLM were adjusted
is possible that the trigger level is set too low (or the signal appropriately for the 1/4″ microphone, then when 1 Pa
presented is set too high) and that multiple triggers per cycle of pressure was presented to the microphone diaphragm,
are occurring. In this case, turning the signal level down or the SLM would read 94 dB SPL [20 log(1 Pa/0.0002 Pa)
increasing the trigger level of the frequency counter should = 94 dB SPL]. If we replaced the 1/4″ microphone with the
CHAPTER 2 • Calibration 13
1″ microphone but did not change the SLM amplifier gain, allow the “freezing” of the signal on the oscilloscope. To
the 1″ microphone would read 128 dB SPL [94 dB SPL + measure, for example, peak-to-peak voltage, one counts the
20 log(50 mV/1 mV)]. How, then, do we calibrate the SLM number of vertical divisions (usually a division is a centi-
so that it displays the correct SPL? In most instances, we meter) extending from the positive to the negative extremes
would use a device that presents a known SPL to the dia- and multiplies this number of divisions by the voltage per
phragm of the microphone. Two types of calibration devices division to obtain the peak-to-peak voltage. It should be
are commercially available for this purpose: pistonphones noted that measurements made on an analog oscilloscope
and acoustic calibrators. The former produces sound by a are assumed to have an error of 5% or more.
mechanical piston, whereas the latter uses an electrical oscil-
lator and a transducer to produce the tone. Each calibrator
produces a specified SPL at a specified frequency, and this
Spectrum Analyzer
calibrator should be periodically sent back to the manu- Numerous devices can be used to provide a frequency-
facturer to assure it remains within specified tolerances of domain representation of a signal (including the octave
frequency and SPL. These calibrators can accommodate a or third-octave band filters available on many SLMs). In
variety of microphone sizes by inserting nesting adapters. this section, we will limit our discussion to instruments
Using an acoustic calibrator is very simple: turn on the SLM, referred to as digital spectrum analyzers. These instruments
place the calibrator snugly over the microphone, and turn may be stand-alone hardware devices or might be part of
on the calibrator. Making sure that the frequency response a computer-based hardware/software application. These
of the SLM is wideband (flat, or dBC if flat weighting is not devices convert an analog input signal to digital format by
available), adjust the gain of the SLM (by trimming a cali- use of an analog-to-digital converter. It is important that
bration potentiometer using a screwdriver or via software) the reader understand that if the sampling rate used dur-
until the specified output of the calibrator (e.g., 114 dB SPL) ing analog-to-digital conversion is too slow, it can cause the
is displayed on the SLM. generation of “false frequencies” in a process called aliasing.
Once the SLM is calibrated, you must remove the Many spectrum analyzers preclude aliasing by judicious use
acoustic calibrator (or pistonphone) and place an appropri- of a low-pass filter (called an antialiasing filter). It should
ate coupler over the microphone: a 2-cc coupler for insert also be noted that not all possible signal amplitudes can be
earphones (e.g., Etymotic ER3 A earphones) or a 6-cc cou- encoded following analog-to-digital conversion, but sig-
pler for supra-aural earphones (such as TDH-39, TDH-49, nal level is rounded off (“quantized”) and that the magni-
or TDH-50 earphones). ANSI S3.6-2010 has RETSPL values tude of possible quantization error is related to the voltage
for both insert and supra-aural earphone for several 6-cc range and the resolution (related to the number of bits) of
(National Bureau of Standards [NBS] 9-A, IEC 318) and the analog-to-digital converter. The time-domain signal is
2-cc (HA-1, HA-2, occluded ear simulator) couplers. digitized over a limited time period, called the time window
or the time epoch. Once the signal is digitized into a time
epoch, it is converted into the frequency domain by Fourier
Oscilloscope transformation. (See Rosen and Howell, 1991 for a more
The oscilloscope, in its most common display mode, pres- complete explanation of aliasing, antialiasing, quantizing,
ents voltage as a function of time. Oscilloscopes come in and digitization.) The fast Fourier transform (FFT) is one
analog and digital types. In the analog oscilloscope, the out- of many algorithms that have been developed to convert a
put of an electron gun transiently illuminates the screen of time-domain (voltage over time) signal into a frequency-
a cathode ray tube. Freezing the display on the oscilloscope domain (amplitude across frequency) signal. Another term
screen involves repeated triggering of the oscilloscope on a for the frequency-domain representation is the spectrum.
fixed phase of the stimulus. Specialized analog oscilloscopes In addition to the possibility of quantization errors and
that can freeze a display for prolonged periods of time are aliasing, you must be aware that signal processing prior to
called storage oscilloscopes. A digital oscilloscope is simi- Fourier transformation can have an influence on the results.
lar to an analog oscilloscope, except that instead of electron Because of some underlying assumptions about the peri-
guns and a cathode ray tube, the signal is recorded by an odic nature of the discretely sampled signal, the spectrum of
analog-to-digital converter and displayed on a flat panel dis- the signal is distorted unless an integer number of cycles of
play. Digital oscilloscopes often have features that are not all frequencies is contained in the time epoch over which
typically available on analog oscilloscopes (e.g., storage of the signal is digitized. To prevent the distortion (often called
waveforms, cursor functions, and summary statistics such leakage) that occurs when a noninteger number of cycles
as peak-to-peak and RMS voltage calculations). Simple is contained in the time epoch, the digitized time epoch
amplitude and voltage measurements are easily performed can be shaped. This shaping multiplies the signal by val-
on a signal using an oscilloscope. Manipulations of the time ues at or near zero, near the beginning and end of the time
base (in time per division) and amplitude (in volts per divi- window and weights them at or near 1, near the middle of
sion), as well as the appropriate adjustment of the trigger, the time window. One popular windowing function is the
14 SECTION I • Basic Tests and Procedures
irregularity when the problem switch or dial is manip- caused by faulty external wiring between the examiner’s
ulated. The danger of an audible transient, whether booth and that of the test subject or within the audiom-
mechanical or electronic, is that the patient may respond eter itself. Cross-talk must be corrected before any testing
to the transient rather than the stimulus tone. Sometimes is carried out.
an antistatic or contact-cleaner spray can alleviate the 7. The clinician should listen to the signal while the attenu-
problem of electronic transients. ation dial is changed from maximum to minimum levels.
5. The audiologist should listen for hum or static with the For instance, a tone may be present at 20 dB HL on the
hearing level dial at a high value, both when a stimulus dial, whereas no tone is present at 15 dB HL on the dial.
signal is present and when it is absent. One should not In some cases, the tone stays at the same hearing level
hear static or hum at levels below 60 dB HL on the dial. from 20 dB HL to −10 dB HL on the dial. These problems
6. “Cross-talk” may occur between earphones, that is, the are easily detected by listening to the audiometer.
signal that is sent to one earphone may be heard in the 8. Finally, the threshold of the clinician (or a person with
contralateral earphone. Such a problem could greatly known hearing thresholds) should be checked with the
affect the audiometric thresholds obtained on that audi- earphones and bone vibrators to make sure that the
ometer, especially for cases with unilateral hearing loss. outputs are approximately correct. If the levels are not
Cross-talk may be detected by unplugging one earphone, within 10 dB of the previous threshold values, the output
sending a signal to that phone, and listening to the other levels should be checked electronically.
earphone. As before, when removing the earphone, a
proper resistive load must be put in its place. The signal Aside from these gross problems, which can be detected
at a suprathreshold dial setting (e.g., 70 dB HL) should by looking or listening (see Figure 2.1 for an example of a
not be heard in the opposite earphone when a signal form that may be used to aid the clinician in carrying out
is presented in the normal manner. Cross-talk may be the listening check), the precise accuracy of the output levels
Audiometer serial #
Date:
Time:
Checked by:
Earphone cords
Power cord
Attenuator cord
Hum
Dials
Frequency
Attenuation
Intensity right phone
Intensity left phone
Tone interrupter
Tone pulse rate
Cross-talk
FIGURE 2.1 Form for biologic
Acoustic radiation check of audiometer. (Reprinted
Bone vibrator(s) from Wilber L. (1972) Calibration:
pure tone, speech and noise
Loudspeakers
signals. In: Katz J, ed. Handbook
Other comments of Clinical Audiology. 1st ed.
Baltimore, MD: The Williams &
Wilkins Company; pp 11–35, with
the permission of Lippincott
Williams & Wilkins.)
16 SECTION I • Basic Tests and Procedures
must be evaluated when the audiometer is first purchased or storage oscilloscope. When gating the signal on, rise time
and at regular intervals thereafter. Frequency, output level, is the length of time it takes for the signal to increase from
linearity of attenuation, and percentage of harmonic dis- −20 to −1 dB (10% to 90%) of its final steady-state value.
tortion should all be checked electronically, in addition to The fall time is the length of time between −1 and −20 dB
the biologic check. Section 5.4 of ANSI S3.6-2010 describes (90% to 10%) relative to its steady-state value. This is usu-
various checks for unwanted sound from the transducer or ally checked at a hearing level of 60 dB HL or less. ANSI
audiometer. S3.6-2010 specifies a rise time as well as a fall time of not less
than 20 ms and not more than 200 ms. A detailed descrip-
tion of the rise and fall characteristics is given in section
Frequency Check 7.5.3 of ANSI S3.6-2010.
The frequency output from the audiometer should be
checked by using an electronic frequency counter. This
instrument will tell the exact frequency of the output signal. Linearity Check
Quite accurate frequency counters are often included in a Attenuator linearity (the hearing level dial) may be checked
digital multimeter. The electrical output from the audiome- electrically, directly from the audiometer, or acoustically
ter may be routed directly to the instrument (i.e., unplug the through its transducer (earphone or bone vibrator). If mea-
earphone, then plug in the frequency counter input to the surements are to be made electrically, the earphone should
audiometer output) because the frequency is determined by remain in the circuit and the voltage should be measured
an oscillator in the audiometer rather than the transducer. in parallel to the earphone, or a dummy load that approxi-
By using an electronic frequency counter, one can easily mates the earphone impedance should replace the trans-
determine if the output from the audiometer corresponds ducer. To check linearity, the audiometer should be turned
to the nominal frequency. The standard for audiometers to its maximum output and then attenuated in 5-dB steps
allows a tolerance of ±1% of the indicated frequency value until the output can no longer be read. Each attenuator on
for Type 1 and 2 audiometers; ±2% for Type 3 and 4 audi- the audiometer should be checked separately. To meet the
ometers; and ±3% for Type 5 audiometers. For example, if ANSI S3.6-2010 standard, the attenuator should be lin-
the audiometer dial reads 1,000 Hz, then the actual output ear within 0.3 of the interval step or by 1 dB, whichever is
must be between 990 and 1,010 Hz for a standard diagnostic smaller. That is, if you change the level in 5-dB steps, the
(Type 1) audiometer. audiometer must attenuate between 4 and 6 dB per step.
Frequency should be checked on initial receipt of the If the attenuation step is 2 dB, then the reading should be
audiometer and at yearly intervals thereafter. Neverthe- between 1.4 and 2.6 dB per step (0.3 × 2 dB = 0.6 dB, which
less, it is appropriate to listen to the audiometer each day to is less than 1 dB). As noted in section 7.2 (and section 7.3.3)
judge whether the frequencies are maintaining reasonably of ANSI S3.6-2010, the SPL or FL of earphones, speakers, or
good accuracy. bone vibrators can vary by no more than ±3 dB from 125 to
5,000 Hz and no more than ±5 dB at 6,000 Hz and above, at
Harmonic Distortion Check any dB HL dial setting.
Attenuator linearity should be checked annually. If a
Linearity measurements may also help detect distortion “fixed loss pad” (i.e., a device that automatically changes
in a transducer or in the audiometer itself. Distortion may the signal level by a set amount, e.g., 20 dB) is present
appear as a lack of linear attenuation, especially at high in the audiometer, its attenuation must also be checked. If
output levels (90 dB HL and above). Harmonic distortion the audiometer attenuates in 1- or 2-dB steps, then these
must be checked through the transducer itself. Excessive smaller attenuation steps should be checked if they are used
harmonic distortion is rarely caused by the audiometer but clinically.
often arises in the various transducers. The maximum per-
missible total harmonic distortion in the current standard
(ANSI S3.6-2010) is 2.5% for earphones and 5.5% for bone EARPHONE LEVEL CALIBRATION
vibrators. The standard also shows the maximum permis-
sible distortion for the second, third, fourth, and higher
Real Ear Methods
harmonics, as well as the subharmonics, across audiometric There are two basic approaches for the calibration of ear-
frequency. phones. One is the “real ear” method and the other is the
“artificial ear” or coupler method. With the original real ear
method, one simply tested the hearing of a group of normal-
Rise–Fall Time hearing persons, averaged the results, and checked to see
The rise–fall time of the tone is a basic parameter of the that the average hearing of this group was at zero on the dial
audiometer, which may be checked by taking the output for each frequency. Although this is theoretically feasible
directly from the audiometer and routing it into a digital with a large population sample, it is not a recommended
CHAPTER 2 • Calibration 17
procedure. ANSI S3.6-2010, Appendix D, describes probe is described in ANSI S3.25-2009, but RETSPLs are not given
tube, loudness balance, and threshold procedures that may for supra-aural or insert receivers using the Zwislocki cou-
be used for this purpose. Clearly, these procedures are pos- pler or the manikin.
sible but quite unwieldy. For audiometers, this approach is When checking the audiometer earphone output, the
technically incorrect because the ISO 389-1:1998 reference supra-aural earphone is placed on the coupler and a 500-g
(which is also used in ANSI S3.6-2010) is not tied to normal weight is placed on top of it. If using an SLM (rather than
hearing per se, but simply refers to an arbitrarily accepted a microphone preamplifier), the output is read in dB SPL,
SPL (i.e., the RETSPL or FL). If the audiologist wishes to use where SPL = 20 log10 P/Pref (where P is the observed sound
a new earphone (that is not listed in ANSI S3.6-2010 Stan- pressure and Pref = 20 µPa). After the earphone is placed on
dard, its appendix, or any subsequent revision), a real ear the coupler, a low-frequency tone (125 or 250 Hz) is intro-
procedure might be the only way to check calibration, but if duced and the earphone is reseated on the coupler until
generally accepted earphones are used, it is much easier and the highest SPL value is read. This helps assure optimal
more efficient to use an artificial ear/coupler method. earphone placement on the coupler. The output from the
earphone is then compared to the expected values at each
frequency. The standard SPL values that are used are given
Artificial Ear (Coupler) Methods in (1) ISO 389-1:1998, often referred to as ISO-1964 because
The most commonly used procedure today is that of the of its initial publication date, and (2) ANSI S3.6-2010. These
“artificial ear,” which consists of a condenser microphone values evolved through a “round robin” in which several
and a 6-cc coupler (for supra-aural earphones) or 2-cc cou- earphones were measured on various couplers at a group of
pler (for insert earphones). The 6-cc coupler was originally laboratories throughout the world (Weissler, 1968).
chosen because it was thought that the enclosed volume was The current ANSI standard includes RETSPLs for the
approximately the same as the volume under a supra-aural TDH-type earphones, as well as insert earphones. It also
earphone for a human ear (Corliss and Burkhard, l953). provides values for both the IEC and NBS couplers for
However, since volume displacement is only one com- supra-aural earphones and values for insert phones using an
ponent of acoustic impedance, it cannot be assumed that occluded ear simulator, HA-1 or HA-2 coupler. Figure 2.2
the coupler actually represents a human ear. Burkhard and shows an audiometer earphone calibration worksheet,
Corliss (1954) pointed out that the impedance characteris- which contains the expected values at each frequency with
tics of a 6-cc coupler probably simulates the impedance of TDH-39 or TDH-49 (or TDH-50) earphones in Telephonics
the human ear over only a small part of the frequency range. type 51 cushions on an NBS 9-A coupler and insert receivers
Because the 6-cc coupler does not replicate the impedance using an HA-1–type coupler. ANSI S3.6-2010 allows a toler-
of the human ear, it cannot be considered a true artificial ance from the listed values of ±3 dB from 125 to 5,000 Hz
ear. Subsequent work by Cox (1986), Hawkins et al. (1990), and ±5 dB at 6,000 Hz and higher.
Killion (1978), and Zwislocki (1970, 1971) has quantified The supra-aural output measurements referred to above
the differences between real ear and coupler values. In an are only valid when a supra-aural–type earphone cushion
attempt to solve this problem, the IEC 318 coupler was (which touches the pinna) such as the Telephonics 51 is
developed. However, there is still some disagreement as to used and not when a circumaural cushion (which encircles
the accuracy of this ear simulator (formerly called an arti- the pinna) is used. ANSI S3.6-2010 provides RETSPL val-
ficial ear) because its impedance characteristics are also not ues for several circumaural earphones (Sennheiser HDA200
exactly those of a real human ear. However, it is clearly more and Koss HV/1 A) with an IEC 60318-2 coupler and a type
accurate than the present NBS 9-A coupler. 1 adapter (Sennheiser earphone) or type 2 adapter (Koss
In addition to the problem of acoustic impedance char- earphone). When the output of the audiometer through the
acteristics, the NBS 9-A coupler is known to have a natu- earphone has been established, it is compared to the appro-
ral resonance at 6,000 Hz (Rudmose, 1964). This interferes priate standard to determine whether it is in calibration or
with the measurement of the output of an audiometer ear- not. If possible, the audiometer trim pots (or by software
phone around that frequency. Other coupler problems are adjustments in newer digital audiometers) should be used to
its size, its shape, and the hard walls that permit the possibil- bring the audiometer into calibration. However, when this is
ity of standing waves at frequencies above 6,000 Hz. Despite not possible or when different earphones will be used with
these difficulties, the NBS 9-A coupler remains the accepted the same audiometer, and when corrections are less than
device (by ANSI S3.6-2010) for measuring the acoustic out- 15 dB, a calibration correction card may be placed on the
put from the audiometer through a supra-aural earphone. A audiometer showing the discrepancy from the established
coupler developed by Zwislocki (1970, 1971, 1980) appears norm. It should be noted that if the output of the audiome-
to very closely approximate the acoustic impedance of the ter is, for example, 10 dB too low, then the dB HL correction
human ear. It is used in KEMAR (a manikin that has a pinna sheet must be decreased by 10 dB. Such corrections must then
and an ear canal, as well as a coupler and microphone) be taken into consideration when an audiogram is plotted.
(Burkhard, 1978; Burkhard and Sachs, 1975). This manikin If an audiometer is off by more than 15 dB at any frequency
18 SECTION I • Basic Tests and Procedures
FREQUENCY: 125 250 500 750 1000 1500 2000 3000 4000 6000 8000
1. SPL*
2. Audiometer dial
setting
3. Nominal ref. SPL
(Line 1 – Line 2)
4. Equipment and mike
correction
5. Corrected ref. SPL
(Line 3 – Line 4)
6a. TDH – 49/50 47.5 26.5 13.5 8.5 7.5 7.5 11.0 9.5 10.5 13.5 13.0
earphones**
TDH - 39 45.0 25.5 11.5 8.0 7.0 6.5 9.0 10.0 9.5 15.5 13.0
6b. ER 3-A 26.5 14.5 6.0 2.0 0.0 0.0 2.5 2.5 0.0 -2.5 -3.5
earphones***
7. Calibration error
(Line 5 – Line 6)
8. Corrections @
* SPL = sound pressure level in dB re: 20 µPA
** TDH-49/50 values from ANSI S3.6-1996, p. 18 (see standard for coupler and cushions)
*** ER3-A values from ANSI S3.6-1996, p. 20 using HA-1–type coupler (see standard for different coupler values)
@ Correction – Rounded to the neatest 5 dB; – = audiometer weak, make threshold better
+ = audiometer weak, make threshold better
FIGURE 2.2 Earphone calibration worksheet. (Reprinted from Wilber L. (1972) Calibration: pure tone,
speech and noise signals. In: Katz J, ed. Handbook of Clinical Audiology. 1st ed. Baltimore, MD: The
Williams & Wilkins Company; pp 11–35, with the permission of Lippincott Williams & Wilkins.)
average, for a large group of people, it cannot be expected to compact disc (CD) input of the speech circuit of the audi-
be true for any individual or for small groups (Studebaker, ometer. The input level should be adjusted so that the moni-
1967; Wilber and Goodhill, 1967). toring VU meter on the face of the audiometer reflects the
appropriate level, usually 0 dB. The output from the trans-
ducer is then measured. For most speech stimuli used for
Artificial Mastoid Procedure audiologic purposes, there is a 1,000 Hz tone on the tape or
The preferred procedure for calibrating bone vibrators CD (or in other digital forms) that has an RMS voltage that
involves the use of a mechanical coupler, often referred to is similar to the RMS voltage of the speech stimuli. Details
as an artificial mastoid. Artificial mastoids were proposed concerning the calibration of the speech circuit of an audi-
as early as 1939 by Hawley (1939). However, it was not ometer are given in section 6.2 of ANSI S3.6-2010.
until Weiss (1960) developed his artificial mastoid that they ANSI S3.6-2010 states that the speech output for the
became commercially available. Just as replication of the 1,000-Hz tone at 0 dB HL should be 12.5 dB above the
acoustic impedance of the human ear is difficult with a cou- RETSPL for the earphone at 1,000 Hz. Bone vibrators
pler, replication of the mechanical impedance of the head should be calibrated separately. All subsequent speech test-
is difficult with an artificial mastoid. Because no commer- ing must be carried out with the monitoring meter peaking
cially available artificial mastoid met the mechanical imped- at the same point as during the calibration check. If, for
ance requirements of the ANSI (S3.13-1972) or IEC (IEC example, one prefers −3 dB on the meter rather than 0 dB,
60373:1971) standards, both the ANSI and IEC standards then calibration of the 1,000-Hz tone must be peaked at
were revised to conform more closely to an artificial mas- −3 dB, or an appropriate correction must be made in
toid that is available (ANSI S3.13-1987; IEC 60318-6:2007). reporting measurements.
ANSI S3.6-2010 gives threshold values in reference equiva- The required flatness of the frequency response of the
lent threshold force levels (RETFLs) that are appropriate for speech audiometer circuit is defined as ±3 dB for the fre-
a bone vibrator such as the B-71 or B-72, or one meeting quencies of 250 to 4,000 Hz and from 0 to −10 dB between
the physical requirements described in section 9.4.3 of ANSI 125 and 250 Hz and ±5 dB between 4,000 and 6,000 Hz.
S3.6-2010. The ISO standard (ISO 389-3:1994) gives one set ANSI S3.6-2010 gives specific requirements for checking
of values that are to be used for all bone vibrators having the microphone circuit as well as the other speech input cir-
the circular tip described in the ANSI and IEC documents. cuits. If the puretone and speech audiometers are separate
These values are also used in the ANSI standard. It is impor- machines, then the speech audiometer must also be checked
tant to recognize that both the ANSI and the ISO values for cross-talk, internal noise, and attenuator linearity as
are based on unoccluded ears using contralateral masking. described earlier. More specific information on calibra-
Thus, the values presuppose that masking will be used in the tion of the speech circuit may be found in section 6.2.10 of
contralateral ear when obtaining threshold. One can use the ANSI S3.6-2010.
same type of worksheet for bone as for air—substituting
the appropriate RETFL values. In both earphone and bone
vibrator calibration, it is important to check distortion as
MONITORING METER
well as overall level through the transducer. Distortion may Monitoring (or VU) meters are indicators of signal level and
be measured directly with software integrated into the SLM are found on the face of most audiometers. The monitoring
or by routing the output of the artificial mastoid and SLM meter is calibrated relative to the input signal that it moni-
to a spectrum analyzer. As mentioned earlier, allowable dis- tors and should not be interpreted as yielding any absolute
tortion values for bone vibrators are more lenient than for values such as 0 dB SPL. On a speech audiometer, the meter
earphones. This is because bone vibrators have more distor- is used to monitor the speech signal or to aid the audiolo-
tion than earphones. In addition to the earlier mentioned gist in adjusting the input calibration tone that precedes the
physical measurement procedures, the importance of just recorded speech materials. The specifications for the meters
listening to the audiometer output through the bone vibra- may be found in section 6.2.10 of ANSI S3.6-2010. In gen-
tor cannot be overstated. The normal ear (with audiologist eral, it is important that the meter be stable, that there is
attached) should be able to perceive gross attenuation and minimal undershoot or overshoot of the needle indicator
distortion problems. The electroacoustic procedures, how- relative to the actual signal, and that any amplitude change
ever, serve to quantify the problems that the human ear can is accurately represented on the meter. The audiologist may
only identify subjectively. check the meter and its entire accompanying input system
as described below.
A puretone should be fed from an oscillator through an
SPEECH AUDIOMETERS electronic switch to the input of the audiometer. The tone
Because running speech fluctuates in SPL (as well as fre- should be monitored by a voltmeter or an oscilloscope. By
quency content) over time, the preferred method is to intro- activating the electronic switch to produce a rapidly inter-
duce a puretone (1,000 Hz) into the microphone, tape, or rupted signal, one can watch the meter to ascertain whether
20 SECTION I • Basic Tests and Procedures
stimuli should be checked electroacoustically at least once available, one can record across the terminals of the tim-
every 12 months. However, if the CD or tape player is in reg- ing mechanism inside the audiometer. It is difficult to check
ular use, weekly maintenance should be carried out (such as the pulse speed on a graphic level recorder because of pen
cleaning and demagnetizing the heads for tape players). The damping, but it is possible to check it on a digital or storage
instruction manuals normally outline the procedures to be oscilloscope. It is not difficult to estimate whether there is
used with the particular tape or CD player. If not, any good roughly a 50% duty cycle (on half the time and off half the
audio equipment dealer can explain the procedure. In addi- time), but it is quite difficult to judge whether the signal is on
tion, the frequency response and time characteristics of the for 200 ms versus 210 ms. The characteristics of the pulsed
tape player should be checked. tone are described in section 7.5.4 of ANSI S3.6-2010.
At present, there are no standards for tape players used If both pulsed and continuous signals are used, it is
with audiometers. However, the frequency response and important to check the relative level of the pulsed and
time characteristics of the tape player may be checked by continuous signals. If they are not equal, this should be
using a standard commercial tape recording of puretones of corrected. The relative levels can be compared by observ-
various frequencies. If you do not have access to such a tape, ing the envelope of the waveform on an oscilloscope or by
it is possible to make one by introducing puretones from an recording the output with a graphic level recorder if there is
audio oscillator into the machine, recording them, and play- no damping problem. The attenuation rate and pulse rate
ing them back. This enables the operator to check both the should be checked annually unless there is a reason to sus-
record and playback sections of the tape recorder. Unfortu- pect a problem earlier.
nately, if both the record and playback are equally reduced Computerized audiometers are becoming commer-
(or increased) in frequency, the output will appear at the cially available. It should be noted that computerized audi-
nominal frequency. The output from the oscillator should ometers, just like manual audiometers, must meet all of the
be monitored with a voltmeter to make certain that a con- technical specifications included in ANSI S3.6-2010.
stant voltage signal is used. Distortion of the puretone from
the tape player should also be checked. If none of this is pos- AUDITORY-EVOKED POTENTIAL
sible, the speed of the tape player can be checked grossly by
marking a tape and then, after timing a segment as it goes
INSTRUMENTS
across the tape head, measuring to see how many inches There is an IEC standard for auditory test signals of short
passed over the heads per second. Also, if the machine is duration for audiometric and neuro-otologic purposes (IEC
badly out of calibration, it will be audible as a pitch change 60645-3:2007). There is also an IEC standard for auditory
in the recorded speech (higher if too fast, lower if too slow). brainstem response instruments (IEC 60645-7:2009 Electroa-
coustics—Audiometric Equipment—Part 7: Instruments
AUTOMATIC (AND COMPUTERIZED) for the measurement of auditory brainstem responses). ISO
AUDIOMETERS 389-6:2007 reports RETSPLs for clicks and standard (2–1–2
cycle) tonebursts. There is currently no ANSI standard that
A calibration check of automatic (or Bekesy) audiometers provides RETSPL for clicks and tonebursts.
begins with frequency, level, cross-talk, and other aspects The basic parameters of the acoustic signals used for
described for manual puretone audiometers. In addition, auditory-evoked potentials (AEPs) are the same as for con-
the attenuation rate and interruption rate for pulsed signals ventional audiometry. One must check output level, fre-
should be checked. ANSI S3.6-2010 requires that a rate of quency, and time. When calibrating acoustic transients from
change of 2.5 dB/s be provided for Type 1, 2, and 3 audiom- an AEP instrument, the instrumentation used to calibrate
eters. Permissible rates for all types of audiometers are given an audiometer may be inappropriate. It is especially impor-
in the ANSI S3.6-2010 standard. As in manual audiometers, tant to check the output from the AEP unit acoustically as
the permissible variance in level per step is 1 dB or 0.3 of the well as electrically. It is easy to display the electrical output
indicated step size, whichever is smaller. The attenuation rate from the AEP unit on an oscilloscope, but to analyze that
may be measured quite easily with a stopwatch. After starting display, one needs to repeat it very rapidly or, preferably,
the motor, a pen marking on the chart is started at the same use a digital or storage oscilloscope. Determination of the
instant as a stopwatch is started. One reads the chart to deter- acoustic level of these acoustic transients requires an SLM
mine how far the signal was attenuated (or increased) during that can record true peak SPL (pSPL) or that allows rout-
the measured time interval. By dividing the duration (in sec- ing the output to an oscilloscope to determine pSPL or peak
onds) into the decibel change in level, one can find the deci- equivalent SPL (peSPL).
bel per second attenuation rate. The audiometer should be
checked for signals both increasing and decreasing in level.
To check the pulsed stimulus duration, one may go
Calibration of Acoustic Transients
from the “scope sync” output on the back of the audiom- Acoustic transients must be calibrated utilizing specialized
eter (if such exists) to an electronic counter, or if that is not instrumentation and procedures. For clinical procedures,
22 SECTION I • Basic Tests and Procedures
voltage on the oscilloscope is measured. The microphone is microphone conditioning/amplifier or SLM is routed to a
calibrated by using a sound source with a known SPL (i.e., spectrum analyzer or to an analog-to-digital converter to a
a pistonphone or acoustic calibrator), or the microphone computer that is programmed to do a Fourier analysis of
sensitivity curve can be used to convert microphone out- the signal. In each case, a display of the spectrum of each
put voltage to input sound pressure (and ultimately to SPL). signal type can be obtained. NOTE: Most clinics will likely
The acoustic calibrator is coupled to the microphone, and not have the instrumentation or perhaps the expertise to do
the voltage out of the microphone conditioning amplifier the calibration of transient stimuli. These calibration guide-
is measured. In this way, a given voltage is produced when a lines will thus be most useful to the manufacturers and to
specified SPL is present at the microphone diaphragm. For the technician who periodically calibrates the AEP instru-
example, an acoustic calibrator produces 114 dB SPL, and 1 mentation.
V is measured at the conditioning amplifier output. Then, Many AEPs are elicited by presentation of brief acous-
a 2-cc coupler is placed on the microphone. An Etymotic tic transients. The two most commonly used transients
ER-3 A insert microphone is then mated to the coupler. A are clicks and tonebursts. A click is produced by exciting
sine wave is presented through the earphone, and 50 mv is a transducer with a brief-duration electrical pulse. For use
measured. The SPL is with human subjects, a common click duration is 100 µs.
In a pulse with duration d, there are spectral zeroes (energy
20 log (50 mv/1,000 mv) + 114 dB SPL = 88 dB SPL
at that frequency dips toward zero) at frequencies that are
The first part of the formula estimates the dB re: 114 dB integer multiples of 1/d. Thus, for an electrical pulse with a
SPL, which is −26 dB; 114 is added to convert from “dB re: duration of 100 µs, the first spectral zero occurs at 10,000 Hz,
114 dB SPL” to “dB SPL.” with spectral zeroes occurring at 20,000, 30,000, 40,000 Hz,
Using the sensitivity curve of the microphone, the peak and so on. The acoustic representation of this 100-µs pulse,
voltage on the oscilloscope is converted to peak pressure as recorded through a TDH-50 earphone in a 6-cc coupler, is
(e.g., in Pascals). The sensitivity of a microphone relates the shown in Figure 2.4. It is important for the reader to under-
voltage out of the microphone to the sound pressure at the stand that the click has energy over a wide range of frequen-
microphone diaphragm, in for example, millivolts per pas- cies, and thus is a broadband stimulus. It is also important
cal. Then the pressure is converted to SPL. For example, a to understand that the spectrum of a transient (indeed, of
peak voltage of 100 mV is measured. The microphone sen- any signal) can and is influenced by not only the earphone,
sitivity is 50 mV/Pa. If 50 mV represents 1 Pa, then 100 mV but also the coupler (and perhaps the microphone) used in
represents 2 Pa. Converting to pSPL, the calibration process.
Clicks are often the stimuli of choice when using the
pSPL = 20 log (2 Pa/0.00002 Pa) = 20 log 10,000
auditory brainstem response (ABR) for hearing screening,
= ∼100 dB pSPL
site-of-lesion testing, and intraoperative monitoring. In
This value should correspond to the pSPL produced by contrast, when interested in obtaining an electrophysiologi-
an SLM in peak-hold mode, although if the time constant of cal estimate of the behavioral audiogram, then a broadband
the SLM is too long (say 100 µs), then the SLM may produce stimulus is not optimal. Several approaches have been used
a lower value. For a given stimulus, the baseline-to-peak to elicit responses from a limited cochlear region. The vari-
peSPL value should be 3.01 dB less than the true pSPL value. ous approaches can be broken down into two strategies:
This is because the peSPL value is actually referenced to a (1) using stimuli with narrow spectra and/or (2) using
RMS measure of a sine wave, rather than a peak measure. masking procedures to eliminate the contribution of spe-
To obtain the true pSPL value, 3.01 dB must be added to cific regions of the cochlea. The latter approach (the use of
the peSPL (using the baseline-to-peak procedure), because masking procedures) goes beyond the scope of the present
the crest factor (ratio of peak to RMS) of a true sine wave is chapter, but is considered later in this book (Chapter 11:
1.414, or 3.01 dB (3.01 dB = 20 log 1.414). Introduction to auditory-evoked potentials).
In addition to the overall level, it is important to One approach to generating a limited-spectrum stimu-
determine the frequency characteristics of the signal (i.e., lus is to present a sine wave for a brief duration. ANSI S3.6-
its spectrum) as it is played through the transducer. The 2010, which reviews technical specifications for audiom-
acoustic spectrum of a signal is not necessarily identical to eters, states that for audiometric purposes a tone must be
the spectrum of the electrical signal. This is because each presented for a duration of not less than 200 ms, and have
system has its own transfer function (i.e., filtering charac- a rise time and fall time ranging between 25 and 100 ms.
teristics), and the acoustic spectrum of the stimulus will Figure 2.5 (upper panel) shows a toneburst with a carrier
be affected by both the earphone and the coupler used to frequency of 2,000 Hz. The time required for the toneburst
mate the earphone to the SLM microphone. The spectrum envelope to increase from zero to its maximal amplitude
of the signal can be measured by routing the acoustic signal is termed its rise time. The time that the toneburst enve-
through a coupler, condenser microphone, and microphone lope remains at this maximal amplitude is called its pla-
conditioning/preamplifier or SLM. Finally, the output of the teau time. The time required for the toneburst envelope to
24 SECTION I • Basic Tests and Procedures
2
Vpk/div
–10
Vpk 0 s 3.891 ms
Time 1 real part No avg
FIGURE 2.4 Time-domain
waveform (upper panel) and B Done 256 Hz –7.787 dBVpk
spectrum (lower panel) of an 0
dBVpk
electrical pulse with a dura-
tion of 100 µs. (Reprinted from
Burkard R, Secor C. (2002)
Overview of auditory evoked 6
potentials. In: Katz J, ed. Hand- dB/div
book of Clinical Audiology. 5th
ed. Baltimore, MD: Lippincott
Williams & Wilkins; pp 233–248, –60
with the permission of Lippincott dBVpk 256 Hz 25.6 kHz
FFT 1 Log mag uniform Vec avg 64
Williams & Wilkins.)
decrease from its maximal amplitude to zero amplitude is frequency (i.e., the frequency of the sine wave). For the
called its fall time. Figure 2.5 (lower panel) shows the ampli- toneburst, there is significant energy over a range of fre-
tude spectrum (i.e., frequency-domain representation– quencies centered at 2,000 Hz. This spread of energy to fre-
amplitude across frequency) of this toneburst. A tone that quencies above and below the carrier frequency is referred
is infinitely long in duration only has energy at the carrier to as spectral splatter. Acoustic transients are used to elicit
50
mVpk/div
–250
FIGURE 2.5 Upper panel: The mVpk 0 s 15.59 ms
Time 1 real part No avg
time-domain representation of
a toneburst. Lower panel: The B Done 64 Hz –99.354 dBVpk
frequency-domain representa- –20
dBVpk
tion of the same toneburst.
(Reprinted from Burkard R,
Secor C. (2002) Overview of
auditory evoked potentials. In: 10
Katz J, ed. Handbook of Clinical dB/div
Audiology. 5th ed. Baltimore,
MD: Lippincott Williams &
Wilkins; pp 233–248, with the –120
permission of Lippincott dBVpk 64 Hz 12.8 kHz
FFT 1 Log mag uniform Vec avg 64
Williams & Wilkins.)
CHAPTER 2 • Calibration 25
AEPs, because many of these AEPs, such as the ABR, are fundamental frequency level when measured in an HA-1–
onset responses. Onset responses are elicited by the first few type coupler. The probe signal shall not exceed 90 dB SPL
milliseconds of the stimulus onset, and hence are primar- as measured in that coupler, in an attempt to minimize the
ily affected by the stimulus rise time. When using toneburst possibility that the probe signal will elicit an acoustic reflex.
stimuli, it is intended to serve as a compromise between an The range of acoustic admittance and acoustic impedance
audiometric tone (long duration, long rise/fall times, and values that should be measurable varies by instrument type.
a very narrow spectrum) and a click (short duration, very The accuracy of the acoustic immittance measurements
fast rise/fall times, and very broad spectrum). To add even should be within 5% of the indicated value or ±10−9 cm3/Pa
more complexity to this topic, there are many unique gat- (0.1 acoustic mmho), whichever is greater. The accuracy of
ing functions that can be used to shape the onset and offset the acoustic immittance measurement can be determined
of the toneburst, including linear, Blackman, and Hanning by connecting the probe to the test cavities and checking
(cosine2) functions. As mentioned previously, methods for the accuracy of the output at specified temperatures and
the calibration of acoustic transients can be found in IEC ambient barometric pressures. A procedure for checking the
60645-3:2007, and RETSPLs for clicks and select toneburst temporal characteristics of the acoustic immittance instru-
stimuli can be found in ISO 389-6:2007. ment is described by Popelka and Dubno (1978) and by
Lilly (1984).
OTOACOUSTIC EMISSION Air pressure may be measured by connecting the probe
to a “U tube” manometer and then determining the water
DEVICES displacement as the immittance device air pressure dial is
There are currently no ANSI standards for otoacoustic rotated. If the SI unit of decapascals (daPa) is used, then
emission (OAE) devices. IEC 60645-6 is an international an appropriate manometer or pressure gauge must also be
standard that can be used to specify the stimuli used to used. The air pressure should not differ from that stated on
obtain OAEs. If using a click to obtain transient-evoked oto- the device (i.e., 200 daPa) by more than ±10 daPa or ±15%
acoustic emissions (TEOAEs), reporting the level in pSPL of the reading, whichever is greater. The standard states that
or peSPL is appropriate. For distortion product otoacoustic the air pressure should be measured in cavities with vol-
emissions (DPOAEs), verification that the primary signals umes of 0.5 to 2 cm3.
(as measured in an appropriate coupler/microphone/SLM Finally, the reflex-activating system should be
and frequency counter or spectrum analyzer) are close to checked. In checking the activation of a contralateral or
the levels and frequencies as specified by the OAE device is ipsilateral reflex, normally an insert receiver will be used
desirable. Because the DPOAE response is an intermodula- that may be measured on a standard HA-1 coupler. The
tion distortion product (typically the cubic difference tone), frequency of the activator can be measured electrically
it is critical to measure the amplitude of the distortion at this directly from the acoustic immittance device. In this
frequency in a hard-walled cavity to know when measured case, one uses a frequency counter as described earlier
distortion in fact represents distortion in the instrumenta- for checking the frequency of puretones in audiometers.
tion itself, rather than representing nonlinearities generated Frequency should be ±3% of the stated value, and har-
by the inner ear. monic distortion should be less than 3% at specified fre-
quencies for earphones and 5% or less for the probe tube
transducer or insert receiver. Noise bands should also be
ACOUSTIC IMMITTANCE DEVICES checked if they are to be used as activating stimuli. Broad-
The standard for acoustic immittance (impedance/admit- band noises should be uniform within ±5 dB for the range
tance) devices is ANSI S3.39-1987. Note that there is also between 250 and 6,000 Hz for supra-aural earphones.
an IEC standard, IEC 60645-5:2004, for measurement of This can be checked by sending the output through the
aural acoustic impedance/admittance. ANSI S3.39-1987 transducer connected to a coupler, a microphone, and a
describes four types of units for measuring acoustic immit- graphic level recorder or spectrum analyzer. The SPL of
tance (listed simply as Types 1, 2, 3, and 4). The specific tonal activators should be within ±3 dB of the stated value
minimum mandatory requirements are given for Types for frequencies from 250 to 4,000 Hz and within ±5 dB for
1, 2, and 3. There are no minimum requirements for the frequencies of 6,000 to 8,000 Hz and for noise. The rise
Type 4 device. Types 1, 2, and 3 must have at least a 226-Hz and fall times should be the same as those described for
probe signal, a pneumatic system (manual or automatic), audiometers and may be measured in the same way. Daily
and a way of measuring static acoustic immittance, tympa- listening checks as well as periodic tests of one or two per-
nometry, and the acoustic reflex. Thus, to check the acous- sons with known acoustic immittance to check tympano-
tic immittance device, one may begin by using a frequency grams and acoustic reflex thresholds should be performed
counter to determine the frequency of the probe signal(s). to catch any gross problems.
The frequency should be within 3% of the nominal value. In summary, acoustic immittance devices should be
The total harmonic distortion shall not exceed 5% of the checked as carefully as one’s puretone audiometer. Failure
26 SECTION I • Basic Tests and Procedures
Lilly DJ. (1984) Evaluation of the response time of acoustic-immit- Weiss E. (1960) An air-damped artificial mastoid. J Acoust Soc Am.
tance instruments. In: Silman S, ed. The Acoustic Reflex. New 32, 1582–1588.
York, NY: Academic Press. Weissler P. (1968) International standard reference zero for audi-
Melnick W. (1973) What is the American National Standards Insti- ometers. J Acoust Soc Am. 44, 264–275.
tute? ASHA. 10, 418–421. Wilber LA. (2004) What Are Standards and Why Do I care? Semi-
Occupational Safety and Health Administration. (1983) Occupa- nars in Hearing—Current Topics in Audiology: A Tribute to Tom
tional Noise Exposure, Hearing Conservation Amendment. Rule Tillman. Stuttgart, Germany: Thieme; pp 81–92.
and Proposed Regulation. Washington, DC: Federal Register, Wilber LA, Goodhill V. (1967) Real ear versus “artificial mastoid”
United States Government Printing Office. methods of calibration of bone-conduction vibrators. J Speech
Popelka GR, Dubno JR. (1978) Comments on the acoustic- Hear Res. 10, 405–416.
reflex response for bone-conducted signals. Acta Otolaryngol Yeager D, Marsh A. (1998) Sound levels and their measurement. In:
(Stockh). 86, 64–70. Harris C, ed. Handbook of Acoustical Measurements and Noise
Roach R, Carhart R. (1956) A clinical method for calibrating the Control. 3rd ed. Woodbury, NY: Acoustical Society of America;
bone-conduction audiometer. Arch Otolaryngol. 63, 270–278. pp 11.1–11.18.
Rosen S, Howell P. (1991) Signals and Systems for Speech and Hear- Zwislocki JJ. (1970) An Acoustic Coupler for Earphone Calibration.
ing. London: Academic Press. Rep. LSC-S-7, Lab Sensory Commun. Syracuse, NY: Syracuse
Rudmose W. (1964) Concerning the problem of calibrating TDH-39 University.
earphones at 6 kHz with a 9 A coupler. J Acoust Soc Am. 36, 1049. Zwislocki JJ. (1971) An Ear-Like Coupler for Earphone Calibration.
Silverman FH. (1999) Fundamentals of Electronics for Speech- Rep. LSC-S-9, Lab Sensory Commun. Syracuse, NY: Syracuse
Language Pathologists and Audiologists. New York, NY: Allyn University.
and Bacon. Zwislocki JJ. (1980) An ear simulator for acoustic measurements.
Speaks C. (1996) Introduction to Sound. Acoustics for the Hearing and Rationale, principles, and limitations. In: Studebaker G, Hoch-
Speech Sciences. 2nd ed. San Diego, CA: Singular Publishing. berg I, eds. Acoustical Factors Affecting Hearing Aid Perfor-
Studebaker G. (1967) Interest variability and the air-bone gap. mance. Baltimore, MD: University Park Press.
J Speech Hear Disord. 32, 82–86.
C H A P T ER 3
Puretone Evaluation
29
30 SECTION I • Basic Tests and Procedures
120 0
100
Sound pressure level (dB re: 20 µP)
20
60
60
40
80
20
0 100
0.125 0.25 0.5 1.0 2.0 4.0 8.0 0.125 0.25 0.5 1.0 2.0 4.0 8.0
Frequency (kHz) Frequency (kHz)
FIGURE 3.1 Thresholds in dB sound pressure level (SPL; left panel) and dB hearing level (HL; right
panel) as a function of frequency. The solid line represents average normal hearing; the dashed line
represents a person’s threshold who has a high-frequency hearing loss.
hearing loss is conductive or sensory/neural. Tuning fork falsely identified as having conductive losses with the Rinne
tests are promoted by some as an important supplement to test, this test misses many people with significant conductive
puretone audiometry. In a recently published book, otolo- losses (Browning, 1987), including 50% of losses that have
gists are advised to include tuning fork tests as an integral 20-dB air–bone gaps. The Weber test fares equally poorly.
part of the physical examination for conductive hearing loss Browning (1987) reports that a majority of children with
(Torres and Backous, 2010). conductive losses give inappropriate responses on the Weber
The two best known tuning fork tests are the Weber test. From these and other studies, one must conclude that
and Rinne. Judgments about the type of hearing loss are tuning fork tests are not a replacement or even a supplement
made by comparing the pattern of results on both tests. to audiometry. Audiometry is capable of identifying nearly
Air conduction (AC) is tested by holding the tuning fork at 100% of air–bone gaps, as small as 15 dB.
the opening of the ear canal, and bone conduction (BC) is
tested by placing the tuning fork on the mastoid process (the
bony area behind the pinna) or on the forehead or incisors
PURETONE AUDIOMETRY
(British Society of Audiology, 1987). For the Weber test, Audiometers are used to make quantitative measures of AC
a client judges whether sound is perceived in one or both and BC PT thresholds. AC thresholds assess the entire audi-
ears when the tuning fork is placed on the forehead. For tory pathway and are usually measured using earphones.
the Rinne test, the client judges whether sound is louder When sound is delivered by an earphone, the hearing sen-
when presented by AC or by BC. Ideally, conductive hear- sitivity can be assessed in each ear separately. BC thresholds
ing losses produce a pattern of responses that is uniquely are measured by placing a vibrator on the skull, with each
different from the one for sensory/neural hearing losses. In ear assessed separately, usually by applying masking noise
the Weber, the sound is lateralized to the poorer ear with a to the nontest ear. The goal of BC testing is to stimulate the
conductive loss and to the better ear for a sensory/neural cochlea directly, thus bypassing the outer and middle ears.
loss. In the Rinne, the sound is louder by BC in a conductive A comparison of AC and BC thresholds provides separate
loss and by AC with a sensory/neural loss. estimates of the status of the conductive and sensory/neural
Some recommend tuning fork tests to check the valid- systems. If thresholds are elevated equally for sounds pre-
ity of audiograms (Gabbard and Uhler, 2005) or to confirm sented by AC and BC, then the outer and middle ear are not
the audiogram before conducting ear surgery (Sheehy et al., contributing to a hearing loss. By contrast, if thresholds are
1971). However, it is important to recognize that tuning poorer by AC than by BC, then the source of at least some of
fork tests administered to people with known conductive the loss is the outer or middle ear. Figure 3.2 illustrates the
losses have shown that these procedures are often inaccurate AC and BC pathways and how hearing thresholds are typi-
(Browning, 1987; Snyder, 1989). Although only about 5% cally affected by damage to these structures. See Chapter 4
of people with normal hearing or sensory/neural losses are for a complete review of BC assessment.
BC normal
Sensory/neural
hearing loss
AC normal
BC impaired
Mixed
hearing loss
AC impaired
Equipment
AUDIOMETERS
Puretones are generated within an audiometer. Audiometers
have the ability to select tonal frequency and intensity level
and to route tones to the left or right earphone. All audi-
ometers also have an interrupter switch that presents the
stimulus to the examinee. The American National Standards
Institute (ANSI) Specification for Audiometers (ANSI, 2010)
describes four types of audiometers, with Type 1 having the
most features and Type 4 having the fewest features. A Type 1
audiometer is a full-featured diagnostic audiometer. A Type
1 audiometer has earphones, bone vibrator, loud speakers,
masking noise, and other features. A Type 4 audiometer is FIGURE 3.4 Etymotic model ER-3A insert earphones.
simply a screening device with earphones, but none of the
other special features.
due to their ease of calibration and the lack of other types
Type 1 (full-featured, diagnostic audiometer) has the
of commercially available earphones. In the past few years,
ability to assess puretone AC thresholds for frequencies
insert earphones and circumaural earphones have become
ranging from 125 to 8,000 Hz and BC thresholds for fre-
available and provide some useful applications for puretone
quencies ranging from 250 to 6,000 Hz. If an audiometer
assessment.
has extended high-frequency capability, air-conduction
Insert earphones are coupled to the ear by placing a
thresholds can be extended to 16,000 Hz. Maximum output
probe tip, typically a foam plug, into the ear canal. The com-
levels for AC testing are as high as 120 dB HL for frequen-
mercially available model that has a standardized calibra-
cies where hearing thresholds are most sensitive. By contrast,
tion method for audiology is the Etymotic model ER-3A,
distortion produced by bone oscillators at high intensities
which is illustrated in Figure 3.4. These earphones have
limits maximum output levels for BC thresholds to values
gained popularity in the past few years because they offer
nearly 50 dB lower than those for AC thresholds for the same
distinct advantages over supra-aural earphones. One major
frequency.
advantage is that insert earphones yield higher levels of
interaural attenuation than supra-aural earphones (Killion
TRANSDUCERS and Villchur, 1989). Interaural attenuation represents the
decibel reduction of a sound as it crosses the head from the
Earphones
test ear to the nontest ear. The average increase in interau-
Earphones are generally used to test puretone AC thresholds. ral attenuation is roughly 20 dB. This reduces the need for
A pair of supra-aural earphones is illustrated in Figure 3.3. masking the nontest ear and decreases the number of mask-
For decades, supra-aural earphones, ones in which the cush- ing dilemmas, situations for which thresholds cannot be
ion rests on the pinna, were the only choice for clinical audi- assessed, because the presentation level of the masking noise
ology. The popularity of supra-aural phones was mainly is possibly too high. (See Chapter 6 for a comprehensive
review of masking.) Another important advantage of insert
earphones over supra-aural earphones is lower test–retest
variability for thresholds obtained at 6 and 8 kHz; variability
for other frequencies is comparable. Given that thresholds
for 6 and 8 kHz are important for documenting changes in
hearing due to noise exposure and for identifying acoustic
tumors, lower variability should increase the diagnostic pre-
cision. A third advantage that insert earphones offer is elim-
ination of collapsed ear canals (Killion and Villchur, 1989).
In about 4% of clients, supra-aural earphones cause the ear
canal to narrow or be closed off entirely when the cushion
presses against the pinna, collapsing the ear canal (Lynne,
1969), resulting in false hearing thresholds, usually in the
high frequencies (Figure 3.5) (Ventry et al., 1961). Because
insert earphones keep the ear canal open, collapsed canals
FIGURE 3.3 Telephonics model TDH-49, an example of are eliminated. A fourth advantage of insert earphones is
supra-aural earphones. that they can be easily used with infants and toddlers who
CHAPTER 3 • Puretone Evaluation 33
Occlusion Symbol
0 80.5%
91.6%
94.4%
20 98.3%
dB hearing level (re: ANSI 2010)
99.2%
100%
40
60
cannot or will not tolerate supra-aural earphones. A fifth earphone type (Voss et al., 2000; Voss and Herman, 2005).
advantage of insert earphones is the option of conducting For these cases, measuring the SPL at the eardrum to
middle-ear testing and otoacoustic emission testing with- specify the level presented to an individual patient would
out changing the earphones; some recently introduced improve the accuracy of hearing thresholds. The probe-
diagnostic instruments use this approach. Although insert tube microphones necessary for these types of measures
earphones offer a hygienic advantage over supra-aural ear- already exist, and hopefully, this technology will become
phones, because the foam tips that are placed into a client’s routinely available for use in diagnostic audiometers (see
ear canal are disposable, the replacement cost of those tips Scheperle et al., 2011 for a discussion of calibration in
is prohibitive for many applications. In addition to higher the ear canal).
costs, insert earphones also yield errant thresholds in persons
with eardrum perforations, including pressure-equalization Speakers
tubes (Voss et al., 2000). (See Figure 3.12 for additional AC thresholds can be measured using speakers as the trans-
information about perforations.) Insert earphones also have ducer. Thresholds so obtained are known as sound-field
maximum output levels that are lower than those produced thresholds. Sound-field thresholds are unable to provide
by supra-aural earphones for some frequencies. Because of ear-specific sensitivity estimates. In cases of unilateral
these differences, many diagnostic clinics keep both ear- hearing losses, the listener’s better ear determines thresh-
phone types on hand and switch between them depending old. This limitation and others dealing with control over
on the application. stimulus level greatly limit clinical applications involving
Circumaural earphones, a third type, have cushions sound-field thresholds. Applications for sound-field thresh-
that encircle the pinna. ANSI (2010) describes reference olds are screening infant hearing or demonstrating to the
equivalent threshold SPL values (SPL values corresponding parents their child’s hearing ability. Sound-field thresholds
to 0 dB HL) for Sennheiser model HDA200 and Koss model may also be desirable for a person wearing a hearing aid or
HV/1A earphones. These earphones and the Etymotic ER-2 cochlear implant.
insert earphones are the only ones in the current standard In sound-field threshold measures, the orientation of
that have reference values covering the extended high fre- the listener to the speaker has a large effect on stimulus level
quencies (8 to 20 kHz). presented at the eardrum. A person’s head and torso as well
Current standards for earphone calibration specify as the external ear (e.g., pinna, ear canal, concha) affect
the level based on measures obtained with the earphone sound levels (Shaw, 1974). Differences in SPL at the eardrum
attached to an acoustic coupler or artificial ear. These coup- are substantial for speaker locations at different distances
lers are designed to approximate the ear canal volume of an and different angles relative to the listener. For this reason,
average person. Given that some clients (e.g., infants) have sound-field calibration takes into consideration these fac-
very small or very large ear canals (e.g., some postsurgi- tors. A mark is usually made on the ceiling (or floor) of the
cal clients and persons with perforated eardrums), coupler room to indicate the location of the listener during testing.
measures may produce erroneous results, regardless of the Even at the desired location, stimulus level at the eardrum
34 SECTION I • Basic Tests and Procedures
for some frequencies can vary as much as 20 dB or more air–bone gaps and bone–air gaps are equipment miscalibra-
by simply having the listener move his or her head (Shaw, tion, test–retest variability, and individual differences in
1974). Calibration assumes the listener will always be fac- anatomy that cause thresholds to deviate from the group-
ing the same direction relative to the sound source (ANSI, mean data used to derive normative values for relating AC
2010). Furniture and other persons in the sound field also and BC thresholds.
affect the stimulus level at a listener’s eardrum (Morgan For threshold measurements bone vibrators are typi-
et al., 1979). All of these factors add to the challenge of cally placed behind the pinna on the mastoid process or
obtaining accurate sound-field thresholds. on the forehead. Although forehead placement produces
Another important consideration in sound-field slightly lower intrasubject and intersubject threshold dif-
threshold measures is the stimulus type. Thresholds cor- ferences (Dirks, 1994), placement on the mastoid process
responding to different frequencies are desired for plotting is preferred by 92% of audiologists (Martin et al., 1998).
an audiogram, but puretones can exhibit large differences Mastoid placement is preferred mainly because it produces
in level at different positions in a testing suite as a result between 8 and 14 dB lower thresholds than forehead place-
of standing waves. Standing waves occur when direct sound ment for the same power applied to the vibrator, depend-
from the speaker interacts with reflections, resulting in ing on the frequency (ANSI, 2010). The median difference
regions of cancellation and summation. Differences in is 11.5 dB. Given that the maximum output limits for bone
stimulus level due to standing waves are minimized by using vibrators with mastoid placement are as much as 50 dB
narrowband noise or frequency-modulated (FM) tones as lower than that for AC thresholds, forehead placement
the stimulus (Morgan et al., 1979). FM tones, also known yields an even larger difference. The inability to measure
as warbled tones, are tones that vary in frequency over a BC thresholds for higher levels means that a comparison of
range that is within a few percent of the nominal frequency. AC and BC thresholds is ambiguous in some cases. That is,
This variation occurs several times per second. Under ear- when BC thresholds indicate no response at the limits of the
phones, thresholds obtained with these narrowband stimuli equipment (e.g., 70 dB HL) and AC thresholds are poorer
are nearly identical to thresholds obtained with puretones, than the levels where no response was obtained (e.g., 100 dB
with some exceptions in persons with steeply sloping HL), the audiologist cannot establish from these thresholds
hearing loss configurations. FM tones and narrowband whether the loss is purely sensory/neural or whether it has a
noise are the preferred stimuli for sound-field threshold conductive component.
measures.
correspond to the right and left ears, respectively. Prior to duration tone is recommended for manual presentation
placement of earphones, clients are asked to remove jewelry (ASHA, 2005). The duration is determined by the amount
such as earrings and glasses if they will interfere with the of time the interrupter switch is held down. Pulsed tones
placement of the earphone. This is particularly relevant for are achieved by selecting this option on the audiometer’s
supra-aural earphones. front panel. If pulsed tones are selected, then the audio-
For circumaural and supra-aural earphones, the dia- meter alternately presents the tone followed by a short silent
phragm of the earphone should be centered over the ear interval (typically 225 ms on followed by 225 ms off) for as
canal. The examiner should view each ear while the phone long as the interrupter switch is depressed. The minimum
is being placed. Immediately after placement, the headband duration for a single pulse of the tone is critical. Numerous
is tightened enough to make the earphone perpendicular to psychoacoustic studies have shown that tonal durations
the floor when the examinee is sitting upright. between roughly 200 ms and 1 second or more yield nearly
The first step in placement of insert earphones is to identical thresholds (Watson and Gengel, 1969). By con-
attach a spring-loaded clip that holds the transducer in trast, the same studies show that durations less than 200 ms
place to the examinee’s clothing. The clip can be attached to result in higher thresholds. For this reason, audiometers are
clothing near the shoulder (or behind a child’s neck) to keep designed to have a nominal pulse duration of 225 ms (ANSI,
the plug from being pulled out of the ear. In some newer 2010). Pulsed and manually presented tones presented from
implementations that combine middle-ear and otoacoustic audiometers that maintain tonal durations between 200 ms
emission measurements, the earphone is attached to a head- and 2 seconds yield nearly identical thresholds, as the psy-
band. For both types of support, the audiologist compresses choacoustic studies suggest. However, pulsed tones are pre-
the foam plug and inserts it into the ear canal so that its ferred for two reasons. Most patients prefer pulsed tones
outer edge lines up with the tragus. (Burk and Wiley, 2004), and pulsed tones also reduce the
number of presentations required to find threshold in per-
Placement of the Bone-conduction Vibrator sons with cochlear hearing loss who have tinnitus (Mineau
Although some recommend forehead placement (Dirks, and Schlauch, 1997). Apparently, pulsed tones help patients
1994), typically audiologists place the BC oscillator on the to distinguish the puretone signal from the continuous
most prominent part of the mastoid process. While hold- or slowly fluctuating noises generated from within their
ing the oscillator against the mastoid process with one hand, auditory system (tinnitus), thereby reducing false-positive
the headband is fit over the head to hold the oscillator in responses. False-positive responses can lengthen test time
place using the other hand. The oscillator surface should (Mineau and Schlauch, 1997), which is costly to an audiology
be set directly against the skin, not touching the pinna, and practice.
with no hair or as little hair as possible between the oscilla- Thresholds typically are obtained using a modified
tor and the skin. Some audiologists play a continuous low- Hughson–Westlake down-up procedure, which is a specific
frequency tone while moving the oscillator slightly side to implementation of a method-of-limits procedure (Carhart
side, asking the listener to report the location at which the and Jerger, 1959; Hughson and Westlake, 1944). The exam-
tone is the strongest. iner begins the threshold-finding procedure by presenting
a tone at 30 dB HL (ASHA, 2005). If the listener responds,
Audiometric Procedure for Threshold Measurement the level of the tone is decreased in 10-dB steps until the lis-
The ASHA Guideline (2005) recommends starting a thresh- tener no longer responds. If the listener does not respond to
old search from either well below threshold or using a supra- this initial 30-dB tone, the examiner raises the tone in 20-dB
threshold tone that familiarizes the participant with the steps until a response is obtained. After every response to a
stimulus. Most clinicians prefer the familiarization method. tone, the level of the tone is decreased in 10-dB steps until
For the familiarization approach, testing usually begins at there is no response. For subsequent presentations when
1,000 Hz at 30 dB HL unless prior knowledge of the exami- there is no response, the examiner raises the level of the tone
nee’s hearing suggests otherwise (ASHA, 2005). At 1,000 Hz, in 5-dB steps until a response is obtained. Following this
an examinee is more likely to have residual hearing than at a “down-10/up-5” rule, the tester continues until the thresh-
higher frequency, and test–retest reliability is excellent. Test- old is bracketed a few times, and a threshold estimate is
ing begins with an examinee’s self-reported better ear. If the obtained. ASHA (2005) recommends that threshold should
examinee believes both ears are identical, testing begins by correspond to the level at which responses were obtained
convention with the right ear. The better ear is tested first for two ascending runs, which is what most clinicians based
to provide a reference to know whether masking needs to their thresholds on even when the ASHA (1978)Guideline
be delivered to obtain a valid estimate of threshold for the recommended that thresholds be based on three ascending
poorer ear. runs. Research based on computer simulations of clinical
Tonal duration is an important factor in a puretone procedures (Marshall and Hanna, 1989) supports the cli-
test. On most audiometers, the option exists to select nician’s position and that of the ASHA (2005) guideline.
either pulsed or manual presentation. A 1- to 2-second The computer simulations of thresholds based on three
CHAPTER 3 • Puretone Evaluation 37
ascending runs showed only a minimal reduction of the vari- TESTING CHILDREN YOUNGER THAN AGE
ability when compared to thresholds based on two ascend- 5 YEARS AND PERSONS WITH SPECIAL NEEDS
ing runs. Listeners who produce inconsistent responses are
an exception, and for these listeners, additional measure- For most children younger than age 5 years, audiologists
ments can be made to confirm the threshold estimate. have special procedures that they employ to measure PT
After a threshold is measured at 1,000 Hz, the next thresholds. Some of these same procedures are also appro-
frequencies that are examined depend on the goal, but the priate for persons older than 5 years who have cognitive def-
higher frequencies are typically tested prior to the lower icits. Chapter 24 on pediatric hearing assessment describes
frequencies. For diagnostic audiometry, thresholds are mea- these procedures and their interpretation.
sured at octave intervals between 250 and 8,000 Hz, along
with 3,000 and 6,000 Hz. Intra-octave thresholds between
500 and 2,000 Hz should be measured when thresholds dif-
Audiometric Interpretation
fer by 20 dB or more between two adjacent octaves. ASHA PT thresholds are displayed in tabular or graphical for-
(2005) also recommends that 1,000 Hz be tested twice as a mats. The tabular format is useful for recording the results
reliability check. Refer to the ASHA (2005) guidelines for of serial monitoring of thresholds, as in a hearing conser-
specifics about the recommended protocol and Chapter 6 vation program, but in many applications, thresholds are
for details about the use of masking noise to eliminate the plotted on an audiogram. ASHA (1990), in a publication
participation of the nontest ear. Masking noise is needed entitled Guidelines for Audiometric Symbols, suggests a stan-
whenever the threshold difference between ears is equal to dardized form for the audiogram. Although other formats
or exceeds the lowest possible values for interaural attenu- for plotting audiograms are acceptable, it is helpful to use
ation. For BC testing, masking is needed to verify results a standardized format for ease of interpretation across clin-
anytime an air–bone gap in the test ear of greater than ics. The audiogram consistent with that recommended in
10 dB is observed. For AC testing, masking is needed when the ASHA guidelines (1990) is shown in Figure 3.7 along
the difference between the AC threshold in the test ear with recommended symbols. This audiogram only covers
and the BC threshold of the nontest ear is greater than or the conventional frequencies. Thresholds for extended high
equal to 40 dB for supra-aural earphones, and considerably frequencies are plotted often in units of dB SPL, because
more for insert earphones, especially in the low frequencies average extended high-frequency thresholds vary over a
(Killion and Villchur, 1989). Specific recommendations for wide range with the age of the listener, making dB SPL a
insert earphones cannot be made until a study with a larger better reference than dB HL for comparing thresholds to
sample size is completed. norms for listeners of different ages. Conversion between
Key to symbols
RE LE RE LE
0 (no resp)
AC
20 AC
Masked
BC
Mastoid
dB hearing level
40
BC
Mastoid
masked
BC
60 Forehead
masked
BC
Forehead
120
250 500 1,000 2,000 4,000 8,000
Frequency (Hz)
FIGURE 3.7 Recommended audiogram and symbols (ASHA, 1990) with a sensory/neural
hearing loss. RE and LE represent the right ear and left ear, respectively. The word “response”
is abbreviated “resp.”
38 SECTION I • Basic Tests and Procedures
TA B L E 3 . 2
0
Classification of Degree of Hearing Loss
Calculated from the Average of Thresholds 20
for 500, 1,000, and 2,000 Hza
Northern Jerger and 40
dB hearing level
Degree of and Downs Goodman Jerger
Loss (2002) (1965) (1980)
60
None <16 <26 <21
Slight 16–25
80
Mild 26–30 26–40 21–40
Moderate 30–50 41–55 41–60 Right Left
Moderately 56–70 100 Air
Bone
severe
Severe 51–70 71–90 61–80
120
Profound >70 >90 >80 250 500 1,000 2,000 4,000 8,000
a
Although all three references cited differ in the value accepted Frequency (Hz)
as a profound loss, a loss of 90 dB HL or more is widely accepted FIGURE 3.8 A bilateral conductive hearing loss. The
as representing a qualitative as well as a quantitative boundary
plotted values represent the average loss reported by
between hearing and deafness.
Fria et al. (1985) in a group of children with otitis media.
units of dB SPL and dB HL can be accomplished for three successful at achieving this goal, because (1) handicap is
different earphone models by consulting reference levels dependent on many factors related to an individual’s needs
published in ANSI (2010). and abilities, (2) only some of the speech frequencies are
Audiograms are often classified by categories based on assessed using this three-frequency average (speech frequen-
the degree of hearing loss. A number of authors have pub- cies range from 125 to 6,000 Hz), and (3) identical amounts
lished systems for classifying hearing loss based on the aver- of hearing loss sometimes result in large differences in the
age AC thresholds for three frequencies. The frequencies ability to understand speech and, as a consequence, the
used for this purpose are usually 500, 1,000, and 2,000 Hz, degree of disability associated with the loss. Despite these
often referred to as the three-frequency puretone average limitations, many audiologists use these categories routinely
(PTA). Table 3.2 shows the categories for the degree of loss to summarize the amount of loss in different frequency
based on this PTA for three different authors (Goodman, regions of an audiogram when describing results to other
1965; Jerger and Jerger, 1980; Northern and Downs, 2002). professionals or to a client during counseling.
The first category is normal hearing. Note that none of the Another factor in audiometric classification is the
three authors agree on the upper limit for normal, which type of hearing loss. The type of hearing loss is determined
ranges from 15 to 25 dB HL. Northern and Downs (2002) by comparing the amount of hearing loss for AC and BC
suggest using 15 dB HL as the upper limit for normal hear- thresholds at the same frequency. A sensory/neural hear-
ing for the three-frequency PTA for children between 2 ing loss has an equal amount of loss for AC and BC thresh-
and 18 years of age and a higher limit for adults. A 15 dB olds (as shown in Figure 3.7). By contrast, a conductive loss
HL upper limit for normal hearing may produce a signifi- has lower BC thresholds than AC thresholds (as shown in
cant number of false positives when applied to thresholds Figure 3.8). Conductive-loss magnitude is described by the
for individual audiometric frequencies, even in children decibel difference between AC and BC thresholds. This dif-
(Schlauch and Carney, 2012). Regardless of the value used ference is known as the air–bone gap, a value that has a max-
as an upper limit for normal hearing, keep in mind that an imum of about 65 dB† (Rosowski and Relkin, 2001). Due
ear-related medical problem can still exist even though all to test–retest differences, an air–bone gap needs to exceed
thresholds fall within the defined normal range. For exam- 10 dB before it is considered significant. A mixed hearing
ple, the presence of a significant air–bone gap might indi- loss shows a conductive component and a sensory/neural
cate the presence of middle-ear pathology even though all component. In other words, a mixed loss has an air–bone
AC thresholds fall within normal limits.
The original intent of classification system for severity
of loss based on a three-frequency PTA was to express, in †
Physiologic models suggest that the maximum air–bone gap occurs
a general way, the degree of handicap associated with the when there is an intact tympanic membrane and a disarticulated ossic-
magnitude of the loss. These categories are only somewhat ular chain (Rosowski and Relkin, 2001).
CHAPTER 3 • Puretone Evaluation 39
TA B LE 3 . 3
0 Right Left
Air Criteria for Classifying Audiometric
Bone
20 Configurations
Term Description
dB hearing level
dB hearing level
Several methods have been proposed to assess the sig-
nificance of threshold differences on retest for complete 60
audiograms (Schlauch and Carney, 2007). These methods
usually require that thresholds for more than one frequency 80
contribute to the decision process, although some accept a
large change for a single frequency, such as 20 dB or more, 100
as a significant difference. One of these methods defines a
significant threshold shift by a minimal change in a PTA.
120
For instance, the Occupational Safety and Health Adminis-
tration (1983) defines a notable threshold shift (in their ter-
140
minology, a standard threshold shift) as a 10-dB or greater 250 500 1,000 2,000 4,000 8,000
change in the PTA based on thresholds for 2, 3, and 4 kHz Frequency (Hz)
in either ear. These frequencies were selected because they
include those that are susceptible to damage by occupa- FIGURE 3.10 Mean vibrotactile thresholds for bone
conduction (dashed line) and air conduction (solid line).
tional noise and have stable test–retest reliability. A second
The range of responses is indicated by the shaded
commonly used approach requires threshold differences to region. (Adapted from Boothroyd and Cawkwell (1970)).
occur at adjacent frequencies. One rule that is applicable
to many situations defines a significant threshold shift as
one for which two adjacent thresholds differ by 10 dB or relatively low vibrotactile thresholds are observed for BC
more on retest. This criterion has been applied widely in at 250 and 500 Hz, a false air–bone gap is likely to occur
audiometric studies and is sometimes combined with other in persons with significant sensory/neural losses at these
criteria to arrive at a decision (ASHA, 1994). A third approach frequencies.
recommends repeating threshold measurements during a Boothroyd and Cawkwell (1970) recommend asking the
single session to improve audiometric reliability (National client if they “feel” the stimulus or “hear” the stimulus as a
Institute for Occupational Safety and Health, 1998). This means to differentiate between these two outcomes. Persons
method is paired with a rule or rules defining the crite- with experience with auditory sensations can usually make
rion for a significant threshold shift. The notable difference this distinction.
between this method and the others described earlier is that The values for vibrotactile thresholds illustrated in
the criterion defining a threshold shift must be repeatable to Figure 3.10 are based on only nine listeners. A more detailed
be accepted as significant. study needs to be conducted to specify these ranges more
The examples in this section on the variability of PT precisely for the transducers in current use.
thresholds have assumed a fixed SD of test–retest differences
of ±5 dB for all audiometric frequencies. Although 5 dB is a BONE-CONDUCTION THRESHOLDS:
reasonable average value for many situations, studies show NOT A PURE ESTIMATE OF
that the SD varies with type of earphone, the time between
tests, and even with audiometric frequency (Schlauch and
SENSORY/NEURAL RESERVE
Carney, 2007). The goal of BC testing is to obtain an estimate of sensory/
neural reserve, but BC thresholds sometimes are influenced
by the physiologic properties of the external, middle, and
VIBROTACTILE THRESHOLDS inner ears. The BC vibrator sets the skull into vibration,
In persons with significant hearing losses, sound vibra- which stimulates the cochlea, but this does not happen in
tions produced by earphones and bone vibrators may be isolation. When the skull is vibrated, the middle-ear ossicles
perceived through the sense of touch. Such thresholds are are also set into motion, and this inertial response of the
known as vibrotactile thresholds. ossicular chain contributes to BC thresholds. Changes in
Figure 3.10 illustrates the range of levels found to yield the external and middle ear can modify the contribution
vibrotactile thresholds for a supra-aural earphone and a of the inertial response, which may result in significant
bone vibrator. A threshold occurring within the range of changes in BC thresholds (Dirks, 1994).
possible vibrotactile thresholds is ambiguous; it could be A classic example of a middle-ear problem that influ-
a hearing threshold or a vibrotactile threshold. Because ences BC thresholds is otosclerosis. Otosclerosis frequently
CHAPTER 3 • Puretone Evaluation 41
causes the footplate of the stapes to become ankylosed or of “no trial” silence to confirm that their responses are, in
fixed in the oval window. This disease process and some fact, responses to test tones.
other types of conductive losses (e.g., glue ear) (Kumar et al., In rare cases, patients have tinnitus resulting from blood
2003) reduce the normal inertial response of the ossicles flowing nearby auditory structures. Blood flowing through
to BC hearing. The result is poorer thresholds that form a a vein or artery sometimes produces masking noise or
depressed region of BC hearing known as Carhart’s notch “bruit” that can elevate thresholds for low-frequency tones
(Carhart, 1950). This notch, which typically shows poorer (Champlin et al., 1990). On the audiogram, this form of
BC thresholds between 500 and 4,000 Hz with a maximum tinnitus may produce an apparent sensory/neural loss. The
usually at 2,000 Hz of 15 dB, disappears following successful loss occurs because the tinnitus masks AC and BC thresh-
middle-ear surgery. The finding that BC thresholds improve olds. Bruit, a recordable form of tinnitus resulting from
following middle-ear surgery is strong evidence that these vibrations in the head or neck, is documented by audiolo-
poorer BC thresholds observed in stapes immobilization gists by measuring sound levels in the ear canal (Champlin
are due to a middle-ear phenomenon rather than a change et al., 1990). This problem is treatable when the problem
in the integrity of the cochlea. is caused by a vein. In a case study reported by Champlin
A frequently observed example of middle-ear problems et al. (1990), the patient received some reduction in tinnitus
affecting BC thresholds occurs in persons with otitis media loudness before surgery by applying pressure to her neck.
with effusion. In this group, falsely enhanced BC thresh- Surgical ligation of the vein responsible for the tinnitus was
olds in the low frequencies (1,000 Hz and below) are seen shown to be an effective treatment. Surgery reduced tinnitus
often. The magnitude of the enhancement can be as much loudness, SPLs of the bruit measured in the ear canal were
as 25 dB (Snyder, 1989). Upon resolution of the middle-ear lower, and the audiogram showed significantly improved
problem, these previously enhanced BC thresholds become thresholds.
poorer and return to their premorbid values.
Similarly, enhancement in BC thresholds occurs for
low frequencies with occlusion of the external ear canal by
PSEUDOHYPACUSIS
a supra-aural ear phone. This low-frequency BC enhance- Pseudohypacusis, also known as functional hearing loss and
ment, known as the occlusion effect, must be considered nonorganic hearing loss, is the name applied to intra-test
when occluding the nontest ear to present masking noise and inter-test inconsistencies that cannot be explained by
during BC testing. However, when the masking noise is medical examinations or a known physiologic condition
presented using an insert earphone with the foam plug (Ventry and Chaiklin, 1965). Most persons who present
inserted deeply into the ear canal, the amount of the low- with this condition are feigning a hearing loss for monetary
frequency enhancement is smaller than it is when supra- or psychological gain, but a very small percentage of per-
aural earphones are used to deliver the masking noise (Dean sons have subconscious motivations related to psychologi-
and Martin, 2000). Further, apparent enhancement of BC cal problems (see Chapter 33).
thresholds can occur in cases of superior canal dehiscence Persons presenting with pseudohypacusis are often
(see Chapter 4). identified from inconsistencies in their responses to the
puretones. In addition to general poor reliability during
threshold searches, there is a tendency for the threshold
Special Populations to become poorer as more presentations are made (Green,
1978). Methods of identifying the pseudohypacusis by com-
TINNITUS paring PT thresholds with other measures and the use of
Many people who come for hearing testing experience tinni- special tests are covered in Chapter 33.
tus, the sensation of hearing internal sounds when no sound
is present (see Chapter 35). Tinnitus can interfere with the
AUDITORY NEUROPATHY
perception of test tones, which can lead to a large number
of false-positive responses, and false-positive responses can Auditory neuropathy (or auditory dys-synchrony) is a con-
produce an inaccurate (too sensitive) threshold estimation. dition that may account for 11% of hearing losses found in
Some listeners simply require additional instruction and children at risk for hearing loss (Rance et al., 1999). Infor-
encouragement to wait until they are more certain they have mation about this disorder may be found in Chapters 13
heard a test tone. In some cases, the audiologist can pres- and 19. Many of these children appear to be severely hard of
ent a clearly audible tone at the test frequency to remind hearing because of very poor speech recognition; however,
the listener of the test tone. For more intractable cases, the PT thresholds do not follow any specific pattern. Puretone
examiner can present a series of pulsed tones and ask the hearing thresholds for these children range from minimal to
listener to count the number of tones. It is important with profound losses. Individuals with auditory neuropathy clas-
listeners who are giving false-positive responses to avoid a sically show very inconsistent audiometric responses during
fixed presentation rhythm and to provide irregular intervals a test and between tests.
42 SECTION I • Basic Tests and Procedures
However, the peaked audiogram is also seen in 13% of ears ments that might preserve hearing. Ototoxic drugs typically
with acoustic tumors (Ries et al., 1998). cause reduction in high-frequency hearing before having
any adverse effect on hearing for the speech range. For this
reason, extended high-frequency hearing testing is recom-
NOISE-INDUCED HEARING LOSS AND
mended for ototoxic monitoring test protocols. Several
ACOUSTIC TRAUMA studies have demonstrated the effectiveness of early identifi-
Exposure to intense sound levels can cause permanent or cation of ototoxic hearing loss by monitoring thresholds for
temporary hearing loss due to hair cell damage. When a nar- frequencies higher than 8,000 Hz (Fausti et al., 1992). How-
rowband sound is presented at a level high enough to result ever, for ototoxic drugs that selectively damage inner hair
in damage, a loss occurs at a frequency roughly one-half cells in the cochlea (e.g., carboplatin), PT thresholds may
octave above the frequency of exposure (Henderson and be unaffected even though extensive damage has occurred
Hamernik, 1995). Most people who are exposed to damag- (Lobaranis et al., 2013).
ing noise levels in their work or recreational endeavors are
exposed to broadband sounds, but their losses, especially OTITIS MEDIA
during early stages of NIHL, are characterized by a “notch”
(a drop in hearing) on the audiogram. The greatest hear- Young children are susceptible to temporary, recurring
ing loss typically occurs in the region of 3,000 to 6,000 Hz. middle-ear inflammations (otitis media) that are often
The susceptibility of these frequencies is a result of sound accompanied by fluid in the middle ear (effusion). Otitis
amplification by the external ear (Gerhardt et al., 1987). The media, often referred to as a middle-ear “infection,” may
amplification is mainly a result of the ear canal resonance, be viral or bacterial but is most often serous (noninfected
which increases the level of sound by 20 dB or more. Tem- fluid). Otitis media is the most common medical diagnosis
porary hearing loss is referred to as temporary threshold for children, accounting for 6 million office visits in 1990 for
shift (TTS), and permanent changes are referred to as per- children between the ages of 5 and 15 years (Stoll and Fink,
manent threshold shifts (PTS). 1996). Adults, too, may have otitis media with effusion,
The greater variability of thresholds at 6 and 8 kHz than although the prevalence decreases significantly with age
at other frequencies makes small noise notches associated (Fria et al., 1985). During the active infection, often lasting
with early NIHL difficult to identify. Some frequently used a month or more, a patient’s hearing loss may fluctuate,
rules for quantifying noise notches can produce high false- usually varying between 0 and 40 dB. The average degree
positive rates when decisions are based on a single audio- of hearing loss is approximately 25 dB. Figure 3.8, which
gram (Schlauch and Carney, 2011). Averaging multiple was used earlier in this chapter to illustrate an audiogram
audiograms improves diagnostic accuracy as does clearing for a conductive loss, shows an audiogram derived from the
the ear canals of all earwax, which can result in the appear- average thresholds from a group of children diagnosed with
ance of a high-frequency loss (Jin et al., 2013; Schlauch and otitis media.
Carney, 2012). NIHL can be slowly progressive, as listen-
ers are exposed to high sound levels over months and years TYMPANIC MEMBRANE PERFORATIONS
(Ward et al., 2000), or it can rapidly change, such as noise
trauma after a sudden explosion or impulsive sound (Kerr Tympanic membrane perforations are caused by trauma,
and Byrne, 1975; Orchik et al., 1987). The shooting of a rifle disease, or surgery. The diameter and location of perfora-
can result in a greater loss in the ear closest to the muzzle tion and the involvement of other middle-ear structures
of the gun. In right-handed persons, the left ear is exposed determine the amount of conductive hearing loss, if any.
directly to the muzzle, and the right ear is protected from For instance, a myringotomy and the placement of pressure-
the direct blast by the head. New evidence (Kujawa and equalization tubes represent a physician-induced perforation
Liberman, 2009) suggests that PT thresholds may return to that results in a minimal air–bone gap in successful surgeries.
near normal following noise exposure, whereas functional The measurement of AC thresholds in the presence of
auditory abilities may remain compromised due to the tympanic membrane perforations requires special consider-
noise exposure. (See Chapter 32 for NIHL.) ation. Figure 3.12 shows an audiogram obtained in a single
session from a school-age child who has a tympanic mem-
brane perforation in the left ear and a pressure-equalization
OTOTOXICITY tube in the right ear. Thresholds were measured twice in
Regular monitoring of PT thresholds is particularly impor- each ear, once with supra-aural earphones and again with
tant for patients who take drugs known to be ototoxic. For insert earphones. Note that the low-frequency thresholds
example, certain powerful antibiotics and cancer-fighting obtained from insert earphones were as much as 15 to 25 dB
drugs are known to cause cochlear and vestibular damage in poorer than the ones obtained with supra-aural earphones.
many patients. Monitoring hearing sensitivity during treat- This outcome is typical and is predicted because insert
ment could allow a physician to consider alternative treat- earphones are more susceptible to calibration problems
44 SECTION I • Basic Tests and Procedures
Within the past few years, a new generation of automated testing, BC thresholds are measured. Although it would be
audiometers has been developed (Margolis et al., 2010). useful to have puretone BC thresholds prior to AC thresh-
The new automated audiometers are capable of measuring olds to know how much masking noise can be presented
masked AC and BC thresholds, as well as WRSs, with only a safely, this advantage is outweighed by the inconvenience
single placement of the earphones and BC oscillator. Bekesy of having to enter the booth multiple times to reposition
audiometry is still used along with some other automated the BC vibrator and earphones. Valid, masked AC thresh-
methods (Laroche and Hetu, 1997), including ones that olds can be obtained successfully from most clients before
implement the threshold-finding procedure used in manual obtaining BC thresholds.
puretone audiometry (Margolis et al., 2010). Computer- A few clinics begin with immittance testing, which usu-
based rules control the presentation of stimuli, examinee ally includes a tympanogram and acoustic reflex thresholds.
responses, and the plotting of thresholds. The goal is to If the case history does not indicate a middle-ear problem
automate threshold collection for routine cases, which will and these tests of middle-ear function are normal, then BC
free audiologists to perform more complex measures or to thresholds may not be performed, and the loss, if present, is
work with difficult-to-test populations. assumed to be a sensory/neural loss. A possible risk of this
strategy is that, in rare instances, persons with middle-ear
Calibration problems have normal immittance measures. In this situa-
tion, a conductive loss would be missed. This approach also
Clinical data require accurate stimulus specification, or the adds the expense of immittance testing for each client. Stud-
results are meaningless. When most persons think of cali- ies should be done using an evidence-based practice model
bration of audiometers, the obvious examples include the to determine whether the assessment of middle-ear status of
accuracy of puretone frequency and level. However, pure- each client using immittance or wideband reflectance (see
tone calibration involves much more, including an assess- Chapter 9) is justified. Another time-saving strategy might be
ment of attenuator linearity, harmonic distortion, rise and to measure BC thresholds at two frequencies, a low and a high
fall times, and more. Consult ANSI (2010) and Chapter 2 frequency, and if an air–bone gap is not observed, BC thresh-
on calibration in this book to learn more about this topic. olds are not measured for other frequencies. A low frequency,
such as 500 Hz, would assess stiffness-related middle-ear
Puretone Thresholds and the pathologies. A high frequency, such as 4,000 Hz, would iden-
tify mass-related middle-ear pathologies and collapsed canals.
Audiologic Test Battery Since this method requires placement of the BC vibrator, the
PT thresholds are measured on nearly everyone entering a amount of time actually saved would be limited.
diagnostic audiology clinic, but the test sequence and the Despite the observed variability, it seems that it is possible
extent of the measurements often differ across clinics. Most for audiologists to obtain important diagnostic information
of these differences in protocol are implemented to save test- about the degree, type, and configuration of hearing losses
ing time, which contributes to the cost of running a clinic. using a variety of valid, evidence-based puretone audiometric
ASHA’s guide to manual PT threshold audiometry (2005) methods. Although at first glance, the puretone test proce-
makes no recommendation concerning the puretone test dure may appear elementary, it is clear that well-informed test
sequence. In 2000, the Joint Audiology Committee on Prac- procedures using appropriate and calibrated test equipment
tice Algorithms and Standards recommended an algorithm provide a necessary part of the complete audiologic test
that listed puretone AC testing (with appropriate masking battery and form the basis for clinical decision making.
applied) followed by puretone BC testing with appropriate
masking. They acknowledged that the assessment process
may vary “based on patient need and the assessment setting.”
FOOD FOR THOUGHT
Furthermore, they stated that “decision-making . . . occurs(s) 1. Given the known test–retest variability of PT thresholds,
throughout this process.” what is the threat to the quality of a hearing conservation
Based on informal surveys of clinicians in a variety of program if the tester chooses not to make multiple esti-
settings, it seems that there is considerable variability in mates of baseline audiograms?
test protocols among clinics. In many clinics, BC thresh- 2. Think about examples of auditory pathology where the
olds are not usually obtained from persons with normal AC PT threshold might be misleading, when it might not
thresholds (near 0 dB HL) unless the case history or risk reflect the full nature of the underlying cochlear injury.
of middle-ear problems suggests otherwise. BC threshold 3. Consider several cases where the ear canal volume (the
testing is also omitted in some clinics for returning patients volume under the earphone) might significantly affect
with pure sensory/neural losses if their AC thresholds match the resulting PT threshold. Consider the client’s total vol-
those of the prior visit. A common alternative test sequence ume of the outer ear, perforations of the eardrum, and
is to begin with puretone AC thresholds followed by supra- possible occlusions in the ear canal. What effect will these
threshold word-recognition testing. After word-recognition have on the resulting thresholds?
46 SECTION I • Basic Tests and Procedures
KEY REFERENCES Gerhardt KJ, Rodriguez GP, Hepler EL, Moul ML. (1987) Ear canal
volume and variability in patterns of temporary threshold
A full list of references for this chapter can be shifts. Ear Hear. 8, 316–321.
Goodman A. (1965) Reference zero levels for pure tone audiom-
found at http://thePoint.lww.com. Below are the key refer-
eter. Am Speech Hear Assoc. 7, 262–263.
ences for this chapter.
Henderson D, Hamernik RP. (1995) Occupational Medicine: State
American National Standards Institute. (1999) Maximum Permissi- of the Art Reviews, Biologic Bases of Noise-Induced Hearing Loss.
ble Ambient Noise for Audiometric Test Rooms. ANSI S3.1–1999. Philadelphia, PA: Hanley & Belfus, Inc.
New York, NY: American National Standards Institute, Inc. Hughson W, Westlake H. (1944) Manual for program outline for
American National Standards Institute. (2010) Specifications rehabilitation of aural casualties both military and civilian.
for Audiometers. ANSI S3.6–2010. New York, NY: American Trans Am Acad Ophthalmol Otolaryngol. (suppl 48), 1–15.
National Standards Institute, Inc. Jerger J, Jerger S. (1980) Measurement of hearing in adults. In:
American Speech-Language Hearing Association. (1990) Guide- Paperella MM, Shumrick DA, eds. Otolaryngology. 2nd ed.
lines for audiometric symbols. ASHA. 32, 25–30. Philadelphia, PA: W.B. Saunders.
American Speech-Language-Hearing Association. (1994) Guide- Jin SH, Nelson PB, Schlauch RS, Carney E. (2013) Hearing conser-
lines for the audiologic management of individuals receiving vation program for marching band members: Risk for noise-
cochleotoxic drug therapy. ASHA. 36, 11–19. induced hearing loss? Am J Audiol. 22, 26–39.
American Speech-Language-Hearing Association. (2005) Guide- Joint Audiology Committee on Practice Algorithms and Stan-
lines for manual pure-tone threshold audiometry. Available dards. (2000) Clinical practice guidelines and statements.
online at: www.asha.org/policy/html/GL2005-00014.html. Audiol Today. Special Issue.
Boothroyd A, Cawkwell S. (1970) Vibrotactile thresholds in pure Killion MC, Villchur E. (1989) Comments on “Earphones in audi-
tone audiometry. Acta Otolaryngol. 69, 381–387. ometry.” J Acoust Soc Am. 85, 1775–1778.
Browning GG. (1987) Is there still a role for tuning-fork tests? Br Kujawa SG, Liberman MC. (2009) Adding insult to injury: cochlear
J Audiol. 21, 161–163. nerve degeneration after “temporary” noise-induced hearing
Carhart R. (1945) An improved method of classifying audiograms. loss. J Neurosci. 29 (45), 14077–14085.
Laryngoscope. 5, 1–15. Kumar M, Maheshwar S, Mahendran A, Oluwasamni A, Clayton
Carhart R. (1950) Clinical application of bone conduction. Arch MI. (2003) Could the presence of a Carhart notch predict
Otolaryngol. 51, 789–807. the presence of glue at myringotomy? Clin Otolaryngol. 28,
Carhart R, Jerger J. (1959) Preferred method for clinical determina- 183–186.
tion of pure-tone thresholds. J Speech Hear Disord. 24, 330–345. Laroche C, Hetu R. (1997) A study of the reliability of automatic
Carhart R, Porter LS. (1971) Audiometric configuration and audiometry by the frequency scanning method (AUDIOSCAN).
prediction of threshold for spondees. J Speech Hear Res. 14, Audiology. 36, 1–18.
486–495. Lloyd LL, Kaplan H. (1978) Audiometric Interpretation: A Manual
Champlin CA, Muller SP, Mitchell SA. (1990) Acoustic measure- for Basic Audiometry. Baltimore, MD: University Park Press.
ments of objective tinnitus. J Speech Hear Res. 33, 816–821. Lobarinas E, Salvi R, Ding D. (2013) Insensitivity of the audio-
Chandler JR. (1964) Partial occlusion of the external auditory gram to carboplatin induced inner hair cell loss in chinchil-
meatus: Its effect upon air and bone conduction hearing acuity. las. Hear Res. 302, 113–120. http://dx.doi.org/10.1016/j.jeares.
Laryngoscope. 74, 22–54. 2013.03.012
Committee on Hearing and Equilibrium. (1995) Committee on Magdziarz DD, Wiet RJ, Dinces EA, Adamiec LC. (2000) Normal
Hearing and Equilibrium guidelines for the diagnosis and eval- audiologic presentations in patients with acoustic neuroma:
uation of Ménière’s disease. Otolaryngol Head Neck Surg. 113, an evaluation using strict audiologic parameters. Otolaryngol
181–185. Head Neck Surg. 122, 157–162.
Dean MS, Martin FN. (2000) Insert earphone depth and the Margolis RH, Eikelbloom RH, Johnson C, Ginter SM, Swanepoel
occlusion effect. Am J Audiol. 9, 131–134. DW, Moore, BCJ. (2013) False air-bone gaps at 4 kHz in listen-
Dirks D. (1994) Bone-conduction thresholds testing. In: Katz J, ers with normal hearing and sensorineural hearing loss. Int J
ed. Handbook of Clinical Audiology. 4th ed. Baltimore, MD: Audiol. 52, 526–532. Early Online: 1–7.
William & Wilkins; pp 132–146. Margolis RH, Glasberg BR, Creeke S, Moore BCJ. (2010) AMTAS®:
Dubno JR, Eckert MA, Lee FS, Matthews LJ, Schmiedt RA. (2013) Automated method for testing auditory sensitivity: Validation
Classifying human audiometric phenotypes of age-related studies. Int J Audiol. 49 (3), 185–194.
hearing loss from animal models. J Assoc Res Otolaryngol. 14, Marshall L, Hanna TE. (1989) Evaluation of stopping rules for
687–701. audiological ascending test procedures using computer simu-
Dubno JR, Lee F, Klein A, Matthews L, Lam CF. (1995) Confidence lations. J Speech Hear Res. 32, 265–273.
limits for maximum word-recognition scores. J Speech Hear Marshall L, Jesteadt W. (1986) Comparison of pure-tone audibil-
Res. 38, 490–502. ity thresholds obtained with audiological and two-interval
Fausti SA, Henry JA, Schaffer HI, Olson DJ, Frey RH, McDonald forced-choice procedures. J Speech Hear Res. 29, 82–91.
WJ. (1992) High frequency audiometric monitoring for early Martin FN. (1994) Introduction to Audiology. 5th ed. Boston, MA:
detection of ototoxicity. J Infect Dis. 165, 1026–1032. Allyn and Bacon.
Fria TJ, Cantekin EI, Eichler JA. (1985) Hearing acuity of chil- Mineau SM, Schlauch RS. (1997) Threshold measurement for
dren with otitis media with effusion. Arch Otolaryngol. 111, patients with tinnitus: Pulsed or continuous tones. Am J
10–16. Audiol. 6, 52–56.
CHAPTER 3 • Puretone Evaluation 47
Moore BC. (2004) Dead regions in the cochlea: Conceptual foun- Schlauch RS, Carney E. (2011) Are false-positive rates leading to an
dations, diagnosis, and clinical applications. Ear Hear. 25 (2), overestimation of noise-induced hearing loss? J Speech Lang
98–116. Hear Res. 54 (2), 679–692.
Morgan DE, Dirks DD, Bower DR. (1979) Suggested threshold Schlauch RS, Carney E. (2012) The challenge of detecting mini-
sound pressure levels for frequency-modulated (warble) tones mal hearing loss in audiometric surveys. Am J Audiol 21 (1),
in the sound field. J Speech Hear Disord. 44, 37–54. 106–119.
National Institute for Occupational Safety and Health. (1998) Cri- Schlauch RS, Levine S, Li Y, Haines S. (1995) Evaluating hearing
teria for a Recommended Standard: Occupational Noise Expo- threshold differences between ears as a screen for acoustic neu-
sure: Revised Criteria. Cincinnati, OH: National Institute for roma. J Speech Hear Res. 38, 1168–1175.
Occupational Safety and Health, US Department of Health Schuknecht HF. (1974) Pathology of the Ear. Cambridge, MA:
and Human Services Report; pp 98–126. Harvard University Press.
National Institute on Deafness and Other Communication Dis- Shaw EAG. (1974) The external ear. In: Kleidel WD, Neff WD,
orders. (2005) Presbycusis. Available online at: http://www. eds. Handbook of Sensory Physiology. Berlin: Springer;
nidcd.nih.gov/health/hearing/presbycusis.asp pp 455–490.
Northern JL, Downs MP. (2002) Hearing in Children. 5th ed. New Snyder JM. (1989) Audiometric correlations in otology. In: Cum-
York, NY: Lippincott Williams & Wilkins. mings CW, Fredrickson JM, Harker LS, et al., eds. Otolaryngology
Occupational Safety and Health Administration. (1983) Occu- Head and Neck Surgery: Update. St Louis, MO: Mosby.
pational noise exposure: hearing conservation amendment. Stahle J, Klockhoff I. (1986) Diagnostic procedures, differential
Occupational Safety and Health Administration, 29 CFR 1910. diagnosis, and general conclusions. In: Pfaltz CR, ed. Con-
95; 48 Federal Register, 9738–9785. troversial Aspects of Ménière’s Disease. New York, NY: Georg
Pavlovic CV, Studebaker GA, Sherbecoe RL. (1986) An articulation Thieme.
index based procedure for predicting the speech recognition Studebaker G. (1967) Intertest variability and the air-bone gap.
performance of hearing-impaired subjects. J Acoust Soc Am. J Speech Hear Disord. 32, 82–86.
80, 50–57. Ventry IM, Chaiklin JB. (1965) Multidisciplinary study of func-
Pfaltz CR, Matefi L. (1981) Ménière’s disease – or syndrome? tional hearing loss. J Audiol Res. 5, 179–272.
A critical review of diagnose criteria. In: Vosteen KH, Ventry IM, Chaiklin JB, Boyle WF. (1961) Collapse of the ear canal
Schuknecht H, Pfaltz CR, et al., eds. Ménière’s Disease. New during audiometry. Arch Otolaryngol. 73, 727–731.
York, NY: Thieme. Voss SE, Herman BS. (2005) How does the sound pressure gen-
Rance G, Beer DE, Cone-Wesson B, Shepherd RK, Dowell RC, erated by circumaural, supraaural and insert earphones differ
King AM, et al. (1999) Clinical findings for a group of infants for adult and infant ears. Ear Hear. 26, 636–650.
and young children with auditory neuropathy. Ear Hear. 20 Voss SE, Rosowski JJ, Merchant SN, Thornton AR, Shera CA,
(3), 238–252. Peake WT. (2000) Middle ear pathology can affect the ear-
Ries DT, Rickert M, Schlauch RS. (1998) The peaked audiometric canal sound pressure generated by audiologic earphones. Ear
configuration in Ménière’s disease: Disease related? J Speech Hear. 21, 265–274.
Lang Hear Res. 42, 829–843. Ward WD, Royster JD, Royster LH. (2000) Auditory and nonaudi-
Rosowski JJ, Relkin EM. (2001) Introduction to analysis of middle- tory effects of noise. In: Berger EH, Royster LH, Royster JD,
ear function. In: Jahn AF, Santos-Sacchi J, eds. Physiology of the Driscoll DP, Layne M, eds. The Noise Manual. 5th ed. Fairfax,
Ear. 2nd ed. San Diego, CA: Singular. VA: AIHA Press; pp. 123–147.
Scheperle A, Goodman SS, Neely S. (2011) Further assessment of Watson CS, Gengel RW. (1969) Signal duration and signal fre-
forward pressure level for in situ calibration. J Acoust Soc Am. quency in relation to auditory sensitivity. J Acoust Soc Am. 46,
130, 3882–3892. 989–997.
Schlauch RS, Arnce KD, Olson LM, Sanchez S, Doyle TN. (1996) Wilson RH, Margolis RH. (1983) Measurements of auditory
Identification of pseudohypacusis using speech recognition thresholds for speech stimuli. In: Konkle DF, Rintelmann WF,
thresholds. Ear Hear. 17, 229–236. eds. Principles of Speech Audiometry. Baltimore, MD: University
Schlauch RS, Carney E. (2007) A multinomial model for identify- Park Press.
ing significant pure-tone threshold shifts. J Speech Hear Res.
150, 1391–1403.
C H A P T ER 4
James R. Steiger
49
50 SECTION I • Basic Tests and Procedures
and the hearing loss was caused by a tympanic membrane the phenomenon in hearing-impaired listeners, and his
disorder blocking the pathway of air-conducted sound. In work was carried on by both Bonafant and Schmalz in
contrast, if the tone was not heard by bone conduction, the 1840s. It was Schmalz who first wrote extensively on
he concluded that the listener’s hearing loss was caused by the diagnostic implications of what would become known
a cochlear nerve disorder. as the Weber tuning fork test. Listeners with unilateral
CHL hear bone-conducted tones louder in the impaired
ear because of the occlusion effect, whereas listeners with
The Rinne Tuning Fork Test unilateral SNHL hear bone-conducted tones softer in
In 1855, Heinrich Adolf Rinne (1819 to 1868) described the impaired ear because of sensory/neural disorder. The
the tuning fork test that bears his name (Feldmann, 1970). Weber test is used to this day, with bone-conducted tones
Rinne noted that the intensity of air-conducted tones was typically applied to the forehead by either tuning forks or
greater than that of bone-conducted tones, owing to the bone vibrators.
relatively lesser density of air in contrast to the greater
density of bone. Most people, including normally hear-
ing listeners and listeners with SNHL, therefore hear air-
APPARATUS
conducted tones louder than bone-conducted tones. In Bone vibrators are transducers composed of diaphragms
contrast, listeners with CHL hear bone-conducted tones encased in plastic. During bone conduction evaluation
louder than air-conducted tones for two reasons. First, the circular vibrating plastic surface of the bone vibrator
listeners with CHL have outer ear occlusions or middle is held in contact with the patient’s skull by the tension
ear disorders that attenuate air-conducted tones. And of a metal band. The American National Standards Insti-
second, outer or middle ear disorder can effectively trap tute (ANSI, 2004) specifies bone vibrator surface diam-
bone-conducted tones that would otherwise radiate out eter, metal band tension, and the output characteristics
of the ear canal; thus occlusions effectively intensify of audiometers and transducers including bone vibra-
bone-conducted tones. This is the so-called occlusion tors. As stated above, the clinical apparatus is calibrated
effect. so that patients’ air and bone conduction thresholds
Rinne’s procedure was straightforward; listeners held may be compared to the same 0 dB HL normal-hearing
the tuning fork with their teeth (dense bone conduction threshold referent. Figure 4.1 shows a common B-71
transmission) allowing vibrations to attenuate until no lon- bone vibrator and also that same bone vibrator properly
ger audible. Then the still vibrating tuning fork was moved placed on the mastoid of a KEMAR manikin. Audiomet-
in front of the ear canal (less dense air conduction trans- ric evaluations are conducted in test rooms compliant
mission). If the tuning fork was audible by air conduction, with ANSI S3.1-1999 (ANSI, 2003). The standard speci-
the listener had normal hearing or an SNHL. If not audible fies maximum permissible ambient noise levels allow-
by air conduction, the listener had a CHL. Today the bone able for audiometric threshold testing as low as 0 dB HL,
conduction tuning fork placement is more likely to be on including when the ears are uncovered as during bone
the mastoid rather than the teeth. conduction threshold evaluations. A full discussion of
Audiologists to this day diagnose site of lesion by com- calibration can be found in Chapter 2 of this book.
paring the air and bone conduction thresholds of their
patients. Audiometers, however, are calibrated so that 0 dB
HL is a normal threshold referent for both air and bone
EVALUATION PROCEDURES
conduction; air and bone conduction thresholds are there- Common practice is to begin with the bone vibrator on the
fore similar for normally hearing listeners as well as listen- better hearing ear mastoid or the right mastoid if a better
ers with SNHL. For listeners with CHL, bone conduction hearing ear is not known or suspected. Evaluation guide-
thresholds are better than air conduction thresholds for the lines were published by the American Speech-Language-
reasons stated above. Hearing Association (ASHA) (2005), including a protocol
for tone presentation, patient response modes, and the defi-
nition of threshold. Bone conduction thresholds should be
Weber Tuning Fork Test tested at several frequencies and traditionally in this order:
In 1827, both the German physician C.T. Tourtual and the 1,000, 2,000, 3,000, and 4,000 Hz, a retest of 1,000, 500, and
English physicist Charles Wheatstone described lateraliza- 250 Hz (ASHA, 1990). Because bone-conducted signals
tion of bone-conducted stimuli to an occluded ear due to may reach either mastoid at similar intensities, contralateral
the occlusion effect (Feldmann, 1970). Wheatstone expe- ear masking may be necessary to obtain ear-specific bone
rienced the occlusion effect when manually occluding an conduction thresholds. A full discussion of threshold evalu-
ear while listening to a vibrating tuning fork in contact ation, masking procedures, and the symbols used for record-
with his skull. Tourtual used as his stimulus a pocket watch ing thresholds on an audiogram can be found in Chapters 3
held in his mouth. In the 1800s Heinrich Weber observed and 6 of this book.
CHAPTER 4 • Bone Conduction Evaluation 51
FIGURE 4.1 B-71 bone vibrator and also that same bone
vibrator properly placed on the mastoid of a KEMAR manikin.
(From Vento B, Durrant JD. (2009) Assessing bone conduc-
tion thresholds in clinical practice. In: Katz J, Medwetsky L,
Burkard R, Hood L, eds. Handbook of Clinical Audiology.
Philadelphia, PA: Lippincott Williams and Wilkins, http://lww.
B
com by permission.)
Outer, Middle, and Inner Ear Tonndorf, 1968). Vibration of the mandible may also add
to the sound wave radiation into the ear canal (Stenfelt and
Components of Bone Conduction Goode, 2005). These sound waves propagate through the
Bone-conducted stimuli cause complex skull vibrations middle ear and finally to the inner ear; thus, the complex
in several directions and with several resonances and phenomenon of bone conduction hearing includes exploi-
antiresonances (Stenfelt and Goode, 2005), the details tation of the air conduction pathway.
of which are beyond the scope of this chapter. Bone- The outer ear component may play little role in the
conducted energy ultimately arrives at the cochlea by normal unoccluded ear canal, but its role is magnified by
various transmission routes, which in turn cause basilar the occlusion effect (Stenfelt and Goode, 2005). Normally
membrane-traveling waves to propagate from the stiffer the outer ear canal acts as a high-pass filter (Tonndorf,
basilar membrane base toward the more compliant basi- 1968), that is, high-frequency energy is passed into the
lar membrane apex as occurs for air conduction (Bekesy, middle ear whereas low-frequency energy escapes through
1960). Bone conduction transmission routes can be dis- the ear canal opening. Outer ear canal occlusion traps this
cussed in terms of outer ear, middle ear, and inner ear low-frequency energy and thereby enhances bone con-
components. duction hearing up to 20 dB in the lower audiometric test
frequencies (Stenfelt and Goode, 2005).
OUTER EAR COMPONENT OF BONE
CONDUCTION MIDDLE EAR COMPONENT OF BONE
The outer ear bone conduction component arises from
CONDUCTION
vibration of the bony and especially the cartilaginous walls The middle ear component of bone conduction is the iner-
of the outer ear canal that, in turn, causes sound waves to tial lag of the ossicles (Barany, 1938). Middle ear ossicles are
radiate into the outer ear canal (Stenfelt and Goode, 2005; not directly attached to the skull, but are instead suspended
52 SECTION I • Basic Tests and Procedures
by ligaments and tendons and attached at either end to the one side of the scala media and the greater fluid volume
elastic tympanic and oval window membranes. The ossicles of the scala vestibuli, vestibule, and semicircular canals on
are free to move out of phase with skull vibrations and will the other side of the scala media (Stenfelt and Goode, 2005;
do so because of inertia, much as coffee would lag and spill Tonndorf, 1968).
from a cup moved precipitously. Middle ear ossicles vibrate Inertia of the inner ear fluids contributes to inner
relative to the skull in a like manner as during air conduc- ear bone conduction hearing, especially below 1,000 Hz
tion hearing, and thus energy is propagated into the inner (Stenfelt and Goode, 2005). Cochlear fluids and windows
ear. The middle ear component occurs mainly at and above are free to vibrate out of phase with skull vibrations and
1,500 Hz and is especially significant near 2,000 Hz, the will do so because of inertia, compared earlier to coffee
approximate resonant frequency of the middle ear (Stenfelt that will lag and spill from a moving cup. Cochlear fluid
et al., 2003). Finally, some have proposed that bone conduc- movement, in turn, displaces the basilar membrane and
tion energy radiates from the walls of the middle ear into initiates traveling waves. The spiral lamina may also be
the middle ear space and sets the tympanic membrane into flexible and lag skull vibrations, thereby contributing to
vibration, but that hypothesis has been challenged (Stenfelt bone conduction hearing, especially at higher frequencies
and Goode, 2005). (Stenfelt et al., 2003).
Finally, bone conduction energy can travel in nonosse-
ous skull contents, such as the brain, membranes, and flu-
INNER EAR COMPONENT OF BONE ids, and reach the cochlea through the cochlear and/or ves-
CONDUCTION tibular aqueducts (de Jong et al., 2011; Stenfelt and Goode,
The inner ear component of bone conduction involves sev- 2005). The nonosseous skull contents route may contribute
eral contributing factors, including cochlear compression or little to normal bone conduction hearing, but it plays a role
distortion, cochlear fluid inertia, osseous spiral lamina iner- in some inner ear disorders as discussed below.
tia, and sound pressure transmission through skull contents
(brain, membranes, and fluid). Inner ear bone conduction
has been described as resulting from alternate compressions
TRANSMISSION ROUTE MODEL
and expansions (Herzog and Krainz, 1926) or distortions Figure 4.2 illustrates the air and bone conduction transmis-
(Tonndorf, 1968) of the bony cochlear capsule. In turn, sion routes. Note the orderly air conduction route through
cochlear fluids are displaced and basilar membrane-travel- the outer, middle, and inner ear in contrast with more
ing waves are initiated. One factor making cochlear fluid dis- complex bone conduction hearing involving concomitant
placement possible is the out-of-phase and disproportion- outer, middle, and inner ear components bilaterally.
ate yielding of the round and oval cochlear windows, which
creates alternating spaces for fluid displacement. Cochlear
fluid movement, in turn, displaces the basilar membrane EXAMPLES OF DIAGNOSES
and initiates traveling waves. Also, the cochlear and vestibu-
lar aqueducts may serve as outlets for fluid displacement
Normal-hearing Sensitivity
allowing for bone conduction hearing when the oval win- Normally hearing patients are without disorder that would
dow is fixed as in otosclerosis (Stenfelt and Goode, 2005; hinder energy from reaching the inner ear by the air conduc-
Tonndorf, 1968). Fluid displacement is further enabled by tion route. Owing to calibration, air conduction thresholds
a fluid volume differential between the scala tympani on will be near 0 dB HL. Similarly, normally hearing patients
Bone
vibrator
placement
Right
mastoid
Cerebrospinal
fluid
FIGURE 4.2 Transmission routes for air conduction, right ear example (narrow arrows) and bone con-
duction, right mastoid example (bold arrows). Note: Higher intensity air-conducted signals can activate
the bone conduction transmission route.
CHAPTER 4 • Bone Conduction Evaluation 53
0 0
20 20
40 40
dB hearing level
dB hearing level
60 60
80 80
100 100
120 120
250 500 1,000 2,000 4,000 8,000 250 500 1,000 2,000 4,000 8,000
Frequency (Hz) Frequency (Hz)
FIGURE 4.3 Audiogram depicting normal-hearing sen- FIGURE 4.4 Audiogram depicting CHL from osteomas.
sitivity. Air conduction: presurgery thresholds represented with
filled circles and postsurgery thresholds represented with
open circles. Bone conduction: presurgery thresholds
are without disorder that would hinder any of the bone connected with a dotted line, postsurgery thresholds
conduction components. Again owing to calibration, bone not connected. (Modified from Pinsker OT. (1972) Otologi-
conduction thresholds will be near 0 dB HL and be similar cal correlates of audiology. In: Katz J, ed. Handbook of
to air conduction thresholds (±10 dB). Figure 4.3 shows Clinical Audiology. Baltimore, MD: Williams and Wilkins,
an example audiogram of a patient with normal-hearing http://lww.com by permission.)
sensitivity.
neither hair cell nor neuron damage is the cause (Hall and
CHL with Air–Bone Gaps of Outer Croutch, 2008). In such cases, bone conduction thresholds
Ear Origin underestimate sensory/neural reserve. Figure 4.4 shows
Outer ear occlusive disorder may hinder air conduction an audiogram example involving occlusive osteomas; the
energy from reaching the inner ear. In such cases, air con- presurgery audiogram shows CHL, but with bone con-
duction thresholds would be poorer than 0 dB HL, at least at duction thresholds that underestimated the true sensory/
some frequencies, to a degree dictated by the occlusive disor- neural reserve revealed by postsurgery bone conduction
der. Bone conduction thresholds, in contrast, may be unaf- thresholds. Cerumen impaction that loads (adds mass to)
fected if the outer ear bone conduction component is not the tympanic membrane can cause pseudoSNHL (Hall and
hindered. Bone conduction thresholds would be near 0 dB Croutch, 2008; Tonndorf, 1968). Anderson and Barr (1971)
HL and accurately reflect sensory/neural hearing (sensory/ reported pseudoSNHL with partial cerumen occlusion of
neural reserve or true sensory/neural capability). The result the outer ear canal, though they attributed it to earphone
is a hallmark of CHL: air conduction thresholds >10 dB HL artifact.
lower (poorer) than normal bone conduction thresholds.
The maximum air–bone gap is approximately 60 dB; higher CHL with Air–Bone Gaps of Middle
intensity air-conducted sound waves set the skull into vibra-
tion and induce bone conduction hearing thus limiting the
Ear Origin
maximum difference between air and bone conduction Middle ear disorder may hinder air conduction energy
thresholds (Bekesy, 1960). from reaching the inner ear. In such cases, air conduction
However, bone conduction thresholds are often thresholds would be poorer than 0 dB HL, at least at some
improved by occlusive disorders due to the occlusion frequencies, to a degree dictated by the middle ear disor-
effect. In such cases, bone conduction thresholds over- der. Bone conduction thresholds, in contrast, may be unaf-
estimate sensory/neural reserve. In rarer cases, outer ear fected if the middle ear bone conduction component is not
occlusive disorder may interfere with the outer or middle significantly hindered. Bone conduction thresholds would
ear bone conduction components and thus lower bone therefore be near 0 dB HL and accurately reflect sensory/
conduction thresholds, a so-called pseudoSNHL because neural reserve or be improved due to the occlusion effect.
54 SECTION I • Basic Tests and Procedures
0 0
20 20
40 40
dB hearing level
dB hearing level
60 60
80 80
100 100
120 120
250 500 1,000 2,000 4,000 8,000 250 500 1,000 2,000 4,000 8,000
Frequency (Hz) Frequency (Hz)
FIGURE 4.5 Audiogram representing the mean CHL FIGURE 4.6 Audiogram depicting CHL from otosclero-
from a group of children. (Modified from Schlauch RS, sis. Air conduction: presurgery thresholds represented
Nelson P. (2009) Puretone evaluation. In: Katz J, with filled circles and postsurgery thresholds repre-
Medwetsky L, Burkard R, Hood L, eds. Handbook of sented with open circles. Bone conduction: presurgery
Clinical Audiology. Philadelphia, PA: Lippincott Williams thresholds connected with a dotted line, postsurgery
and Wilkins, http://lww.com by permission.) thresholds not connected. (Modified from Dirks D. (1985).
Bone-conduction testing. In: Katz J, ed. Handbook of
Clinical Audiology. Baltimore, MD: Williams and Wilkins,
The resulting air–bone gaps can be as great as approxi- http://lww.com by permission.)
mately 60 dB. Air–bone gaps can be seen with many middle
ear disorders; an example of a CHL audiogram is shown in
Figure 4.5. media, and Kumar et al. (2003) reported 2,000 Hz bone
However, middle ear disorder often hinders the mid- conduction notches in patients with glue ear. Apparently,
dle ear contribution to bone conduction and thus lowers effusion may produce stiffening or loading effects, thus
(makes poorer) bone conduction thresholds. The result is hindering the middle ear bone conduction component
a pseudoSNHL in addition to the CHL, with bone conduc- (Tonndorf, 1968). An example of CHL caused by otitis
tion thresholds underestimating sensory/neural reserve. media is shown in Figure 4.7; note the pretreatment pseu-
For example, ossicular fixation caused by otosclerosis pre- doSNHL and CHL, and the resolution of the same post-
dictably manifests as lower (poorer) than normal bone con- treatment.
duction thresholds at the approximate middle ear resonant It should be noted here that true SNHL with otitis
frequency of 2,000 Hz (Carhart, 1950). The air–bone gaps media has also been proposed. Pathogens in middle ear effu-
one expects with CHL may therefore be reduced or oblit- sion may pass through the round window and cause dam-
erated at and near 2,000 Hz. For stapedial fixation caused age to cochlear hair cells, and because of the round window
by otosclerosis, this pseudoSNHL is known as the Carhart proximity to the basal turn of the cochlea, high-frequency
notch. An audiogram example is shown in Figure 4.6; note SNHL might result (Paparella et al., 1984).
the Carhart notch and CHL presurgery and the restoration
of normal air and bone conduction thresholds postsurgery
consistent with restoration to more normal middle ear
SNHL
resonance. Neither sensory nor neural disorders hinder energy from
The middle ear bone conduction component can be reaching the inner ear by the air conduction route or by
affected by other disorders as well. For example, Dirks any of the bone conduction routes. Changes in air and bone
and Malmquist (1969) reported pseudoSNHL in addi- conduction thresholds are affected only by the damage to
tion to CHL in a case of mallear fixation. Similarly, pseu- sensory/neural structures which will lower (make poorer)
doSNHL in addition to CHL has been reported for sub- air and bone conduction thresholds similarly. Air and bone
jects with otitis media (Carhart, 1950; Hall and Croutch, conduction thresholds will therefore be similar (±10 dB).
2008). Yasan (2007) reported 1,000 Hz and in some cases Figure 4.8 shows an example audiogram of a patient with
2,000 Hz bone conduction notches in patients with otitis SNHL from presbycusis.
CHAPTER 4 • Bone Conduction Evaluation 55
0 0
20 20
40 40
dB hearing level
dB hearing level
60 60
80 80
100 100
120 120
250 500 1,000 2,000 4,000 8,000 250 500 1,000 2,000 4,000 8,000
Frequency (Hz) Frequency (Hz)
FIGURE 4.7 Audiograms depicting CHL from otitis FIGURE 4.8 Audiogram depicting a sensory/neural
media. Air conduction: pretreatment thresholds repre- hearing loss from presbycusis. (Modified from Harrell
sented with filled circles and post-treatment thresholds RW, Dirks D. (1994) In: Katz J, ed. Handbook of Clinical
represented with open circles. Bone conduction: pre- Audiology. Philadelphia, PA: Lippincott, Williams and
treatment thresholds connected with a dotted line, post- Wilkins, http://lww.com by permission.)
treatment thresholds not connected. (Modified from Hall
CM, Croutch C. (2008) Pseudosensory-neural hearing
loss. Hear Rev. 16(1), 18–22, by permission.) typically manifesting in the lower test frequencies con-
sistent with a mathematical model analysis based on the
anatomical dimensions of the inner ear (Merchant et al.,
2007). Bone conduction thresholds, in contrast, may be
MHL with Air–Bone Gaps of Outer improved by skull content sound pressure transmissions
or Middle Ear Origin
Patients may have an MHL. An MHL audiogram will there-
fore show evidence of both the SNHL (affected air and 0
bone conduction thresholds) and the CHL (air–bone gaps
>10 dB). Bone conduction thresholds may accurately reflect
sensory/neural reserve or a pseudoSNHL may underesti- 20
mate sensory/neural reserve. An MHL audiogram example
is shown in Figure 4.9. 40
dB hearing level
0 0
20 20
40 40
dB hearing level
dB hearing level
60 60
80 80
100 100
120 120
250 500 1,000 2,000 4,000 8,000 250 500 1,000 2,000 4,000 8,000
Frequency (Hz) Frequency (Hz)
FIGURE 4.10 Audiogram depicting hearing loss from FIGURE 4.11 Audiogram depicting hearing loss from a
superior semicircular canal dehiscence. large vestibular aqueduct.
through the dehiscence, that is, augmentation of the inner Intracranial Hypertension with
ear bone conduction component. The term air–bone gap
of inner ear origin has been used to describe these findings
PseudoSNHL
(Attias et al., 2012). This is a useful term as it emphasizes Intracranial hypertension with abnormal cerebrospinal
that the air–bone gaps do not reflect either a CHL or MHL. fluid flow has been associated with a number of neurologic
Figure 4.10 shows an example audiogram from a patient conditions including syringohydromyelia, Chiari malfor-
with SSCD. mations, trauma, tumors, arachnoiditis, subarachnoid hem-
orrhages, meningitis, and multiple sclerosis (Steiger et al.,
2007). Resulting audiologic symptoms may include whoosh-
Large Vestibular Aqueducts ing pulsatile tinnitus and low-frequency pseudoSNHL. The
with SNHL and Air–Bone Gaps pulsatile tinnitus may arise from circle of Willis blood flow
of Inner Ear Origin or pulsations of the walls of the dural sinuses (Rudnick and
Sismanis, 2005), which travel through the cochlear or vestib-
Large vestibular aqueducts (LVA) cause SNHL with air–
ular aqueducts to the cochlea (Marchbanks et al., 2005). The
bone gaps of inner ear origin (Attias et al., 2012; Jackler and
pseudoSNHL might be attributable to masking from the
De La Cruz, 1989). The cause of SNHL with LVA is unclear;
pulsatile tinnitus (Rudnick and Sismanis, 2005; Steiger et al.,
it may result from traumatic endolymph pressure from the
2007) or from elevated cochlear fluid pressure stiffening
endolymphatic duct and sac that damages hair cells, or by
the basilar, oval, and round window membranes (Sismanis,
endolymph electrolyte content that is harmful to the hair
1987). Stiffened cochlear membranes, in turn, may interfere
cells or stria vascularis (Campbell et al., 2011; Jackler and
with cochlear fluid motion and thus hinder the inner ear
De La Cruz, 1989; Levinson et al., 1989). The audiomet-
bone conduction component. Figure 4.12 shows an audio-
ric findings for LVA may also be influenced by the third-
gram of a patient with intracranial hypertension.
window effect similar to SSCD. Air–bone gaps of inner ear
origin are possible: Air conduction thresholds may be low-
ered (made poorer) because some energy is shunted away TECHNICAL CLINICAL CAVEATS
from the cochlea through the LVA, whereas bone conduc-
tion thresholds may be unaffected or improved by sound
Vibrotactile Responses
pressure transmissions through the skull contents to the It is possible for a patient to feel bone conductor diaphragm
LVA, (Attias et al., 2012; Merchant et al., 2007). Therefore, vibrations during bone conduction evaluation, especially at
as with SSCD, the air–bone gaps seen with LVA do not high intensities and at lower test frequencies (Nober, 1964).
reflect outer ear occlusion or middle ear disorder as in When a patient responds to stimuli felt but not heard, the
CHL or MHL. Figure 4.11 shows an example audiogram responses are called vibrotactile. Vibrotactile responses must
from a patient with LVA. not be recorded as auditory thresholds as two possible errors
CHAPTER 4 • Bone Conduction Evaluation 57
0
Interaural Attenuation, Masking,
and the Occlusion Effect
20
Audiologists typically target a test ear for evaluation while
being vigilant for the possibility of the patient hearing in
the nontest ear. This vigilance is warranted during bone-
40
dB hearing level
TABL E 4 .1
to record evidence of changes in the middle ear bone con- Threshold Accuracy and the
duction component, for example, the middle ear resonance
changes that are likely to occur with otosclerosis. Not sur-
Air–Bone Gap
prisingly, audiometers in most clinics are calibrated for Throughout this chapter significant air–bone gaps were
mastoid placement. defined as >10 dB. However, patient response variability
Forehead placement can be used if correction factors can result in underestimated or exaggerated air–bone gaps
from Table 4.1 are applied or if the audiometer is calibrated and even bone–air threshold gaps. Studebaker (1967), for
for forehead transducer placement. Audiologists who prefer example, calculated the standard deviation of air–bone gaps
forehead transducer placement should consider purchas- at 5 dB and noted that air–bone threshold gaps of ≥15 dB
ing a specifically designed bone vibrator and headband. can sometimes be seen in the absence of CHL. Similarly,
Forehead placement has advantages, including low intra- Margolis (2008) calculated a hypothetical air–bone threshold
subject and intersubject variability because of the less vari- gap distribution based on the independent variability of air
able forehead placement surface and more uniform non- and bone conduction thresholds. Significant air–bone and
pneumatized forehead bone (Dirks, 1964). Also, vibrations even bone–air threshold gaps were predicted, of course, with
from forehead placement are perpendicular to middle ear greater threshold gaps occurring less frequently. Moreover,
ossicular motion and may not engage the middle ear bone Margolis reported apparent tester bias; when testing patients
conduction mechanism as with mastoid placement. The with SNHL an expert audiologist measured more air–bone
resulting forehead bone conduction thresholds should be threshold gaps ≤5 dB than the distribution predicted. Audi-
relatively unaffected by the changes in middle ear resonance ologists should not rigidly adhere to untenable expectations
and, in cases of ossicular fixation, reflect a truer measure of regarding air–bone threshold gaps.
cochlear reserve than bone conduction thresholds obtained
during mastoid transducer placement. Figure 4.13 shows an
example audiogram with forehead and mastoid bone vibra- CONCLUSION
tor placement for a patient with ossicular fixation. Bone conduction threshold evaluation is an integral com-
ponent of the basic audiologic examination. When bone
conduction thresholds are interpreted with an understand-
ing of air and bone conduction hearing, more accurate site-
0 of-lesion and etiology diagnoses can be made. It is hoped
that with this chapter the author has informed and moti-
20
vated readers to that end.
120
250 500 1,000 2,000 4,000 8,000 KEY REFERENCES
Frequency (Hz)
A full list of references for this chapter can be
FIGURE 4.13 Audiogram depicting hearing loss from
found at http://thePoint.lww.com. Below are the key refer-
ossicular (mallear) fixation. Air conduction: presurgery
ences for this chapter.
thresholds represented with filled circles and postsur-
gery thresholds represented with open circles. Bone American National Standards Institute. (2003) Maximum Permis-
conduction: presurgery forehead placement thresholds sible Ambient Noise Levels for Audiometric Test Rooms (ANSI
connected with a dotted line, presurgery mastoid place- S3.1-1999; Rev. ed.). New York, NY: Author.
ment thresholds not connected. (Modified from Dirks D. American National Standards Institute. (2004) Specifications for
(1985) Bone-conduction testing. In Katz J, ed. Hand- Audiometers (ANSI S3.6-2004). New York, NY: Author.
book of Clinical Audiology. Baltimore, MD: Williams and American Speech-Language-Hearing Association. (1990) Guide-
Wilkins, http://lww.com by permission.) lines for Audiometric Symbols. Rockville, MD: Author.
CHAPTER 4 • Bone Conduction Evaluation 59
American Speech-Language-Hearing Association. (2005) Guide- Kumar M, Maheshwar A, Mahendran S, Oluwasamni, Clayton
lines for Manual Pure-Tone Threshold Audiometry. Rockville, MI. (2003) Could the presence of a Carhart notch predict the
MD: Author. presence of glue at myringotomy? Clin Otolaryngol. 28(3),
Anderson H, Barr B. (1971) Conductive high-tone hearing loss. 183–186.
Arch Otolaryngol. 93(6), 599–605. Levinson MJ, Parisier SC, Jacobs M, Edelstein DR. (1989) The large
Attias J, Ulanovski D, Shemesh R, Kornreich L, Nageris B, Preis M, vestibular aqueduct syndrome in children: a review of 12 cases
Peled M, Efrati M, Raveh E. (2012) Air-bone gap component and the description of a new clinical entity. Arch Otolaryngol.
of inner-ear origin in audiograms of cochlear implant candi- 115, 54–58.
dates. Otol Neurotol. 33, 512–517. Marchbanks RJ, Burge DM, Martin AM, Bateman DE, Pickard J,
Barany E. (1938) A contribution to the physiology of bone con- Brightwell AP. (2005) The relationship between intracranial
duction. Acta Otolaryngol. (suppl 26), 1–4. pressure and tympanic membrane displacement. Br J Audiol.
Bekesy G. (1960) Experiments in Hearing. New York, NY: McGraw 24(2), 123–129.
Hill Book Co. Margolis RH. (2008) The vanishing air-bone gap: audiology’s
Boothroyd A, Cawkwell S. (1970) Vibrotactile thresholds in pure dirty little secret. Audiology Online. Available online at: http://
tone audiometry. Acta Otolaryngol. 69(1–6), 381–387. www.audiologyonline.com/articles/vanishing-air-bone-gap-
Campbell AP, Adunka OF, Zhou B, Qaqish BF, Buchman CA. (2011) audiology-901&referer=www.clickfind.com.au
Large vestibular aqueduct syndrome. Laryngoscope. 121, 352– Martin FN, Champlin CA, Chambers JA. (1998) Seventh survey of
357. audiometric practices in the United States. J Am Acad Audiol.
Carhart R. (1950) Clinical application of bone conduction audi- 9(2), 95–104.
ometry. Arch Otolaryngol. 51, 798–808. Merchant SN, Rosowski JJ, McKenna MJ. (2007) Superior semi-
Chien WW, Janky K, Minor LB, Carey JP. (2012) Superior semicir- circular canal dehiscence mimicking otosclerotic hearing loss.
cular canal dehiscence size: multivariate assessment of clinical Adv Otorhinolaryngol. 65, 137–145.
impact. Otol Neurotol. 33, 810–815. Nober EH. (1964) Pseudoauditory bone conduction thresholds.
de Jong M, Perez R, Adelman C, Chordekar S, Rubin M, Kirksunov J Speech Hear Disord. 29, 469–476.
L, Sohmer H. (2011) Experimental confirmation that vibra- Paparella MM, Morizono T, Le CT, Mancini F, Sipilä P, Choo YB,
tions at soft tissue conduction sites induce hearing by way of a Lidén G, Ki CS. (1984) Sensory-neural hearing loss in otitis
new mode of auditory stimulation. J Basic Clin Physiol Phar- media. Ann Otol Rhinol Laryngol. 93, 623–629.
macol. 22(3), 55–58. Rudnick E, Sismanis A. (2005) Pulsatile tinnitus and sponta-
Dirks D. (1964) Factors related to bone conduction reliability. Arch neous cerebrospinal fluid rhinorrhea: indicators of benign
Otolaryngol. 79, 551–558. intracranial hypertension syndrome. Otol Neurotol. 26(2),
Dirks D, Malmquist C. (1969) Comparison of frontal and mastoid 166–168.
bone conduction thresholds in various conduction lesions. Sismanis A. (1987) Otologic manifestations of benign intracranial
J Speech Hear Res. 12, 725–746. hypertension syndrome. Laryngoscope. 97(8, Pt 2, suppl 42),
Feldmann H. (1970) A history of audiology: a comprehensive 1–17.
report and bibliography from the earliest beginnings to the Steiger JR, Saccone P, Watson KN. (2007) Assessment of objective
present. In: Tonndorf J, ed. Translations of the Beltone Institute pulsatile tinnitus in a patient with syringohydromyelia. J Am
for Hearing Research. Chicago, IL: The Beltone Institute for Acad Audiol. 18(3), 197–206.
Hearing Research; pp 11–111. Stenfelt S, Goode RL. (2005) Bone conducted sound: physiological
Frank T. (1982) Forehead versus mastoid threshold differences and clinical aspects. Otol Neurotol. 26, 1245–1261.
with a circular tipped vibrator. Ear Hear. 3, 91–92. Stenfelt S, Puria S, Hate N, Goode RL. (2003) Basilar membrane
Hall CM, Croutch C. (2008) Pseudosensory-neural hearing loss. and osseous spiral lamina motion in human cadavers with air
Hear Rev. 16(1), 18–22. and bone conduction stimuli. Hear Res. 181, 131–143.
Herzog H, Krainz W. (1926) Das knochenleitungsproblem. Z Hals Studebaker GA. (1967) Clinical masking of the nontest ear. J Speech
Usw Heilk. 15, 300–306. Hear Disord. 32, 360–371.
Hulecki LR, Small SA. (2011) Behavioral bone conduction thresh- Tonndorf J. (1968) A new concept of bone conduction. Arch Oto-
olds for infants with normal hearing. J Am Acad Audiol. 22, laryngol. 87, 49–54.
81–92. Yasan H. (2007) Predictive role of Carhart’s notch in pre-operative
Jackler RK, De La Cruz A. (1989) The large vestibular aqueduct assessment for middle-ear surgery. J Laryngol Oto. 121, 219–
syndrome. Laryngoscope. 99(12), 1238–1243. 221.
C H A P T ER 5
Speech Audiometry
61
62 SECTION I Ş #BTJD5FTUTBOE1SPDFEVSFT
.FUIPEPǨ1SFTFOUBUJPO 40
Conductive
c
effects because the task difficulty is too great to show subtle
100 m
Normal changes in performance, whereas scores above 80% are often
1
2
affected by ceiling effects because the task difficulty is too
80 easy to be sensitive to performance changes. For an individ-
Cochlear
ual with a steep slope, the measurements should be made in
3
small (dB) steps to obtain valid results, whereas a shallow
60
function allows for larger step sizes to obtain valid results.
When selecting test material, it is best to choose stimuli
40 that produce a steep function, which suggests the materials
are homogeneous with respect to the task (Wilson and
Margolis, 1983).
Percent correct recognition
20
0 3FTQPOTF.PEF
The response mode for speech audiometry is generally verbal.
100 However, for SDT the response mode can be similar to that
4 Retrocochlear
e a
of puretone thresholds, where patients can push a button or
80 raise their hand when they hear the speech stimuli. A written
response is generally avoided because of the increased test
Rollover time and reliance on the patient’s ability to write and spell.
60 5 For testing children or nonverbal individuals, see Chapters 24
and 31.
40
SPEECH RECOGNITION
20
6 THRESHOLD
0 4UJNVMJ
0 20 40 60 80 100 Spondaic words are generally used for obtaining SDTs and
Presentation level (dB HL) SRTs and are recommended by ASHA (1988). Spondaic
FIGURE 5.2 Psychometric functions of word recogni- (adjective) words or spondees (noun) are two-syllable words
tion performance illustrating various types of hearing with equal stress on both syllables. Lists of spondaic words
loss can be seen in both panels as a function of percent for assessing hearing loss for speech were first developed at
correct (ordinate) and presentation level (abscissa). The the Harvard Psychoacoustic Laboratories (PAL) by Hudgins
top panel illustrates a sample psychometric function for et al. (1947). Criteria for selection of words included a high
a listener with normal hearing (open circles), conduc- level of word familiarity, phonetic dissimilarity, and homo-
tive hearing loss (curve #1), and cochlear hearing loss geneity with respect to audibility. Of the original 42 spond-
(curves #2 and #3). The bottom panel shows possible ees identified by Hudgins et al. (1947), 36 of the most famil-
psychometric functions for retrocochlear hearing loss iar were used in the development of the Central Institute
(curves #4, #5, and #6). (Adapted from Department of
for the Deaf (CID) W-1 and W-2 tests (Hirsh et al., 1952).
Veterans Affairs (1997).)
Currently, ASHA (1988) recommends the use of 15 of the
original 36 spondees used in the CID W-1 and W-2 tests
as demonstrated by the example functions drawn in for obtaining SRTs. These 15 words, shown in Table 5.1, are
Figure 5.2. In the top panel of Figure 5.2, curve #2 shows the most homogeneous with respect to audibility (Young
a function that reaches maximum performance (88%) at et al., 1982), as is the list of 20 easily pictured spondees for
80 dB HL and plateaus through 100 dB HL. In the bottom use with children (Frank, 1980).
panel of Figure 5.2, curve #4 shows a function that reaches
maximum performance (85%) at approximately 60 dB HL,
and then performance decreases as level is increased, which
3FDPNNFOEFE4355FTUJOH1SPUPDPM
is depicted by a rollover in the shape of the function. The SRT measurement involves four steps: (1) instructions;
Also of importance when describing performance in (2) familiarization; (3) initial and test phase for the descend-
terms of the psychometric function is the slope of the func- ing technique; and (4) calculation of threshold. Wilson
tion. The slope of the function is typically calculated from et al. (1973) described these steps, which were subsequently
the dynamic portion of the function that ranges between set forth by ASHA (1988) as a guideline for determining
20% and 80%. Scores below 20% are often affected by floor an SRT.
64 SECTION I Ş #BTJD5FTUTBOE1SPDFEVSFT
use of an SRT (Wilson and Margolis, 1983). In addition, only recognition performance in quiet, it is important to note that
58% of audiologists complete the familiarization step of the empirical data (Bilger, 1984) support that speech recognition
test protocol, and 60% do not follow the recommended performance is a single construct and performance at one
ASHA (1988) protocol but, instead, determine an SRT using level of linguistic complexity (e.g., sentences) can be predicted
a modified Hughson–Westlake procedure with two out of by performance at another level (e.g., monosyllabic words).
three criteria (Martin et al., 1998). These observations are The systematic relationship between recognition per-
of concern because the SRT is a valid and reliable proce- formances at various levels of linguistic complexity by
dure when standardized recorded materials are used with adults with acquired hearing losses was demonstrated by
a specified testing procedure. The SRT is also particularly Olsen et al. (1997). Performance for phonemes, words in
useful when assessing response reliability in an individual isolation, and words in sentences was measured for 875 lis-
who appears to be malingering (see Chapter 33). teners with sensory/neural hearing loss. They found that
the scores for words in isolation and in sentences were pre-
dictable from the phoneme recognition scores, with mean
SPEECH RECOGNITION IN QUIET prediction errors of only 6% and 12%, respectively. Thus,
The purpose of speech recognition testing in quiet is to assess for example, a person scoring 60% correct on a phoneme
how well a person can understand speech in a quiet environ- recognition task would be predicted to score 22% (±6%) for
ment when the level of the speech is loud enough to obtain the recognition of words in isolation and 42% (±12%) for
a maximum SRS (PBmax). The level necessary for a person the recognition of words in sentences.
with hearing loss to perform maximally is highly variable
from person to person and is dependent on the materials
used to obtain the SRS (Jerger and Hayes, 1977). We feel
.POPTZMMBCJD8PSET
that it is unfortunate that, in most audiology clinics, speech Historically, word lists such as the Northwestern University
recognition testing is assessed only at one presentation level Auditory Test Number 6 (NU No. 6; Tillman and Carhart,
(Wiley et al., 1995). The majority of audiologists select a 1966), the CID Auditory Test W-22 (CID W-22; Hirsh et al.,
single presentation level 30 to 40 dB SL re: SRT, meaning 1952), and the Phonetically Balanced 50 (PB-50; Egan, 1948)
that the materials are presented 30 to 40 dB above the SRT have been used to assess word recognition performance in a
(Martin et al., 1998; Wiley et al., 1995). Kamm et al. (1983) quiet background during audiologic evaluations.
found that speech recognition testing at 40 dB SL re: SRT The initial work of Egan (1944) outlined six princi-
did not approximate maximal performance for 40% of their pal criteria that the Psychoacoustics Lab at Harvard used
25 subjects with hearing loss. Evidence suggests that evaluat- to develop the PB-50 word lists. The six criteria were (1)
ing speech recognition abilities at more than one level cap- monosyllabic structure, (2) equal average difficulty of lists,
tures a portion of the psychometric function and allows a (3) equal range of difficulty of lists, (4) equal phonetic
better estimation of performance at PBmax. A procedure sug- composition of lists, (5) representative sample of American
gested by Wilson (2005, Personal communication) suggests English, and (6) familiar words. According to Hood and
the use of at least two levels with 25 words presented at each Poole (1980), it was assumed by Egan that meeting criteria
level. For persons with normal hearing or mild hearing loss 1, 4, 5, and 6 would ensure criteria 2 and 3. Further work
as evidenced by a puretone average (PTA) of ≤35 dB HL for to revise the PB-50 word lists by Hirsh et al. (1952) and
500, 1,000, and 2,000 Hz, the first level should be 50 dB HL Tillman et al. (1963) utilized the six criteria to create the
followed by the second level of 70 dB HL. For persons with W-22 word lists and the NU No. 6 word lists, respectively.
greater hearing loss, the first level should be 10 dB greater
than their PTA of 500, 1,000, and 2,000 Hz, and the second
1#
level should be 20 dB greater than the first level. If you are
unable to raise the second level 20 dB greater than the first The initial use of monosyllabic words for speech recogni-
level because of loudness discomfort issues, raise the second tion testing is attributed to Egan (1948) who worked in the
level as high as possible over the first level. PAL at Harvard University. His original pool of 1,000 words
Several types of materials are used to assess speech rec- was divided into 20 lists of 50 words, which collectively are
ognition ability in quiet such as sentences, nonsense syllables, known as the PAL PB-50 word lists. Each list was consid-
and the most commonly used stimuli, monosyllabic words. ered to be phonetically balanced such that the 50 words that
Previous research has shown that nonsense syllables are the composed a list were a proportionally correct representa-
most difficult of the three materials mentioned above for indi- tion of the phonetic elements in English discourse.
viduals to recognize, whereas sentences are the easiest. Recog-
nition performance of monosyllabic words falls on the per-
$*%8
formance continuum somewhere between nonsense syllables
and sentences. Although monosyllables are the most com- Hirsh et al. (1952) had five judges rate the familiarity of the
monly used stimuli in clinical settings for measuring speech 1,000 monosyllabic words selected by Egan for the PB-50
66 SECTION I Ş #BTJD5FTUTBOE1SPDFEVSFT
word lists, and 120 of the PB-50s were selected along with 80
TA B LE 5 . 2
other words to compose the new word lists. These 200 very
common words were selected and phonetically balanced Critical Difference Ranges (95%) for
into four 50-word lists known as the CID W-22 word lists. Select Percent Correct Scores as a
The CID W-22 word lists were recorded onto magnetic tape Function of Number of Test Items
as spoken by Ira Hirsh who monitored his voice on a VU
meter stating the carrier phrase “You will say” and letting % Correct 8PSET 8PSET 8PSET
each target word fall naturally at the end of the phrase. The 0 0–20 0–8 0–4
CID W-22 word lists are some of the most popular word 10 0–50 2–24
lists used by audiologists for measuring suprathreshold 20 0–60 4–44 8–36
word recognition ability in quiet. 30 10–70 14–48
40 10–80 16–64 22–58
/6/0 50 10–90 32–68
60 20–90 36–84 42–78
Lehiste and Peterson (1959) devised lists of CNCs (consonant– 70 30–90 52–86
syllable nucleus [vowel]–consonant) that were phonemi- 80 40–100 56–96 64–92
cally balanced versus phonetically balanced. That is, lists 90 50–100 76–98
that were developed to be phonetically balanced did not 100 80–100 92–100 96–100
take into account the position of the sound in a word and
From Thornton and Raffin (1978).
how the acoustic realization of the sound would be affected
by coarticulatory factors. Lehiste and Peterson argued that phoneme scoring (Boothroyd, 1968). In a 25-word list of
phonemic balancing could be accomplished by allowing for monosyllables, you have 25 items to score using whole-word
the frequency of occurrence of each initial consonant, vowel scoring, whereas you would have 50 to 75 possible items to
nucleus, and final consonant to be similar across CNC word score using phoneme scoring.
lists. The Lehiste and Peterson lists were condensed into
four lists of 50 words known today as the NU No. 6.
Historically, 50 words were included in each test list to
4FOUFODF5FTUT
facilitate phonetic balancing and to allow for a simple con- Sentence-level tests were developed at Bell Laboratories
version from number correct to percent correct following (Fletcher and Steinberg, 1929) and were used during World
testing. Studies have examined the benefits of abbreviat- War II to evaluate military communication equipment
ing the number of words used per list from 50 to 25 with (Hudgins et al., 1947). Until the development of the CID
mixed results in terms of test–retest reliability (Beattie et al., Everyday Sentences (Silverman and Hirsh, 1955), no sentence
1978; Elpern, 1961). The most important work regarding test had received clinical acceptance. The CID sentences con-
this issue of half versus full lists was the examination of sist of 10 lists of 10 sentences each with 50 key words in each
speech recognition data as a binomial variable by Thornton list. Interrogative, imperative, and declarative sentences are
and Raffin (1978). As discussed in the earlier section on included. Responses can be spoken or written and are scored
psychometric functions, performance ability between 20% as the percentage of key words correctly recognized.
and 80% is the most variable, whereas performance ability is The basis for the use of sentences in the clinical assess-
least variable at either extreme of the function (Egan, 1948). ment of speech recognition abilities is that sentences pro-
The results of Thornton and Raffin (1978) support these vide a more “realistic” listening condition for everyday com-
early views on performance using the binomial distribution munication than does the use of isolated words or nonsense
to mathematically model word recognition performance. It syllables (Bess, 1983; Silverman and Hirsh, 1955). Although
indicates that the accuracy between scores for the same lis- sentences may have greater face validity than other stim-
tener depends on the number of words used per list and the uli, they also provide semantic, syntactic, and lexical clues
listener’s level of performance. In addition, Thornton and (i.e., extrinsic redundancies). Thus it is difficult to distin-
Raffin created a table of the lower and upper limits of the guish individuals who do well on a speech recognition task
95% critical differences for percentage scores as a function because they have good speech recognition skills or because
of test items. Table 5.2 shows the critical differences a retest they make good use of top-down (linguistic, cognitive) pro-
score would need to exceed to be considered statistically cessing skills. Another complication of the use of sentence
different for the original test score. As seen in Table 5.2, as materials is that, as length exceeds seven to nine syllables,
the number of items increases, the range decreases, suggest- memory constraints, particularly in the elderly, may impact
ing that as the set size increases, the variability in the scores performance (Miller, 1956). Despite these potential limita-
decreases, allowing for the detection of more subtle differ- tions, several clinically useful sentence tests have been devel-
ences in performance. One way to increase set size without oped. Because the ability to use context is preserved even in
increasing test time is to move from whole-word scoring to older adults with hearing loss, for most patient populations,
CHAPTER 5 Ş 4QFFDI"VEJPNFUSZ 67
sentence tests are typically too easy (ceiling effect) and, netic context. Contrasts include intonation; vowel height
therefore, fail to distinguish among levels of difficulty. How- and place; and initial and final consonant voicing, contin-
ever, they are well suited as adaptive noise procedures (see uance, and place. In addition to minimizing the effects of
“Speech Recognition in Noise” section) instead of supra- lexical context and word familiarity on performance, the
threshold quiet procedures. An exception to this trend is the use of nonsense syllables allows for detailed examination
use of sentence tests in quiet for individuals with severe-to- of phonetic errors. Despite these advantages, nonsense syl-
profound hearing losses. lables lack face validity with regard to being representative
For the profoundly impaired patient population, the of everyday speech communication.
City University of New York (CUNY) Sentences (Boothroyd Minimization of lexical context and word familiar-
et al., 1988), which consist of 72 sets of topic-related sen- ity effects, while allowing for the analysis of errors and
tences, were designed to assess the use of cochlear implants confusions, can also be accomplished through the use of
and tactile aids as supplements to speech reading. Each sen- closed-set tests using real word stimuli. Classic tests of pho-
tence in a set is about one of 12 topics: food, family, work, neme recognition include the Modified Rhyme Test (MRT;
clothes, animals, homes, sports/hobbies, weather, health, House et al., 1955; Kruel et al., 1968) and its variations (e.g.,
seasons/holidays, money, or music. Each set contains four Rhyming Minimal Contrasts Test (Griffiths, 1967) and the
statements, four questions, and four commands and one California Consonant Test (CCT; Owens and Schubert,
sentence of each length from 3 to 12 words, for a total of 1977). The MRT consists of 50 test items each with six
102 words per set. Performance is scored as the number response alternatives. Twenty-five of the items differ by
of words correct. Original recordings were on laser-video the initial consonant (i.e., bent, went, sent, tent, dent, and
disc and were presented via the Computer Assisted Speech rent), and the other 25 items differ by the final consonant
Perception Software (CASPER; Boothroyd, 1987) program. (i.e., peas, peak, peal, peace, peach, and peat). The CCT also
The CUNY Sentences are being converted to DVD format consists of 100 items but uses a four-choice, rather than a
with upgrades to the CASPER software as part of current six-choice, response format in assessing the perception of
work at the Rehabilitation Engineering Research Center 36 initial consonant items and 64 final consonant items. The
(RERC) on Hearing Enhancement at Gallaudet University perception of medial vowels as well as initial and final con-
(http://www.hearingresearch.org/). sonants was added in the University of Oklahoma Closed
Response Speech Test by Pederson and Studebaker (1972).
/POTFOTF4ZMMBCMF5FTUT1IPOFNF A closed-set format is also used in the Speech Pattern
Contrast (SPAC) test (Boothroyd, 1984), which was designed
3FDPHOJUJPO5FTUT to assess the ability to perceive both suprasegmental (i.e.,
The effects of lexical context and word familiarity on test stress and intonation) and segmental phonologically (i.e.,
performance can be minimized by the use of nonsense sylla- vowel height and place, initial and final consonant voicing,
ble and/or closed-set phoneme recognition tests. Nonsense continuance, and place) relevant distinctions. Test length of
syllables were one of the first materials used to assess speech SPAC is minimized by combining two segmental contrasts
recognition ability during the development of telephone cir- in one subset (e.g., final consonant voicing and continu-
cuits at Bell Telephone Laboratories (Fletcher and Steinberg, ance) with four items (e.g., seat-seed-cease-sees). Although
1929). However, clinical use of nonsense syllables for those the SPAC as well as other speech feature tests and NSTs are
with hearing loss did not occur until the 1970s when two not routinely used in clinical audiology, the information
carefully developed tests became available—the CUNY provided about the details of an individual’s speech percep-
Nonsense Syllable Test (CUNY-NST; Levitt and Resnick, tion ability can be quite useful when assessing the need for
1978) and the Nonsense Syllable Test (NST; Edgerton and and the benefits of hearing aids and cochlear implants for
Danhauer, 1979). The CUNY-NST is a closed-set test con- both children and adults.
sisting of seven subtests, each of which has seven to nine
consonant–vowel (CV) or vowel–consonant (VC) syllables.
The CUNY-NST assesses perception of the consonants most
SPEECH RECOGNITION IN NOISE
likely to be confused by individuals with hearing loss using The most common complaint expressed by adults with
three vowel contexts. The Edgerton–Danhauer NST is an hearing loss is the inability to understand a speaker when
open-set test consisting of 25 nonsense bisyllabic CVCV listening in an environment of background noise. In 1970,
items, allowing for assessment of the perception of 50 con- Carhart and Tillman suggested that an audiologic evaluation
sonant and 50 vowel stimuli. More recently, Boothroyd et al. should include some measure of the ability of an individual
(1988) described the Three Interval Forced Choice Test of to understand speech when in a background of speech noise.
speech pattern contrast perception (THRIFT), an NST that Prior to the revival of the directional microphone in the late
can be used with children 7 years of age or older (Hnath- 1990s, however, the information gained from a speech-in-
Chisolm et al., 1998). The THRIFT measures the perception noise task for most rehabilitative audiologists was not perti-
of nine speech pattern contrasts presented in varying pho- nent to the selection of amplification because of the fact that
SECTION I Ş #BTJD5FTUTBOE1SPDFEVSFT
most hearing aids were mainly selected based on gain, slope, for separation between individuals with normal hearing
and output curves. Thus in the technology-driven field of and those with hearing loss (Beattie, 1989; McArdle et al.,
audiology, speech-in-noise testing failed to gain a place in 2005b). Typically, individuals with sensory/neural hear-
the traditional audiologic evaluation. The revolution of ing loss require the signal to be 10 to 12 dB higher than
digital hearing aids and their multitude of features, such the noise to obtain 50% performance on the psychometric
as directional microphones, noise reduction strategies, and function, whereas individuals with normal hearing on aver-
digital signal processing strategies, have created an impor- age obtain 50% performance at an S/N of 2 to 6 dB. McArdle
tant reason for utilizing speech-in-noise tasks on a routine et al. (2005a, 2005b) found mean performance on the
basis when evaluating an individual with hearing loss. Words-in-Noise (WIN) test (Wilson, 2003) to be 12.5 and
For the past 40 years, researchers have observed that lis- 6 dB S/N for 383 listeners with hearing loss and 24 listeners
teners with hearing loss show a greater disadvantage when with normal hearing, respectively. Similarly, under similar
listening in a competing speech background compared with experimental conditions, Dirks et al. (1982) and Beattie
listeners with normal hearing, such that the S/N needed for (1989) who used CID W-22 word lists in noise found 50%
the listener with hearing loss is 10 to 15 dB greater than that points of 12 and 11.3 dB S/N, respectively, for listeners with
needed by listeners with normal hearing (e.g., Carhart and hearing loss.
Tillman, 1970). Plomp (1978) reported that for every 1-dB Several studies have examined the possibility of predict-
increase in signal over the competing noise, a listener with ing the ability of an individual to understand speech-in-noise
hearing loss would receive, on average, an improvement of using puretone audiograms and SRSs in quiet without suc-
3% in terms of ability to recognize the signal. Thus, a 10-dB cess (Beattie, 1989; Carhart and Tillman, 1970; Cherry, 1953;
improvement in S/N should add 30% in terms of intelligi- Dirks et al., 1982; Killion and Niquette, 2000; Plomp, 1978;
bility as measured by open-set, speech recognition tests for Wilson, 2003). The data in Figure 5.3 were compiled from
listeners with hearing loss. two studies (McArdle et al., 2005a, 2005b). In the figure, per-
The addition of background noise to a speech recog- formance on a word recognition in quiet task at 80 dB HL
nition task has been shown to improve the sensitivity and is graphed on the ordinate as a function of 50% points on
validity of the measurement (Beattie, 1989; Sperry et al., the WIN test along the abscissa. The same words spoken by
1997). In terms of improving sensitivity, the addition of the same speaker were used for both the recognition task in
multiple S/Ns increases the difficulty of the task and allows quiet and in noise. The shaded area of the figure represents
100
176 (45.5%)
107 (27.6%)
80
Percent correct recognition at 80 dB HL
60 104 (26.9%)
the range of performance by 24 listeners with normal hear- Recognition performance is scored as the percentage of LP
ing on the WIN. and HP words correctly perceived. By providing both LP
Two main observations can be seen in the data in Fig- and HP scores, the SPIN test not only allows for the assess-
ure 5.3: (1) only 5 out of 387 listeners with hearing loss per- ment of the acoustic-phonetic components of speech, but
formed in the normal range on both the recognition task in also examines the ability of an individual to utilize linguistic
quiet and in noise; and (2) 45.5% of the 387 listeners with context.
hearing loss had word recognition scores in quiet at 80 dB In the 1980s, two additional tests designed to assess rec-
HL that were ≥90% correct. Thus, it is of interest to note ognition of everyday speech based on correct word recogni-
that although 73% of the listeners with hearing loss had tion performance in sentence length stimuli were developed.
word recognition scores in quiet ≥80%, the overwhelming The Connected Speech Test (CST; Cox et al., 1987), which
majority of these listeners displayed abnormal performance was developed as a criterion measure in studies of hearing
on a word recognition task in noise. This finding suggests aid benefit, consists of 48 passages of conversationally pro-
that speech-in-noise testing may be considered a stress test duced connected speech. Each passage is about a familiar
of auditory function (Wilson, 2013, Personal communi- topic and contains 10 sentences. Sentence length varies from
cation). In addition, it is clear that word recognition abil- 7 to 10 words, and there is a total of 25 key words in each
ity in noise is not easily predicted by word recognition in passage. Sentences are presented at an individually deter-
quiet for listeners with hearing loss other than to say that mined S/N, and performance is scored as the number of key
listeners with poor recognition ability in quiet also perform words correct.
poorly on word recognition tasks in noise. Because we are The most recent application of sentence length stimuli
unable to predict the ability of an individual to understand is in tests that are scored in terms of the decibel-to-noise
speech in a noisy background, audiologists should use the ratio required to achieve 50% correct performance. The two
tests available for quantifying the S/N needed by the listener most common tests are the Hearing in Noise Test (HINT;
to understand speech in noise. Several materials, described Nilsson et al., 1994) and the Quick Speech-in-Noise (Quick-
in the following section, have been developed to measure SIN) test (Killion et al., 2004). The two tests vary in the
speech-in-noise performance. type of sentences and type of noise used. The HINT uses
the Bamford–Kowal–Bench (BKB) Standard Sentence Lists
(Bench et al., 1979) that were compiled from the utterances
.BUFSJBMT of hearing-impaired children and contain straightforward
Initially, efforts in speech-in-noise testing were focused on vocabulary and syntax. Sentences are presented in sets of
sentence-level materials to make the task more of a real- 10 sentences, and the listener must repeat the entire sen-
world experience; however, normal everyday sentences tence correctly to receive credit. The noise used is speech-
were too easy, and further manipulation of the sentences spectrum noise that is held constant while the signal is varied
was needed to obtain the 50% correct point of perfor- to find the 50% correct point. The QuickSIN uses the Har-
mance for a listener on a speech-in-noise task. Speaks and vard Institute of Electrical and Electronics Engineers (IEEE,
Jerger (1965) developed the Synthetic Sentence Identifica- 1969) sentences, which are a collection of low-context,
tion (SSI) test to minimize the effect of contextual cues that meaningful sentences, whose phonetic balance is similar to
often made it easy to understand sentence-level materials that of English. In the QuickSIN, there are six sentences per
even in a background of noise. The stimuli are called syn- list, and each sentence contains five key words. All sentences
thetic sentences because they are not actual sentences, but are presented in multitalker babble with the five key words
rather they contain normal English phonemes and syntax in each sentence scored as correct or incorrect. Recently, the
but no semantic context. An example of a sentence is “Small BKB-SIN test (Etymotic Research, 2005) was developed for
boat with a picture has become.” The task of the listener is use with children (ages ≥5), cochlear implant patients, and
to select which one of 10 sentences displayed on a response adults for whom the QuickSIN test is too difficult.
form is perceived when presented against a competing story More recently, monosyllabic and digit materials in mul-
describing the life of Davy Crockett. The competing story titalker babble have been developed at the Auditory Research
can be presented either contralaterally or ipsilaterally. Lab of the James H. Quillen Veterans Affairs Medical Center
Another interesting sentence-level test, the Speech Per- (Wilson, 2003; Wilson and Strouse, 2002; Wilson and Weak-
ception in Noise (SPIN) test (Kalikow et al., 1977), varies ley, 2004). The word and digit materials have been shown to
the amount of semantic context that leads to the last word be sensitive to the different recognition abilities of normal-
of each sentence, which is a monosyllabic target word. The hearing and hearing-impaired adults in multitalker babble
SPIN test has eight forms of 50 sentences each that are pre- (Wilson et al., 2003; Wilson and Weakley, 2004). McArdle
sented at a fixed S/N of 8 dB. The target word in 25 of the et al. (2005b) examined the effect of material type (i.e., digits,
sentences has low predictability (LP) given the limited clues words, and sentences) on S/N loss for young listeners with
from the preceding context, and the other 25 have high normal hearing and older listeners with hearing impair-
predictability (HP) from the preceding sentence context. ment. The three speech-in-noise tests that were examined
70 SECTION I Ş #BTJD5FTUTBOE1SPDFEVSFT
24
20
(29–40%)
12
(43–60%)
include: (1) QuickSIN (Etymotic Research, 2001); (2) WIN the construct was described by Carhart et al. (1969) and
test (Wilson and Strouse, 2002; Wilson, 2003); and (3) digit termed perceptual masking. Energetic masking is described
triplets-in-multitalker babble (Wilson and Weakley, 2004). in the literature as peripheral masking, such that a stimulus
As expected, the younger listeners performed better than the interferes with the perception of a second stimulus making
older listeners on all three tasks. For the older listeners with the first stimulus a “masker.” Nonenergetic masking, or
hearing loss, the S/N required for 50% recognition of each informational masking, occurs when the target stimulus is
material type presented was −4, 12.4, and 11.7 dB S/N for similar to the masking stimulus, creating uncertainty for the
digits, words, and sentences, respectively. Figure 5.4 shows a listener as to whether he or she is hearing the target or the
bivariate plot of the 50% points for the older listeners with masker. Informational masking can occur at different pro-
hearing loss on both the QuickSIN (abscissa) and the WIN cessing levels (e.g., phonetic, semantic) and is greater for a
(ordinate). The diagonal line in Figure 5.4 represents equal speech masker than noise, especially when the talker is the
performance on both QuickSIN and the WIN. As can be same gender or, even worse, the same talker for both the tar-
seen, mean performance, as indicated by the bold filled cir- get and the masker (Brungart, 2001). Informational mask-
cle, is close to the diagonal line, suggesting that either the use ing has a greater effect when the masker is a single speaker
of monosyllabic words or IEEE sentences in this population versus a background of multiple talkers since once you add
provided a similar measure of performance in noise. More more than a couple of speakers the background “informa-
importantly, the performance difference at the 50% point tion” in the masker becomes hard to distinguish. Most com-
between normal-hearing listeners and hearing-impaired lis- mercially available speech-in-noise tests involve multitalker
teners was 7.6 dB for both words and sentences, suggesting babble, which decreases the effects of informational mask-
that words and sentences in a descending speech-in-noise ing but future studies in this area are warranted.
task were equally sensitive to the effects of hearing loss. For
a more in-depth discussion of the use of words or sentences CONSIDERATIONS FOR SPEECH
in speech-in-noise testing, see Wilson and McArdle (2005). AUDIOMETRY IN CHILDREN AND
A new body of literature has evolved in the area of
speech-in-noise testing focused on informational masking,
OTHER SPECIAL POPULATIONS
which is defined as nonenergetic masking that increases Speech stimuli are used for the behavioral assessment of the
threshold as a result of uncertainty (Wilson et al., 2012). auditory function of a child from birth onward. With very
Although the term informational masking is more recent, young infants, speech stimuli might be used to elicit a startle
CHAPTER 5 Ş 4QFFDI"VEJPNFUSZ 71
response, and as the infant develops, SDTs and SRTs can be on speech perception performance in the same way that the
obtained using a variety of behavioral techniques, such as developing linguistic abilities of a child are important to con-
visual response audiometry or play audiometry. The tech- sider in assessment. Although recorded materials are avail-
nique used will be dependent on the motor capabilities of able in languages such as Spanish (Wesilender and Hodgson,
the child. In addition to considering the motor capacity 1989), unless the audiologist speaks Spanish, errors could
of the child for responding (e.g., head turn, picture point- be made in determining correct from incorrect responses.
ing), the phonologic, receptive, and expressive language Wilson and Strouse (1999) suggest the use of a multimedia
skills of the child need to be considered during speech rec- approach similar to that used by McCullough et al. (1994)
ognition testing. For example, by the time a child can func- with nonverbal patients. Stimulus words are presented in the
tion at about a 5-year-old level, conventional SRTs can be patient’s native language, and the person responds by select-
obtained as long as the spondee words used are within the ing the perceived word from a closed set of alternatives shown
receptive vocabulary of the child (ASHA, 1988). Similarly, on a touchscreen monitor. Scoring could be done automati-
several suprathreshold pediatric speech recognition tests, cally through a software program.
such as the Word Intelligibility Picture Identification (WIPI)
test (Ross and Lerman, 1970) and the Northwestern Uni- CLINICAL FUNCTIONS OF SPEECH
versity Children’s Perception of Speech (NU-CHIPS) test
(Elliot and Katz, 1980), are comprised of words expected to
RECOGNITION MEASURES
be within the receptive vocabulary of a child. One of the historical purposes for the use of speech recog-
A variety of speech recognition tests are available for nition testing in the clinical test battery was as a diagnos-
use with children. For example, both the WIPI and NU- tic tool for determining the location of peripheral auditory
CHIPS use monosyllabic words presented in a closed-set pathology. Figure 5.2 illustrates typical psychometric func-
format. Other test paradigms allow for the assessment of the tions obtained in quiet for the recognition of monosyllabic
perception of speech feature contrasts (e.g., Imitative Test of words by listeners with normal auditory function as well
Speech Pattern Contrast Perception [IMSPAC]; Kosky and as those with conductive, sensory (cochlear), and neural
Boothroyd, 2003; Visually Reinforced Audiometry Speech (retrocochlear) hearing losses. For normal-hearing listen-
Pattern Contrast Perception [VRASPAC]; Eisenberg et al., ers, regardless of word recognition materials used, when the
2004), syllabic pattern and stress (e.g., Early Speech Percep- presentation level is about 30 dB higher than the dB level
tion [ESP] test; Moog and Geers, 1990), lexically easy versus needed for 50% performance (i.e., SRT), a score of 90% or
lexically hard words (e.g., the Lexical Neighborhood Test better can be expected. For individuals with hearing loss,
[LNT]; Kirk et al., 1995), and words in sentences presented when listening at a moderate level, scores may range any-
in quiet (e.g., BKB sentences; Bamford and Wilson, 1979) where from 100% correct to 0% correct. Because of this
and in noise (e.g., BKB-SIN test; Etymotic Research, 2005), wide dispersion of speech recognition performance across
a task which requires word and sentence recognition in both individuals with various types of hearing loss, speech rec-
quiet and noise (e.g., Pediatric Speech Intelligibility [PSI] ognition testing provides only limited diagnostic informa-
test; Jerger and Jerger, 1984). tion if testing is done at only one intensity level (see, for
In addition to children, special consideration also needs discussion, Bess, 1983; Penrod, 1994). When testing is com-
to be given to the assessment of speech perception abilities pleted at several intensity levels, however, certain patterns
in profoundly hearing-impaired adults, nonverbal patients, of performance can be expected with certain hearing losses
and multilingual patients (Wilson and Strouse, 1999). Pro- (Wilson and Strouse, 1999).
foundly hearing-impaired adults typically obtain scores of Individuals with conductive hearing loss tend to exhibit
zero on standard speech recognition tests. As a result, batter- little difficulty on speech recognition tests, with perfor-
ies such as the Minimal Auditory Capabilities (MAC) battery mance typically at 90% or better when testing is conducted
have been developed (Owens et al., 1985). Tasks included in at moderate SLs (curve #1 of Figure 5.2). A patient with a
the MAC battery involve discrimination of syllabic number, sensory/neural hearing loss will generally have poorer SRSs
noise versus voice, and statements versus questions; recog- than would a person with the same degree of hearing loss
nition of spondaic words and consonants and vowels in real due to conductive pathology. Although a very wide range
words in closed-set tasks; and more standard open-set rec- of scores are found across patients with cochlear as well as
ognition of words in isolation and sentences. retrocochlear hearing losses, SRSs tend to be poorest among
Nonverbal patients are often encountered in medical set- those with retrocochlear pathology. Although some individ-
tings where patients may have medical conditions such as lar- uals with cochlear losses will demonstrate a slight decrease in
yngectomies or cerebral vascular accidents. For these patients, recognition performance when intensity levels are increased
written responses or picture pointing tasks may be appropri- beyond the initial level needed for obtaining maximum
ate. Increases in the ethnic diversity of the US population can performance (curve #3 of Figure 5.2), marked decreases in
result in the audiologist assessing a patient who speaks little performance with increasing intensity after maximum per-
to no English. Limited knowledge of English could impact formance is achieved are typically characteristic of a neural
72 SECTION I Ş #BTJD5FTUTBOE1SPDFEVSFT
loss (curves #4 and #5 of Figure 5.2). The phenomenon of can provide insight into the use of appropriate amplification
reduced SRSs with increasing intensity that occurs with ret- and/or cochlear implant speech processing strategies.
rocochlear pathology is referred to as the “rollover” effect In addition to testing in noise, Brandy (2002) points
(Bess et al., 1979; Dirks et al., 1977). In addition to rollover, out that audiologists can gain insight into the (re)habili-
retrocochlear pathology would be suspected in the presence tative needs of patients through recording incorrect word
of a significant discrepancy in SRSs between two ears or responses, with subsequent examination of speech feature
lower than expected performance at all presentation levels error patterns (e.g., fricatives, stops, glides). Other reha-
(curve #6 of Figure 5.2). bilitative applications of speech audiometry include the
Assessment of the central auditory system also uses use of materials that allow for the assessment of use of
measures of speech recognition performance. Tasks can be linguistic context (Flynn and Dowell, 1999) and auditory–
presented either monaurally or binaurally. Monaural tasks visual performance (Boothroyd, 1987) and for the deter-
use distorted, degraded, or low-redundancy speech stimuli mination of most comfortable and uncomfortable listen-
to reduce extrinsic redundancies. Methods of degrada- ing levels (Punch et al., 2004). Information obtained with
tion include filtering (Bocca et al., 1955), time compression a variety of materials presented in a variety of paradigms
(Wilson et al., 1994), and reverberation (Nabelek and can be useful in determining optimal device settings, start-
Robinson, 1982). Binaural tests were designed to assess the ing points for therapeutic intervention, and directions for
ability of the central auditory nervous system to integrate patient counseling.
or resynthesize the different parts of a signal that are pre-
sented to each of the two ears. For example, in the Binaural
Fusion test (Matzker, 1959), a low-pass filtered version of
FOOD FOR THOUGHT
a word is presented to one ear, whereas a high-pass filtered 1. Given the use of SRTs as a verification for puretone
version of the same word is presented to the opposite ear. A thresholds in every patient has been questioned since
normal-functioning central auditory nervous system is able 1983, what is the cost benefit of measuring SRTs in a busy
to integrate the information from each ear and respond with practice? Might the time be better spent gathering other
the correct target word. On the other hand, binaural dich- information about the patient’s auditory functioning?
otic tasks involve the presentation of different speech signals 2. Determining the presentation level for word recognition
simultaneously to both ears. The patient must repeat either testing in quiet historically has been 40 dB SL, re: SRT.
or both of the signals depending on the test used. Com- Given the evidence for this level is based on listeners with
mon clinical dichotic tests include Dichotic Digits (Kimura, normal hearing, what is most appropriate for determin-
1961), the Staggered Spondaic Word test (Katz, 1962), and ing the presentation level(s) for listeners with hearing
the Dichotic Sentence Identification test (Fifer et al., 1983). loss? Additionally in an audiologic evaluation, what are
The interpretation of performance on tests designed to assess the benefits of using 25 words at each of two or three
auditory processing abilities is beyond the scope of the pres- presentation levels versus 50 words at one presentation
ent chapter and is discussed in detail in Chapters 27 and 29. level?
In addition to diagnostic applications, speech rec- 3. Speech recognition in quiet has been performed by audi-
ognition testing has an important role in estimating the ologists since the 1950s. Given that the most common
adequacy and effectiveness of communication and in the complaint of listeners with hearing loss is their difficulty
planning and evaluation of (re)habilitative efforts, includ- communicating in noisy situations, should the stan-
ing the selection and fitting of hearing aids and cochlear dard comprehensive audiometric battery be modified to
implants. For example, many audiologists label speech rec- include speech-in-noise measures?
ognition performance for monosyllabic words presented in
quiet performance as “excellent,” “good,” “fair,” or “poor” in
an attempt to link performance to the adequacy of commu- KEY REFERENCES
nication in everyday settings. However, research designed A full list of references for this chapter can be
to demonstrate systematic relationships between recogni- found at http://thePoint.lww.com. Below are the key refer-
tion performance in quiet and actual everyday communica- ences for this chapter.
tion has been largely unsuccessful (Davis, 1948; High et al.,
1964). A better estimate of the impact of a hearing loss on American National Standards Institute (ANSI). (1996) Specifica-
tions for Audiometers. S3.6–1996. New York,NY: American
daily communication might be obtained with the use of
National Standards Institute.
speech-in-noise tests such as the WIN, QuickSIN, or HINT.
American National Standards Institute. (2004) Specifications for
As Wilson and McArdle (2005) discuss, speech-in-noise Audiometers. S3.6–2004. New York, NY: American National
testing allows for the assessment of the most common com- Standards Institute.
plaint of patients—the inability to understand speech in American Speech-Language-Hearing Association. (1988) Guide-
background noise; and thus, test results provide important lines for determining threshold level for speech. ASHA. 30,
information for use in counseling. Furthermore, test results 85–89.
CHAPTER 5 Ş 4QFFDI"VEJPNFUSZ 73
American Speech-Language-Hearing Association. (2000) Audiology Cox RM, Alexander GC, Gilmore C. (1987) Development of the
Survey. Rockville, MD: American Speech-Language-Hearing Connected Speech Test (CST). Ear Hear. 8, 119S–126S.
Association. Davis H. (1948) The articulation area and the social adequacy
Asher WJ. (1958) Intelligibility tests: A review of their standardization, index for hearing. Laryngoscope. 58, 761–778.
some experiments, and a new test. Speech Monogr. 25, 14–28. Department of Veterans Affairs. (1997) The Audiology Primer for
Bamford J, Wilson I. (1979) Methodological considerations and Students and Health Care Professionals. Mountain Home, TN:
practical aspects of the BKB sentence lists. In: Bench J, Bamford Veterans Affairs Medical Center.
JM, eds. Speech-Hearing Tests and the Spoken Language of Hear- Dirks D, Kamm D, Bower D, Betsworth A. (1977) Use of perfor-
ing Impaired Children. London: Academic Press; pp 148–187. mance intensity function in diagnosis. J Speech Hear Disord.
Beattie RC. (1989) Word recognition functions for the CID W-22 27, 311–322.
Test in multitalker noise for normally hearing and hearing- Dirks DD, Morgan DE, Dubno JR. (1982) A procedure for quanti-
impaired subjects. J Speech Hear Disord. 54, 20–32. fying the effects of noise on speech recognition. J Speech Hear
Beattie RC, Svihovec DA, Edgerton BJ. (1978) Comparison of Disord. 47, 114–123.
speech detection and spondee thresholds and half- versus full- Edgerton BJ, Danhauer JL. (1979) Clinical Implications of Speech
list intelligibility scores with MLV and taped presentation of Discrimination Testing Using Nonsense Stimuli. Baltimore,
NU-6. J Am Audiol Soc. 3, 267–272. MD: University Park Press.
Bench J, Kowal A, Bamford J. (1979) The BKB (Bamford-Kowal- Egan JP. (1944) Articulation Testing Methods, II. OSRD Report No.
Bench) sentence lists for partially-hearing children. Br J Audiol. 3802. Cambridge, MA: Psychoacoustic Laboratory, Harvard
13, 108–112. University.
Bess FH. (1983) Clinical assessment of speech recognition. In: Egan JP. (1948) Articulation testing methods. Laryngoscope. 58,
Konkle DF, Rintelmann WF, eds. Principles of Speech Audiom- 955–991.
etry. Baltimore, MD: University Park Press; pp 127–201. Eisenberg LS, Martinez AS, Boothroyd A. (2004) Perception of
Bess FH, Josey AF, Humes, LE. (1979). Performance intensity func- phonetic contrasts in infants. In: Miyamoto RT, ed. Cochlear
tions in cochlear and eighth nerve disorders. Am J Otolaryngol. Implants: International Congress Series 1273. Amsterdam:
1, 27–31. Elsevier; pp 364–367.
Bilger RC. (1984) Speech recognition test development. In: Elkins E, Elliot L, Katz D. (1980) Northwestern University Children’s Percep-
ed. Speech Recognition by the Hearing Impaired. ASHA Reports tion Speech (NU-CHIPS). St. Louis, MO: Auditec.
14. Rockville, MD: ASHA; pp 2–7. Elpern BS. (1961) The relative stability of half-list and full-list
Bocca E, Calaero C, Cassinari V, Migilavacca F. (1955) Testing discrimination tests. Laryngoscope. 71, 30–36.
“cortical” hearing in temporal lobe tumors. Acta Otolaryngol. Etymotic Research. (2001) QuickSINTM (Compact Disc). Elk Grove
45, 289–304. Village, IL: Etymotic Research.
Boothroyd A. (1968) Developments in speech audiometry. Sound. Etymotic Research. (2005) BKB-SINTM (Compact Disc). Elk Grove
2, 3–10. Village, IL: Etymotic Research.
Boothroyd A. (1984) Auditory perception of speech contrasts by Feldmann H. (1970) A history of audiology: A comprehensive report
subjects with sensorineural hearing loss. J Speech Hear Res. 27, and bibliography from the earliest beginnings to the present.
134–144. Transl Beltone Inst Hear Res. 22, 1–111. [Translated by J. Tonndorf
Boothroyd A. (1987) CASPER: Computer Assisted Speech Perception from Die geschichtliche entwicklung der horprufungsmethoden,
Evaluation and Training. Proceedings of the 10th Annual Confer- kuze darstellung and bibliographie von der anfongen bis zur
ence on Rehabilitation Technology. Washington, DC: Association gegenwart. In: Leicher L, Mittermaiser R, Theissing G, eds.
for the Advancement of Rehabilitation Technology. Zwanglose Abhandungen aus dem Gebeit der Hals-Nasen-Ohren-
Boothroyd A, Hnath-Chisolm T, Hanin L, Kishon-Rabin L. (1988) Heilkunde. Stuttgart, Germany: Georg Thieme Verlag; 1960.]
Voice fundamental frequency as an aid to the speechreading of Fifer RC, Jerger JF, Berlin CI, Tobey EA, Campbell JC. (1983)
sentences. Ear Hear. 9, 335–341. Development of a dichotic sentence identification test for
Brandy WT. (2002) Speech audiometry. In: Katz J, ed. Handbook of hearing-impaired adults. Ear Hear. 4, 300–305.
Clinical Audiology. 4th ed. Baltimore, MD: Lippincott Williams Finney DJ. (1952) Statistical Method in Biological Assay. London:
& Wilkins; pp 96–110. C. Griffen.
Brungart DS. (2001) Informational and energetic masking effects Fletcher H, Steinberg J. (1929) Articulation testing methods. Bell
in the perception of two simultaneous talkers. J Acoust Soc Am. Syst Techn J. 8, 806–854.
109, 1101–1109. Flynn MC, Dowell RC. (1999) Speech perception in a communi-
Carhart R. (1951) Basic principles of speech audiometry. Acta cative context: An investigation using question/answer pairs.
Otolaryngol. 40, 62–71. J Speech Lang Hear Res. 42, 540–552.
Carhart R, Tillman TW. (1970) Interaction of competing speech Frank T. (1980) Clinically significance of the relative intelligibility
signals with hearing losses. Arch Otolaryngol. 91, 273–279. of pictorially represented spondee words. Ear Hear. 1, 46–49.
Carhart R, Tillman TW, Greetis ES. (1969) Perceptual masking in Griffiths JD. (1967) Rhyming minimal contrasts: A simplified
multiple sound backgrounds. J Acoust Soc Am. 45, 694–703. diagnostic articulation test. J Acoust Soc Am. 42, 236–241.
Cherry EC. (1953) Some experiments on the recognition of speech High WS, Fairbanks G, Glorig A. (1964) Scale for self-assessment
with one and with two ears. J Acoust Soc Am. 25, 975–979. of hearing handicap. J Speech Hear Disord. 29, 215–230.
Conn MJ, Dancer J, Ventry IM. (1975) A spondee list for determin- Hirsh IJ, Davis H, Silverman SR, Reynolds EG, Eldert E, Benson
ing speech reception threshold without prior familiarization. RW. (1952) Development of materials for speech audiometry.
J Speech Hear Disord. 40, 388–396. J Speech Hear Disord. 17, 321–337.
74 SECTION I Ş #BTJD5FTUTBOE1SPDFEVSFT
Hirsh IJ, Palva T, Goodman A. (1954) Difference limen and recruit- Matzker J. (1959) Two new methods for the assessment of central
ment. AMA Arch Otolaryngol. 60, 525–540. auditory functions in cases of brain disease. Ann Otol Rhinol
Hnath-Chisolm T, Laipply E, Boothroyd A. (1998) Age-related Laryngol. 68, 1185–1197.
changes on speech perception capacity. J Speech Hear Res. 41, McArdle R, Chisolm TH, Abrams HB, Wilson RH, Doyle PJ.
94–106. (2005a) The WHO-DAS II: measuring outcomes of hearing
Hood JD, Poole JP. (1980) Influence of the speaker and other fac- aid intervention for adults. Trends Amplif. 9, 127–143.
tors affecting speech intelligibility. Audiology. 19, 434–455. McArdle R, Wilson RH, Burks CA. (2005b) Speech recognition
House AS, Williams CE, Hecker MHL, Kryter KD. (1955) Artic- in multitalker babble using digits, words, and sentences. J Am
ulation-testing methods: Consonantal differentiation with a Acad Audiol. 16, 726–739.
closed-response set. J Acoust Soc Am. 37, 158–166. McCullough JA, Wilson RH, Birck JD, Anderson LG. (1994) A mul-
Hudgins CV, Hawkins JE Jr, Karlin JE, Stevens SS. (1947) The timedia approach for estimating speech recognition in multi-
development of recorded auditory tests for measuring hearing lingual clients. Am J Audiol. 3, 19–22.
loss for speech. Laryngoscope. 57, 57–89. Miller GA. (1956) The magical number seven, plus or minus two:
Institute of Electrical and Electronics Engineers. (1969) IEEE rec- Some limits on our capacity for processing information. Psychol
ommended practice for speech quality measurements. IEEE Rev. 63, 81–97.
Trans Audio Electroacoust. 17, 227–246. Moog JS, Geers AE. (1990) Early Speech Perception Test for Profoundly
Jerger J, Hayes D. (1977) Diagnostic speech audiometry. Arch Deaf Children. St. Louis, MO: Central Institute for the Deaf.
Otolaryngol. 103, 216–222. Nabelek A, Robinson P. (1982) Monaural and binaural speech per-
Jerger S, Jerger J. (1984) Pediatric Speech Intelligibility Test. St. Louis, ception in reverberation for listeners of various ages. J Acoust
MO: Auditec. Soc Am. 71, 1242–1248.
Kalikow DN, Stevens KN, Elliot LL. (1977) Development of a Nilsson M, Soli S, Sullivan J. (1994) Development of the Hearing in
test of speech intelligibility in noise using sentence materials Noise Test for the measurement of speech reception thresholds
with controlled word predictability. J Acoust Soc Am. 61, in quiet and in noise. J Acoust Soc Am. 95, 1085–1099.
1337–1351. Olsen WO, Van Tassell DJ, Speaks CE. (1997) Phoneme and word
Kamm CA, Morgan DE, Dirks DD. (1983) Accuracy of adaptive recognition for words in isolation and in sentences. Ear Hear.
procedure estimates of PB-max level. J Speech Hear Disord. 48, 18, 175–188.
202–209. Owens E, Kessler DT, Raggio MW, Schubert ED. (1985) Analysis and
Katz J. (1962) The use of staggered spondaic words for assessing revision of the Minimum Auditory Capabilities (MAC) battery.
the integrity of the central auditory nervous system. J Audit Ear Hear. 6, 280–290.
Res. 2, 327–337. Owens E, Schubert ED. (1977) Development of the California
Killion MC, Niquette PA. (2000) What can the pure-tone audio- Consonant Test. J Speech Hear Res. 20, 463–474.
gram tell us about a patient’s SNR loss? Hear J. 53, 46–53. Pederson OT, Studebaker GA. (1972) A new minimal-contrast
Killion MC, Niquette PA, Gudmundsen GI, Revit LJ, Banerjee closed-response-set speech test. J Audit Res. 12, 187–195.
S. (2004). Development of a quick speech-in-noise test for Penrod JP. (1994) Speech threshold and word recognition/discrim-
measuring signal-to-noise ratio loss in normal-hearing and ination testing. In: Katz J, ed. Handbook of Clinical Audiology.
hearing-impaired listeners. J Acoust Soc Am. 116, 2395–2405. 4th ed. Baltimore, MD: Williams & Wilkins; pp 147–164.
Kimura D. (1961) Some effects of temporal lobe damage on audi- Plomp R. (1978) Auditory handicap of hearing impairment and the
tory perception. Can J Psychol. 15, 157–1165. limited benefit of hearing aids. J Acoust Soc Am. 63, 533–549.
Kirk KI, Pisoni DB, Osberger MJ. (1995) Lexical effects of unspo- Punch J, Joseph A, Rakerd B. (2004) Most comfortable and uncom-
ken word recognition by pediatric cochlear implant users. Ear fortable loudness levels: six decades of research. Am J Audiol.
Hear. 16, 470–481. 13, 144–157.
Konkle DF, Rintelmann WF. (1983) Introduction to speech audi- Roeser RJ, Clark JL. (2008). Live voice speech recognition audio-
ometry. In: Konkle DF, Rindtelman WF, eds. Principles of metry – stop the madness! Audiol Today. 20,32–33.
Speech Audiometry. Baltimore, MD: University Park Press; Ross M, Lerman J. (1970) A picture identification task for hearing-
pp 1–10. impaired children. J Speech Hear Res. 13, 44–53.
Kosky C, Boothroyd A. (2003) Validation of an on-line implemen- Silverman SR, Hirsh IJ. (1955) Problems related to the use of speech
tation of the Imitative Test of Speech Pattern Contrast Percep- in clinical audiometry. Ann Otol Rhinol Laryngol. 64, 1234–1244.
tion (IMSPAC). J Am Acad Audiol. 14, 72–83. Speaks C, Jerger J. (1965) Performance-intensity characteristics of
Kruel EJ, Bell DW, Nixon JC. (1969) Factors affecting speech dis- synthetic sentences. J Speech Hear Res. 9, 305–312.
crimination test difficulty. J Speech Hear Res. 12, 281–287. Sperry JL, Wiley TL, Chial MR. (1997) Word recognition per-
Kruel EJ, Nixon JC, Kryter KD, Bell DW, Lang JS, Schubert ED. formance in various background competitors. J Am Acad
(1968) A proposed clinical test of speech discrimination. Audiol. 8, 71–80.
J Speech Hear Res. 11, 536–552. Thornton AR, Raffin MJM. (1978) Speech-discrimination scores
Lehiste I, Peterson G. (1959) Linguistic considerations in the study modeled as a binomial variable. J Speech Hear Res. 21, 507–518.
of speech intelligibility. J Acoust Soc Am. 31, 280–286. Tillman TW, Carhart R. (1966) An Expanded Test for Speech Dis-
Levitt H, Resnick S. (1978) Speech reception by the hearing crimination Utilizing CNC Monosyllabic Words. Northwestern
impaired. Scand Audiol. 6 (suppl), 107–130. University Auditory Test No. 6. Brooks Air Force Base, TX: US
Martin FN, Champlin CA, Chambers JA. (1998) Seventh survey of Air Force School of Aerospace Medicine.
audiometric practices in the United States. J Am Acad Audiol. Tillman TW, Carhart R, Wilber L. (1963) A Test for Speech Discrim-
9, 95–104. ination Composed of CNC Monosyllabic Words. Northwestern
CHAPTER 5 Ş 4QFFDI"VEJPNFUSZ 75
University Auditory Test No. 4. Technical Documentary Report Wilson RH, Morgan DE, Dirks DD. (1973) A proposed SRT pro-
No. SAM-TDR-62–135. Brooks Air Force Base, TX: US Air cedure and its statistical precedent. J Speech Hear Disord. 38,
Force School of Aerospace Medicine. 184–191.
Tillman TW, Jerger JF. (1959) Some factors affecting the spondee Wilson RH, Preece JP, Salamon DL, Sperry JL, Bornstein SP. (1994)
threshold in normal-hearing subjects. J Speech Hear Res. 2, Effects of time compression and time compression plus rever-
141–146. beration on the intelligibility of Northwestern University
Wesilender P, Hodgson WR. (1989) Evaluation of four Spanish Auditory Test No. 6. J Am Acad Audiol. 5, 269–277.
word recognition ability lists. Ear Hear. 10, 387–392. Wilson RH, Preece JP, Thornton AR. (1990) Clinical use of the
Wiley TL, Stoppenbach DT, Feldhake LI, Moss KA, Thordardottir compact disc in speech audiometry. ASHA. 32, 3247–3251.
ET. (1995) Audiologic practices: What is popular versus what Wilson RH, Strouse A. (1999) Auditory measures with speech sig-
is supported by evidence. Am J Audiol. 4, 26–34. nals. In: Musiek FE, Rintelmann WF, eds. Contemporary Per-
Wilson RH. (2003) Development of a speech in multitalker babble spectives in Hearing Assessment. Needham Heights, MA: Allyn
paradigm to assess word-recognition performance. J Am Acad & Bacon; pp 21–66.
Audiol. 14, 453–470. Wilson RH, Strouse A. (2002) Northwestern University Audiology
Wilson RH, Abrams HB, Pillion AL. (2003) A word-recognition Test #6 in multitalker bubble: A preliminary report. J Rehabil
task in multitalker babble using a descending presentation Res Dev. 39, 105–113.
mode from 24 dB to 0 dB in signal to babble. J Rehabil Res Dev. Wilson RH, Trivette CP, Williams DA, Watts KA. (2012). The
40, 321–328. effects of energetic and informational masking on the Words-
Wilson RH, Margolis RH. (1983) Measurements of auditory in-Noise Test (WIN). J Am Acad Audiol. 23, 522–533.
thresholds for speech stimuli. In: Konkle DF, Rintelmann WF, Wilson RH, Weakley DG. (2004) The use of digit triplets to evalu-
eds. Principles of Speech Audiometry. Baltimore, MD: Univer- ate word-recognition abilities in multitalker babble. Semin
sity Park Press; pp 79–126. Hear. 25, 93–111.
Wilson RH, McArdle R. (2005) Speech signals used to evaluate the Young L, Dudley B, Gunter MB. (1982) Thresholds and psycho-
functional status of the auditory system. J Rehabil Res Dev. 42 metric functions of the individual spondaic words. J Speech
(suppl 2), 79–94. Hear Res. 25, 586–593.
C H A P T ER 6
Clinical Masking
William S. Yacullo
In the first edition of this text, Sanders (1972) wrote the fol- a masking stimulus must be applied to the nontest ear to
lowing introduction to his chapter on clinical masking: eliminate its participation.
Of all the clinical procedures used in auditory assess-
ment, masking is probably the most often misused
and the least understood. For many clinicians the Air-Conduction Testing
approach to masking is a haphazard hit-or-miss bit Cross hearing occurs when a stimulus presented to the test
of guesswork with no basis in any set of principles. ear “crosses over” and is perceived in the nontest ear. There
(p 111) are two parallel pathways by which sound presented through
an earphone (i.e., an air-conduction transducer) can reach
Unfortunately, this statement may still hold true today.
the nontest ear. Specifically, there are both bone-conduction
The principles of clinical masking are difficult for many
and air-conduction pathways between an air-conduction
beginning clinicians to understand. Although the clinician
signal presented at the test ear and the sound reaching the
can apply masking formulas and procedures appropriately
nontest ear cochlea (Studebaker, 1979). First, the earphone
in most clinical situations, a lack of understanding of the
can vibrate with sufficient force to cause deformations of
underlying theoretical constructs becomes evident dur-
the bones of the skull. An earphone essentially can function
ing cases where modification of a standard procedure is
as a bone vibrator at higher sound pressures. Because both
required. A lack of understanding of the underlying con-
cochleas are housed within the same skull, the outcome is
cepts of masking often leads to misuse of clinical procedures.
stimulation of the nontest ear cochlea through bone con-
Theoretical and empirical bases of masking provide a
duction. Second, sound from the test earphone can travel
strong foundation for the understanding of applied clini-
around the head to the nontest ear, enter the opposite ear
cal masking procedures. It will become evident throughout
canal, and finally reach the nontest ear cochlea through an
this chapter that there is not a single “correct” approach to
air-conduction pathway. Because the opposite ear typically
clinical masking. Any approach to clinical masking that is
is covered during air-conduction testing, sound attenuation
based on sound theoretical constructs and verified through
provided by the earphone will greatly minimize or elimi-
clinical experience is correct. One approach will not meet
nate the contribution of the air-conduction pathway to the
all clinical needs. A strong foundation in the underlying
process of cross hearing. Consequently, cross hearing dur-
concepts of clinical masking serves three purposes. First,
ing air-conduction testing is considered primarily a bone-
it allows the clinician to make correct decisions about the
conduction mechanism.
need for masking. Second, it allows the clinician to make a
Cross hearing is the result of limited interaural attenu-
well-informed decision when selecting a specific approach
ation (IA). IA refers to the “reduction of energy between
to clinical masking. Finally, it allows the clinician to apply
ears.” Generally, it represents the amount of separation or
and modify a clinical masking procedure appropriately.
the degree of isolation between ears during testing. Specifi-
cally, it is the decibel difference between the hearing level
THE NEED FOR MASKING (HL) of the signal at the test ear and the HL reaching the
nontest ear:
A major objective of the basic audiologic evaluation is
assessment of auditory function of each ear. There are situ- IA = dB HL Test Ear – dB HL Nontest Ear
ations during both air-conduction and bone-conduction
testing when this may not occur. Although a puretone or Consider the following hypothetical examples pre-
speech stimulus is being presented through a transducer to sented in Figure 6.1. You are measuring puretone air-
the test ear, the nontest ear can contribute partially or totally conduction threshold using traditional supra-aural
to the observed response. Whenever it is suspected that the earphones. A puretone signal of 90 dB HL is presented to the
nontest ear is responsive during evaluation of the test ear, test ear. Because of limited IA, a portion of the test signal can
77
78 SECTION I • Basic Tests and Procedures
contact with the skin covering the cranial skull. Insert ear-
phones are coupled to the ear by insertion into the ear canal.
Generally, IA increases as the contact area of the trans-
ducer with the skull decreases (Zwislocki, 1953). More
specifically, IA is greater for supra-aural than circumaural
earphones. Furthermore, IA is greatest for insert earphones
(Killion et al., 1985; Sklare and Denenberg, 1987), partly
because of their smaller contact area with the skull. (The
reader is referred to Killion and Villchur, 1989; Zwislocki
A et al., 1988, for a review of advantages and disadvantages of
earphones in audiometry.) Because supra-aural and insert
earphones are most typically used during audiologic testing,
they will be the focus of this discussion.
There are different approaches to measuring IA for air-
conducted sound (e.g., “masking” method, “compensation”
method, method of “best beats”; the reader is referred to
Zwislocki, 1953, for discussion). The most direct approach,
however, involves measurement of transcranial thresholds
(Berrett, 1973). Specifically, IA is measured by obtaining
unmasked air-conduction (AC) thresholds in subjects with
B unilateral, profound sensory/neural hearing loss and then
FIGURE 6.1 Interaural attenuation (IA) is calculated as calculating the threshold difference between the normal and
the difference between the hearing level (HL) of the sig- impaired ears:
nal at the test ear and the HL reaching the nontest ear
cochlea. A puretone signal of 90 dB HL is being presented IA = Unmasked AC Impaired Ear – Unmasked AC Normal Ear
to the test ear through traditional supra-aural earphones.
Example A: If IA is 40 dB, then 50 dB HL is reaching the For example, if unmasked air-conduction thresholds are
nontest ear cochlea. Example B: If IA is 80 dB, then 10 obtained at 60 dB HL in the impaired ear and 0 dB HL in
dB HL is reaching the nontest ear cochlea. (From Yacullo the normal ear, then IA is calculated as 60 dB:
WS. (1996) Clinical Masking Procedures. 1st ed. Boston,
MA: Allyn & Bacon, © 1996, p 3. Adapted by permission of IA = 60 dB HL – 0 dB HL
Pearson Education, Inc., Upper Saddle River, NJ.) = 60dB
There is the assumption that air- and bone-conduction
thresholds are equal (i.e., no air-bone gaps) in the ear with
reach the nontest ear cochlea. If IA is 40 dB, then 50 dB HL normal hearing.
theoretically is reaching the nontest ear: Figure 6.2 illustrates the expected unmasked puretone
air-conduction thresholds in an individual with normal
IA = dB HL Test Ear – dB HL Nontest Ear hearing in the left ear and a profound sensory/neural hearing
= 90 dB HL − 50 dB HL loss in the right ear. Unmasked bone-conduction thresholds,
regardless of bone vibrator placement, are expected at HLs
= 40 dB
consistent with normal hearing in the left ear. If appropriate
If IA is 80 dB, then only 10 dB HL is reaching the nontest contralateral masking is not used during air-conduction test-
ear. It should be apparent that a greater portion of the test ing, then a shadow curve will result in the right ear. Because
signal can reach the nontest ear when IA is small. Depend- cross hearing is primarily a bone-conduction mechanism,
ing on the hearing sensitivity in the nontest ear, cross hear- unmasked air-conduction thresholds in the right ear will
ing can occur. “shadow” the bone-conduction thresholds in the left (i.e.,
IA during earphone testing is dependent on three fac- better) ear by the amount of IA. For example, if IA for air-
tors: Transducer type, frequency spectrum of the test sig- conducted sound is equal to 60 dB at all frequencies, then
nal, and individual subject. There are three major types of unmasked air-conduction thresholds in the right ear theo-
earphones currently used during audiologic testing: Supra- retically will be measured 60 dB above the bone-conduction
aural, circumaural, and insert (American National Stan- thresholds in the better ear. The shadow curve does not
dards Institute/Acoustical Society of America [ANSI/ASA], represent true hearing thresholds in the right ear. Rather, it
2010). Supra-aural earphones use a cushion that makes reflects cross-hearing responses from the better (i.e., left) ear.
contact solely with the pinna. Circumaural earphones use When using supra-aural earphones, IA for puretone
a cushion that encircles or surrounds the pinna, making air-conducted signals varies considerably, particularly across
CHAPTER 6 • Clinical Masking 79
subjects, ranging from about 40 to 85 dB (Berrett, 1973; ified length, a nipple adaptor, and a disposable foam eartip.
Chaiklin, 1967; Coles and Priede, 1970; Killion et al., 1985; A major advantage of the 3A insert earphone is increased
Sklare and Denenberg, 1987; Smith and Markides, 1981; IA for air-conducted sound, particularly in the lower fre-
Snyder, 1973). Your assumption about IA will influence the quencies (Hosford-Dunn et al., 1986; Killion et al., 1985;
decision about the need for contralateral masking. The use Sklare and Denenberg, 1987; Van Campen et al., 1990). This
of a smaller IA value assumes that there is smaller separa- is clearly illustrated in the results of a study by Killion et al.
tion between ears. Consequently, contralateral masking will (1985) (Figure 6.3).
be required more often. When making a decision about the Increased IA with 3A insert earphones is the result of
need for contralateral masking during clinical practice, a two factors: (1) Reduced contact area of the transducer with
single value defining the lower limit of IA is recommended the skull and (2) reduction of the occlusion effect (OE).
(Studebaker, 1967a). Zwislocki (1953) evaluated IA for three types of earphones:
Based on currently available data, a conservative estimate circumaural, supra-aural, and insert. Results suggested that
of IA for supra-aural earphones is 40 dB at all frequencies. IA for air-conducted sound increased as the contact area of
Although this very conservative estimate will take into the earphone with the skull decreased. When an acoustic
account the IA characteristics of all individuals, it will result signal is delivered through an earphone, the resultant sound
in the unnecessary use of masking in some instances. pressure acts over a surface area of the skull determined by
Commonly used insert earphones are the Etymotic the earphone cushion. The surface area associated with a
Research ER-3A (Killion, 1984) and the E-A-RTONE 3A small eartip will result in a smaller applied force to the skull,
(E-A-R Auditory Systems, 1997). The ER-3A and the E-A- resulting in reduced bone-conduction transmission.
RTONE 3A insert earphones are considered functionally Chaiklin (1967) has also suggested that IA may be
equivalent because they are built to identical specifications increased in the low frequencies with a deep insert because
(Frank and Vavrek, 1992). Each earphone consists of a of a reduction of the OE. ANSI/ASA (2010) defines the OE
shoulder-mounted transducer, a plastic sound tube of spec- as an increase in loudness for bone-conducted sound at
80 SECTION I • Basic Tests and Procedures
5 dB) to the average values reported for the 3A insert ear- ognition thresholds. Rather, a different response task when
phone (Killion et al., 1985). measuring different speech thresholds in each ear (i.e., SDT
IA for speech is typically measured by obtaining speech in one ear and SRT in the other) can affect the measured
recognition thresholds (SRTs) in individuals with unilateral, IA for speech. Comparison of SRTs between ears or SDTs
profound sensory/neural hearing loss. Specifically, the dif- between ears generally should result in the same measured
ference in threshold between the normal ear and impaired IA. Smith and Markides (1981) measured IA for speech in
ear without contralateral masking is calculated: 11 subjects with unilateral, profound hearing loss. IA was
calculated as the difference between the SDT in the better
IA = Unmasked SRTImpaired Ear – SRTNormal Ear ear and the unmasked SDT in the poorer ear. The range
of IA values was 50 to 65 dB. It is interesting to note that
Recall that SRT represents the lowest HL at which speech the lowest IA value reported for speech using a detection
is recognized 50% of the time (ANSI/ASA, 2010; Ameri- task in each ear was 50 dB, a value comparable to the lowest
can Speech-Language-Hearing Association [ASHA], 1988). minimum reported IA value (i.e., 48 dB) for spondaic words
IA for spondaic words presented through supra-aural ear- (e.g., Martin and Blythe, 1977; Snyder, 1973).
phones varies across subjects and ranges from 48 to 76 dB There is also some evidence that it may be appropri-
(Martin and Blythe, 1977; Sklare and Denenberg, 1987; Sny- ate to use a more conservative estimate of IA when making
der, 1973). Again, a single value defining the lower limit of a decision about the need for contralateral masking during
IA is recommended when making a decision about the need assessment of suprathreshold speech recognition. Although
for contralateral masking (Studebaker, 1967a). A conserva- IA for the speech signal remains constant during measure-
tive estimate of IA for spondees, therefore, is 45 dB when ment of threshold or suprathreshold measures of speech
using supra-aural earphones (Konkle and Berry, 1983). The recognition (i.e., the decibel difference between the level of
majority of audiologists measure SRT using a 5-dB step size the speech signal at the test ear and the level at the non-
(Martin et al., 1998). Therefore, the IA value of 48 dB is typ- test ear cochlea), differences in the performance criterion
ically rounded down to 45 dB. for each measure must be taken into account when select-
There is considerable evidence that speech can be ing an appropriate IA value for clinical use. SRT is defined
detected at a lower HL than that required to reach SRT. relative to a 50% response criterion. However, suprathresh-
Speech detection threshold (SDT) is defined as the lowest old speech recognition performance can range from 0% to
HL at which speech can be detected or “discerned” 50% 100%.
of the time (ASHA, 1988). The SRT typically requires an Konkle and Berry (1983) provide an excellent ratio-
average of about 8 to 9 dB greater HL than that required nale for the use of a more conservative estimate of IA when
for the detection threshold (Beattie et al., 1978; Chaiklin, measuring suprathreshold speech recognition. They suggest
1959; Thurlow et al., 1948). Given this relationship between that the fundamental difference in percent correct criterion
the two speech thresholds, Yacullo (1996) has suggested requires the specification of nontest ear cochlear sensitiv-
that a more conservative value of IA may be appropriate ity in a different way than that used for threshold measure-
when considering the need for contralateral masking dur- ment. If suprathreshold speech recognition materials are
ing measurement of SDT. presented at an HL equal to the SRT, then a small percent-
Consider the following hypothetical example. You are age of the test items can be recognized. It should be noted
measuring speech thresholds in a patient with normal hear- that the percentage of test words that can be recognized
ing in the right ear and a profound, sensory/neural hear- at an HL equal to SRT is dependent on the type of speech
ing loss in the left ear. If the patient exhibits the minimum stimuli, as well as on the talker and/or recorded version of
reported IA value for speech of 48 dB, then an SRT of 0 dB a speech recognition test. Regardless of the type of speech
HL would be measured in the right ear and an unmasked stimulus (e.g., meaningful monosyllabic words, nonsense
SRT of 48 dB HL would be measured in the left ear. If an syllables, or sentences) and the specific version (i.e., talker/
unmasked SDT is subsequently measured in the left ear, it is recording) of a speech recognition test, 0% performance
predicted that the threshold would occur at an HL of about may not be established until an HL of about −10 dB rela-
8 to 9 dB lower than the unmasked SRT. An unmasked SDT tive to the SRT. Konkle and Berry (1983) recommend that
would be expected to occur at about 39 to 40 dB HL. Com- the value of IA used for measurement of suprathreshold
parison of the unmasked SDT in the impaired ear with the speech recognition should be estimated as 35 dB. That is,
SRT in the normal ear theoretically would result in mea- the IA value of 45 dB (rounded down from 48 dB) based
sured IA of approximately 39 to 40 dB. When an unmasked on SRT measurement is adjusted by subtracting 10 dB. This
SDT is measured and the response is compared to the SRT adjustment in the estimate of IA reflects differences in per-
in the nontest ear, a more conservative estimate of IA for cent correct criterion used for speech threshold and supra-
speech may be appropriate. threshold measurements.
It should be noted that the actual IA for speech does not The majority of audiologists use an IA value of 40 dB
change during measurement of speech detection and rec- for all air-conduction measurements, both puretone and
82 SECTION I • Basic Tests and Procedures
speech, when making a decision about the need for con- into vibration, both cochleas can be potentially stimulated.
tralateral masking (Martin et al., 1998). The use of a single Consequently, an unmasked bone-conduction threshold
IA value of 40 dB for both threshold and suprathreshold can reflect a response from either cochlea or perhaps both.
speech audiometric measurements can be supported. Given Although a bone vibrator may be placed at the side of the
the smallest reported IA value of 48 dB for spondaic words, test ear, it cannot be assumed that the observed response is in
a value of 40 dB is somewhat too conservative during mea- fact from that ear.
surement of SRT. However, it should prove adequate during Consider the following example. You have placed a bone
measurement of SDT and suprathreshold speech recogni- vibrator at the right mastoid process. A puretone signal of
tion when a more conservative estimate of IA (by approxi- 50 dB HL is presented. If IA is considered to be 0 dB, then it
mately 10 dB) may be appropriate. should be assumed that a signal of 50 dB HL is potentially
Unfortunately, there are only very limited data available reaching both cochleas. It should be apparent that there is
about IA for speech when using insert earphones. Sklare essentially no separation between the two cochleas during
and Denenberg (1987) reported IA for speech (i.e., SRT unmasked bone-conduction audiometry.
using spondaic words) in seven adults with unilateral, pro- Based on currently available data, a conservative estimate
found sensory/neural hearing loss using ER-3A insert ear- of IA for bone-conducted sound is 0 dB at all frequencies.
phones. IA ranged from 68 to 84 dB. It should be noted that
the smallest reported value of IA for spondaic words (i.e.,
68 dB) is 20 dB greater when using 3A insert earphones with WHEN TO MASK
deeply inserted foam eartips (Sklare and Denenberg, 1987) Contralateral masking is required whenever there is the pos-
than when using supra-aural earphones (i.e., 48 dB) (Mar- sibility that the test signal can be perceived in the nontest ear.
tin and Blythe, 1977; Snyder, 1973). Therefore, a value of IA is one of the major factors that will be considered when
60 dB represents a very conservative estimate of IA for evaluating the need for masking. The basic principles under-
speech when using 3A insert earphones. This value is derived lying the decision-making processes of when to mask during
by adding a correction factor of 20 dB to the conservative IA puretone and speech audiometry will now be addressed.
value used with supra-aural earphones (i.e., 40 dB) for all
threshold and suprathreshold measures of speech.
Based on currently available data, conservative estimates Puretone Audiometry: Air
of IA for all threshold and suprathreshold measures of speech Conduction
are 40 dB for supra-aural earphones and 60 dB for 3A insert
When making a decision about the need for masking dur-
earphones with deeply inserted foam eartips.
ing puretone air-conduction testing, three factors need to
be considered: (1) IA, (2) unmasked air-conduction thresh-
Bone-Conduction Testing old in the test ear (i.e., HL at the test ear), and (3) bone-
conduction hearing sensitivity (i.e., threshold) in the non-
There are two possible locations for placement of a bone
test ear. Recall that when cross hearing occurs, the nontest
vibrator (typically, the Radioear B-71) during puretone
ear is stimulated primarily through the bone-conduction
threshold audiometry: The mastoid process of the tempo-
mechanism. When a decision is made about the need
ral bone and the frontal bone (i.e., the forehead). Although
for contralateral masking, the unmasked air-conduction
there is some evidence that a forehead placement produces
threshold in the test ear (ACTest Ear) is compared to the bone-
more reliable and valid thresholds than a mastoid place-
conduction threshold in the nontest ear (BCNontest Ear). If
ment (see Dirks, 1994, for further discussion), the major-
the difference between ears equals or exceeds IA, then air-
ity (92%) of audiologists in the United States continue to
conduction threshold in the test ear must be remeasured
place a bone-conduction transducer on the mastoid process
using contralateral masking. The rule for when to mask dur-
(Martin et al., 1998).
ing puretone air-conduction testing can be stated as follows:
IA is greatly reduced during bone-conduction audiom-
Contralateral masking is required during puretone air-
etry. IA for bone-conducted sound when using a bone vibra-
conduction audiometry when the unmasked air-conduction
tor placed at the forehead is essentially 0 dB at all frequencies;
threshold in the test ear equals or exceeds the apparent bone-
IA when using a mastoid placement is approximately 0 dB at
conduction threshold (i.e., the unmasked bone-conduction
250 Hz and increases to about 15 dB at 4,000 Hz (Studebaker,
threshold) in the nontest ear by a conservative estimate of IA:
1967a). Regardless of the placement of a bone vibrator (i.e.,
mastoid vs. forehead), it is generally agreed that IA for bone- AC Test Ear − BC Nontest Ear ≥ IA
conducted sound at all frequencies is negligible and should
be considered 0 dB (e.g., Dirks, 1994; Hood, 1960; Sanders This rule is consistent with the guidelines for manual
and Rintelmann, 1964; Studebaker, 1967a). When a bone puretone threshold audiometry recommended by ASHA
vibrator, regardless of its location, sets the bones of the skull (2005).
CHAPTER 6 • Clinical Masking 83
FIGURE 6.4 Audiogram illustrating the need for contralateral masking during
puretone air-conduction audiometry. See text for discussion.
Note that the term “apparent” bone-conduction thresh- First consider the need for contralateral masking
old is considered when making a decision about the need assuming that air-conduction thresholds were measured
for masking. Remember that an unmasked bone-conduc- using supra-aural earphones. A conservative estimate of IA
tion threshold does not convey ear-specific information. It is 40 dB. We will use the following equation when making a
is assumed that the bone-conduction response can originate decision about the need for contralateral masking:
from either or both ears. Therefore, the unmasked bone-
conduction response is considered the apparent or possible AC Test Ear – BC Nontest Ear ≥ IA
threshold for either ear.
Consider the unmasked puretone audiogram* pre- Because it is not possible to measure bone-conduction
sented in Figure 6.4. Because IA for bone-conducted sound threshold at 8,000 Hz, it is necessary to predict an unmasked
is considered 0 dB, unmasked bone-conduction thresholds threshold given the findings at other test frequencies. In this
are traditionally obtained at only one mastoid process. particular example, unmasked bone-conduction threshold
During air-conduction threshold testing, the potential for at 8,000 Hz will probably have a similar relationship with
cross hearing is greatest when there is a substantial differ- the air-conduction thresholds in the better (i.e., left) ear.
ence in hearing sensitivity between the two ears and when a Because there is no evidence of air-bone gaps at the adja-
stimulus is presented at higher HLs to the poorer ear. Con- cent high frequencies, we will assume that a similar relation-
sequently, there is greater potential for cross hearing when ship exists at 8,000 Hz. Therefore, our estimate of unmasked
measuring puretone thresholds in the right ear. bone-conduction threshold is 45 dB HL.
It will be necessary to remeasure puretone thresholds at
*The puretone audiogram and audiometric symbols used throughout all test frequencies in the right ear using contralateral mask-
this chapter are those recommended in ASHA’s (1990) most recent ing because the difference between ears equals or exceeds
guidelines for audiometric symbols (see Chapter 3). our estimate of IA.
84 SECTION I • Basic Tests and Procedures
the presence of an air-bone gap in the test ear (Air-Bone mal distribution of the relationship between air- and bone-
GapTest Ear) of 15 dB or greater: conduction thresholds in individuals without significant
air-bone gaps, then an air-bone difference of ±10 dB is not
Air-Bone Gap Test Ear ≥ 15 dB
unexpected.
where If unmasked bone-conduction thresholds suggest air-
bone gaps of 10 dB or less, then contralateral masking is not
Air-Bone Gap = AC Test Ear − Unmasked BC required. Although unmasked bone-conduction thresholds
do not provide ear-specific information, we have accom-
ASHA (2005) recommends that contralateral masking plished our goal for bone-conduction testing. If unmasked
should be used whenever a potential air-bone gap of 10 dB bone-conduction thresholds suggest no evidence of sig-
or greater exists. When taking into account the variability nificant air-bone gaps, then we have ruled out the presence
inherent in bone-conduction measurements (Studebaker, of a significant conductive component. Consequently, our
1967b), however, a criterion of 10 dB may be too stringent. assumption is that the hearing loss is sensory/neural in
There is a certain degree of variability between air- and nature.
bone-conduction threshold, even in individuals without Figure 6.5 provides three examples of the need for
conductive hearing loss. If we assume that there is a nor- contralateral masking during bone-conduction audiometry.
A B
Frequency (Hz)
Hearing level in decibles (dB HL)
Unmasked air- and bone-conduction thresholds are pro- for contralateral masking during speech audiometry: (1) IA,
vided in each case. (2) presentation level of the speech signal (in dB HL) in the
Example A. Contralateral masking is not required dur- test ear, and (3) bone-conduction hearing sensitivity (i.e.,
ing bone-conduction testing in either ear. When we com- threshold) in the nontest ear.
pare the unmasked bone-conduction thresholds to the Contralateral masking is indicated during speech audi-
air-conduction thresholds in each ear, there are no potential ometry whenever the presentation level of the speech signal
air-bone gaps of 15 dB or greater. For example, consider the (in dB HL) in the test ear (Presentation LevelTest Ear) equals or
thresholds at 2,000 Hz. Comparison of the unmasked bone- exceeds the best puretone bone-conduction threshold in the
conduction threshold to the air-conduction thresholds sug- nontest ear (Best BCNontest Ear) by a conservative estimate of IA:
gests a potential air-bone gap of 5 dB in the right ear and
0 dB in the left ear. Because the unmasked bone-conduction Presentation Level Test Ear − Best BC Nontest Ear ≥ IA
threshold does not suggest the presence of significant air-
bone gaps in either ear, our conclusion is that the hearing Because speech is a broadband signal, it is necessary to
loss is sensory/neural bilaterally. Obtaining masked bone- consider bone-conduction hearing sensitivity at more than
conduction thresholds, although they would provide ear- a single puretone frequency. Konkle and Berry (1983) and
specific information, would not provide additional diag- Sanders (1991) recommend the use of the bone-conduction
nostic information. puretone average of 500, 1,000, and 2,000 Hz or some other
Example B. Comparison of unmasked bone-conduc- average that is predictive of the SRT. ASHA (1988) recom-
tion thresholds to the air-conduction thresholds in the left mends that the puretone bone-conduction thresholds at
ear does not suggest the presence of significant air-bone 500, 1,000, 2,000, and 4,000 Hz should be considered. Mar-
gaps. Consequently, masked bone-conduction thresholds tin and Blythe (1977) suggest that 250 Hz can be eliminated
are not required in the left ear. Our conclusion is that the from any formula for determining the need for contralat-
hearing loss is sensory/neural. eral masking when measuring the SRT. Yet, the nontest ear
However, masked bone-conduction thresholds will be bone-conduction threshold at 250 Hz may be an important
required in the right ear. Comparison of unmasked bone- consideration when measuring the SDT. Olsen and Matkin
conduction thresholds to the air-conduction thresholds (1991) state that the SDT may be most closely related to the
in the right ear suggests potential air-bone gaps ranging best threshold in the 250 to 4,000 Hz range when audio-
from 25 to 35 dB. The unmasked bone-conduction thresh- metric configuration steeply rises or slopes. Therefore, fol-
olds may reflect hearing in the better (i.e., left) ear. Bone- lowing the recommendation of Coles and Priede (1975), the
conduction thresholds in the right ear may be as good as the most conservative approach involves considering the best
unmasked responses. They also may be as poor as the air- bone-conduction threshold in the 250- to 4,000-Hz fre-
conduction thresholds in that ear. Because we do not have quency range.
ear-specific information for bone-conduction thresholds, The examples presented in Figures 6.6 and 6.7 illus-
the loss in the right ear can be either mixed or sensory/neu- trate the need for contralateral masking during threshold
ral. To make a definitive statement about the type of hearing and suprathreshold speech audiometry, respectively. First
loss, it will be necessary to obtain masked bone-conduction consider the audiogram presented in Figure 6.6. Audiom-
thresholds in the right ear. etry was performed using supra-aural earphones. Puretone
Example C. There is evidence that contralateral masking testing (using appropriate contralateral masking dur-
will be required when measuring bone-conduction thresh- ing both air- and bone-conduction audiometry) reveals a
olds in both ears. Comparison of unmasked bone-conduc- severe-to-profound, sensory/neural hearing loss of gradu-
tion thresholds to the air-conduction thresholds suggests ally sloping configuration in the right ear. There is a very
potential air-bone gaps ranging from 30 to 35 dB in each mild, sensory/neural hearing loss of relatively flat config-
ear. As in the previous example, bone-conduction thresh- uration in the left ear. Given the difference between ears
olds in each ear may be as good as the unmasked responses. observed during puretone audiometry, it is anticipated that
They may also be as poor as the air-conduction thresholds contralateral masking may be needed during assessment of
in that ear. To make a definitive statement about the type SRT in the poorer ear.
of hearing loss, it will be necessary to obtain masked bone- There are different approaches that can be used when
conduction thresholds in both ears. determining the need for contralateral masking during
measurement of SRT. The most efficient and recommended
approach involves predicting the speech threshold using
Speech Audiometry the puretone threshold data in the poorer ear and, on that
Because speech audiometry is an air-conduction procedure, basis, determining the need for contralateral masking. For
the rules for when to mask will be similar to those used dur- example, SRT is measured at 20 dB HL in the left ear, a find-
ing puretone air-conduction audiometry. There are three ing consistent with the puretone results. Given the relatively
factors to consider when making a decision about the need low HL at which the SRT was established in the better (i.e.,
CHAPTER 6 • Clinical Masking 87
FIGURE 6.6 Audiogram illustrating the need for contralateral masking during measurement of
speech recognition threshold (SRT). See text for further discussion.
88 SECTION I • Basic Tests and Procedures
FIGURE 6.7 Audiogram illustrating the need for contralateral masking during measurement of supra-
threshold speech recognition. See text for further discussion.
CHAPTER 6 • Clinical Masking 89
left) ear, it is expected that contralateral masking will not be the right ear (80 dB HL) and the measured SRT in the left
required when measuring the SRT. Specifically, the SRT of ear (20 dB HL) equals or exceeds 40 dB, our estimate of
20 dB HL in the left ear does not equal or exceed the best IA for speech:
bone-conduction threshold of 55 dB HL in the nontest ear
STTest Ear – STNontest Ear ≥ IA
by a conservative estimate of IA (40 dB):
Presentation Level Test Ear – Best BC Nontest Ear ≥ IA 80 dB HL – 20 dB HL ≥ 40 dB ? Yes
20 dBHL – 55 dBHL ≥ 40 dB? No An alternative approach that can be used when mak-
ing a decision about the need for contralateral masking
However, if we predict that an SRT will be measured at about during assessment of SRT involves measuring unmasked
80 dB HL in the right ear (based on the puretone average), speech thresholds in both ears. Consider again the example
then contralateral masking will be required because the esti- presented in Figure 6.6. Assume that unmasked SRTs were
mated speech threshold of 80 dB HL equals or exceeds the measured at 65 and 20 dB HL in the right and left ears,
best bone-conduction threshold of 15 dB HL in the nontest respectively. Again there is an indication that contralat-
ear by 40 dB, our estimate of IA for speech: eral masking will be required when measuring the SRT in
the right ear. The presentation level of 65 dB HL (i.e., the
Presentation Level Test Ear – Best BC Nontest Ear ≥ IA
unmasked SRT) in the test ear equals or exceeds the best
bone-conduction threshold of 15 dB HL in the nontest ear
80 dBHL – 15 dBHL ≥ 40 dB? Yes
by 40 dB, our estimate of IA for speech:
Stated differently, the difference between the predicted pre- Presentation Level Test Ear – Best BC Nontest Ear ≥ IA
sentation level in the test ear and the best bone-conduction
threshold in the nontest ear equals or exceeds our estimate 65 dBHL – 15 dBHL ≥ 40 dB ? Yes
of IA. It is important to note, however, that a decision about
the need for contralateral masking during measurement of Similarly, the difference between the unmasked SRT in the
speech threshold must always take into account not only the right ear (65 dB HL) and the measured SRT in the left ear
presentation level at the measured SRT, but also all supra- (20 dB HL) equals or exceeds our estimate of IA (40 dB).
threshold levels used during threshold measurement. This Although this approach can sometimes provide the audi-
will be discussed further in the section addressing selection ologist with a more accurate estimate of the patient’s IA for
of masking levels during speech audiometry. speech (which may be useful when selecting appropriate
During our earlier discussion of the need for contralat- masking levels), it often just increases the number of steps
eral masking during puretone air-conduction audiometry, needed to establish the true SRT in the test ear.
it was indicated that a correct decision about the need for The audiogram presented in Figure 6.7 illustrates the
contralateral masking can be made sometimes by simply need for contralateral masking during assessment of supra-
comparing the air-conduction thresholds of the two ears. threshold speech recognition. Puretone testing reveals
Similarly, a decision about the need for contralateral mask- normal hearing through 1,000 Hz, steeply sloping to a
ing during measurement of speech thresholds can be often severe-to-profound sensory/neural hearing loss in the high
made by comparing speech thresholds in the two ears. frequencies bilaterally. SRTs were measured at 0 dB HL in
Contralateral masking is required during measurement both ears, a finding consistent with the puretone findings.
of speech threshold when the speech threshold in the test ear Contralateral masking was not required during puretone
(STTest Ear) equals or exceeds the speech threshold in the nontest and speech threshold audiometry. Suprathreshold speech
ear (STNontest Ear) by a conservative estimate of IA: recognition will be assessed using the California Consonant
Test (CCT). This is a closed-set word recognition test that is
STTest Ear − STNontest Ear ≥ IA sensitive to the speech recognition difficulties of individu-
als with high-frequency hearing loss (Owens and Schubert,
Consider again the audiogram presented in Figure 6.6. 1977). If we use the recommended sensation level (SL) of
Recall that we predicted that SRT would be measured at 50 dB (Schwartz and Surr, 1979), then presentation level for
about 80 dB HL in the right ear. In this particular example, both ears will be 50 dB HL (i.e., 50 dB relative to the SRT of
comparison of the two speech thresholds (i.e., the mea- 0 dB HL).
sured SRT of 20 dB HL in the left ear and the predicted Let us now consider the need for contralateral mask-
SRT of 80 dB HL in the right ear) would lead us to a cor- ing during assessment of suprathreshold speech recogni-
rect decision about the need for contralateral masking tion. We will consider the need for masking using two types
when measuring SRT in the poorer ear without the need of air-conduction transducers: Supra-aural and 3A insert
to consider bone-conduction hearing sensitivity in the earphones. The advantage of insert earphones will become
nontest ear. The difference between the predicted SRT in apparent.
90 SECTION I • Basic Tests and Procedures
Let us assume that supra-aural earphones are being Consider the following example. Absolute threshold
used during speech audiometry. Contralateral masking will for a 1,000-Hz puretone stimulus is initially determined to
be required when assessing suprathreshold speech recogni- be 40 dB HL. Another sound, white noise, is now presented
tion in both ears because the difference between the presen- simultaneously to the same ear. Absolute threshold for the
tation level of 50 dB HL in the test ear and the best puretone 1,000-Hz signal is redetermined in the presence of the white
bone-conduction threshold of 0 dB HL in the nontest ear noise and increases to 60 dB HL. Sensitivity to the puretone
equals or exceeds 40 dB, our conservative estimate of IA for signal has been affected by the presence of the white noise.
speech: This increase in threshold of one sound in the presence of
another is defined as masking. Because the puretone thresh-
Presentation Level Test Ear – Best BC Nontest Ear ≥ IA old was raised by 20 dB (i.e., a threshold shift of 20 dB), the
white noise has produced 20 dB of masking.
Right Ear 50 dBHL – 0 dBHL ≥ 40 dB ? Yes There are two basic masking paradigms: ipsilateral and
contralateral. In an ipsilateral masking paradigm, the test
Left Ear 50 dBHL – 0 dBHL ≥ 40 dB ? Yes
signal and the masker are presented to the same ear. In a
contralateral masking paradigm, the test signal and masker
A different outcome results if we substitute 3A insert
are presented to opposite ears. Masking is used clinically
earphones for the supra-aural arrangement. Because of
whenever it is suspected that the nontest ear is participating
the greater IA offered by 3A insert earphones, contralat-
in the evaluation of the test ear. Consequently, masking is
eral masking will not be required when assessing supra-
always applied to the nontest or contralateral ear. Masking
threshold speech recognition in either ear. Specifically,
reduces sensitivity of the nontest ear to the test signal. The
the difference between the presentation level of 50 dB
purpose of contralateral masking, therefore, is to raise the
HL in the test ear and the best puretone bone-conduc-
threshold of the nontest ear sufficiently so that its contribu-
tion threshold of 0 dB HL in the nontest ear does not
tion to a response from the test ear is eliminated.
equal or exceed 60 dB, our conservative estimate of IA
for speech:
Presentation Level Test Ear – Best BC Nontest Ear ≥ IA Masking Noise Selection
Standard diagnostic audiometers provide three types of
Right Ear 50 dBHL – 0 dBHL ≥ 60 dB ? No masking stimuli: narrowband noise, speech spectrum noise,
and white noise. Our clinical goal is to select a masker that is
efficient (Hood, 1960). An efficient masker is one that pro-
Left Ear 50 dBHL – 0 dBHL ≥ 60 dB ? No
duces a given effective level of masking with the least overall
sound pressure level.
The example presented in Figure 6.7 illustrates two
To better understand this concept of masker efficiency,
important concepts related to assessment of suprathreshold
let us review the classic masking experiment conducted by
speech recognition. First, it should not be assumed that con-
Fletcher (1940). White noise is a broadband stimulus that
tralateral masking is never required when assessing individ-
contains equal energy across a broad range of frequencies.
uals with symmetrical sensory/neural hearing loss. Second,
Because of its broadband spectrum, it has the ability to
the need for contralateral masking often can be eliminated
mask puretone stimuli across a broad range of frequencies
by using an air-conduction transducer that provides greater
(Hawkins and Stevens, 1950). Fletcher addressed which fre-
IA (i.e., 3A insert earphone).
quency components of broadband noise contribute to the
masking of a tone.
Fletcher (1940) conducted what is known as a centered
MASKING CONCEPTS masking experiment. Initially, a very narrow band of noise
Before proceeding to a discussion of clinical masking pro- was centered around a puretone signal. The bandwidth of
cedures, a brief review of basic masking concepts, including the noise was progressively widened, and the masking effect
masking noise selection and calibration, will be presented. on the puretone signal was determined. Fletcher observed
Generally, masking relates to how sensitivity for one sound that the masked puretone threshold increased as the band-
is affected by the presence of another sound. ANSI/ASA width of the masking noise was increased. However, once the
(2010) defines masking as follows: noise band reached and then exceeded a “critical bandwidth,”
additional masking of the puretone signal did not occur.
The process by which the threshold of hearing for one
This concept of the critical band as first described by
sound is raised by the presence of another (masking)
Fletcher (1940) consists of two components:
sound. The amount by which the threshold of hearing
for one sound is raised by the presence of another 1. When masking a puretone with broadband noise, the
(masking) sound, expressed in decibels (p 7). only components of the noise that have a masking effect
CHAPTER 6 • Clinical Masking 91
on the tone are those frequencies included within a nar- Calibration of Effective
row band centered around the frequency of the tone.
2. When a puretone is just audible in the presence of the
Masking Level
noise, the total noise power present in the narrow band When a masking noise is presented to the nontest ear, we are
of frequencies is equal to the power of the tone. interested in how much masking is produced. Consequently,
masking noise is calibrated in EM level (dB EM).
The first component of the critical band concept has clinical ANSI/ASA (2010) defines EM level for puretones as
implications when selecting an appropriate masker during “the sound pressure level of a band of noise whose geomet-
puretone audiometry. The second component has relevance ric center frequency coincides with that of a specific pure
when calibrating the effective masking (EM) level of the tone that masks the pure tone to 50% probability of detec-
masking stimulus. tion” (p 7). (Reference EM levels, calculated by adding an
White noise is adequate as a masker for puretone appropriate correction value to the reference equivalent
stimuli. However, it contains noise components that do threshold sound pressure level [RETSPL] at each frequency,
not contribute to the effectiveness of the masker. The addi- are provided in the current ANSI/ASA specification for
tional noise components outside the tone’s critical band audiometers.) It is also indicated that, in individuals with
simply add to the overall level (and loudness) of the mask- normal hearing, “the amount of effective masking . . . is equal
ing stimulus. Therefore, the most efficient masker for pur- to the number of decibels that a given band of noise shifts a
etone stimuli is a narrow band of noise with a bandwidth pure-tone threshold . . . when the band of noise and the pure
slightly greater than the critical band surrounding the tone. tone are presented simultaneously to the same ear” (ANSI/
It provides the greatest masking effect with the least overall ASA, 2010, p 7).
intensity. Sanders and Rintelmann (1964) confirmed that Stated differently, effective masking (in dB EM) refers to
narrowband noise was a far more efficient masker for pur-
1. The HL (dB HL) to which puretone threshold is shifted
etone stimuli than white noise. For a given sound pressure
by a given level of noise; and
level (50, 70, and 90 dB SPL), narrowband noise centered at
2. The puretone threshold shift (in dB) relative to 0 dB HL
the frequency of the puretone signal (ranging from 250 to
provided by a given level of noise.
4,000 Hz) consistently produced a greater masking effect
(about 10 to 20 dB) than white noise. Although contralateral masking is used clinically dur-
The masking noise typically used during puretone ing hearing assessment, the following examples of ipsilateral
audiometry, therefore, is narrowband noise centered geo- masking will facilitate an understanding of the concept of
metrically around the audiometric test frequency. ANSI/ EM level.
ASA (2010) specifies the band limits (i.e., the upper and Example 1: A puretone air-conduction threshold is
lower cutoff frequencies) of narrowband masking noise. measured at 0 dB HL in the right ear. A narrowband noise
To minimize the perception of tonality that often is associ- geometrically centered at the test frequency is presented to
ated with very narrow bands of noise, the bands specified the same ear at 50 dB EM. This EM level of 50 dB will shift
by ANSI/ASA are somewhat wider than the critical bands puretone threshold to 50 dB HL.
for EM. The goal is to avoid confusion of the masker with Example 2: A puretone air-conduction threshold is
the signal. measured at 30 dB HL in the right ear. A narrowband noise
Speech spectrum noise (i.e., weighted random noise geometrically centered at the test frequency is presented to
for the masking of speech) is typically used as a masker the same ear at 50 dB EM. This EM level of 50 dB will shift
during speech audiometry. Speech is a broadband stimu- puretone threshold to 50 dB HL.
lus that requires a broadband masker. Although white noise These examples illustrate two important points. First, a
is an adequate masker, it is not the most efficient. Speech given level of EM will shift all unmasked puretone thresh-
spectrum noise is white noise that has been filtered to simu- olds to the same dB HL. Of course, if unmasked puretone
late the long-term average spectrum of speech. The average threshold is greater than a particular level of EM, then no
spectrum of speech contains the greatest energy in the low threshold shift will occur. For example, a masker of 50 dB
frequencies with spectrum level decreasing as a function EM will not have a masking effect if the unmasked puretone
of increasing frequency (Dunn and White, 1940). Speech threshold is 70 dB HL. Second, EM refers to the amount of
spectrum noise has a more limited bandwidth than white threshold shift only relative to 0 dB HL.
noise. It is a more efficient masker than white noise, pro- Speech spectrum noise is also calibrated in EM level.
ducing a masking advantage of 8 dB (Konkle and Berry, Just as HL for speech (dB HL) is specified relative to the
1983). ANSI/ASA (2010) specifies that the spectrum of SRT, EM level is also referenced to the SRT. Specifically, EM
weighted random noise for the masking of speech should for speech refers to the dB HL to which the SRT is shifted
have a sound pressure spectrum level that is constant from by a given level of noise. ANSI/ASA (2010) defines EM level
100 to 1,000 Hz, decreasing at a rate of 12 dB per octave for speech as the “sound pressure level of a specified mask-
from 1,000 to 6,000 Hz. ing noise that masks a speech signal to 50% probability of
92 SECTION I • Basic Tests and Procedures
recognition” (p 8). (If the speech spectrum noise has spec- are also referred to as calculation or formula methods.
tral density characteristics as specified by ANSI/ASA and if Psychoacoustic approaches are considered appropriate for
the sound pressure level of the masker is equal to the RET- threshold measurements, whereas acoustic methods are
SPL for speech, then the masker is calibrated in dB EM.) typically most efficient for suprathreshold measurements.
ANSI/ASA (2010) also states that in individuals with nor-
mal hearing, “the amount of effective masking . . . is equal to
the number of decibels that a masking noise shifts a speech
Puretone Audiometry
recognition threshold . . . when the masking noise and Formulas and equations have been presented for the cal-
speech signal are presented simultaneously to the same ear” culation of minimum and maximum masking levels dur-
(p 8). ing puretone audiometry (Lidén et al., 1959; Martin, 1967,
Consider the following example. SRT is measured at 0 1974; Studebaker, 1962, 1964). A brief discussion of these
dB HL. Speech spectrum noise is then presented to the same formulas will facilitate an understanding of the manner in
ear at 50 dB EM. This EM level of 50 dB will shift the SRT which appropriate levels of masking are selected during
to 50 dB HL. puretone threshold testing.
Calibration of masking noise in EM level greatly sim-
plifies clinical masking procedures. When masking noise is
MINIMUM MASKING LEVEL
calibrated in dB EM, then the decibel value indicated on the
masking level control will indicate the masking effect pro- Lidén et al. (1959) and Studebaker (1964) offered formu-
duced in the nontest ear. This clearly facilitates the selection las for calculating minimum masking level during puretone
of appropriate masking levels during clinical testing. air-conduction audiometry that include consideration of
IA, HL of the test signal, and air-bone gaps in the nontest
ear. Although this “formula” approach to calculating mini-
CLINICAL MASKING PROCEDURES mum masking level is necessary during administration of
All approaches to clinical masking address two basic ques- suprathreshold auditory tests (this approach will be dis-
tions. First, what is the minimum level of noise that is cussed later in the section addressing masking in speech
needed in the nontest ear to eliminate its response to the audiometry), it proves somewhat disadvantageous during
test signal? Stated differently, this is the minimum masking threshold audiometry. First, it can be time consuming. Sec-
level that is needed to avoid undermasking (i.e., even with ond, the clinician may not have all required information to
contralateral masking, the test signal continues to be per- accurately calculate minimum masking level at that point in
ceived in the nontest ear). Second, what is the maximum time. (The reader is referred to Yacullo, 1996 for further dis-
level of noise that can be used in the nontest ear that will not cussion of the derivation of these equations and formulas.)
change the true threshold or response in the test ear? Stated The simplified method described by Martin (1967,
differently, this is the maximum masking level that can be 1974) is recommended for clinical use. Martin has suggested
used without overmasking (i.e., with contralateral mask- that formulas are unnecessary during threshold audiome-
ing, the true threshold or response in the test ear has been try and has simplified the calculation of minimum mask-
changed). Because of limited IA for air-conducted sound, ing level. Specifically, the “initial” masking level (in dB EM)
the masking stimulus presented to the nontest ear can also during air-conduction threshold testing is simply equal
cross over to the test ear and produce masking of the test to air-conduction threshold (in dB HL) of the nontest ear
signal (i.e., overmasking). Stated simply, the purpose of (i.e., ACNontest Ear). It should be noted that the initial masking
clinical masking is to make the test signal inaudible in the level is calculated in the same manner regardless of the air-
nontest ear without affecting the true response to the sig- conduction transducer being used (i.e., supra-aural ear-
nal in the test ear. Therefore, the major goal of any clinical phone or 3A insert earphone).
masking procedure is the avoidance of both undermasking The audiometric data presented in Figure 6.8 will be
and overmasking. used to facilitate an understanding of the calculation of
Studebaker (1979) has identified two major approaches masking levels during puretone threshold audiometry.
to clinical masking: psychoacoustic and acoustic. Psycho- Audiometry was performed using supra-aural earphones.
acoustic procedures are “those based upon observed shifts Unmasked air- and bone-conduction thresholds at 500 Hz
in the measured threshold as a function of suprathreshold are provided; masked air- and bone-conduction thresholds
masker effective levels in the nontest ear” (Studebaker, 1979, are also included for later discussion. Unmasked puretone
p 82). These approaches are also identified as threshold shift air-conduction testing suggests that contralateral mask-
or shadowing procedures. Acoustic procedures are “those ing will be required only when measuring air-conduction
based upon calculating the approximate acoustic levels of threshold in the left ear. Specifically, the unmasked air-
the test and masker signals in the two ears under any given conduction threshold of 65 dB HL in the left ear equals or
set of conditions and on this basis deriving the required exceeds the threshold (both air and bone conduction) in the
masking level” (Studebaker, 1979, p 82). These procedures nontest ear by a conservative estimate of IA (i.e., 40 dB).
CHAPTER 6 • Clinical Masking 93
bone-conduction audiometry, an OE can be created in the Consider again the example presented in Figure 6.8.
nontest ear. The nontest ear consequently can become more Assume that we have subsequently measured a masked air-
sensitive to bone-conducted sound for test frequencies below conduction threshold of 85 dB HL in the left ear. A masked
2,000 Hz, particularly when using supra-aural earphones bone-conduction threshold will also be required in the left
(Berger and Kerivan, 1983; Berrett, 1973; Dean and Martin, ear. Comparison of the unmasked bone-conduction thresh-
2000; Dirks and Swindeman, 1967; Elpern and Naunton, old of 5 dB HL with the masked air-conduction threshold
1963; Goldstein and Hayes, 1965; Hodgson and Tillman, of 85 dB HL in the left ear suggests a potentially significant
1966). During the application of contralateral masking, there air-bone gap (i.e., ≥15 dB). Initial masking level is calcu-
is increased probability that the nontest ear will respond lated in the same manner regardless of the air-conduction
when obtaining a masked bone-conduction threshold in the transducer used for the delivery of the masking stimulus.
test ear. Studebaker (1979) points out that the OE does not The only difference in calculation relates to applying a dif-
actually affect the hearing sensitivity of the occluded ear, but ferent correction factor for the OE when testing in the lower
rather increases the sound pressure level of the signal reach- frequencies. Using the recommended fixed OE values for
ing the cochlea. The reader is referred to Tonndorf (1968, supra-aural earphones, initial masking level during bone-
1972) for further discussion of the contribution of the exter- conduction testing at 500 Hz is calculated as follows:
nal auditory meatus to bone-conduction thresholds.
There is evidence suggesting that the OE is decreased Initial Masking Level = AC Nontest Ear + OE + 10 dB
significantly when using deeply inserted insert earphones = 5 dB HL + 20 dB + 10 dB
(Berger and Kerivan, 1983; Chaiklin, 1967; Dean and = 35 dB EM
Martin, 2000). Berger and Kerivan (1983) and Dean and
Martin (2000) studied the magnitude of the OE in normal- In this particular example, it is appropriate to account for
hearing subjects using E-A-R foam eartips and supra- the OE because there is no evidence of a significant air-bone
aural earphones as occluding devices. Their overall results gap in the nontest (i.e., right) ear. The use of a supra-aural
are remarkably similar. First, the average OEs in the low earphone for delivery of masking in the lower frequencies,
frequencies are greatly reduced when occluding the ear however, will result in greater initial masking levels than
using an E-A-R foam eartip with deep insertion. Second, when using a 3A insert because of a larger OE correction
the advantage of a greatly reduced OE for the E-A-R foam factor.
eartip is lost when a partial or shallow insertion is used.
Third, partial or shallow insertion of an E-A-R foam eartip
yields average OEs that are similar to those measured with MAXIMUM MASKING LEVEL
supra-aural earphones. Different theories have been offered Maximum masking level refers to the maximum level of
to explain the reduced OE for an occluding device that is noise that can be used in the nontest ear that will not shift or
deeply inserted into the ear canal. The reader is referred to change the true threshold in the test ear. Two factors influ-
Berger and Kerivan (1983), Tonndorf (1972), and Yacullo ence maximum masking level during puretone audiometry:
(1996) for further discussion. (1) The bone-conduction threshold of the test ear (BCTest Ear)
The clinician can use either individually determined and (2) IA of the air-conducted masking stimulus (Lidén
(Dean and Martin, 2000; Martin et al., 1974) or fixed OE et al., 1959). Maximum masking level (MMax), based on the
values (i.e., based on average data reported in the literature) original concept described by Lidén et al., can be summa-
when calculating initial masking level. Based on the largest rized using the following equation:
average OEs reported in the literature (Berger and Keri-
van, 1983; Berrett, 1973; Dean and Martin, 2000; Dirks and M Max = BC Test Ear + IA − 5dB
Swindeman, 1967; Elpern and Naunton, 1963; Goldstein
and Hayes, 1965; Hodgson and Tillman, 1966), the follow- If BCTest Ear + IA is just sufficient to produce overmasking,
ing values are recommended for clinical use. then clinically, we want to use a masking level that is some-
When using supra-aural earphones, the following fixed OE what less than the calculated value. Consequently, 5 dB is
values are recommended: 30 dB at 250 Hz, 20 dB at 500 Hz, subtracted from the level that theoretically produces over-
and 10 dB at 1,000 Hz. When using 3A insert earphones with masking. Because we are concerned about an undesired
deeply inserted foam eartips, the following values are recom- masking effect in the test ear, bone-conduction sensitivity
mended: 10 dB at 250 and 500 Hz and 0 dB at frequencies of in that ear must be considered. As a result, overmasking is
1,000 Hz or higher. more of a potential problem when bone-conduction sen-
It should be noted that the OE is decreased or absent sitivity is very good in the test ear. Overmasking, on the
in ears with conductive hearing impairment (Martin et al., other hand, is generally not an issue when bone-conduction
1974; Studebaker, 1979). If the nontest ear exhibits a poten- hearing sensitivity is poor. The poorer the bone-conduction
tial air-bone gap of 20 dB or more, then the OE should not hearing sensitivity is in the test ear, the greater the levels of
be added to the initial masking level at that frequency. masking that can be used without overmasking.
CHAPTER 6 • Clinical Masking 95
The following two points are important to remember. Clearly in this case, our calculation based on the unmasked
First, the equation for maximum masking level is the same bone-conduction threshold (i.e., 60 dB EM) is an underesti-
for both air- and bone-conduction audiometry. Masking mate of the actual maximum level (i.e., 135 dB EM).
noise is always delivered through an air-conduction trans- Whenever an unmasked bone-conduction thresh-
ducer (e.g., insert or supra-aural earphone) regardless of old is used during determination of maximum masking,
the transducer used for measuring puretone threshold (i.e., the resultant value is typically smaller than the masking
air- or bone-conduction transducer). Second, maximum level that will actually result in overmasking. Although
masking level is generally higher when using 3A insert ear- the actual calculation of maximum masking level during
phones because of increased IA, particularly in the lower puretone threshold audiometry is often of limited use,
frequencies. consideration of the maximum level of noise that can be
Consider again the example presented in Figure 6.8. We used in the nontest ear can alert the audiologist to the pos-
will now calculate the maximum masking level that can be sibility of overmasking, particularly in cases of conductive
used during both masked air- and bone-conduction audi- hearing loss when bone-conduction hearing sensitivity is
ometry: very good.
M Max = BC Test Ear + IA − 5 dB
= 80 dB HL + 60 dB − 5 dB RECOMMENDED CLINICAL PROCEDURE
= 135 dB EM The most popular method for measuring masked puretone
thresholds was first described by Hood in 1957 (Hood,
Rather than using the very conservative IA estimate of 40 dB 1960). The Hood method, also referred to as the plateau,
when using supra-aural earphones, in this case, we will use threshold shift, or shadowing procedure, is a psychoacoustic
the more accurate estimate of 60 dB. If the bone-conduction technique that relies on observations about the relation-
threshold in the right (i.e., nontest) ear is assumed to be ship between masker level in the nontest ear and mea-
5 dB HL (i.e., the unmasked bone-conduction threshold) sured threshold in the test ear. Hood originally described
and the unmasked air-conduction threshold in the left ear a masking procedure that was applicable for measurement
is 65 dB HL, then there is evidence that IA is at least 60 dB. of masked bone-conduction thresholds. However, it proves
If 140 dB EM is just sufficient to produce overmasking (i.e., equally effective for measurement of air-conduction
BCTest Ear + IA), then 135 dB EM is the maximum level of thresholds as well.
noise that can be used in the nontest ear that will not shift or The example presented in Figure 6.9 will help facilitate
change the true threshold in the test ear. It should be noted an understanding of the underlying concept of the threshold
that 135 dB EM is a level that significantly exceeds the out- shift procedure. Unmasked puretone air-conduction thresh-
put limits for standard audiometers. olds, obtained using supra-aural earphones, were measured
Generally, it is neither time efficient nor necessary to at 10 dB HL in the right ear and 60 dB HL in the left ear
calculate maximum masking level during puretone thresh- (Figure 6.9A). Contralateral masking will be required when
old audiometry, particularly when using psychoacoustic or testing the left ear because there is a difference between the
threshold shift masking procedures (which will be described test and nontest ears that equals or exceeds a conservative
shortly). In addition, the estimated maximum masking level estimate of IA (i.e., 40 dB). An initial masking level of 20 dB
is typically very conservative and not an accurate indication EM (i.e., ACNontest Ear + 10 dB) is now presented to the right
of the true maximum. In the above example, we calculated a ear, and puretone threshold is re-established. Recall that the
relatively accurate estimate by using a more accurate value of purpose of contralateral masking is to raise the threshold
IA (rather than the conservative value) and the actual bone- of the nontest ear sufficiently to eliminate its contribution
conduction threshold in the test ear (i.e., 80 dB HL). How- when measuring a response in the test ear. Assuming that
ever, the true bone-conduction threshold (obtained with overmasking is not occurring, then contralateral masking
appropriate contralateral masking) in the test ear is typically should have an effect only on the responsiveness of the non-
not known when maximum masking level is estimated dur- test ear.
ing both air- and bone-conduction threshold audiometry. There are two possible outcomes when puretone thresh-
Because only an unmasked bone-conduction threshold is old is re-established in the presence of contralateral masking:
available at the time that masking levels are determined, we (1) No measured puretone threshold shift (e.g., puretone
are required to use the unmasked threshold as the estimate threshold remains constant at 60 dB HL; Figure 6.9B) or (2) a
of bone-conduction hearing sensitivity in the test ear. Let us measured puretone threshold shift (e.g., puretone threshold
calculate again MMax using the unmasked bone-conduction shifts from 60 to 70 dB HL; Figure 6.9C). If contralateral
response as the estimate of bone-conduction threshold: masking in the nontest ear does not produce a masking
effect, then it is concluded that the unmasked puretone
M Max = 5 dB HL + 60 dB − 5 dB threshold represents a response from the test ear. Conversely,
= 60 dB EM if contralateral masking in the nontest ear does produce a
96 SECTION I • Basic Tests and Procedures
A C
FIGURE 6.9 Example illustrating the underlying concept of the plateau or threshold shift masking
procedure. See text for further discussion. (From Yacullo WS. (1996) Clinical Masking Procedures.
1st ed. Boston, MA: Allyn & Bacon, © 1996, p 69. Adapted by permission of Pearson Education, Inc.,
Upper Saddle River, NJ.)
masking effect, then it is concluded that the unmasked pur- ear. Masking noise is introduced at an initial masking level of
etone threshold represents a response from the nontest ear. 10 dB EM (i.e., ACNontest Ear + 10 dB), and puretone thresh-
The underlying concept of the Hood procedure is that the old is re-established. Threshold shifts to 50 dB HL. When
introduction of masking to the nontest ear will produce a the masker level is raised sequentially to 20 and 30 dB EM,
masking effect (i.e., a threshold shift) only if the nontest ear puretone threshold continues to shift by 10 dB. A shadowing
is contributing to the observed response. Decisions about effect has occurred because the masked puretone threshold
which ear is contributing to the measured threshold are “shadows” the threshold of the nontest or masked ear with
based on whether a threshold shift occurs when masking is each increment in EM level. Because a threshold shift occurs
introduced to the nontest ear. when masking level is raised, it is concluded that the mask-
Hood (1960) outlined two essential steps of the plateau ing noise and the tone are restricted to the nontest ear.
masking procedure: (1) Demonstration of the shadowing When the masker is raised from 30 to 100 dB EM, pur-
effect and (2) identification of the changeover point. The etone threshold no longer shifts and remains stable at 70 dB
hypothetical example presented in Figure 6.10 illustrates HL. A plateau has been reached. Because there is no addi-
basic concepts of the plateau masking procedure. Puretone tional masking effect (i.e., a threshold shift) when masker
testing using supra-aural earphones reveals unmasked level is increased, it is concluded that the nontest ear is no
air-conduction thresholds of 0 dB HL in the right ear and longer contributing to the observed response. Puretone
40 dB HL in the left ear (Figure 6.10A). Unmasked bone- threshold of the test ear (i.e., 70 dB HL) has been reached.
conduction threshold is 0 dB HL. Because there is a 40-dB Hood (1960) refers to the initial point on the masking func-
difference between ears, contralateral masking will be tion where puretone threshold remains stable with increas-
required when measuring air-conduction threshold in the ing masking level as the “changeover point.” In this example,
left ear. (Masked air- and bone-conduction thresholds in the the changeover point of 30 dB EM also corresponds to mini-
left ear are included for later discussion.) mum masking level, the minimum amount of noise required
The masking function presented in Figure 6.10B shows to establish the true threshold in the test ear. Masker levels
the relationship between measured puretone threshold (in that result in no threshold shift (i.e., the plateau) represent
dB HL) in the test ear and EM level (in dB EM) in the nontest adequate masking (i.e., 30 through 100 dB EM). Masker
CHAPTER 6 • Clinical Masking 97
A B
levels less than 30 dB EM represent undermasking. That is, is increased somewhat by using a masker increment of 5 dB.
there is insufficient masking to establish the true puretone It is somewhat arbitrary whether a 5- or 10-dB step size is
threshold in the test ear. used for increasing masker level. Either step size is accept-
When the masker level exceeds 100 dB EM (i.e., 110 and able. However, the smaller step size of 5 dB is strongly rec-
120 dB EM), however, a puretone threshold shift with each ommended whenever the masking plateau is narrow and
increment in masking level is again observed. Overmasking there is increased risk of overmasking (i.e., cases of bilateral
is now occurring. The masking noise has reached the test ear conductive hearing loss).
through cross hearing, and a masking effect (i.e., a thresh- Hood (1960) did not specify the number of masker
old shift) is observed in the test ear. Assuming that a masked increments needed to establish a masking plateau. Clini-
bone-conduction threshold is measured subsequently in the cally, it is neither time efficient nor necessary to measure the
left ear at 65 dB HL, then an estimate of maximum masking entire masking plateau. It is generally agreed that a mask-
level is 100 dB EM (BCTest Ear + IA − 5 dB: 65 dB HL + 40 dB − ing “plateau” has been established when masker level can be
5 dB). Whereas the plateau and overmasking portions of the increased over a range of at least 15 to 20 dB without shift-
masking function represent responses from the test ear, the ing puretone threshold (Kaplan et al., 1993; Martin, 1980;
undermasking or shadowing portion represents responses Sanders, 1991; Silman and Silverman, 1991).
from the nontest ear. It should be apparent from the masking The recommended clinical procedure (Yacullo, 1996,
function in Figure 6.10 that the width of the masking plateau 2004), based on the major components of Hood’s shadow-
is defined by the minimum and maximum masking levels. ing technique, is summarized as follows:
The clinical goal of the plateau procedure is to estab-
lish the HL at which puretone threshold remains unchanged 1. Masking noise is introduced to the nontest ear at
with increments in masking level. Two important variables the initial masking level. Puretone threshold is then
that relate to the plateau procedure are (1) the magnitude re-established.
of the masker increment and (2) the number of masker 2. Level of the tone or noise is increased subsequently by
increments needed to establish a masking plateau. Although 5 dB. If there is a response to the tone in the presence
Hood (1960) originally recommended that masker level be of the noise, the level of the noise is increased by 5 dB.
changed in increments of 10 dB, others have suggested that If there is no response to the tone in the presence of the
the level should be a 5-dB step size (Martin, 1980; Silman noise, the level of the tone is increased in 5-dB steps until
and Silverman, 1991). Martin (1980) suggests that accuracy a response is obtained.
98 SECTION I • Basic Tests and Procedures
3. A plateau has been reached when the level of the noise occurs, the tone is increased in 5-dB steps until audibility
can be increased over a range of 15 to 20 dB without is achieved. However, the HL of the tone may be increased
shifting the threshold of the tone. This corresponds to a inappropriately because of a decision-making process based
response to the tone at the same HL when the masker is on a single response. This may lead to imprecision when mea-
increased in three to four consecutive levels. suring the masked threshold. Therefore, it is recommended
4. Masked puretone threshold corresponds to the HL of the that masked puretone threshold be re-established using a
tone at which a masking plateau has been established. standardized threshold procedure (e.g., ASHA, 2005) in the
presence of the final level of masking noise that resulted in
If a 10-dB step size is used for increasing masking level, then a plateau. This sometimes leads to a 5-dB improvement in
the plateau corresponds to a range of 20 dB (i.e., a response the masked puretone threshold. However, the decision to re-
to the tone at the same HL when the masker is increased in establish masked puretone threshold at the end of the pla-
two consecutive levels). teau procedure will be influenced by time considerations.
The recommended procedure for establishing a masking Remember that the goal of the plateau procedure is
plateau does not require that puretone threshold be formally to establish the HL at which puretone threshold remains
established each time that the masking level is increased. unchanged with increments in masking level. Given this
This approach would significantly increase the time required goal, there are three major outcomes that can result when
to establish a masking plateau. Rather, the tone is presented measuring puretone threshold. These outcomes are illus-
once at the same HL as the previous response. If no response trated in the three examples presented in Figure 6.11. In
each example, the unmasked puretone thresholds at 2,000 Hz in the nontest ear and a conductive hearing loss in the test
are the same. Unmasked puretone air-conduction thresh- ear. The presence of significant hearing loss in the nontest
olds, obtained using 3A insert earphones, were measured ear requires higher initial masking levels; the presence of a
at 15 dB HL in the right ear and 75 dB HL in the left ear. conductive hearing loss in the test ear (i.e., normal bone-
Contralateral masking will be required when measuring conduction hearing sensitivity) decreases the maximum
air-conduction threshold in the left ear; an initial masking masking level. The consequence of a reduced or nonexistent
level of 25 dB EM is presented to the right ear, and puretone masking plateau is the inability to establish correct masked
threshold is re-established. thresholds in the test ear.
In the first outcome, the unmasked puretone threshold The classic example of a masking dilemma is demon-
of 75 dB HL remains unaffected with increasing masking strated with a bilateral, mild-to-moderate conductive hear-
level. The level of the noise was increased over a range of ing loss. The possibility for overmasking exists when mea-
20 dB without shifting the threshold of the tone. In this exam- suring masked air- and bone-conduction thresholds in both
ple, the initial masking level occurs at the masking plateau. ears. Naunton (1960) states that, in some cases of bilateral
Contralateral masking has confirmed that the unmasked conductive hearing loss, it is not possible to mask the non-
puretone threshold represents a response from the test ear. test ear without simultaneously producing a masking effect
In the second outcome, the initial masking level pro- in the test ear.
duces a puretone threshold shift. A masking plateau is One solution to the masking dilemma is the use of
reached, however, when masking level is increased from insert earphones (Coles and Priede, 1970; Hosford-Dunn
35 to 55 dB EM (i.e., a masking range of 20 dB). Because et al., 1986; Studebaker, 1962, 1964). Recall that the use
masked puretone threshold remains stable at 95 dB HL with of 3A insert earphones significantly increases IA for air-
increasing masking level, puretone threshold is recorded as conducted sound, particularly in the lower frequencies
95 dB HL. Contralateral masking has confirmed that the (Killion et al., 1985; Sklare and Denenberg, 1987). There are
unmasked puretone threshold represents a cross-hearing two advantages of using insert earphones in cases of bilat-
response from the nontest ear. eral conductive hearing loss. First, the need for masking
In the third outcome, the initial masking level again during measurement of air-conduction thresholds is often
produces a puretone threshold shift. However, puretone eliminated because of greater IA for air-conducted sound.
threshold continues to shift to the output limits of the Second, the use of insert earphones reduces the probabil-
audiometer with increasing masking level. A plateau is not ity of overmasking in cases where contralateral masking
obtained. Therefore, it is concluded that there is no measur- is required. The use of an air-conduction transducer with
able hearing in the left ear. This conclusion is correct assum- increased IA increases the range between the minimum and
ing that overmasking has not occurred. maximum masking levels, thereby increasing the width of
Turner (2004a, 2004b) has described a masking method the masking plateau and the range of permissible masking
that can replace the plateau procedure in some masking levels (Studebaker, 1962).
situations. A disadvantage of the plateau method is that it
can be time consuming. The “optimized” masking method CENTRAL MASKING
described by Turner, which is based on the principles of the
masking plateau, can reduce the number of masking levels The introduction of contralateral masking can produce a
required to reach the plateau and establish threshold. The small threshold shift in the test ear even when the mask-
method is optimized because it uses the maximum possible ing level is insufficient to produce overmasking. Wegel
masking levels without overmasking. This is accomplished and Lane (1924) referred to this phenomenon as central
through the use of higher initial masking levels and maxi- masking. It has been hypothesized that threshold shifts
mum masker increments. However, there are some masking in the presence of low levels of masking are mediated
situations where the optimized approach is not appropriate. through central nervous system processes (Lidén et al.,
The reader is referred to the two articles by Turner (2004a, 1959). Central masking has been reported to influence
2004b) for further discussion. thresholds measured during both puretone and speech
audiometry (Dirks and Malmquist, 1964; Lidén et al.,
1959; Martin, 1966; Martin and DiGiovanni, 1979; Martin
THE MASKING DILEMMA
et al., 1965; Studebaker, 1962). Although the threshold
There are clinical situations where minimum masking shift produced by central masking is generally consid-
levels can result in overmasking. Studebaker (1979) states ered to be approximately 5 dB (Konkle and Berry, 1983;
that a “masking dilemma” results when the width of the Martin, 1966), variable results have been reported across
masking plateau is very narrow or nonexistent. Remember subjects and studies. There is also some indication that
that the width of the masking plateau is defined by mini- central masking effects increase with increasing mask-
mum and maximum masking levels. Generally, a masking ing level (Dirks and Malmquist, 1964; Martin et al., 1965;
dilemma results whenever there is a significant hearing loss Studebaker, 1962).
100 SECTION I • Basic Tests and Procedures
FIGURE 6.12 An example illustrating audiogram interpretation using unmasked and masked
puretone thresholds.
CHAPTER 6 • Clinical Masking 101
FIGURE 6.13 An example illustrating the use of the threshold shift masking procedure for determin-
ing speech detection threshold.
CHAPTER 6 • Clinical Masking 103
of the speech or noise is increased by 5 dB until a masking frequency range. Presentation LevelTest Ear – IA, an estimate
plateau has been reached. Remember that it is acceptable to of the HL of the speech signal reaching the test ear, repre-
use a 10-dB masker increment when establishing a masking sents the minimum masking level required. The presence
plateau. In this particular case, the risk of overmasking is of air-bone gaps in the nontest (i.e., masked) ear, however,
essentially nonexistent because of the poor bone-conduction will reduce the effectiveness of the masker. Consequently,
hearing sensitivity in the test ear (and the use of an air- minimum masking level must be increased by the size of the
conduction transducer with increased IA for presenting the air-bone gap.
masking noise). Therefore, the use of a 10-dB masker incre- Lidén et al. (1959) recommended that the average
ment can be easily justified. Masked SDT is subsequently air-bone gap in the nontest ear, calculated using frequen-
measured at 90 dB HL (using 40 dB EM). cies of 500, 1,000, and 2,000 Hz, be accounted for when
Although the plateau masking procedure can be used determining the minimum masking level. Coles and
during assessment of masked SRT, it can prove very time Priede (1975) suggested a more conservative approach
consuming. Recall that only a single detection response to and recommended that the largest air-bone gap at any fre-
speech is required when measuring masked SDT before quency in the range from 250 to 4,000 Hz be considered.
making a decision about increasing the level of the speech Remember that speech is a broadband signal. Therefore,
or masker. The use of the plateau procedure for measuring bone-conduction hearing sensitivity across a range of fre-
masked SRT, however, requires that threshold be re-estab- quencies in the nontest ear must be considered. There is
lished (i.e., 50% correct recognition of spondaic words) at the assumption that the largest air-bone gap will have the
each masking level until a plateau is reached. The acoustic greatest effect on masking level. Following the conservative
method proves to be the method of choice when measuring recommendation of Coles and Priede (1975), it is recom-
masked SRT because of its greater time efficiency (Konkle mended that the largest air-bone gap across the frequency
and Berry, 1983; Studebaker, 1979) and will be discussed in range in the nontest ear be accounted for when determin-
the following section. ing minimum masking level.
Maximum masking level (MMax) for speech, originally
described by Lidén et al. (1959), can be defined using the
ACOUSTIC MASKING PROCEDURES following equation:
Recall that acoustic masking procedures are based on calcu-
lating the estimated acoustic levels of the test and masking M Max = Best BC Test Ear + IA − 5dB
stimuli in the two ears during a test condition and, on this
basis, selecting an appropriate masking level. A major dis- Best BCTest Ear represents the best bone-conduction threshold
advantage of the acoustic or formula approach is that the in the test ear in the frequency range from 250 to 4,000 Hz,
application requires knowledge about air-bone gaps in both and IA is equal to IA for speech. If Best BCTest Ear + IA rep-
test and nontest ears (Konkle and Berry, 1983; Studebaker, resents a level that will just produce overmasking in the test
1979). Knowledge about air-bone gaps in the nontest ear is ear, then a slightly lower masking level should be used clini-
required to calculate minimum masking level. Information cally. Consequently, a value of 5 dB is subtracted from the
about bone-conduction hearing sensitivity in the test ear calculated level.
is required to calculate maximum masking level. Assum- Lidén et al. (1959) originally recommended that the aver-
ing that complete puretone threshold data are available age puretone bone-conduction threshold in the test ear, again
before performing speech audiometry, however, formula calculated using frequencies of 500, 1,000, and 2,000 Hz,
approaches for calculating required masking levels prove should be accounted for when estimating maximum masking
very effective during measurement of both threshold and level. However, a more conservative approach includes con-
suprathreshold speech recognition. sideration of the best bone-conduction threshold in the test
The underlying concepts of minimum and maximum ear over a wider range of frequencies (i.e., 250 to 4,000 Hz).
masking levels for speech are similar to those offered earlier There is the assumption that the best bone-conduction
for puretone stimuli. Minimum masking level for speech threshold in the test ear is the most susceptible to the effects
(MMin), originally described by Lidén et al. (1959), can be of overmasking.
defined using the following equation: The optimal masking level during speech audiometry
is one that occurs above the minimum and below the maxi-
M Min = Presentation Level Test Ear − IA mum masking levels (Konkle and Berry, 1983; Lidén et al.,
+ Largest Air-Bone Gap Nontest Ear 1959; Studebaker, 1979). Minimum and maximum mask-
ing levels represent, respectively, the lower and upper lim-
Presentation LevelTest Ear represents the HL (dB HL) of the its of the masking plateau. Studebaker (1979) states that a
speech signal at the test ear, IA is equal to IA for speech, major goal of the acoustic or formula approach is to apply
and Largest Air-Bone GapNontest Ear represents the largest rules that will place the masking level at approximately the
air-bone gap in the nontest ear in the 250- to 4,000-Hz middle of the range of correct values (i.e., the middle of the
104 SECTION I • Basic Tests and Procedures
masking plateau). This concept was originally discussed by ately for air-bone gaps in the test and nontest ears. In cases
Luscher and König in 1955 (cited by Studebaker, 1979). where there are no air-bone gaps in either ear, the selected
Studebaker (1962) first described an equation for calcu- masking level is simply equal to the HL of the speech signal.
lating midmasking level during puretone bone-conduction To avoid the use of very high levels of contralateral mask-
audiometry. The basic concepts underlying the midplateau ing that can sometimes result, Studebaker indicates that it
procedure, however, can be easily applied during speech is permissible to reduce the masking level by 20 to 25 dB
audiometry. Yacullo (1999) states that a simple approach to below the presentation level of the speech signal. The reader
calculating the midmasking level (MMid) involves determin- is referred to Studebaker (1979) for a more comprehensive
ing the arithmetic mean of the minimum and maximum discussion.
masking levels: According to the results of a survey of audiometric
practices in the United States, many audiologists “base their
M Mid = (M Min + M Max )/2 masking level for word-recognition testing on the stimulus
level presented to the test ear and subtract a set amount,
For example, if MMin is equal to 40 dB EM and MMax is such as 20 dB” (Martin et al., 1998, p 100). Although selec-
equal to 80 dB EM, then MMid, the masking level that falls tion of a masking level that is equal to the presentation level
at midplateau, is 60 dB EM. When a masking level falls at at the test ear minus 20 dB may appear somewhat arbitrary,
the middle of the acceptable masking range (i.e., midmask- it can actually be supported by sound theoretical constructs.
ing level), then the risk of undermasking and overmasking Yacullo (1996, 1999) has described a simplified approach,
is minimized (Studebaker, 1962). It should be noted that based on the underlying concepts of both the midplateau
midplateau actually represents a range of values surround- and Studebaker acoustic procedures, that can be used when
ing the midmasking level. Consequently, there is some flex- selecting contralateral masking levels during speech audi-
ibility in using a somewhat higher or lower masking level. ometry. Although this approach was originally described for
Yacullo (1999) states that there are two major advan- use during assessment of suprathreshold speech recognition
tages of the midplateau masking procedure. First, IA is elim- (Yacullo, 1996), it also proves equally effective during mea-
inated as a source of error when determining an appropri- surement of SRT (Yacullo, 1999). Stated simply, EM level is
ate masking level. Masking levels are often determined using equal to the presentation level of the speech signal in dB HL
very conservative estimates of IA. However, IA has equal yet at the test ear minus 20 dB:
opposite effects on minimum and maximum masking lev-
els. Although the value of IA used for determining mini- dB EM = Presentation Level Test Ear − 20 dB
mum and maximum masking levels will influence the width
of the masking plateau, the midmasking level remains the Given two prerequisite conditions (which will be dis-
same. cussed shortly), the selected masking level will fall approxi-
Second, midmasking level can be determined for both mately at midplateau. Unfortunately, inappropriate use of
threshold and suprathreshold speech measures by using this simplified approach can result in undermasking or
the same formula approach (Konkle and Berry, 1983). The overmasking.
midplateau procedure avoids a potential problem dur- Jerger and associates (Jerger and Jerger, 1971; Jerger et al.,
ing measurement of suprathreshold speech recognition 1966) appear to be the first to report the use of a masking
that is related to calibration of EM level and percent cor- procedure that involved presenting contralateral masking
rect response criterion. Recall that EM level for speech is noise at a level 20 dB less than the presentation level of the
specified relative to the SRT (i.e., 50% correct recognition speech signal at the test ear. Specifically, it was reported that
of spondaic words) (ANSI/ASA, 2010). Suprathreshold “whenever the speech level to the test ear was sufficiently
speech recognition performance, however, can range from intense that the signal might conceivably cross over and be
0% to 100%. Konkle and Berry (1983) indicate that a major heard on the nontest ear, the latter was masked by white
advantage of the midplateau procedure is that the middle of noise at a level 20 dB less than the speech presentation level
the masking plateau (i.e., the optimal masking level) is not on the test ear” (Jerger and Jerger, 1971, p 574). It should be
influenced by different listener response criteria used dur- noted, however, that Jerger and associates used white noise
ing assessment of threshold and suprathreshold speech rec- as a contralateral masker rather than the typically used
ognition. The reader is referred to Konkle and Berry (1983) speech spectrum noise. In addition, the white noise was not
and Studebaker (1979) for more detailed discussion. calibrated in EM level for speech.
Studebaker (1979) has described a somewhat different More recently, Hannley (1986) and Gelfand (2009)
acoustic masking procedure for use during speech audiom- have discussed briefly the simplified approach to masking.
etry that is consistent with the goal of selecting a masking Gelfand indicates, however, that using an EM level equal to
level that occurs at midplateau. Specifically, the recom- the HL of the speech signal at the test ear minus 20 dB gen-
mended masking level is equal to the presentation level of erally proves most effective in cases of sensory/neural hear-
the speech signal in dB HL at the test ear, adjusted appropri- ing loss. In fact, the desired outcome may not occur when
CHAPTER 6 • Clinical Masking 105
there are significant air-bone gaps in the nontest ear (e.g., M Max = Best BC Test Ear + IA − 5 dB
conductive hearing loss).
= 45 dB HL + 40 dB − 5 dB
Yacullo (1999) states that the simplified masking pro-
cedure when used appropriately can significantly reduce the = 80 dB EM
calculations required for the determination of optimal (i.e.,
midmasking) masking level. Specifically, the method proves M Mid = (M Min + M Max )/2
effective given the following two conditions: (1) There are = (60 + 80)/2
no significant air-bone gaps (i.e., ≥15 dB) in either ear and = 70 dB EM
(2) speech is presented at a moderate SL (i.e., 30 to 40 dB
SL) relative to the measured or estimated SRT. If these two An EM level of 70 dB is appropriate for three reasons.
prerequisites are met, then the selected masking level should First, it occurs at midplateau. Second, it occurs at least
occur approximately at midplateau. 10 dB above the calculated minimum. Remember that a
Acoustic masking procedures are recommended when safety factor of at least 10 dB or greater should be added
assessing threshold and suprathreshold speech recogni- to the calculated minimum value to account for intersub-
tion. The following two examples help illustrate the use of ject variability with respect to masker effectiveness (Martin,
the midplateau masking procedure, as well as the simpli- 1974; Studebaker, 1979). Finally, it does not exceed the cal-
fied approach when applicable, for measurement of supra- culated maximum masking level.
threshold speech recognition and SRT. It should be noted that the width of the masking plateau
The example presented in Figure 6.14 illustrates the use is typically underestimated when a conservative estimate of
of the midplateau masking procedure during assessment of IA is used for determining the minimum and maximum
suprathreshold speech recognition. Puretone testing reveals masking levels. If IA is actually greater than the conservative
a mild, sensory/neural hearing loss of flat configuration in estimate of 40 dB, then the width of the masking plateau
the right ear. There is a moderate-to-severe, sensory/neural will increase. For example, if this patient actually exhibits IA
hearing loss of gradually sloping configuration in the left for speech of 55 dB (rather than the conservative estimate
ear. SRTs were measured at 35 dB HL in the right ear and of 40 dB), then the minimum level will be decreased and
55 dB HL in the left ear, findings that support the puretone the maximum level will be increased by the same amount
results. Suprathreshold speech recognition will be assessed (i.e., 15 dB). Although the width of the masking plateau
at 40 dB SL using Central Institute for the Deaf (CID) W-22 increases, the midmasking level remains the same. As stated
monosyllabic word lists. earlier, a major advantage of the midplateau method is
Let us first consider the situation where supra-aural that IA is eliminated as a source of error when selecting an
earphones are being used during audiometry. Contralateral appropriate masking level.
masking will be required only when measuring suprathreshold We now will take another look at the example presented
speech recognition in the left ear. Specifically, the presentation in Figure 6.14 and substitute 3A insert earphones for the
level of 95 dB HL (i.e., SRT of 55 dB HL + 40 dB SL) exceeds supra-aural arrangement. Contralateral masking will also
the best bone-conduction threshold of 30 dB HL in the nontest be required when assessing suprathreshold speech recog-
ear by a conservative estimate of IA for speech (i.e., 40 dB): nition in the left ear. The presentation level of 95 dB HL
(i.e., SRT of 55 dB HL + 40 dB SL) exceeds the best bone-
Presentation Level Test Ear − Best BC Nontest Ear ≥ IA conduction threshold of 30 dB HL in the nontest ear by a
conservative estimate of IA for speech (i.e., 60 dB). We will
95 dB HL − 30dB HL ≥ 40 dB again use the midplateau masking procedure to select an
appropriate level of contralateral masking. The calculations
65dB HL ≥ 40 dB are the same for both supra-aural and 3A insert earphones
with the exception that an IA value of 60 dB will be substi-
We will use the midplateau masking procedure to select tuted in our equations for minimum and maximum mask-
an appropriate contralateral masking level. Remember that ing levels. Masking levels for use with insert earphones are
the midplateau masking procedure involves a three-step summarized below:
process: Calculation of (1) minimum masking level (MMin),
(2) maximum masking level (MMax), and (3) midmasking M Min = 40 dB EM
level (MMid):
M Max = 100 dB EM
M Min = Presentation Level Test Ear − IA
M Mid = 70 dB EM
+ Largest Air-Bone Gap Nontest Ear
= 95 dB HL − 40 dB + 5 dB It should be apparent that an increase in IA has equal
= 60 dB EM yet opposite effects on the minimum and maximum
106 SECTION I • Basic Tests and Procedures
FIGURE 6.14 An example illustrating the use of the midplateau and simplified masking procedures
during assessment of suprathreshold speech recognition.
CHAPTER 6 • Clinical Masking 107
masking levels. Because IA is increased by 20 dB when using measured at approximately 0 dB HL in the right ear and
insert earphones, the width of the masking plateau is increased 70 dB HL in the left ear. Prior to measurement of speech
by 40 dB. The midmasking level, however, remains the same. thresholds, we can accurately predict whether contralateral
We now will take one final look at the example pre- masking will be required. Contralateral masking will be
sented in Figure 6.14 and consider the use of the simplified required only when measuring SRT in the left ear because
masking approach. Because the two prerequisite conditions the estimated speech threshold of 70 dB HL exceeds the best
are met, the simplified approach should result in the selec- bone-conduction threshold of 0 dB HL in the nontest ear
tion of an optimal masking level (i.e., midmasking level). by a conservative estimate of IA for speech (i.e., 60 dB). An
Recall that EM level is simply calculated as the presenta- unmasked SRT is subsequently measured in the left ear at
tion level of the speech signal in dB HL at the test ear minus 0 dB HL, a finding consistent with the puretone results.
20 dB. The same equation will be applicable when using any It has already been demonstrated that the simplified
earphone (i.e., 3A and supra-aural): masking procedure proves very effective during assessment
of suprathreshold speech recognition. However, it can also
dB EM = Presentation Level Test Ear − 20 dB be applied effectively during measurement of SRT. When
= 95 dB HL − 20 dB selecting an appropriate contralateral masking level when
= 75 dB measuring SRT, it is important to consider not only the HL
at which the speech threshold is finally established, but also
It should be noted that the masking level of 75 dB EM the highest presentation levels used throughout the thresh-
calculated using the simplified approach is in good agree- old procedure. Regardless of the SRT procedure used, spon-
ment (i.e., ±5 dB) with the value (i.e., 70 dB EM) deter- daic words are presented typically at both suprathreshold
mined using the midplateau procedure. Yacullo (1999) and threshold levels.
indicates that there are two major advantages to using the For example, ASHA (1988) recommends a descend-
simplified masking approach with 3A insert earphones, ing threshold technique for measuring SRT that is based
which are the result of a wider masking plateau. First, there on the earlier work of others (Hirsh et al., 1952; Hudgins
is greater flexibility in deviating somewhat from the cal- et al., 1947; Tillman and Olsen, 1973; Wilson et al., 1973).
culated midmasking level while still remaining within an The initial phase involves familiarizing the patient with
acceptable range of midplateau. It should be noted that the the spondaic words at a comfortable, suprathreshold HL.
midplateau actually represents a small range of values sur- (Familiarization with test words is strongly recommended
rounding the midmasking level. This range of acceptable regardless of the SRT procedure.) The preliminary phase
values essentially increases when using 3A insert earphones. involves setting the HL to 30 to 40 dB above the estimated
The use of the simplified masking approach can sometimes or predicted SRT before descending in 10-dB decrements
result in the selection of high masking levels, even though until two words are missed. In fact, an HL of 30 to 40 dB
overmasking is not occurring. Consequently, the audiolo- above the predicted SRT typically results in a comfortable
gist can justify subtracting a value of greater than 20 dB listening level for most patients during the familiariza-
(e.g., 25 or 30 dB) from the presentation level at the test ear tion phase. The test phase involves initially presenting test
when using insert earphones. In the example presented in words at HLs approximately 10 dB higher than the calcu-
Figure 6.14, an EM level equal to the presentation level in lated SRT. The calculation of threshold, based on a statisti-
the test ear minus 25 or 30 dB (e.g., 65 or 70 dB EM) would cal precedent, takes into account the patient’s responses at
still result in an acceptable masking level that falls within higher HLs.
the vicinity of midplateau, yet clearly exceeds the minimum Consider again the example presented in Figure 6.15. If
level by a sizeable amount. the ASHA-recommended procedure is used to measure SRT,
Second, there is greater flexibility in deviating slightly then the highest HLs employed (during the familiarization
from the recommended prerequisite conditions (i.e., no and preliminary phases) will be about 30 to 40 dB above the
air-bone gaps in either ear, use of moderate SLs) while still estimated SRT. In this example, we will use a moderate SL
remaining within an acceptable range of midplateau and of 30 dB above the estimated SRT (i.e., 70 dB HL + 30 dB
without significantly increasing the potential for overmasking. SL = 100 dB HL) during the familiarization and preliminary
Consequently, there is greater margin for error when select- phases.
ing an appropriate level of masking. The use of the simplified approach to selecting an
The example presented in Figure 6.15 illustrates the appropriate level of contralateral masking should prove
application of the midplateau and simplified masking pro- effective in this case because both prerequisite conditions
cedures during measurement of SRT. Audiometry was per- have been met. First, there are no significant air-bone gaps
formed using 3A insert earphones. Puretone testing reveals in either ear. Second, speech is presented at a moderate SL
normal hearing in the right ear. There is a severe sensory/ (i.e., 30 dB) when the highest HLs are used during the test
neural hearing loss of flat configuration in the left ear. Based procedure (i.e., familiarization and preliminary phases).
on the puretone findings, it is predicted that SRTs will be Assuming that 100 dB HL is the highest presentation level
108 SECTION I • Basic Tests and Procedures
FIGURE 6.15 An example illustrating the use of the midplateau and simplified masking procedures
during assessment of speech recognition threshold.
CHAPTER 6 • Clinical Masking 109
used during our test procedure, then EM level in the nontest ter suprathreshold word recognition tests at a specified SL
ear is calculated as follows: referenced to the SRT (Martin and Morris, 1989; Martin
et al., 1994, 1998), typically 30 or 40 dB (Martin and Mor-
dB EM = Presentation Level Test Ear − 20 dB ris, 1989; Martin et al., 1994). Finally, it can be applied
= 100 dB HL − 20 dB effectively during both threshold and suprathreshold mea-
= 80 dB sures of speech recognition.
Direct calculation of midmasking level is strongly
We can verify the appropriateness of the selected mask- recommended in cases where there is potential risk of
ing level by using the midplateau method: overmasking. Yacullo (1999) states that any factor that
increases minimum masking level or decreases maximum
M Min = Presentation Level Test Ear − IA masking level will reduce the width of the masking pla-
+ Largest Air-BoneGap Nontest Ear teau and increase the probability of overmasking. For
example, the presence of significant air-bone gaps in the
= 100 dB HL − 60 dB + 0 dB nontest ear and/or the use of higher SLs (i.e., ≥50 dB) will
= 40 dB EM increase minimum masking level. The presence of signifi-
cant air-bone gaps in the test ear will decrease maximum
M Max = Best BC Test Ear + IA − 5 dB masking level. In cases where the masking plateau is either
= 55 dB HL + 60 dB − 5 dB very narrow or nonexistent (e.g., unilateral or bilateral
conductive hearing loss), knowledge about minimum and
= 110 dB EM
maximum masking levels will allow the clinician to make
well-informed decisions about appropriate contralateral
M Mid = (M Min + M Max )/2 masking levels.
= (40 + 110)/2
= 75 dB EM
FOOD FOR THOUGHT
The masking level of 80 dB EM selected using the sim-
1. Discuss how IA influences decisions about the need
plified approach is in good agreement (i.e., ±5 dB) with
for contralateral masking during puretone and speech
the value determined using the midplateau approach (i.e.,
audiometry.
75 dB EM). Although spondees will be presented at lower
2. Discuss how the OE influences measured IA for air-
HLs when measuring the SRT, it is not necessary to decrease
conduction transducers (i.e., supra-aural and 3A insert
the original level of masking. First, the selected masking
earphones with deeply inserted foam eartips). How does
level is appropriate for the highest HLs used during all
the OE influence contralateral masking levels during
phases of threshold determination. Second, the selected
bone-conduction audiometry?
masking level does not exceed the maximum masking level
3. The plateau masking procedure originally was described
(i.e., overmasking will not occur).
by Hood as a method for contralateral masking during
It can be argued that the simplified approach (as well
puretone threshold audiometry. Discuss how the underly-
as the midplateau method) can result in the use of unnec-
ing principles of the masking plateau are applied to proce-
essarily high masking levels during measurement of SRT.
dures for contralateral masking during speech audiometry.
As was discussed earlier, the midplateau represents a range
of masking levels. The audiologist can justify subtracting a
value of greater than 20 dB (e.g., 25 or 30 dB) from the pre-
sentation level at the test ear, particularly when using insert KEY REFERENCES
earphones. In this example, a masking level of 70 or 75 dB A full list of references for this chapter can be
EM (rather than the original level of 80 dB EM) still falls found at http://thePoint.lww.com. Below are the key refer-
within an acceptable range of midplateau, while still occur- ences for this chapter.
ring significantly higher than the minimum.
Yacullo (1999) states that the simplified masking American National Standards Institute/Acoustical Society of
approach during speech audiometry has wide applicability. America (ANSI/ASA). (2010) American National Standard
Specification for Audiometers. ANSI/ASA 3.6–2010. New York:
First, it can be used with a large and diverse patient popu-
ANSI/ASA.
lation, including those with normal hearing and sensory/ American Speech-Language-Hearing Association. (1988) Deter-
neural hearing loss. Second, it can be used equally effec- mining threshold level for speech [Guidelines]. Available
tively when using either supra-aural or insert earphones. online at: www.asha.org/policy.
Third, the procedure can be used in clinical situations American Speech-Language-Hearing Association. (1990) Audio-
where moderate SLs are used. For example, the majority metric symbols [Guidelines]. Available online at: www.asha.
of audiologists in the United States continue to adminis- org/policy.
110 SECTION I • Basic Tests and Procedures
American Speech-Language-Hearing Association. (2005) Guide- Hirsh IJ, Davis H, Silverman SR, Reynolds EG, Eldert E, Benson
lines for manual pure-tone threshold audiometry [Guide- RW. (1952) Development of materials for speech audiometry.
lines]. Available online at: www.asha.org/policy. J Speech Hear Disord. 17, 321–337.
Beattie RC, Svihovec DV, Edgerton BJ. (1978) Comparison of Hodgson W, Tillman T. (1966) Reliability of bone conduction
speech detection and spondee thresholds for half- versus full- occlusion effects in normals. J Aud Res. 6, 141–151.
list intelligibility scores with MLV and taped presentations of Hood JD. (1960) The principles and practice of bone-conduction
NU-6. J Am Audiol Soc. 3, 267–272. audiometry. Laryngoscope. 70, 1211–1228.
Berger EH, Kerivan JE. (1983) Influence of physiological noise and Hosford-Dunn H, Kuklinski AL, Raggio M, Haggerty HS. (1986)
the occlusion effect on the measurement of real-ear attenua- Solving audiometric masking dilemmas with an insert masker.
tion at threshold. J Acoust Soc Am. 74, 81–94. Arch Otolaryngol Head Neck Surg. 112, 92–95.
Berrett MV. (1973) Some relations between interaural attenuation Hudgins CV, Hawkins JE Jr., Karlin JE, Stevens SS. (1947) The
and the occlusion effect. Unpublished doctoral dissertation. development of recorded auditory tests for measuring hearing
Iowa City, IA: University of Iowa. loss for speech. Laryngoscope. 57, 57–89.
Blackwell KL, Oyler RF, Seyfried DN. (1991) A clinical comparison Jerger J, Jerger S. (1971) Diagnostic significance of PB word func-
of Grason Stadler inserts earphones and TDH-50P standard tions. Arch Otolaryngol. 93, 573–580.
earphones. Ear Hear. 12, 361–362. Jerger J, Jerger S, Ainsworth J, Caram P. (1966) Recovery of audi-
Chaiklin JB. (1959) The relation among three selected auditory tory function after surgical removal of cerebellar tumor.
speech thresholds. J Speech Hear Res. 2, 237–243. J Speech Hear Disord. 31, 377–382.
Chaiklin JB. (1967) Interaural attenuation and cross-hearing in Kaplan H, Gladstone VS, Lloyd LL. (1993) Audiometric Inter-
air-conduction audiometry. J Aud Res. 7, 413–424. pretation. 2nd ed. Needham Heights, MA: Allyn & Bacon.
Coles RRA, Priede VM. (1970) On the misdiagnosis resulting from Killion MC. (1984) New insert earphones for audiometry. Hear
incorrect use of masking. J Laryngol Otol. 84, 41–63. Instrum. 35, 28, 46.
Coles RRA, Priede VM. (1975) Masking of the non-test ear in Killion MC, Villchur E. (1989) Comments on “Earphones in audi-
speech audiometry. J Laryngol Otol. 89, 217–226. ometry” [Zwislocki et al., J. Acoust. Soc. Am. 83, 1688–1689].
Dean MS, Martin FN. (2000) Insert earphone depth and the occlu- J Acoust Soc Am. 85, 1775–1778.
sion effect. Am J Audiol. 9, 131–134. Killion MC, Wilber LA, Gudmundsen GI. (1985) Insert earphones
Dirks DD. (1994) Bone-conduction threshold testing. In: Katz J, for more interaural attenuation. Hear Instrum. 36, 34, 36.
ed. Handbook of Clinical Audiology. 4th ed. Baltimore, MD: Konkle DF, Berry GA. (1983) Masking in speech audiometry. In:
Williams & Wilkins; pp 132–146. Konkle DF, Rintelmann WF, eds. Principles of Speech Audiom-
Dirks DD, Malmquist C. (1964) Changes in bone-conduction etry. Baltimore, MD: University Park Press; pp 285–319.
thresholds produced by masking in the non-test ear. J Speech Lidén G, Nilsson G, Anderson H. (1959) Masking in clinical audi-
Hear Res. 7, 271–278. ometry. Acta Otolaryngol. 50, 125–136.
Dirks DD, Swindeman JG. (1967) The variability of occluded and Littler TS, Knight JJ, Strange PH. (1952) Hearing by bone conduc-
unoccluded bone-conduction thresholds. J Speech Hear Res. tion and the use of bone-conduction hearing aids. Proc R Soc
10, 232–249. Med. 45, 783–790.
Dunn HK, White SD. (1940) Statistical measurements on conver- Martin FN. (1966) Speech audiometry and clinical masking. J Aud
sational speech. J Acoust Soc Am. 11, 278–288. Res. 6, 199–203.
E-A-R Auditory Systems. (1997) Instructions for the Use of E-A- Martin FN. (1967) A simplified method for clinical masking. J Aud
RTONE 3 A Insert Earphones. Revised ed. Indianapolis, IN: Res. 7, 59–62.
E-A-R Auditory Systems. Martin FN. (1974) Minimum effective masking levels in threshold
E-A-R Auditory Systems. (2000a) Instructions for the Use of E-A-RTONE audiometry. J Speech Hear Disord. 39, 280–285.
5 A Insert Earphones. Indianapolis, IN: E-A-R Auditory Systems. Martin FN. (1980) The masking plateau revisited. Ear Hear. 1,
E-A-R Auditory Systems. (2000b) Introducing the New E-A-RTONE® 112–116.
5 A Insert Earphone [Brochure]. Indianapolis, IN: E-A-R Auditory Martin FN, Armstrong TW, Champlin CA. (1994) A survey
Systems. of audiological practices in the United States in 1992. Am
Elpern BS, Naunton RF. (1963) The stability of the occlusion effect. J Audiol. 3, 20–26.
Arch Otolaryngol. 77, 376–382. Martin FN, Bailey HAT, Pappas JJ. (1965) The effect of central
Feldman AS. (1963) Maximum air-conduction hearing loss. masking on threshold for speech. J Aud Res. 5, 293–296.
J Speech Hear Disord. 6, 157–163. Martin FN, Blythe ME. (1977) On the cross hearing of spondaic
Fletcher H. (1940) Auditory patterns. Rev Mod Phys. 12, 47–65. words. J Aud Res. 17, 221–224.
Frank T, Vavrek MJ. (1992) Reference threshold levels for an ER-3 Martin FN, Butler EC, Burns P. (1974) Audiometric Bing test for
A insert earphone. J Am Acad Audiol. 3, 51–58. determination of minimum masking levels for bone conduc-
Gelfand SA. (2009) Essentials of Audiology. 3rd ed. New York: tion tests. J Speech Hear Disord. 39, 148–152.
Thieme Medical Publishers, Inc. Martin FN, Champlin CA, Chambers JA. (1998) Seventh survey of
Goldstein DP, Hayes CS. (1965) The occlusion effect in bone-con- audiometric practices in the United States. J Am Acad Audiol.
duction hearing. J Speech Hear Res. 8, 137–148. 9, 95–104.
Hannley M. (1986) Basic Principles of Auditory Assessment. Need- Martin FN, DiGiovanni D. (1979) Central masking effects on
ham Heights, MA: Allyn & Bacon. spondee threshold as a function of masker sensation level
Hawkins JE, Stevens SS. (1950) Masking of pure tones and of and masker sound pressure level. J Am Audiol Soc. 4, 141–
speech by white noise. J Acoust Soc Am. 22, 6–13. 146.
CHAPTER 6 • Clinical Masking 111
Martin FN, Morris LJ. (1989) Current audiologic practices in the Thurlow WR, Silverman SR, Davis H, Walsh TE. (1948) A statisti-
United States. Hear J. 42, 25–44. cal study of auditory tests in relation to the fenestration opera-
Naunton RF. (1960) A masking dilemma in bilateral conduction tion. Laryngoscope. 58, 43–66.
deafness. Arch Otolaryngol. 72, 753–757. Tillman TW, Olsen WO. (1973) Speech audiometry. In: Jerger J,
Olsen WO, Matkin ND. (1991) Speech audiometry. In: Rintelmann ed. Modern Developments in Audiology. 2nd ed. New York:
WR, ed. Hearing Assessment. Needham Heights, MA: Allyn & Academic Press; pp 37–74.
Bacon; pp 39–140. Tonndorf J. (1968) A new concept of bone conduction. Arch
Owens E, Schubert ED. (1977) Development of the California Otolaryngol. 87, 49–54.
Consonant Test. J Speech Hear Res. 20, 463–474. Tonndorf J. (1972) Bone conduction. In: Tobias JV, ed. Foundations
Sanders JW. (1972) Masking. In: Katz J, ed. Handbook of Clinical of Modern Auditory Theory. Vol II. New York: Academic Press;
Audiology. Baltimore, MD: Williams & Wilkins; pp 111–142. pp 197–237.
Sanders JW. (1991) Clinical masking. In: Rintelmann WF, ed. Hearing Turner RG. (2004a) Masking redux I: An optimized masking
Assessment. Needham Heights, MA: Allyn & Bacon; pp 141–178. method. J Am Acad Audiol. 15, 17–28.
Sanders JW, Rintelmann WF. (1964) Masking in audiometry. Turner RG. (2004b) Masking redux II: A recommended masking
Arch Otolaryngol. 80, 541–556. protocol. J Am Acad Audiol. 15, 29–46.
Schwartz DM, Surr R. (1979) Three experiments on the California Van Campen LE, Sammeth CA, Peek BF. (1990) Interaural attenua-
Consonant Test. J Speech Hear Disord. 64, 61–72. tion using Etymotic ER-3 A insert earphones in auditory brain
Silman S, Silverman CA. (1991) Auditory Diagnosis: Principles and stem response testing. Ear Hear. 11, 66–69.
Applications. San Diego, CA: Academic Press. Wegel RL, Lane GI. (1924) The auditory masking of one pure tone
Sklare DA, Denenberg LJ. (1987) Interaural attenuation for Tube- by another and its probable relation to the dynamics of the
phone insert earphones. Ear Hear. 8, 298–300. inner ear. Phys Rev. 23, 266–285.
Smith BL, Markides A. (1981) Interaural attenuation for pure Wilson R, Morgan D, Dirks D. (1973) A proposed SRT proce-
tones and speech. Br J Audiol. 15, 49–54. dure and its statistical precedent. J Speech Hear Disord. 38,
Snyder JM. (1973) Interaural attenuation characteristics in audi- 184–191.
ometry. Laryngoscope. 73, 1847–1855. Yacullo WS. (1996) Clinical Masking Procedures. Boston, MA: Allyn
Studebaker GA. (1962) On masking in bone-conduction testing. & Bacon.
J Speech Hear Res. 5, 215–227. Yacullo WS. (1999) Clinical masking in speech audiometry: A
Studebaker GA. (1964) Clinical masking of air- and bone- simplified approach. Am J Audiol. 8, 106–116.
conducted stimuli. J Speech Hear Disord. 29, 23–35. Yacullo WS. (2004) Clinical masking. In: Kent RD, ed. The MIT
Studebaker GA. (1967a) Clinical masking of the non-test ear. Encyclopedia of Communication Disorders. Cambridge, MA:
J Speech Hear Disord. 32, 360–371. MIT Press; pp 500–504.
Studebaker GA. (1967b) Intertest variability and the air-bone gap. Zwislocki J. (1953) Acoustic attenuation between the ears. J Acoust
J Speech Hear Disord. 32, 82–86. Soc Am. 25, 752–759.
Studebaker GA. (1979) Clinical masking. In: Rintelmann WF, ed. Zwislocki J, Kruger B, Miller JD, Niemoeller AR, Shaw EA, Stude-
Hearing Assessment. Baltimore, MD: University Park Press; baker G. (1988) Earphones in audiometry. J Acoust Soc Am. 83,
pp 51–100. 1688–1689.
C H A P T ER 7
Case History
Douglas L. Beck
113
114 SECTION I • Basic Tests and Procedures
For example, a patient presenting with a fluctuating little loud. The same thing happens with the car radio when
low-frequency sensory/neural hearing loss (SNHL) and we’re driving to the store. When she sets the volume, I just
tinnitus in the same ear, with aural fullness and occasional hear noise and can’t tell anything about what they’re saying.
vertigo, has a low index of suspicion for otosclerosis, but has When I was a boy, I could hear a pin drop from 40 paces.”
a reasonably high index of suspicion for Ménière’s disease.
“I understand. How long have you been playing the TV and
The high index of suspicion for Ménière’s disease would
radio louder than your wife likes it?”
lead us to ask probing questions to note whether the pre-
senting symptomatology is in agreement with a Ménière’s “Let’s see, I started working at the steel fabrication factory
diagnosis or would lead us in another direction, such as an 14 years ago, and my son was born 8 years ago . . . . so yeah,
acoustic neuroma. it’s been at least 8 or 10 years. When I let her set the vol-
The competent professional understands the probabili- ume, I can hear the voices, but I really can’t understand what
ties of certain things occurring and the related signs and they’re saying. That drives me nuts. I told her and I’m telling
symptoms of each. Although Ménière’s disease is a relatively you too, I ain’t gonna wear no hearing aids.”
rare disorder, occurring in less than 1% of the general popu-
Given the information presented in this scenario, one
lation, it is a common diagnosis for patients with the symp-
can make several, reasonable, assumptions. We could assume
toms described earlier. Of course, we do not make a medical
that Mr. Smith has a noise-induced SNHL, likely impacting
diagnosis of a disease. However, if the data come together
4,000 Hz, and because he cannot hear the consonant sounds
with what we know about the patient the audiologist might
(high frequencies), he cannot clearly understand the words spo-
include “Consistent with Ménière’s disease,” in the report.
ken to him. We might also assume that Mr. Smith is not going
Three scenarios follow to illustrate the interview
to wear hearing aids and that there is little we can do to assist.
technique.
However, there are other options and protocols to employ:
SCENARIO ONE “Mr. Smith, have you had a hearing test before?”
Review any/all assembled paperwork (chart, lab notes, test “Not since the Army, back some 20 years ago.”
results, history, etc.) before meeting the patient for the ini- “Do both ears seem about the same, or is one ear better than
tial consultation. Shake hands and greet the patient, their the other?”
spouse, significant other, family, and so on, and always
introduce yourself. This is an amazingly simple protocol, “The left ear is terrible—can’t hear thunder with that one.”
but it is often overlooked, and when it is overlooked, it sets “I see. Do you have any ear pain?”
an unprofessional tone for the rest of the encounter. I usu-
ally say, “Good morning. My name is Dr. Beck, I’m an audi- “None at all. My ears feel fine.”
ologist. Please come in Mr. Smith.” “Okay then. May I take a look?”
After exchanging greetings and after sitting down in the
“Sure, help yourself.”
office, inquire as to why the patient scheduled today’s visit.
At this point, the audiologist has a rather low index of
“Thanks for coming in today Mr. Smith. What brings you
suspicion for a tumor, such as an acoustic neuroma, because
to the office?”
they occur in about 0.00001% of the population, but a higher
Mr. Smith: “I would like a comprehensive audiometric eval- index of suspicion for more likely possibilities, including a
uation to confirm my bilateral sensory/neural noise-induced unilateral sudden sensory/neural loss that went undiagnosed
hearing loss that my otolaryngologist diagnosed last week. I (or maybe Mr. Smith works with his left ear toward a loud
am very interested in acquiring two digital hearing aids, and machine while wearing hearing protection only in the right
by the way, I am wealthy and do not have insurance. I pay ear, or perhaps he experienced head trauma on the left or an
cash, and money is no object. I want to hear everything as explosion near his left side during boot camp; there are lots of
best I can.” possibilities). The examination of the pinna, concha, ear canal,
and tympanic membranes is normal. The audiologist says,
Because this patient has already been seen and diag-
“Okay, your ears look fine,” and the interview continues to
nosed by the ear, nose, and throat (ENT) specialist, the index
determine which diagnosis has the highest index of suspicion.
of suspicion for some other disease process or a medical/
surgical issue is extremely low. “Mr. Smith, let me make sure I understand . . . the right ear is
the better ear and the left ear has been bad for a long time.
SCENARIO TWO Have you ever had the left ear checked?”
Mr. Smith: “Well doc, you know how it is. My wife always “Yes. I had the doctor look at it a year or two ago when it
complains I have the TV up too loud and it drives her outta went bad. He put me on antibiotics and that was the end of
the room. I like to be able to hear the darn thing so I keep it a it. It didn’t get better though, so I left it alone.
CHAPTER 7 • Case History 115
“Okay. What about drainage, anything coming out of so yeah, it’s been at least 8 or 10 years. When my wife sets
your ears?” the TV, it sounds like everyone is mumbling; I can hear the
voices, but I really can’t understand what they’re saying.
“No sir.”
That drives me nuts. I told her and I’m telling you too, I
“Any dizziness or spinning sensations?” ain’t gonna wear no hearing aids.”
Given the information presented above, one can
“Not any more. Well, maybe a little. When my left ear was
make several assumptions. We could assume Mr. Smith
going bad, I had some dizziness, but the doctor looked at it
has a noise-induced SNHL, impacting frequencies around
and put me on antibiotics, and the dizziness got better after
4,000 Hz, and because of the reduced amplitude and dis-
a while.”
tortion affecting mostly the high-frequency consonant
“So the dizziness started and the left ear went bad all sounds, he cannot clearly hear the words spoken to him.
about a year or two ago?” We can also be comfortable in thinking that Mr. Smith is
not going to wear hearing aids, which reduces what we can
“That’s right.”
do to assist him. However, there are other options and pro-
“Okay, very good. Are you on any medications?” tocols to explore.
“Just a cholesterol pill and a baby aspirin, that’s about it.” “Mr. Smith, have you had a hearing test before?”
“Okay, and one last thing I’d like to ask you before we do the “Not since the Army, back some 20 years ago.”
hearing test—do you have any ringing or buzzing noises in
your ears?” “Do both ears seem about the same, or is one ear better than
the other?”
“Yeah, the darn left ear can’t hear anything, but it sure makes
a racket. Kinda like a “shhhhh” noise going on in there. “They’re just about the same”
Keeps me up at night sometimes.” “I see. Any ear pain?”
The audiologist does a comprehensive audiomet- “None at all. My ears feel fine.”
ric evaluation and determines the following audiometric
profile: “That’s good. May I take a look?”
Right ear: Normal peripheral hearing. Tympanogram “Sure doc, knock yourself out.”
normal (type A), ipsilateral reflexes within normal lim-
its (WNL). Word recognition score (WRS) = 96%. Speech The pinna, concha, ear canal, and tympanic membranes
reception threshold (SRT) = 15 dB HL. are normal in appearance. The audiologist says, “Your ears
Left ear: Flat 85 dB sensory/neural (SN) loss. Tympano- look fine,” and the interview continues.
gram normal (type A), ipsilateral reflexes absent @105 dB “Okay, what about drainage? Is there anything coming out
stimulus level. WRS = 8%, SRT = SAT (speech awareness of your ears?”
threshold used because speech understanding was extremely
poor) = 80 dB HL. “No sir.”
The index of suspicion for a left retrocochlear disorder
“Any dizziness or spinning sensations”
is very high at this point. The case history supports this pos-
sibility, and the test results indicate a possible retrocochlear “Nope.”
diagnosis for the left ear.
“Very good. Are you taking any medications?”
The audiologist refers the patient to an otolaryngolo-
gist (preferably an otologist or neurotologist) based on the “Just a cholesterol pill and a baby aspirin, that’s about it.”
high index of suspicion for a retrocochlear hearing loss. The
“The last question I’d like to ask you before we do the
otologist meets with and interviews the patient and refers
hearing test is do you have any ringing or buzzing noises in
the patient for a magnetic resonance imaging (MRI) study
your ears?”
with contrast (gadolinium). A 3-cm vestibular schwannoma
(acoustic neuroma) is diagnosed. Mr. Smith is scheduled “Yeah . . . maybe a little when it’s really quiet, nothing that
for surgery 3 weeks later, and the tumor is removed via the really bothers me though.”
translabyrinthine approach.
The audiologist does a comprehensive audiometric eval-
uation and determines the following audiometric profile:
Right ear: Moderate high-frequency sensory/neural
SCENARIO THREE
hearing loss. Tympanogram normal (type A), ipsilateral
Mr. Smith: “Let’s see, I started working at this really noisy reflexes are within normal limits (WNL). WRS = 96%.
factory 14 years ago, and my son was born 8 years ago . . . SRT = 45 dB HL.
116 SECTION I • Basic Tests and Procedures
Left ear: Moderate high-frequency sensory/neural hear- background page are preferred and appreciated. Black print
ing loss. Tympanogram normal (type A), ipsilateral reflexes on a white background will be the easiest to read. Another
are WNL. WRS = 92%. SRT = 45 dB HL. important consideration is to use and/or design question-
naires that are easily assessed and tabulated, so the profes-
“Mr. Smith, I’d like to review the results of today’s tests with
sional can scan the page to find the “positive” results, which
you. Would you like to have your wife join us while I review
will need to be considered.
the results?”
In 2005, the Centers for Medicare and Medicaid Ser-
“Sure, that would be great. She’s in the waiting room.” vices (CMS) added a new benefit under Medicare Part B
that will likely increase the quantity of pencil and paper–
“Hi Mrs. Smith, please join us while I review the results of today’s
based hearing and balance screenings offered to patients.
examination. This way, the two of you will have the chance to
This benefit is “bundled” within the “Welcome to Medicare”
learn about the results, and I can address your questions.”
examination. The examination has seven screening sections
In this third scenario, the index of suspicion for a noise- for physicians, nurses, or nurse practitioners to employ
induced hearing loss is high, and there are no red flags and when addressing new patients. Importantly, the Medicare
no indications of a medical or surgical problem. In essence, rules state that the screening tests must be in the form of
the same patient, in three different scenarios, has three questions or questionnaires and that the selected screening
separate sets of circumstances, each of which are typically tests must be recognized by a national medical professional
revealed through an interview-based case history, which is organization.
more or less driven by the index of suspicion. In addition to a wealth of other tests, the American
Academy of Audiology (AAA) and the American Speech-
Questionnaires Language-Hearing Association (ASHA) have recommended
that the following questionnaire be used for this purpose:
Another very useful and efficient case history tool is the Hearing Handicap Inventory for the Elderly—Screening
health questionnaire. A well-designed questionnaire is Version (HHIE-S; Ventry and Weinstein, 1982). There is
highly focused, simple, takes just a few minutes to fill out, likely to be greater popularity for screening tests. Therefore,
and quickly directs the professional to the area(s) of great- audiologists should be familiar with the above-noted ques-
est concern. Questionnaires regarding hearing health care tionnaires and their format, scoring, and importance.
can be presented to patients verbally or written. Written
questionnaires are available in electronic and paper-based Subjective, Objective,
formats.
However, it is my personal preference to not have
Assessment, and Plan
patients fill in downloadable, lengthy questionnaires at Another way to gather useful case history information
home. It is terrifically advantageous for the audiologist to quickly is to use the standard subjective, objective, assess-
spend the time working through a well-designed question- ment, and plan (SOAP) format. The SOAP format is
naire with the patient, to establish rapport and trust and essentially a “medical model” case history–gathering tool.
to allow the patient to tell their story. We learn much more There are many variations on the SOAP format used by
about the patient and their situation when we put in the time clinics, medical schools, and, often, military healthcare
to ask questions and listen to the patient and then write chart facilities.
notes reflecting that conversation. Time spent asking ques- Critics believe the SOAP format is impersonal and does
tions and listening to and then questioning and clarifying not recognize the patient as a whole person. In addition,
their response is time well spent. the SOAP format tends to treat the person as if he or she
was the disease/disorder/problem, and it calls for the use of
VERBAL PRESENTATIONS jargon and related abbreviations. Although jargon is com-
monly used in health professions, it can vary from location
Remember, if you are giving a patient a verbal presentation to location, and it may be nearly impossible for many well-
of a hearing questionnaire, there is already a reasonable educated colleagues to interpret. In the following examples,
index of suspicion for hearing loss. Therefore, sit about 3 ft abbreviations will be used along with their explanations,
away from the patient in a well-lit room, face the patient, be which will immediately follow in parenthesis.
sure there is no background noise or visual distractions, and
maintain the patient’s full attention. SUBJECTIVE
The subjective section provides a brief subjective history,
PENCIL AND PAPER PRESENTATIONS often focusing on the CC as well as other clinical observa-
Keep in mind that, because the majority of patients seen by tions. The patient’s medical and audiology history would be
audiologists are over 55 years of age, large font, dark print, placed in this section. Other entries in this section would
and maximal contrast between the printed words and the be notes the patient/relative/friends offer regarding pain or
CHAPTER 7 • Case History 117
Diagnostic Audiology
119
120 SECTION I • Basic Tests and Procedures
TA B L E 8 . 2
test results to diagnose ANSD: “. . . the presence of a [cochlear recommend that everyone have tympanometry for the
microphonic] or reversing waves at the beginning of the reasons previously discussed). We think that a nonorganic
trace does NOT make a diagnosis of ANSD . . . without the component is more likely to be present during subjective
cross-check of middle-ear muscle reflexes (MEMR), OAEs, testing and may not be discovered until cross-checked with
and an ABR latency-intensity function” (p. 32). For further objective tests. For these reasons, we recommend objective
information about these tests, the reader is referred to the testing first. A suggested testing order is shown in Figure 8.1.
chapters that discuss electrophysiological tests in the text. Nevertheless, it should be noted that some audiologists
Table 8.2 summarizes many of the cross-check tests that are advocate giving puretone and speech tests first when the
used in audiology. patient may be more alert and can actively respond to these
tests and then relax during the objective tests.
Order of Tests Administered
Beyond the Test Battery
Although we acknowledge that there is considerable vari-
ability in test protocols across clinics, we recommend that Although the use of a test battery is important, it is also
testing begin with the objective tests unless contraindi- vital for the audiologist to remember the case history and
cated. At least two major advantages can be found for test- the patient complaints. In some ways, one may consider this
ing objective measures first. The first advantage is that the patient-reported information to be a cross-check of the test
audiologist will have a good idea of the type and degree of battery itself. The case studies presented below demonstrate
hearing loss before beginning the subjective tests. The sec- examples of diagnostic audiology in action.
ond advantage is the reduced risk for misdiagnosis of dis-
orders such as ANSD and vestibular schwannoma, as well
as failure to detect a patient who is malingering. One caveat
CASE STUDIES
needs to be discussed with conducting objective tests first. The importance of objective testing in conjunction with
With rising costs of health care we need to be cautious that subjective tests can be seen through the use of case exam-
we are doing tests that are necessary. It is possible that, if ples. The following cases are a range of examples that high-
the patient’s hearing is completely normal (with no listen- light the use and benefit of incorporating the cross-check
ing complaints) or the patient is profoundly deaf (with principle into your clinical practice. The cases will be pre-
previous documentation to support the initial diagnosis), sented with a brief history and test results. A discussion
tests such as OAEs and acoustic reflexes will be unlikely to of potential difficulties and challenges in interpreting the
add further information about the patient (but we would audiologic data is incorporated within each case. Although
122 SECTION I • Basic Tests and Procedures
Otoacoustic emissions
Case 1
CASE HISTORY
Air conduction
Mr. Ang Kim, age 36, is being seen today after he failed the
hearing screening at his company’s health fair. His medical
SRT history is generally unremarkable, though he reports that
he is just getting over a sinus infection and recently under-
Subjective went surgery for a slipped disc in his back. You have back-
tests Bone conduction to-back patients today and because there is nothing remark-
able in his history you decide to do a quick audiogram and
send him on his way. Results from otoscopy, puretone, and
WRS speech audiometry are shown in Table 8.3 and Figure 8.2.
With subjective information alone this audiogram
could indicate many things. For example, you may inac-
Speech in noise
curately diagnose Mr. Kim with a collapsed ear canal, an
FIGURE 8.1 A suggested comprehensive diagnostic
impacted cerumen plug, or a perforated tympanic mem-
test battery. brane without additional tests to cross check your findings.
0 Reliability Good
Results:
Right Left
20 PTA 28 dB HL 5 dB HL
SRT 25 dB HL 5 dB HL
HL HL
60
80
100
TA B L E 8 . 4 TA B LE 8 . 6
TABLE 8.5
Despite your busy schedule, you decide you need more and puretone and speech audiometry results are shown in
information to make an accurate diagnosis, so you perform Figure 8.3.
objective testing to cross check your subjective results. The If you decide not to proceed with further tests to cross
results from immittance testing and OAE testing are shown check your results, you might diagnose this patient with
in Tables 8.4–8.6. normal hearing in the right ear and a mild conductive hear-
With this information, you have several different tests ing loss in the left ear. You might then refer Mrs. Jones to
to confirm your finding of a conductive hearing loss. The an Ear Nose and Throat physician who would order more
Type B tympanogram in the right ear reveals normal ear tests.
canal volume but no mobility. The normal ear canal volume Instead, you decide to proceed and include additional
suggests that the TM is not perforated and there is no ceru- tests in your battery that would provide a cross-check. We
men plug. The pattern of the ARTs is consistent with a right will review those results next (see Tables 8.8–8.10).
conductive pathology. TEOAEs in the right ear are absent These results suggest that Mrs. Jones has normal hear-
which is expected with a conductive pathology. ing that contradicts your puretone findings. Normal results
The combination of the subjective and objective test on tympanometry, ARTs, and TEOAEs are not consistent
results correctly leads you to suspect otitis media with effu- with a mild conductive hearing loss. With this information
sion and would require a referral for Mr. Kim to a physician. you review the patient’s case history and puretone findings
In this case, you are able to make an appropriate referral again and realize that the apparent conductive hearing loss
based on the information you obtained from a test battery in the right ear is likely the result of a collapsing ear canal. It
incorporating both objective and subjective measures. is not uncommon for the pressure of the supra-aural head-
phones to cause the canal to collapse, particularly in older
patients for whom the cartilage supporting the ear canal is
Case 2 soft. To confirm this finding you decide to retest Mrs. Jones
with insert earphones. When you repeat your audiogram
CASE HISTORY using the insert earphones, you measure Mrs. Jones’s right-
Mrs. Edith Jones, age 77, is being seen today for a hearing ear air-conduction thresholds at 5 or 10 dB HL for all fre-
test. She does not perceive a listening difficulty but her hus- quencies tested. You are able to report to Mrs. Jones that her
band was recently fit with hearing aids and insisted she have hearing appears to be normal!
her hearing checked too. Her medical history is remarkable Both cases 1 and 2 highlight the importance of using
for high blood pressure and type 2 diabetes which are both objective test results in conjunction with subjective test
controlled by medication. results to avoid misdiagnosis. Both audiograms revealed
You conduct a basic audiometric evaluation on similar test results but very different actual diagnoses, which
Mrs. Jones. Results for otoscopy are displayed in Table 8.7 were only confirmed with the use of objective testing.
124 SECTION I • Basic Tests and Procedures
0 Reliability Good
Results:
Right Left
20 PTA 28 dB HL 5 dB HL
SRT 30 dB HL 10 dB HL
HL HL
60
80
100
TA B L E 8 .7 TA B LE 8 . 8
TA B L E 8 . 9
TABL E 8.10
TA B L E 8 .1 1 TA B LE 8 .1 2
TA B L E 8 .1 3
0 Reliability Good
Results:
Right Left
20 PTA 10 dB HL 90 dB HL
SRT 10 dB HL 75 dB HL
HL HL
60
80
100
TABL E 8.14
TA B L E 8 .1 5 TA B LE 8 .1 6
0 Reliability Good
Results:
Right Left
20 PTA 33 dB HL 35 dB HL
SRT 30 dB HL 35 dB HL
60
80
100
TA B L E 8 .1 7
TABL E 8.18
0 Reliability Good
Results:
Right Left
20 PTA 28 dB HL 18 dB HL
SRT 30 dB HL 20 dB HL
HL HL
10% @ 85 dB
60 HL
80
100
Ms. Jackson’s might result in delayed or inappropriate reha- ing. The patient’s asymmetry in high-frequency audiometric
bilitation recommendations. Unfortunately, ANSD can be thresholds and poor WRS in the right ear combined with
missed easily in clinical practice if the audiologist does not the objective results suggest a retrocochlear pathology and
perform a thorough test battery (Smart and Kelly, 2008). warrant an ENT referral for additional testing. The patient’s
report of unilateral tinnitus, the abnormal ART pattern, the
asymmetry in puretone thresholds, and the apparent rollover
Case 5 in the patient’s right-ear word recognition are all suggestive
CASE HISTORY of retrocochlear dysfunction. Taken in isolation, each might
be sufficient for you to make a referral for a medical evalu-
Mr. Don Warner, age 58, is being seen today with his pri- ation. However, having consistent results from several tests
mary complaint being a constant ringing in his right ear. He allows you to be more confident in your recommendation
notes that the ringing has been present off and on for over and provide the ENT with as much information as possible.
3 years but it has become more bothersome recently. When
asked about his hearing he admits that he has difficulty
understanding what others are saying in noisy places. He Case 6
denies aural fullness and dizziness. He plays tennis regularly
CASE HISTORY
and generally is in great health. Results from your testing are
shown in Tables 8.19–8.22 and Figure 8.6. Mr. José Gonzalez, age 24, was seen today for an audiologic
The order of test administration is important. Because evaluation. He was just hired as a clerk for a federal judge and
you performed immittance testing and TEOAEs first, you therefore has to undergo a rigorous physical examination,
knew that the patient’s tympanic membrane is mobile, that including a hearing test. Mr. Gonzalez denies any hearing dif-
the ART pattern is abnormal in the right ear, and that the ficulties, tinnitus, dizziness, or aural fullness. He reports that
outer hair cells of the right ear do not appear to be function- he is in great health and is currently training for a marathon.
ing normally. You were able to obtain this information before
the patient provided any information through subjective test-
TA B LE 8 . 20
TA B L E 8 .1 9
Tympanometry Results (226-Hz Probe
Tone) for Case 5
Otoscopy Findings for Case 5 Right Left
Right Ear Left Ear ECV 1.6 mL 1.8 mL
Clear ear canal; intact Clear ear canal; intact Compliance 0.7 mL 0.9 mL
tympanic membrane tympanic membrane Middle-ear pressure 0 daPa −10 daPa
CHAPTER 8 • Diagnostic Audiology 129
TA B L E 8 . 2 1
TABL E 8.2 2
0 Reliability ?
Results:
Right Left
20 PTA 10 dB HL NR
SRT 5 dB HL NR
HL
60
80
100
TABLE 8.24
TA B L E 8 . 2 5
TABL E 8.2 6
severe-to-profound hearing loss also have abnormalities of With respect to the audiologic test battery, disagree-
vision (American Academy of Pediatrics, 2007). ment among objective and subjective test measures which
cannot be resolved as tester, equipment, or patient error is
Referral for Medical Otolaryngologic indicative of need for medical referral. Abnormally poor
Evaluation speech scores relative to the audiogram, asymmetric hearing
loss, and reports of aural fullness and/or tinnitus are other
The most common referral made by audiologists is to a signs of possible serious ear disease which should be evalu-
medical doctor. Sending a patient to an otolaryngologist, ated by a physician.
primary care physician, or pediatrician is indicated if the
audiologic evaluation reveals evidence of an underlying Referral for Auditory Processing
medical condition. Symptoms may include ear pain, bleed-
ing or drainage from the ear (otorrhea), tympanometric
Evaluation
abnormality without known etiology, or physical abnormal- Disagreement between objective and subjective hearing tests
ity observed during otoscopy. Patients who report frequent may be reason to refer a patient for an evaluation of audi-
ear infections, fluctuating or sudden hearing loss, or bal- tory processing. Patients with apparently normal peripheral
ance disturbance should also be seen by a medical profes- auditory function may still have difficulty processing com-
sional (see Table 8.27). Newly identified hearing loss is also plex signals such as speech. These individuals often report
reason for referral. Although some audiologists undertake that they can hear well, but have difficulty understanding
cerumen management in their own practice, many others what others are saying, particularly in the presence of noise.
prefer to refer to an otolaryngologist or the patient’s pri- Tests of speech perception in noise such as the Bamford–
mary care physician for removal of impacted wax. Children Kowal–Bench Speech-in-Noise Test (BKB-SIN; Etymo-tic
who exhibit a previously undiagnosed hearing loss or who Research, 2005), Quick Speech-in-Noise Test (QuickSIN;
exhibit delays in speech or language development should be Etymo-tic Research, 2001), and Hearing in Noise Test (HINT;
seen by a pediatric otolaryngologist or developmental pedi- Nilsson et al., 1994) may help to confirm this difficulty. If
atrician prior to any audiologic m