You are on page 1of 12

ORIGINAL ARTICLES

Quality of dental care: development of standards

Howard Bailit, DMD, PhD o f evaluation m ust be reasonable in terms o f time


Meni Koslowsky, PhD and m oney. Fourth, the basic dentist-patient
relationship must be kept intact.
Joseph G rasso, MS, DDS, Farmington, Conn
With regard to the first factor, the A m erican
Stanley Holzman, DDS
D ental A ssociation and state associations have
Robert Levine, MS, DDS, West Hartford, Conn influenced, to som e degree, the evaluation pro­
Paula Valluzzo, BS cedures o f third parties. In general, how ever,
m ost insured programs function independently
Paula Atwood, BA, Farmington, Conn
o f organized dentistry. With the expected in­
crease in prepaid dental care, the dental profes­
sion might consider the formation o f founda­
Standards of dental care were developed by a
tions that contract with third parties to monitor
group of nine private general dental practitioners
the cost and quality o f dental care in a given state
and dental school faculty members. Five assessors
or area. A precedent for this type o f organiza­
were trained to use the standards that were then
tion has already been established in m edicine.1
tested for reliability, variability, validity, and prac­
A t the national level, dentists m ust have som e
ticality on a sample of 47 patients. The results
role in determining P SR O policy; this organiza­
showed high reliability for most items and moder­
tion will probably be responsible for evaluating
ate variability and validity. The time necessary to
the quality o f dental services purchased with fed­
train assessors and evaluate individual patients
eral and state funds.
was reasonable. The potential use of this quality
In terms o f objectivity, one o f the bases for an
assessment system is discussed in terms of contin­
evaluation system m ust be reliable criteria o f
uing education, peer review committees, and den­
tal program evaluation.
care; that is, similar judgm ents m ust be made by
tw o or more assessors w ho independently evalu­
ate the sam e patient. F or reliability to be
achieved, written criteria m ust be tested under
controlled conditions. Som e progress has been
R ecen t passage o f legislation mandating the de­ made in this area, notably the work o f Abram-
velopm ent o f professional standards review or­ ow itz,2 Friedm an,3 Schonfeld and co-w orkers,4
ganizations and the increasing involvem ent o f S oricelli,5 and C o n s,6 although much remains to
insurance com panies in monitoring the quality o f be done.
care make it apparent that providers o f dental The cost m ust also be reasonable. M ethods o f
care eventually w ill be involved in som e form o f evaluation m ust not rely solely on the use o f den­
organized review system . A major challenge fac­ tists, the m ost expensive form o f dental man­
ing organized dentistry is to ensure that the sy s­ power. Through the u se o f com puters and non­
tem is acceptable to both the dentists and the professional raters, the review system can min­
public. im ize the tim e practicing dentists must spend
M any factors might affect the profession’s away from direct patient care.
support o f a particular evaluation system . First, Finally, although the evaluation system may
the dental profession must have a significant role have som e effect on dentist-patient relationships,
in developing the system . Second, the m ethod the mutual respect and trust that exist betw een
o f evaluation m ust be objective. Third, the cost m ost dentists and patients must be maintained.

842 ■ JADA, Vol. 89, October 1974


T his brief review is a general background to som e lim itations. F or exam ple, if the patient is
the present paper on an experim ental quality not conscientious about oral hygiene, the best
review system for general dentistry, developed dentistry may deteriorate.
jointly by a group o f com m unity practitioners T h e second dim ension o f quality is the process
and dental school faculty, the latter serving o f care. Regardless o f the outcom e o f care, did
mainly as technical advisers. T his paper d e­ the dentist plan the proper treatment for the pa­
scribes the process o f developm ent and the test­ tient and w as the treatment adequately carried
ing o f the criteria. E xcept for exam ples, the ac­ out? Criteria must be available for com parison to
tual criteria are not presented. make judgm ents on the process o f care.
T h e structural dim ension o f care focu ses on
■ Fram ew ork fo r evaluating quality: F or a the setting in which care takes place. A sse ss­
com m on fram e o f reference, it is necessary to ment can be made o f such things as physical fa­
discuss briefly the concept o f quality and the cilities and equipm ent, qualifications o f pro­
various dim ensions o f quality that can be eval­ viders, and administration o f the p ractice.11 It
uated. A com m only accepted definition o f quality is assum ed that if the structure is adequate, then
is the one o f L ee and J on es7: “ G ood medical the process o f care is likely to be acceptable.
care is the kind o f medicine practiced and taught There is a relationship among the three dimen­
by the recognized leaders o f the m edical profes­ sions o f care; though one does not necessarily
sion at a given tim e or period o f social, cultural, cause the other, they are related “ as links in a
and professional developm ent in a com m unity or ch ain .” 10 Within a proper structure, dentists can
population group.” deliver adequate care that should result in healthy
Several aspects o f this definition need em ­ oral tissues. T he present project is concerned
phasis. First, it is clear that the definition o f qual­ with all three aspects o f quality and especially
ity depends on the professional leaders in a given the interrelationships am ong them; how ever,
com m unity, and although there will be general this paper is directed to the developm ent and
agreem ent on som e standards in all com m unities, testing o f standards for the “ process o f care.”
individual differences will exist. T h ese regional
differences should be incorporated into any na­
tional standards.
A second point is that standards are not con­ Methods
stant over tim e. Even within a single com m unity,
standards o f care will change as n ew biologic or
technical discoveries are made in the diagnosis ■ Development o f criteria and standards: Since
and treatm ent o f disease. T his is also true for the evaluation o f the process o f care depends on
changes in social and cultural values that often criteria developed by leaders in the dental com ­
define what society and the profession consider m unity, the first step was to select a standards
abnormal conditions requiring treatment. com m ittee. On the advice o f the Hartford (Conn)
W ith this definition in mind, there are several D ental S o ciety ’s execu tive com m ittee, a nine-
general conceptual m odels for the m easurem ent mem ber com m ittee was formed: one dentist was
o f the quality o f care, including th ose o f F ried­ a m em ber o f the C onnecticut State Board o f
m an,3 R oem er,8 and S chon feld.9 A ll have many D ental Exam iners, tw o w ere on the executive
points in com m on, but the m odel o f D onabedian10 com m ittee, four w ere active members o f the so­
has received w ide acceptance and will be used in ciety, and tw o w ere U niversity o f C onnecticut
this study. dental school faculty in the department o f gen­
Donabedian proposed that the quality o f care eral dentistry. T hey all considered them selves to
can be evaluated from essentially three different be general dental practitioners and had at least
but related aspects: the structure, the p rocess, seven years’ experience in practice. In addition,
and the outcom e o f care. T he outcom e o f receiv­ the com m unity m embers o f the standards com ­
ing dental service is usually m easured by the mittee w ere generally w ell known am ong their
D M F , periodontal, and other morbidity indexes. peers, and it was assum ed that criteria and stan­
T he outcom e o f care is often considered the m ost dards developed by this group would be accept­
important m easure o f quality, but sin ce many able to local dentists.
factors other than dental care affect the dental A t the first m eeting o f the com m ittee a deci­
health o f the patient, outcom e m easures have sion was made to try to reach unanimity when­

Bailit—others: QUALITY OF DENTAL CARE ■ 843


ever possible on criteria, but in areas o f contro­ dontic or crown and bridge treatm ent w ould be
versy a majority vote w ould be binding. A lso , it comparable. Since there is little evid en ce about
was realized that the com m ittee w as rather large the relationships in quality among different treat­
and a subcom m ittee w ould be needed to do m ost ment areas, limiting the criteria to four or five
o f the staff work. T he subcom m ittee consisted conditions did not seem appropriate. Rather,
o f one dental school faculty m em ber and tw o the com m ittee decided to prepare criteria and
com m unity practitioners. standards for the com m on conditions that gen­
c r i t e r i a o f c a r e . A n exten sive bibliography eral dental practitioners treat. T hen, after eval­
w as available on evaluation procedures and cri­ uation o f the standards clinically, it w ould be
teria used in other investigations. From a general possible to determine the feasibility o f a m ore
perspective, the work o f P eterson and co-w ork­ limited approach.
ers12 on general m edical practitioners w as par­ A third conceptual problem w as the division
ticularly helpful, as was a manual o f dental cri­ o f the process o f care into its com ponents. There
teria and standards developed by Friedm an.3 are m any w ays in which this could be d one, but a
For the m ost part, the actual content o f the cri­ logical method is by the sequence o f events in
teria w as based on the practical experience o f the treatm ent o f the average patient. T hus, the
the clinicians on the com m ittee; reference was first com ponent would be the quality o f the his­
also made to dental textbooks. A nother useful tory and examination; second, the diagnosis;
source, dealing with som e o f the conceptual third, the treatment plan; and fourth, the treat­
problem s that had to be resolved before criteria ment.
and standards w ere developed, w as a book by T he final issue w as the m ethods available to
D on ab ed ian .13 Four o f these conceptual prob­ collect information about patient care. T o som e
lem s d eserve com m ent. extent, the m ethods used w ould define the areas
First w as the question o f whether the criteria o f patient care that could be evaluated. Three
should be norm ative or empirical. Should the cri­ approaches to data collection w ere considered:
teria reflect what dentists should do in practice observation o f the dentist, record audit, and pa­
(normative) or what they actually do (empirical)? tient exam ination. O f th ese three m ethods, ob­
O bviously, these are not necessarily the sam e. servation o f the dentist w hile he w as treating a
T he com m ittee decided that criteria and stan­ patient was discarded as being too exp en sive and
dards should not be unrealistic. “ Ideal dentist­ probably unacceptable to m ost practitioners.
ry” or the “ best possible dentistry” are intan­ T he evaluation o f dental records including radio­
gibles that are probably not definable and that, graphs seem ed a m ore reasonable approach,
in any event, suggest a level o f care that is seldom since all dentists keep records and these records
a ch iev ed .11 T he m ore important issu e is w hether can be evaluated relatively inexpensively.
the care is adequate. T hus, the criteria are nor­ T h e other major source o f data w as the clin­
m ative in that they are oriented to what the av­ ical exam ination o f patients. F or the evaluation
erage dentist should do to provide adequate care. o f the technical quality o f m ost services related
T he second major issu e w as w hether to d evel­ to the teeth per se, patient exam ination should
op criteria for all aspects o f general dentistry or provide an acceptable method because teeth do
for m ore limited parts o f it. Since general dental not repair and a permanent record o f the treat­
practitioners deal with many different typ es o f ment rendered is left. This is not true for ser­
problem s, it w ould be im possible to develop cri­ vices involving the soft tissu es, which can repair
teria for every conceivable situation. without any permanent changes in size, shape,
O ne approach to this conceptual problem that or color.
has recently received attention is the “ tracer c o m p o n e n t s o f c a r e . With th ese general
m ethod .” 14 H ere, criteria are developed for cer­ guidelines in mind, the subcom m ittee started to
tain specific disease entities and providers are develop criteria and standards for each o f the
evaluated with respect to these conditions. It is four com ponents.
assum ed that the level o f care provided for the — H istory and examination: For sim plicity, it
tracer d iseases approxim ates the level o f care was decided to limit the criteria initially to the
for all diseases. exam ination o f a new patient seeking com pre­
In dentistry, the tracer method could mean h en sive care. F or these patients, the history and
evaluation o f only the quality o f amalgam restor­ exam ination should be m ost com plete.
ations, if it is assum ed that the quality o f endo­ F iv e major elem ents w ere identified as n eces­

844 ■ JADA, Vol. 89, October 1974


sary in an adequate history and examination: medical, extraoral tissues, preventive services,
the chief complaint and, if associated with a spe­ restorative services, intraoral soft tissues, perio­
cific problem, a description of the present illness; dontal services, occlusion, and sequence of
a personal history; the past medical history; the treatments (illustration).
past dental history; and the dental examination —Treatment: The assessment of the technical
(including use of diagnostic aides). For adequate quality o f treatment can be done with relative
care to be provided, some description of these objectivity where there is a permanent change
five sections should be found in the dental rec­ in hard tissues such as in restorative dentistry.
ords. The information does not have to appear H owever, the treatment o f tissues such as the
in the categories or sequence suggested. The im­ periodontium that can repair may be impossible
portant issue is whether the appropriate data to evaluate. In a period of several months after
exist somewhere in the record. treatment, the tissues can return to their previous
For each of the five areas, specific criteria state for reasons that may be independent o f the
were developed to assess the adequacy of the dentist’s therapy; the actual therapy cannot be
information (illustration). It became apparent directly evaluated, only the outcome o f care. As
that adequacy involved two aspects—the level a result o f these technical limitations, criteria for
o f detail and its accuracy. Although it is useful assessing periodontal care were not developed.
to determine if the information recorded was ac­ In addition to periodontal services, endodontics
curate, in this report only the level of detail in and oral surgery presented somewhat similar
the record is evaluated. problems so that their assessment was limited to
— Diagnosis: Evaluation of the diagnosis pre­ just a few criteria.
sented several practical problems because, in Most o f the information to assess the technical
many instances, dentists do not specifically list quality of treatment was based on the examina­
diagnoses. In a few situations such as soft tissue tion of patients. For a few areas such as endo­
lesions, specific diagnoses might be in the record, dontics and oral surgery, radiographs and treat­
whereas for most other conditions they are not. ment notes were the main source of data.
A lso, if the treatment plan is done well, it is rea­ A s previously implied, the treatment criteria
sonable to assume that the diagnoses were ade­ are divided in terms of the primary categories of
quate. Therefore, since it is almost impossible to services commonly provided in general dental
obtain direct evidence on diagnoses and since practice. These include restorations, endodon­
the quality o f the diagnoses is covered, to some tics, crowns and fixed bridges, partial dentures,
extent, in the treatment plan assessment, a de­ complete dentures, and oral surgery (illustra­
cision was made not to have a specific section on tion).
the diagnostic phase of care. With the subcommittee doing much of the de­
—Treatment plan: It is necessary to separate velopmental work, the meetings of the standards
the evaluation o f the plan of treatment from the committee proved to be productive. Each set of
actual treatment. The former includes the judg­ criteria was discussed in depth and, although
ments o f the dentist on what should be done for there was often controversy over particular is­
the patient, whereas the latter reflects the tech­ sues, it was seldom necessary to have a formal
nical quality of the treatment provided. This is vote. M ost decisions were reached by consen­
an important issue, since the dentist should not sus, and in six months, a first draft of the criteria
be rated for the same performance in two differ­ was completed.
ent parts of the evaluation.
Information on the treatment plan is elicited ■ Scoring system : Both quantitative and cate­
from the treatment progress notes and from the gorical (qualitative) scoring system s were de­
examination of the patient. The presence of a signed.
specific treatment plan in the record would be of q u a n t i t a t i v e s c o r e . Before the criteria
great value but, except for the more complex could be used in the evaluation of patients, it was
and difficult situations, most dentists probably necessary to develop a reliable scoring system
plan their treatment but do not record it. that was easily understood by the assessors. At
The criteria for the treatment plan assessment first, each criterion was scored on a three-point
are divided into ten categories primarily on the scale: 1, unsatisfactory; 2, adequate; and 3, su­
basis of the information obtained in the history perior. A value of 9 was assigned when no deci­
and examination. The categories are personal, sion could be made; this was usually because of

Bailit— others: QUALITY OF DENTAL CARE ■ 845


Examples of criteria used to assess history and examination, treatment plan, and treatment.

History and examination 1. Alleviation of acute condi­ resist light hand pressure in a
M e d ica l h is to ry : Foran adequate tions (pain, bleeding, or acute infec­ downward direction.
medical history, these data should tion) or the chief complaint. 2. Stability
be in the record: 2. Consultation with or referral For both the maxillary and man­
1. A general description of the to a physician for systemic evalua­ dibular dentures, there should be
patient’s general health, including tion. only slight movement in a plane hor­
past serious illnesses. 3. Placement of the patient on izontal with the ridge, when light
2. In addition to a general state­ antibiotics or other premedications. twisting pressure is placed on the
ment of the patient’s health, specific 4. Institution of primary pre­ denture by the hand.
references should be made to these vention prog rams such as oral hy­ 3. Vertical dimension
conditions: sensitivity to drugs and giene instruction, prophylaxes, The teeth should not come in
other allergies, rheumatic fever, plaque control, or fluoride treat­ contact when the patient talks.
bleeding problems, liverand kidney ments. There should not be excessive
disease, heart disease, diabetes, 5. Control of deep caries that free-way space with overclosure
and pregnancy status (if fem aieo f may cause pulpal exposure. (The when the teeth are in contact.
child-bearing age). sequence of items 6 through 10 is 4. Extension of the flanges
3. The names of the patient’s interchangeable.) The flanges of the denture
personal physicians (if any). 6. Extractions and soft tissue should b e to th e d e p th o fth e m u c o -
4. The date of the patient’s last surgery. buccal folds without displacement
visit to a physician for a physical 7. Treatm ent of teeth endodon- of tissue.
examination. tically. On the lingual aspect of the
5. The medications being 8. Treatm ent of the periodon­ mandibular denture, the flanges
taken by the patient (if any) and the tium. should make contact with the floor
reasons for taking them. 9. Treatm ent of orthodontic of the mouth at rest and should
problems. not dislodge the denture when the
Treatment plan 10. Adjustment of occlusion. tongue is extended to moisten the
S equence o f tre a tm e n t: Inm ost 11. Operative treatm ent of surface of the lower lip.
instances there is an orderly se­ teeth (restorations). 5. Occlusion
quence of priorities in planning a 12. Replacement of teeth pros- There should be bilateral con­
patient’s care. Although some pa­ thetically. tact of all molar teeth.
tients will not require certain treat­ 13. Recall examinations and There should be no movement
ments such as referral to a physi­ maintenance treatment. of denture bases when the teeth are
cian, treatm ent plans should follow in light occlusion.
the general sequence listed here. Treatment With repeated closure, the teeth
As a broad guideline, the patient’s C o m p le te d e n tu re s : The factors should meet without sliding.
chief complaint should be dealt considered in judging complete 6. Placement of posterior teeth
with on th efi rst visit. Of course, if dentures are retention, stability, The buccal cusps of the molars
the complaint concerns the need for vertical dimension, extension of should be placed over the alveolar
dentures or other treatments that flanges, occlusion, placement of ridge in the m andibular denture.
cannot be provided until extensive teeth over ridges, and appearance. 7. Appearance
dental or medical care is completed, 1. Retention The shade of the teeth should
the chief com plaint cannot be re­ Retention in the maxillary den­ blend with the patient’s remaining
sponded to at the beginning of treat­ ture should be sufficient to allow natu ral teeth (if any) in the opposite
ment. the patient to perform the normal arch.
This sequence of treatments is mouth functions of talking, eating, The labial position of the maxil­
recommended: and opening w ithoutdislodgingthe lary anterior teeth should provide
denture. adequate support to the lips.
Retention in the maxillary den­
ture should be strong enough to

846 ■ JADA, Vol. 89, October 1974


lack of information. Preliminary attempts to use five-point categorical (or qualitative) scale was
this system showed that assessors had difficulty devised to provide an overall, rather than an item
making reliable judgments between the second by item, measure o f quality. The assessors gave
and third classes, especially in evaluating the their general impression of the quality of care
treatment plan and treatment. Therefore, for provided after examining the patient and briefly
these latter two components o f care, the scale reviewing the record (treatment notes and radio­
was reduced by removing the third classifica­ graphs). Then, the quality of care was rated by
tion (superior). one of five general categories.
In the rating of the treatment plan and treat­ — Category 1: The dentist has met few of the
ment, a problem arose if the dentist had multiple standards commonly accepted for adequate care.
opportunities to meet specific criteria. For ex­ The treatment is clearly o f the low est quality,
ample, treatment-planning criteria were devel­ technically and judgmentally.
oped on the materials used to restore teeth. Since — Category 2: The dentist has some area
the dentist usually restored more than one tooth where his work is adequate, but overall the level
and often met the standards for some teeth but of care has not reached an acceptable level.
not others, the question arose as to whether the — Category 3: The dentist has achieved ade­
treatment plan in this area was satisfactory. quate care in all phases of the treatment plan and
Arbitrarily, a general rule was established. If treatment.
90% or more of the decisions were correct, then — Category 4: The dentist has done superior
a satisfactory score was assigned. By the same work in most of the major phases o f care. There
token, if less than 90% o f the decisions were cor­ are, however, one or two areas where the work
rect for that item, an unsatisfactory score was is adequate but not superior.
assigned. In practical terms this meant that if a — Category 5: The dentist has provided out­
dentist used the correct materials in nine of ten standing care in all phases o f treatment.
teeth restored for a given patient, a satisfactory The assessor who performed the overall eval­
score was assigned for that criterion. If only five uation was not involved in the detailed (quanti­
restorations were evaluated and one was not ade­ tative) evaluation o f that particular patient.
quate, an unsatisfactory score was assigned. In addition to its use as a validity check, a ma­
One other issue on scoring deserves mention. jor advantage of the categorical approach is the
Many components o f care involve more than one speed with which an assessment can be com­
criterion. A s an example, assessors were asked pleted; it requires about five minutes. Conse­
to evaluate the use of restorative materials. For quently, it is crucial to compare both systems
this item, four of five specific criteria related to and to judge whether One can be used as a substi­
the material’s biological effects on pulpal tissue, tute for the other.
strength, esthetics, and so on are available. The
issue then is scoring the use of restorative ma­ ■ Training o f assessor: Two types o f assessors
terials if three of the four standards are met but were trained. For the history and examination,
one is consistently unmet. It can be argued that two research assistants without any background
in this situation use of materials should receive in dentistry or other health disciplines were
an adequate rating. Y et, from a clinical perspec­ taught to rate the dental records and to abstract
tive, if one important aspect of material use is them for use by the dental assessors.
consistently not achieved, it is likely that this The rationale for using nondental personnel
aspect of the treatment plan is clinically inade­ for this task was that decisions were being made,
quate. A s an example, for anterior teeth a dentist not on the appropriateness of treatment, but
might use a synthetic filling material that has ex­ rather on the amount of detail in the record, and
cellent strength, matches the color of the natural nonprofessional record assessors could do this
tooth, but causes a permanent, adverse pulpal re­ job at less cost than could trained dentists.
action even with a cavity liner or base. The use of Three members o f the standards committee—
materials should be given a low score even two from the community and one from the dental
though most o f the criteria are met. Thus, an school—were trained as assessors to evaluate
“ adequate” score would be assigned to an item if the treatment plan and treatment. Although the
all criteria within that item were met 90% or more assessors were already familiar with the evalua­
of the time. tion system , they were asked to take a test to
c a t e g o r i c a l s c o r e . For the validation o f the examine their didactic knowledge o f the criteria.
quantitative evaluation system just described, a After the assessors had successfully completed

Bailit—others: QUALITY OF DENTAL CARE ■ 847


the test, patients were examined clinically, and sources for care. A lso, attitudinal questions
preliminary data were collected on the criteria were used to assess the patient’s willingness to
and scoring system. It soon became apparent practice good oral hygiene and to pay for need­
that som e o f the criteria were not practical, could ed services.
not be rated reliably, or for some other reason N ext, the patient was then seen by the dental
were not usable. The standards committee met assessor who evaluated the treatment plan and
several times to modify the criteria. treatment. H is sources o f information were the
The usual procedure during the training period abstracted treatment notes, the patient question­
was for two or three patients to be scheduled for naire, and his own clinical examination of the
an afternoon; each patient was rated indepen­ patient. After the first assessor had finished, the
dently by the three assessors. Then, the asses­ patient was seen independently by a second as­
sors met to discuss any differences among them. sessor who proceeded to do the same type of
This quickly led to the assessors’ developing evaluation. Finally, a third clinical assessor eval­
comparable interpretations o f the standards and uated the patient using the qualitative scoring
a better understanding of the entire process of system.
evaluation.
A fter two months o f training and after modifi­ ■ D ata analysis: The process variables were
cations were made in the criteria, a manual was examined in terms of their reliability, variability,
prepared outlining in detail the method of eval­ validity, and practicality. The ideal evaluation
uation. A t this time, two additional assessors system would be highly consistent and stable
were recruited, one a member of the standards (reliable), would highly discriminate between
committee, the other a community practitioner. patients (variable), would be meaningfully inter­
pretable (valid), and easily learned and adminis­
■ Examination o f patients: With both sets of tered (practical).
assessors trained, the criteria were evaluated for — Reliability: Two measures were used to de­
reliability. Approximately 50 patients were sche­ termine interrater reliability. The first was sim­
duled for examination. These participants were ply a descriptive indicator that compared the per­
active dental patients from three sources: the centage o f agreement between any two judges
private practices o f community dentists, the den­ across all items. The value of this measure can
tal school clinic, or the Veterans Administra­ range from 0 (perfect disagreement) to 100% (per­
tion hospital (these patients received their care fect agreement). A chance finding would be 25%
in the private practices o f local dentists). The pa­ in history and examination and 33% in treatment
tients selected were above the age o f 18, had com ­ plan and treatment. H ere, a chance result refers
pleted a course of treatment (were in a main­ to the number of ways two judges can agree, di­
tenance phase of care), and had been treated for vided by the sum o f possible agreements and dis­
more than one condition. In regard to the latter, agreements.
most patients had, as a minimum, received care Another indicator of reliability was the com­
requiring restorative and periodontal therapy. parison o f mean scores (on items 1,2, and 3 only)
A description of the evaluation process fol­ between judges for the same record or patient.
lows. First, before the clinical evaluation, the The t statistic was used to test the null hypothe­
research assistants rated the history and exam­ sis that means for the two judges were equal. A
ination based on the dentist’s records. They then significant value indicated lack of agreement be­
abstracted the record by writing a brief descrip­ tween judges on a particular patient or record.
tion o f the treatment given at each visit in se­ —Variability: Before any practical meaning
quence. This allowed the dental assessor to eval­ can be assigned to a criterion, variance must be
uate the plan o f treatment without wasting time present. An item that everyone agrees on and
trying to organize the material in the original rec­ has been assigned the same value (for example,
ord; it also prevented the dental assessor from 2) does not distinguish between dentists. A s the
knowing the name of the dentist who treated the evaluation of such an item does take some finite
patient. amount o f time, it reduces the efficiency of the
Immediately before the clinical assessment, instrument.
the patient was asked to complete a short ques­ The method used to measure variability was
tionnaire noting the history of dental treatments, the standard deviation. It was calculated for
any outstanding medical problems, and payment each item across all patients; items rated 9, no
848 ■ JADA, Vol. 89, October 1974
Table 1 ■ Percentage of patients assessed by category
judgment, were excluded. The range o f vari­ of evaluation: treatment planning and treatment.
ability scores with this data was 0 to a value
C a te g o ry o f e v a lu a tio n % p a tie n ts
slightly greater than 1. A value o f 0 meant that
T re a tm e n t plan
only one score had been assigned to the item M e d ic a l 48.9
E x tra o ra l 10.6
(that is, either 1, 2, or 3) whereas a value near 1 S o ft tis s u e 2.1
signified that an equal number o f Is, 2s, and 3s P re v e n tiv e
R e s to ra tiv e
84.0
89.4
had been assigned. P e rio d o n ta l 63.8
O c c lu s a l 10.6
—Validity: The validity of an instrument an­ P ro s th e tic s 66.0
swers the question, “ Is the instrument measur­ P e ria p ic a l 21.3
T re a tm e n t
ing what it is supposed to measure?” Three mea­ R e s to ra tiv e 89.4
C row n and b rid g e 53.2
sures of validity are of concern here: content or R e m o v a b le p a rtia l d e n tu re s 21.3
C o m p le te d e n tu re s 12.8
consensual, predictive, and concurrent. The first E n d o d o n tic s 30.0
refers to the appropriateness of the items in the O ral s u rg e ry 12.8

instrument. This criterion was met by having all


items reviewed and approved by the standards thetic services. H owever, few could be evalu­
committee. The second measure investigates the ated for intra- or extra-oral soft tissue lesions,
predictive ability of the items in regard to other occlusal problems, and oral surgery treatment.
variables such as outcome indexes. This assessment could not be made for two main
The only estimate o f validity statistically de­
reasons: either the patient never needed treat­
termined in this paper, concurrent validity, de­
ment (for example, periapical lesion), or if he did,
scribes the relationship between the quantita­
there was not enough information on the extent
tive and qualitative scores. This was accom­
o f the problem or how the dentist handled it (for
plished by calculating the product moment cor­
example, soft tissue lesion). Often, it was impos­
relation between the sum of the treatment plan
sible to distinguish between these two reasons,
and treatment scores (a quantitative measure)
but the end result was the same: no judgment
and the qualitative score assigned to each pa­
could be made.
tient.
— Practicality: Practicality refers to several
■ Reliability: First, so that the reliability o f the
different measures including the length of time
record audit of the history and examination could
required to learn the criteria and to evaluate a
be measured, the audits o f the research assist­
particular patient clinically. These considera­
ants were compared with those of a dentist.
tions are crucial for wide dissemination of the
When it was apparent that there was little differ­
system. ence between the non-dentist and dentist audi­
tors, ten records were chosen at random and
evaluated twice for detail by the two research
Results assistants. The average agreement between as­
sessors for all items was 95%. Similarly, the per­
centage agreement between assessors for each
■ D escription o f sample: The sample consisted item ranged from 80% to 100%. All the demo­
o f 32 men and 15 women, ranging in age from 20 graphic items such as sex, age, and occupation
to 65 with a mean age of 30.6. Since about half were scored identically by both judges. Those
the patients were recruited from the Veterans relating to charting o f caries, periodontal exam­
Administration dental clinic, the distribution o f ination, and oral hygiene evaluation were the
age and sex is clearly skewed to men aged 20 to most troublesome and received a lower reliability
25 years. Thus, the patient population cannot of 80%.
be considered representative o f the total popula­ These results were supported by the findings
tion o f patients in the region. of no significant differences in mean scores be­
Each patient additionally was classified ac­ tween judges for any o f the records. It is apparent
cording to areas of treatment planning and treat­ that for the assessment of history and examina­
ment that could be assessed. For example, how tion a high degree o f reliability was achieved.
many patients had restorative or endodontic The analysis of reliability for treatment plan
treatment that could be evaluated? From Table showed similar results. The percentage agree­
1 it is evident that most patients had received ment between assessors across all items ranged
restorative, periodontal, and some type o f pros- from 79% to 96% (Table 2). Although some pairs

Bailit—others QUALITY OF DENTAL CARE ■ 849


Table 2 ■ Reliability between assessors (percentage
that judges C and D , who did the least number of
agreement) for all items: treatment plan and treatment.
evaluations, had the highest proportion o f signifi­
% a g re e m e n t
cant differences compared with those o f their
No. T re a tm e n t plan T re a tm e n t
P a ir no. A s s e s s o rs p a tie n ts (N = 2 9 item s) (N = 5 0 ite m s ) partners. The lack of practice probably contrib­
1 A /B 4 86 94 uted to their unreliability. Even with this result,
2 A/C 4 85 96
3 A/D 9 85 90
close to 80% of the t tests were not significant;
4 A/E 6 86 94 this indicates that, in general, treatment assess­
5 B/C 2 83 94
6 B/D 2 83 92 ment for a particular patient was quite reliable.
7 B/E 1 96 94
8 C/D 4 81 92
9 C/E
D/E
2
13
79
92
95 ■ Variability: Since each patient was seen twice,
10 92
scores by each assessor were assigned to one of
two groups. This procedure yielded two stan­
dard deviations for each item across all patients.
Table 3 ■ Proportion of items classified by percentage These values were averaged to obtain a better
of agreement between assessors.* indication of the “ true” population standard
C om ponent of
% a g re e m e n t b etw e en a s s e s s o rs
-------------------------------------------------------------------------------
deviation.
e v a lu a tio n <80% 8 0 % -9 0 % 9 0 % -9 9 % 100% Although it is quite difficult to clearly define an
H is to ry & e x a m in a tio n acceptable value for the standard deviation, for
31 ite m s 0.10 0.25 0.65
T re a tm e n t plan practical purposes 0.25 was set as a lower limit.
29 item s 0.21 0.41 0.03
T re a tm e n t
0.35 Items with standard deviations below this cutoff,
50 ite m s 0 .0 6 0.28 0.48 0.18 representing about 25% o f the maximum score,
* Each c e ll re p re s e n ts th e p ro p o rtio n o f in d iv id u a l ite m s w ith in a would serve as poor discriminators and predic­
s p e c ifie d ra n g e o f a g re e m e n t.
tors. A s can be seen from Table 4, about a third
of the items on history and examination and
treatment had standard deviations below 0.25
o f dentists saw very few patients in common,
and more than half the items in the treatment
others such as D and E, who saw a total of 13
plan component failed to reach this arbitrary
patients together, had a 92% agreement score.
cutoff.
For the reliability between assessors for each
A close examination o f the items in each com­
item, only a fifth of the items had less than 90%
agreement; the lowest o f these was 59% for a spe­ ponent showed where the low standard devia­
tions were most prominent. For example, in the
cific criterion related to the control of periodon­
history and examination nearly all the demo­
tal inflammation. As is evidenced from Table 3,
graphic criteria had little or no variance. Simi­
more than a third of the items were agreed on at
larly, the medical items in the treatment plan
least 90% o f the time.
were very low on the standard deviation. Finally,
Finally, o f a total of 4 7 t tests, each determined
in treatment, the items relating to extractions
by a comparison of the two assessments for each
showed little, if any, variability.
patient, only four were significant (2.4 are ex­
pected by chance). For the treatment plan, then,
■ Validity: A s a measure o f concurrent validity
the reliability was high regardless o f the measure
used. the summed treatment plan and treatment scores
For the evaluation o f treatment, the percen­ were compared to the qualitative score for each
patient. A product moment correlation o f 0.30
tage agreement between assessors for all items
was found between the measures. This was sig­
ranged from 90% to 96% (Table 2). Similarly,
nificant at the 0.05 level. Although there is some
more than half of the items had agreement per­
centages higher than 90 and only three items
were agreed on less than 80% o f the time (Table
Table 4 ■ Percentage of items classified by standard
3). This occurred for criteria related to embra­ deviation.*
sure space in amalgams and the marginal ridge
S ta n d a rd d e v ia tio n
heights in crowns or fixed bridges. The agree­ C om ponent of No. item s
e v a lu a tio n in co m p o n e n t =£0.25 > 0 .2 5 < 0 .7 5 ^ 0 .7 5
ment level for these items was 79%.
H is to ry and
In the calculation of the 47 t tests, 11 showed e x a m in a tio n 31 32% 26% 42%
T re a tm e n t plan 29 58 31 11
significance beyond the 5% level. This is about T re a tm e n t 50 34 28
38
nine more than would have been expected by *Each c e ll re p re s e n ts th e p e rc e n ta g e o f ite m s w ith in a s p e c ifie d
chance. A close examination of the data showed s ta n d a rd d e v ia tio n range.

850 ■ JADA, Voi. 89, October 1974


indication that the two scores were tapping the cation. Objective evaluation o f the quality of
same underlying variable, 91% of the variance care provided in a practice by colleagues en­
is unaccounted for; the process standards con­ ables the individual dentist to receive feedback
tain unique variance unexplained by a simple on his clinical strengths and weaknesses. This
evaluation measure. assessment should be carried out in an atmos­
phere o f helpful concern rather than of sanctions
■ Practicality: The training time for assessors, and punishment—the usual connotations asso­
both professional and nonprofessional, averaged ciated with peer review. Even the dentist doing
about 20 hours. About 18 more hours were spent the assessment will find peer review a learning
by each clinical assessor in practicing the system experience. In fact, it is recommended that all
with patients. The time for examination of pa­ dentists should, at some time, serve as asses­
tients varied indirectly with the number o f pa­ sors.
tients seen. A t the beginning o f the evaluation The preliminary results o f testing the criteria
sessions, about 40 minutes were required to do for reliability, validity, and practicality were en­
the treatment plan and treatment, but by the end couraging. A ssessor reliability was relatively
o f the sessions, very few patients required more high for most items. For the few categories in
than 20 minutes of the examiner’s time. The qual­ which there were problems, it is our impression
itative measure required no more than five min­ that the criteria were not explicit and the asses­
utes at any time. sors were unsure of themselves. These criteria
These times are all based on the particular as­ have been modified by the standards committee,
sessors, recorders, and patients used in the pres­ and the reliability for these items should improve.
ent test run. However, these results are probably In terms of the variability scores, the standard
not unique and may be generalized to most set­ deviations were not as high as expected; this in­
tings. dicates that many items were always assigned
the same value by the assessors. T o some extent,
this may have led to artificially high reliability
values for some items.
Discussion Several reasons could account for the low var­
iability of some items. The standards may have
Perhaps the most important results of this study been set too low or high; however, a more im­
are the insights gained on the process of devel­ portant reason is probably the mix of patients
oping criteria and standards and not on the con­ evaluated. With a larger number o f patients cov­
tent of the criteria per se. Private dental practi­ ering a wider range o f clinical problems, greater
tioners, without special expertise or training, variance would, in all likelihood, result.
were able to formulate criteria of care within a Practically, the total time needed for the den­
six-month period. Some o f these same practi­ tist to evaluate one patient was less than 25 min­
tioners then used these criteria in the assess­ utes. O f equal significance, the time necessary to
ment o f patients. While technical advice was train a practicing dentist to be an assessor was
available from dental faculty with a background also reasonable. These results suggest that this
in tests and measurements, this type of advice system , or a modified version of it, could be in­
is probably obtainable in most communities; this stituted on a larger scale. It could serve as a form
suggests that many dental groups could develop o f continuing education, as a framework for pa­
standards. tient assessment by peer review committees,
A s noted in the introduction, there are many and as one level of quality assessment for den­
advantages to the development o f criteria at a tal care programs. With reference to the last
local as well as a national level. First o f all, there item, there must be different levels o f evalua­
is a greater probability that the criteria will be tion when large numbers of patients are involved.
acceptable to community dentists. A second ad­ Certainly, it is unrealistic to suppose that a sig­
vantage is the educational value o f having a group nificant percentage o f patients in a program
of practitioners who know and respect each other could be examined as part of a quality monitor­
discuss the elements of adequate care. Third, the ing system. The cost and inconvenience to both
development of standards may be the beginning providers and patients would be too great. In­
of a community peer review system that goes be­ stead, the initial or screening review will prob­
yond problems o f patient or third party com­ ably depend on information obtained from insur­
plaints, and becomes a form o f continuing edu­ ance claim forms or patient records. Then, on

Bailit—others: QUALITY OF DENTAL CARE ■ 851


the basis o f these results, selected dentists would rating o f the quality o f care on a scale of from one
be subject to a more detailed review. It is at this to five resulted in a low correlation (r=0.30) be­
point that the evaluation system proposed here tween the qualitative and quantitative scores.
would be o f value. In the five to ten minutes allowed for the qualita­
Mention was made previously of problems tive assessm ent, it is our impression that the ex­
associated with patient sampling. This whole aminers tended to focus on one or two items that
question needs considerable study before any did not meet acceptable standards of care and to
organized evaluation system can be effected. weigh these items heavily in the final score they
Currently, there is little knowledge of the num­ assigned. This suggests that it is necessary to
ber and types o f patients who must be examined have an orderly schedule to follow in the exam­
to obtain a stable estimate of quality in the aver­ ination o f patients.
age dental practice. Certainly, assessment of O f course, the value o f the qualitative versus
two or three patients from an active list of 1,500 the quantitative assessment systems cannot be
to 2,000 probably yields little information on the decided finally until the predictive validity of
quality o f the entire practice. This point should each method is evaluated—the relationship be­
be emphasized since some recent publications tween the process scores (qualitative and quan­
on the quality of dental care have suggested that titative) and scores based on indexes of patient
a low but significant percentage of dentists are health. Although unlikely, it is possible that the
delivering inadequate care; unfortunately these qualitative score correlates more closely with
results are based on samples o f only a few pa­ the number of D M F teeth or periodontal index
tients from each practice.15,16 Although this con­ than does the quantitative score. This issue is
clusion might be correct, it is premature to make currently under investigation.
such statements without further evaluation.
A related issue that has both theoretical and
practical significance is the relationship of qual­
ity among different treatment areas. For exam­ Summary
ple, if it is assumed that the average general den­
tist spends most o f his time providing restora­ This paper presents a description o f the process
tive, periodontal, and prosthetic services, is the of development and testing of clinical criteria
quality of care for each o f these treatment areas and standards for general dentistry. A standards
approximately the same within a particular prac­ committee made up of nine dental practitioners
tice? From a theoretical point of view , this prob­ established criteria for three components of care:
lem is o f interest because it leads to speculations the history and examination, treatment planning,
on the reasons for any quality differences found. and treatment. The standards establish what the
Are they a result of personal preferences of the average general dental practitioner should do to
dentist? D oes he enjoy one area but not the oth­ provide adequate care. Criteria could not be de­
er? Are they a factor of his education and train­ veloped for all services performed by dentists,
ing or, perhaps, the amount of time spent in pro­ so only the more common ones were considered.
viding specific services? Obviously, these are After a limited pretest of the criteria, five
not mutually exclusive categories and probably dentists were trained to use them in the evalua­
all reasons are relevant. tion o f patients. Two independent assessments
Practically, if the quality of care seen in one or of 47 dental patients were made in an effort to
two types of services is indicative o f quality in determine the reliability and variability o f speci­
the entire range of services provided by a den­ fic items and the practicality of the entire sys­
tist, then the evaluation process can be greatly tem. In addition, the validity o f the criteria
restricted and limited to the evaluation o f a few was evaluated by a comparison of the scores as­
patients of a specific type. H owever, if this is signed on the quantitative review system to those
not true, then it is important that enough patients obtained independently on the basis of a qualita­
o f different types be seen to obtain a valid es­ tive rating scheme.
timate of quality. Although these problems seem The results indicated a high degree of reliabil­
somewhat esoteric at this time, they may soon ity for most items. H owever, the variability was
assume major importance. often quite low; this may be explained partially
The attempt to validate the criteria with use by the limited number and diversity of patients
of an independent evaluation o f the patient and evaluated.
852 ■ JADA, Vol. 89, October 1974
The time needed to train assessors and to ex­ Dr. Koslowsky is assistant professor, departm ent of behavioral
sciences and com m unity health, and Dr. Grasso is assistant pro­
amine patients was within practical limits; thus, fessor, departm ent o f general dentistry, School o f Dental Med­
this system is feasible on a larger scale. With icine, University o f C onnecticut Health Center. Dr. H olzman and
further refinements in the review procedures, Dr. Levine are private practitione rs and members o f the Hartford
Dental Society. Ms. Valluzzo and Ms. A tw ood are research as­
the time and, therefore, the expense of evalua­ sistants in the departm ent of behavioral sciences and com m un­
tion can probably be reduced even more. ity health, U niversity o f C onnecticut Health Center.
The correlation coefficient between the quan­
1. Egdahl, R.H. Foundations fo r medical care. New Engl J Med
titative and qualitative scoring systems was low 288:491 March 8, 1973.
but statistically significant. This indicates that 2. Abram ow itz, J. Planning fo r the Indian Health Service. J
P ublic Health Dent 31:70 S pring 1971.
the simpler qualitative method cannot be sub­
3. Friedman, J.W. A guide fo r th e evaluation of dental care. Los
stituted for the quantitative approach. Angeles, School o f P ublic Health, University o f C alifornia, 1972.
The major point to be emphasized from this 4. S chonfeld, H.K., and others. Professional dental standards
study is that criteria and standards of dental care fo r the con tent of dental exam inations. JADA 77:870 O ct 1968.
5. S oricelli, D.A. M ethods o f adm inistrative control fo r the pro­
can be developed and tested by local dental m otion o f quality in dental programs. Am J P ublic Health 58:1723
groups. These standards may serve as the basis Sept 1968.
for any government-sponsored evaluation sys­ 6. Cons, N.C. Method fo r posttreatm ent evaluation of the qual­
ity of dental care. J P ublic Health Dent 3 1 :104 S pring 1971.
tem. 7. Lee, R.I., and Jones, L.W. The fundam entals o f good med­
ical care. P ublication of the C om m ittee on the Costs of Medical
A uthor’s note: G . Ryge and M. Snyder pub­ Care, no. 22. Chicago, C hicago University Press, 1933.
8. Roemer, M.l. Evaluation o f health service program s and lev­
lished an investigation (“ Evaluating the clinical els of measurement. HSMHA Health Rep 86:839 Sept 1971.
quality o f restorations” ) on the development and 9. Schonfeld, H.K., and others. The content o f good dental
testing o f process standards for assessing the care: m ethodology in a form ulatio n fo r clinical standards and
audits, and prelim inary find in gs. Am J P ublic Health 57:1137
technical quality o f restorations (JA D A 87:369 July 1967.
Aug 1973). 10. Donabedian, A. Evaluating the quality o f medical care.
M ilbank Mem Fund Q 44:166 July (Suppl) 1966.
11. Friedman, J.W. Study and appraisal guide fo r dental care
programs. Berkeley, School o f P ublic Health, D ivision o f P ublic
This w ork was supported in part by contract NIH-72-4207 from Health and Medical A dm inistration, University of C alifornia, May
the Division o f Dental Health, National Institutes o f Health, US 1963.
P ublic Health Service. 12. Peterson, O.L., and others. An analytical study o f North
C arolina general practice: 1953-1954. J Med E duc31:1 Dec 1956.
The au thors acknow ledge the assistance of Dr. Earle Yeamans 13. Donabedian, A. A guide to medical care adm inistration.
and Robert Villanova; the members o f the standards com m ittee: V olum e II: Medical care appraisal—qu ality and utilization. New
Dr. Nathan Dubin, Dr. Stanley Holzman, Dr. Robert Levine, Dr. York, The American P ublic Health Association, Inc., 1969.
Calvin Mass, Dr. S edrick Rawlins, Dr. Robert Villanova, and Dr. 14. Kessner, D.M.; Kalk, C.E.; and Singer, J. Assessing health
John Zazzaro; the Hartford Dental Society and the State Board qu ality—the case fo r tracers. New Engl J Med 288:189 Jan 25,
o f Dental Examiners; Dr. Michael Zazzaro, secretary o f the board; 1973.
and Ms. Joan Jannace and Ms. Sharon Zarcaro. 15. B eilin, L.E.; and Kavaler, F. P olicing p u blicly funded health
care fo r poor quality, overutilization, and fraud— the New Y ork
Dr. B ailit is professor and head of the departm ent o f behavioral C ity M edicaid experience. Am J P ublic Health 60:811 May 1970.
sciences and com m unity health, School o f Dental Medicine, 16. Denenberg, H.S. A shopper's guide to dentistry. Harrisburg,
University o f C onnecticut Health Center, Farm ington, Conn 06032. Pennsylvania Insurance Department, 1973.

Bailit—others: QUALITY OF DENTAL CARE ■ 853

You might also like