Professional Documents
Culture Documents
Icann 2013 V 3
Icann 2013 V 3
net/publication/258171153
CITATIONS READS
4 516
3 authors:
Michael Oakes
University of Wolverhampton
111 PUBLICATIONS 1,486 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Christo Panchev on 20 November 2014.
1 Introduction
There is a tremendous growth in the amount of data produced in the medical domain
[1] and many approaches, including statistical and neural approaches have been
proposed for medical data mining which produce information that helps in problem
solving and taking decisions [2,3,4].
The work presented here is based on the large data set of patient records from a major
British National Health Service (NHS) audiology centre in England containing
180,000 individual audiology records (from 23,000 patients). The decisions of
whether to prescribe an ITE or BTE hearing aid are typically made by audiology
technicians working in the out-patient clinics, on the basis of audiogram results and in
consultation with the patients. ITE hearing aids are not generally available on the
NHS in England, as they are more expensive than BTE hearing aids. However, both
types of aids are prescribed at the audiology centre providing the data. Usually the
choice is straightforward, but in some cases the technicians could benefit from a
second opinion (e.g one given by a decision support system) with an
explanation/justification of how that second opinion was arrived at.
2 Data Pre-processing
Figure 1 represents the directed acyclic graph for the ITE/BTE aid the Bayesian
network obtained from Weka v3.4, where nodes represent the.
In Table 2, the probability for gender=male, age<=60, and BTE aid, that is, P(gender
= 'male'/age ='<= 60', aid ='BTE' ) is calculated as (339 + 1)/(673 + 2) = 0.504,
where 339 is the number of instances of “gender=male, age<=60, and BTE aid”, 1 is
the initial count for “gender=male, age<=60, and BTE aid”, 673 is the total number of
instances with “age<=60 and BTE aid”, and 2 is the count of different values of
gender (that is, male and female). Using the same method the probabilities for the rest
of the variables are calculated.
Testing of these Bayesian network showed that overall there was 93.2% agreement
between the predictions of this model and the actual hearing aid chosen by the
audiologist (as given in the “type” field) as shown in Table 3. The agreement rate was
higher for patients fitted with ITE aids (97.1%) than for those fitted with BTE aids
(88.5%).
The second model that was deployed is based on a Multi Layer Perceptron. The
network had 21 input and 2 output neurons covering the data attributes (Table 4). The
network had 5 hidden neurons with hyperbolic tangent sigmoid activation function
and was trained using Levenberg-Marquardt backpropagation [5].
Table 5 presents the confusion matrix of the results from the neural network.
Although the neural network shows slightly higher overall performance (93.7%), the
results between the two models are qualitatively the same. As in the Bayesian model,
the highest agreement between the neural network and the medical expert is for the
ITE aids (98.2%). The highest misclassification of the models is for the case where
the model suggests an ITE aid whereas the human decision was to prescribe the BTE
one (11.7%). This is a partially expected result since, as mentioned earlier in the
paper, ITE aids are generally not available on NHS in England and doctors have the
tendency to bias their decisions toward the generally available BTE hearing aids.
Following the results presented above, the importance of each of the input
attributes was evaluated for their relative contribution to the correct ITE/BTE
decision. The network trained with the full set of input features were evaluated in
separate tests where one of the input attributes was set to 0. The relative importance
of an attribute was calculated as proportional to the neural network’s misclassification
error during tests with the data of that factor being ignored. The output error was
calculated over the entire dataset (i.e. training, validation and test data). When a
particular input factor is removed, a higher output error will indicate that this attribute
is more significant in the performance of the model, i.e. higher importance/effect on
ITE/BTE classification, whereas a lower error would indicate relatively lesser degree
of relevance.
0.5
0.45
0.4
0.35
Misclassification
0.3
0.25
0.2
0.15
0.1
0.05
Diagno…
0
Age
Mould
AC250
AC500
AC1000
AC2000
AC4000
AC8000
BC250
BC500
BC1000
BC2000
BC4000
Base line
Mask
Gender
Fig. 2. Relative attribute importance of the input attributes toward the ITE/BTE classification.
The base line is 6.0% misclassification error with all input attributes present.
The results presented in Figure 2 show that the Mould is the most significant factor
in determining the ITE/BTE aid. This is another expected result, since medically the
mould type is highly correlated to the hearing aid being used. Leaving the mould
aside, the other significant attributes which can be identified are the Age and Gender
causing increased misclassification errors of 12.6% and 11.5% respectively. On the
frequencies range, the most significant attributes are shown to be the bone conduction
frequencies BC1000 and BC2000 increasing the error to 11.2% and 10.7%
respectively. These results are similar to [6] where using logistic regression Age was
not found significant factor associated with ITE/BTE hearing aids but Gender,
BC1000 and BC2000 were found significant.
6 Conclusions
We have presented two machine learning models for the classification of ITE/BTE
hearing aids based on audiology data. Both models provide qualitatively similar
results with the Neural Network having slightly better classification rate, indicating
that they are both viable for implementation into a real decision support system for
hearing aid prescriptions. In addition, the disagreement rate between the models and
the audiology experts could provide a quantifiable measure as to what percentage of
patients could have benefited from prescribing the appropriate hearing aid based
purely on diagnostic data rather than considering the availability and costs of the
devices. Furthermore, the discovery of significant attributes (factors) and relationships
in audiology data for hearing aid classification will provide supplementary
information for audiology experts.
References