15-06-22-3rd-Molar-Pt-1-As-RE-Submitted - Kings College

1
DENTAL AGE ESTIMATION (DAE): Data Management for Tooth Development
Stages including the Third Molars. Appropriate Censoring of the final S stage of
development – stage H.
Graham J Robertsa,*, Fraser McDonalda, Manoharan Andiappanb, Victoria S Lucasa
Department of Orthodontics, King’s College London Dental Institute, Floor 25, Tower
Wing, Guy’s Campus, London, SE1 9RT, UK
a
b
Biostatistics and Research Methods Centre, Division of Patient and Population Health,
Denmark Hill Campus, King’s College London Dental Institute, Denmark Hill,
London, SE5 9RW, UK
*Corresponding author. Tel: +44 01732 463636
Email addresses: email: [email protected], [email protected],
[email protected], [email protected]
tel +44 020 7188 4432
2
Graphic Abstract
3
Abstract
Introduction.
The final stage of dental development of third molar is usually helpful to indicate whether or
not a subject is aged over 18 years. A complexity is that the final stage of development is
unlimited in its upper border. Investigators usually select an inappropriate upper age limit or
censor point for this tooth development stage.
Materials and Methods
The literature was searched for appropriate data sets for dental age estimation and those that
provide the count (n), the mean (x), and the standard deviation (sd) for each of the tooth
development stages. The Demirjian G and Demirjian H were used for this study. Upper and
lower limits of the stage G and Stage H data were calculated limiting the data to plus or
minus three standard deviations from the mean. The upper border of Stage H was limited by
appropriate censoring at the maximum value for Stage G.
Results
The maximum age at attainment from published data, for stage H, ranged from 22.60 years to
34.50 years. These data were explored to demonstrate how censoring provides an estimate
for the correct maximum age for the final stage of Stage H as 21.63 years for UK Caucasians.
Conclusion
This study shows that confining the data array of individual tooth developments stages to
± 3sd provides a reliable and logical way of censoring the data for tooth developments stages
with a Normal distribution of data. For stage H this is inappropriate as it is unbounded in its
upper limit. The use of a censored data array for stage H using Percentile values is
appropriate. This increases the reliability of using third molar stage H alone to determine
whether or not an individual is over 18 years old. For Stage H, individual ancestral groups
should be censored using the same technique.
1. Introduction
The concept of Dental Age Estimation (DAE) is not new. The assessment of tooth
eruption as a method of estimating age was introduced in the 19th Century1 and is still
occasionally used as a guide to a child's age. The availability of radiographs has enabled
visualization of the mineralization of developing teeth which are then classified into
4
identifiable stages from which dental maturity can be determined2. The close link between
dental maturity and chronological age (CA) justifies the use of dental age (DA) as a surrogate
for Chronological Age3. At the 10 year threshold, DA is accurate to within 3 weeks of CA4.
Despite several publications relating to the 18 year threshold, there is little known about the
reliability of estimates made at or near this age5,6,7. A systematic and highly informative
attempt has been made to identify and explore these difficulties in the series of papers from
the University of Leuven8.
The most common way to determine whether or not a subject is over 18 years is to calculate
the probability => 18.00 using the mean () and the standard deviation (sd) of the data array
for the final stage of tooth development. A typical outcome using this approach is that a
subject exhibiting stage H is 0.92 (92%) probability to be over 18 years9. The underlying
assumption is that the data for Stage H is Normally distributed. It has been suggested that an
approach using percentile data would, on theoretical grounds, lead to a conclusion more in
line with the distribution of the data10. The research team in Germany use both Normal
distribution data and some percentile data to display the results for stages of third molar
development11. The Swiss-German research team proposed that because subjects exhibiting
stage H may be below 18 years of age, it is inapproprite to use the presence of stage H alone
as an indication that the subject is over 18 years old12.
An important requirement of statistical analysis is that the data used are effectively
managed13. This is to ensure that inappropriate values are not included in the data array. This
might be because of a grossly incorrect result such as an age value when the data of birth and
data of assessment are reversed, and incorrect assessments that give unjustifiable outliers.
These problems are managed by so-called ‘cleaning’ of the data by visual inspection when
checking the data14.
Common practice in statistics is to use only the central 95% of the data array, the so called
95% Confidence Interval (CI)15. It is common to use this 95% confidence interval,
sometimes called the reference range, as the data array for a given variable to enable
clinicians to determine what is normal and non-normal. With regards to DAE, lawyers
question the appropriateness of this as it excludes 5% of the data or the equivalent of 1 of 20
subjects.
5
Given this concern the research presented here is aimed at encompassing as much of the data
as possible by using plus or minus three standard deviations ( ± 3sd ) as this incorporates
99.9974% of the sample data, and by inference a similar percentage of the population. This
is regarded as an age range for subjects of unknown age whose future will depend upon a fair
and robust assessment of the likelihood of being within or without this stated age range.
A further problem is that of Stage H, the final stage of development which is unbounded in its
upper limit. This raises the issue as to how different investigators have set the upper age limit
for the samples investigated. Publications range from 21 years16, 22 years17, 23 years18, 24
years19, 26 years20, 26.9 years22, and 33.9 years23.
Clearly this wide range of age limits for the upper boundary of subjects in studies on DAE,
spanning almost 13 years, raises the question as to what should be the upper boundary for
Stage H of third molars and how this boundary is identified.
Perusal of the literature reveals that investigators do not indicate how the data management
has been carried out to ensure that there are robust data sets using the 8 stage system2 for each
of the 250 plus data arrays for each Tooth Development Stage (TDS).
A further problem is that it is dificult to compare published data sets as the number of Tooth
Development Stages (TDS) used for assessment of dental development varies from 4
stages24, to 21 stages25. (Mahtab M. personal communication). The most frequently used
number of Tooth Development Stages used to estimate Age at Attainment (AaA) of each
stage is the 8 stages described by the Anglo-Canadian research team2. Because of it
widespread use, the 8 stage system of TDS has been selected for this review. It is important
to be aware that the present paper utilises only the anatomical descriptions of the 8 tooth
development stages but does not include the system of mathematical integration to estimate
dental maturity and from that dental age2. In addition the 1973 system of dental age
assessment was limited to teeth in the mandible and excluded third molars. This limitation
was superseded when an American research team applied the 8 stage system to a study of the
development of all four third molars5. This forms the basis for the system of age estimation
at the 18 year threshold currently in use in the United States of America26, and also by many
investigators in Europe including the DARLInG team at King’s College London. The Tooth
Development Stages defined in 1973 are referred to by the letters A, B, C, D, E, F, G, and H.
6
Detailed descriptions of each stage are available in the original publication2 and more
recently in a paper at the 16 year threshold27.
The presentation of data follows two formats:
1. Normal distribution summary data
2. Percentile data.
This final stage of tooth development is unbounded in its upper age limit. With regard to third
molars this means that an enquiry about estimating age using a radiograph showing a fully
developed root of a lower left third molar from a male Caucasian who is aged 25 years would
return the same age and probability estimate as an enquiry from someone aged 15 years.
The difficulty with the lack of a natural upper limit to the data array for the AaA of Stage H
is usually overcome by not using stage H for age estimation in children who are still
developing28. This exclusion of stage H is not possibe when estimating age at the 18 year
threshold as the third molar stage H is the principle dental biological marker that can be used
for age prediction for late adolescence and emerging adults. This was the genesis of the
seminal paper on third molars published by the research team in the USA5. In reviewing the
articles on this problem, it is clear that almost all the papers published do not indicate in
sufficient detail the approach used for the curtailment of the upper limit of the AaA for Stage
H. The problem has been touched upon in a paper from South Korea where the authors used
different age ranges to estimate the mean value of stage H29. For example, focusing on the
Lower Left Third Molar Stage H for males, the AaA for the LL8Hm is 21.6 years for the 14
to 24 year band, 22.1 years for the 14 to 25 year age band, and 22.5 years for the 12 to 26
year year age band29. This raises the issue of the amount by which the AaA for stage H is
inflated by using an upper age cut-off or censor point of 26 years when compared to 21 years.
This problem has been explored and led to the concept of ‘appropriate censoring’30. It is
clear from published papers that the issue of censoring the data for the final stage of tooth
development has been ignored or overlooked by the majority of investigators in the field of
DAE.
A further problem is the way in which the uncertaintly uncertainty of the point estimate is
provided. It is conventional to use the 95% confidence interval. The problem with this is
7
that it excludes 1 in 20 subjects, a figure that is not acceptable to lawyers. The use of a 100%
confidence interval expressed as a reference range has much to commend it as it places any
estimates of age beyond any reasoanble reasonable doubt. This is important for criminal
procedures but for civil procedures it is necessary only to proivde provide the central 50% of
the data31. This is usually presented as the Interquartile Width (IQW) (Alltman 1991)32.
The purpose of this paper is threefold:
1. To review data management procedures to determine the suitability of a ± 3sd
constraint to the lower and upper limits of the data array for each AaA for all the
tooth development stages.
2. To describe a simple system of censoring to provide a realistic age to the upper
boundary for stage H.
3. To review extant data sets where the 8 stage system of Demirjian2 has been used to
assess the impact of inappropriate censoring of Stage H on the estimation of the lower
and upper limits of AaA for the Lower Left Third Molar (encompassing 100% of the
data array), the estimation of the middle 50% i.e. the Inter Quartile Range, and the
implications of this for DAE at the 18 year threshold.
Materials and Methods
The data for this study are from previous publications thus ethical approaval is not required.
Ethical approval for the previous studies was obtained from the King’s College Hospital NHS
Trust Research and Devlopment Committee – Reference 06DS03, and The Eastman Dental
Hospital – Reference 03/E02.
The dental, medical and forensic literature was searched using both electronic databases and
handsearching of a publications derived form the electronic search.
Timespan of search
The search was limited to the period January 1973 to December 2014.
8
Inclusion Criteria
The papers included required that the TDS described by the Anglo-Canadian team was
used2. In addition, the results section provided the count (n-tds), arithmetic mean (-tds) and
the standard deviation (sd-tds).
The subjects providing the data were aged between 4 years and 26 years.
Exclusion Criteria
Publications that did not report the arithmetic mean, and the standard deviation for each AaA.
Electronic Searches
These comprised PubMed, EMBASE, and the Cochrane Collaboration using the terms
‘Demirjian’, Tooth Development Stages’, ‘Tooth Development’, ‘Dental Maturity’.
Hand Searches
Subsequent to this the reference lists in all the articles identified by the electronic searches
were used to identify further articles.
Reference Data Set used for comparison
The Dental Age Research London Information Group (DARLInG) Reference Data Set33 was
used to provide the full data for The Lower Left Third Molar Stage G for males (LL8Gm)
and also the uncensored and censored data for Stage H for males LL8Hm.
Censoring and the Normal distribution
The censoring process for stages A through to G for the DARLInG data was carried out by
identifying the lower limit of the data as minus three standard deviations and the upper limit
as plus three standard deviations (± 3sd). This was achieved by exporting the data from the
DARLInG Access database into Microsoft Excel. Pivot tables were used to isolate individual
data sets e.g. LL8Gm. Following this a simple count of all the data arrays that were within
± 3sd was made and the outcome expressed as a percentage of TDS that fell within the ± 3sd
range for each of the data arrays in the whole dentition.
9
The results presented are those only for stages G and H of the Lower Left Third Molar
(Figure 1) . This is to ensure clarity in presenting and interpreting the findings relating to
Censoring of data.
Censoring of Stage H
The distribution of data for the uncensored stage H was assessed and a proability probability
function plot created in additon to a Stacked (percentage) Bar Graph (see graphic abstract).
The censoring process for Stage H of third molars in the DARLInG datasets was achieved by
first curtailing the upper limit of Stage Data at the age when 100% of subjects in the
DARLInG data set first exhibited Stage H. This is defined as the age immediately above the
oldest subject with stage G30. The data remaining after this censoring of the upper limit of
stage H was then subjected to further scrutiny using the ± 3sd procedure described above.
The resulting data array, limited by ± 3sd was then used to produce the full array of Normal
distribution summary data and percentile data (see DARLInG RDS).
The data were then used to graph a Probability Distribution Histogram on to which is
superimposed a Normal curve (STATA issue 13.0)31. Presentation of the probability
distribution histograms were limited to the LL8Gm censored by ± 3sd and the LL8Hm
uncensored, and censored by the maximum age of LL8Gm (21.64 years). This is for the
purposes of clarity. The appearance and definitions of these two stages are shown (Figure 1.)
10
Figure 1.
Stage G:
a. The calcified region of the bifurcation has
developed further down from its semilunar
stage to give the roots a more definite and
distinct outline.
b. The apical end of the root canal is still
partially open (distal root of molars).
Stage H:
a. The apical end of the root canal is
completely closed (distal root of
molars).
b.
The periodontal membrane has a
uniform width around the root apex.
The summary data from publications that provided information on stage H only of lower left
third molars were then used to calculate percentile summary data and Normal distribution
equivalent viz Table 1.
Table 1.
Percentile data
Normal distribution data
minimum
≈ -3 sd
1st quartile
≈ 0.25 cumulative probability
median
≈ mean
3rd quartile
≈ 0.75 cumulative probability
maximum
≈ +3sd
Table 1. Percentile summary point estimates and Normal distribution summary equivalents.
11
These data were used to provide population estimates of for the 100% Confidence Interval
(the whole data array), the 50% Confidence Interval (The Interquartile Width [IQW])32 and
the Probability of a Subject with Stage H being under or over 18 years old using the data
from published papers and the DARLInG RDS33. The Percentile equivalent was also
estimated using the PERCENTRANK.INC Function in Microsft Excel34.
Results
The probability distribution histograms for LL8Gm is shown in Figure 2. This is offered as
it is typical of the Normal distribution for all the TDS other than stage H.
Place Figure 2 approximately here.
Figure 2. Probability Distribution Histogram of
the Lower Left Third Molar Stage G- male. This
shows the mean, standard deviation, and the
lower and upper limits defined by ± 3sd.
Figure 2. Probability Distribution Histogram of the Lower Left Third Molar Stage G – male.
This shows the mean (𝑥̅ ), standard deviation (sd), and lower and upper limits defined by
± 3sd.
Of the 256 Tooth Deveolopment Stages present in the UK Caucasian Reference Data Set, 32
were not assessed as there were no data available (due to extreme youth) and a further 32
were not included as these were Stage H which had not been censored. This left 192 TDS of
developing teeth. Of these, 186 (96.88%) gave a data array within the ± 3sd lower and upper
limits. The remaining 6 (3.13%) gave a data array only a small amount outside the ± 3sd
limits.
12
The data array for Stage H was censored using the maximum age for Stage G. This is the
point in a cross sectional sample (as should be all DAE Reference Data Sets), where all the
third molars present in the sample first exhibit Stage H. The distribution of this uncensored
data is non-normal and is slightly skewed to the left as is shown in Figure 3. This is owing
to the lack of depletion of Stage H numbers because each tooth, as the subject gets older,
does not move on to a further stage. Thus the upper part of the data array becomes stacked
with fully mature third molars at Stage H. In the uncensored Stage H there are AaA values
up to the age limit of the sample which is 26 years. This shows that in the DARLInG
Reference Data Set (RDS) there are considerable numbers of completed Stage H cases, (n-tds
= 161) inappropriately occuping the upper regions of the array of data (Figure 3).
The data for stage H censored at the maximum of stage G, 21.64 years, shows a much
narrower age range of data for the probability distribution histogram (Figure 4).
It is clear that these two distributions are markedly different with the LL8Hm (uncensored)
plot being very top heavy in the upper age bracket. A feature of this top heavy data is that the
upper border of the LL8Hm dataset is 26.88 years with a mean of 22.04 years. For the
censored stage H (Figure 4) the data is still top heavy. A Shapiro Wilk test for Normality
shows a non-Normal distribution for both the uncensored (p < 0.00030) and also the censored
stage H data (p < 0.00274 ). The histogram and Shapiro Wilk’s test showed a normal
distribution (p = 0.21) for LL8Gm.
13
Lower Left Third Molars Stage H - male
15
Uncensored data
Mean = 22..68 years
Median = 23.46 years
Figure 3. The probability distribution histogram
for LL8Hm [The Lower Left Third Molar Stage H,
10
male] Uncensored.
The mean value for this uncensored stage H is
22.68 years. the Shapiro Wilk test confirms that
5
this is non-Normal with a p value of 0.0003
0
[highly significantly different from Normal].
15
20
25
LL8Hm - uncensored (Years)
30
Lower Left Third Molars Stage H - male
Figure 4. The probability distribution histogram
15
Censored data
Mean = 19.73 years
Median = 20.17 years
for LL8Hm [The Lower Left Third Molar Stage H,
male] Censored at the maximum value of Stage G
10
= 21.64 years. The mean value for this censored
stage H is 19.73 years. the Shapiro Wilk test
confirms that this is non-Normal with a p value of
0
5
0.0027 [significantly different from Normal]
15
20
25
LL8Hm - censored (Years)
30
To illustrate the effect of censoring, the data for LL8Hm was progressively censored by 1
year decrements by reducing the Upper Age (Table 2. a to g).
Table 2.
Row
Upper Age
n- tds
sd-tds
Boundary
(Count)
(Years)
Minimum CPV = 0.25
(Years)
(Years)
Mean CPV = 0.75 Maximum
First
̅- tds)
(𝒙
Third
Quartile
(Years)
Quartile
(Years)
(Years)
(Years)
a
Uncensored
159
2.79
15.47
20.81
22.68
24.57
26.88
b
24.99
138
2.48
15.47
19.89
21.57
23.24
24.89
c
23.99
108
2.19
15.47
19.28
20.76
22.24
23.93
d
22.99
89
1.95
15.47
18.86
20.17
21.48
22.95
e
21.99
72
1.76
15.47
18.42
19.61
20.79
21.96
f
Censored 21.64
65
1.71
15.47
18.58
19.73
20.89
21.63
g
20.99
53
1.53
15.47
17.89
18.92
19.95
20.98
CPV = Cumulative Probability Value;
tds = tooth development stage
14
Table 2. This shows the Uncensored stage H data (row a) and the Censored Stage H data
(row f). The reduction (difference) in the mean value is 2.95 years. Rows b to e inclusive
show the descending mean value with the 1 year decrements to the censor point. The
0.25CPV and the 0.75 CPV are the Normal distribution equivalents to the 25th%-ile and the
75th%-ile. The mean is similar in value to the Median which is the 50th%-ile.
Row g shows the effect of over-censoring with a mean value too low at 18.92 years.
To explore these issues on the population estimates the data extracted from the 21 research
reports that contained the relevant information was used to estimate the values shown in
Table 3.
Put the Excel Table here.
Table 3. Calculated and extracted values for - 3sd, 0.25 Cumulative Probability Value (CPV),
the mean (𝑥̅ ), 0.75 cumulative Probability Value, + 3sd, the range, and the Inter Quartile
width. The 0.25 CPV and 0.75 CPV are included to approximate the 25th and 75th
percentiles. All the reports used provided the data for ancestry, gender, mean and standard
deviation. [Table 3 is an Attached Excel File]
The research reports that provided percentile data were used to enable a comparison with the
summary data values derived using Normal distribution statistics (Table 3). In addition for
the DARLInG data sets available, the PERCENTRANK.INC function of Microsoft Excel34
was used to estimate the probability that a subject would be >18 years. This was performed
only for the subjects for whom the full data array was available35,36, 33, 37 -39.
The data reveal large differences for both the Range (Highest minus Lowest) and the
Interquartile Width (25th%-ile to 75th%ile) (Table 3). [Table 3 is an Attached Excel File]
Discussion
The data extracted from the DARLIng RDS show that the data for the AaA for Stages A to G
are Normally distributed33. The example used for this PAPER is the LL8Gm (Figures 1 and
2). The use of ± 3sd to embrace the lower and upper limits was successful as over 96% of the
resulting data arrays were within the limits determined in this way. Although not published
here the remaining data arrays were all very close to being within these limits. This is
important as the results for the minimum and maximum derived in this way, representing
15
99.9973% of the data, were subsequently regarded as the 100% Confidence Interval for the
data array for a single TDS. This is particularly important for the third molar as the final stage
of development, stage H, is used to determine whether a subject is below or above the 18 year
threshold. This 100% CI is closely related to the concept of Reference Intervals32.. It
represents the range of data that encompasses all values within this range and are considered
to be normal. (For a detailed exposition of this concept see Altman 1991 Chapter 14).
It is proposed that for future publications the data management procedures for TDS are
explained by authors in sufficient detail to enable readers to assess the ‘fitness for purpose’ of
such an approach. Within the UK Caucasian RDS33 there were 192 data arrays for the TDS
present representing all the Tooth Morphology Types in the human dentition. The remaining
arrays, taking the total number up to 258, are for Stage H for which the ± 3sd approach is
inappropriate.
The use of ± 3sd satisfies the requirements of lawyers not to exclude 5% of the data (1 in 20).
The additional finding in this paper are the large differences in the data for stage H when
looking at the range of data. Values for the range as high as 15.72 years and 13.26 years
(Table 3) respectively for females and males raise concerns as to the appropriateness of the
data leading to these large and biologically untenable differences. It is clear that these
extreme values are in part the consequence of using the Minimum and Maximum values (±
3sd). In the context of DAE this broader CI is appropriate as it provides clients with a broad
and justifiable range within which age estimation may be provided with only a very small
risk of error at either end of the distribution (0.0013 [0.13%]) or approximately less than 1 in
1000.
Censoring of stage H is an important problem in relation to determining child or adult status
for asylum seekers. Asylum seekers who are assessed as children have the right to stay in the
UK with the associated benefits of domestic care, education, and financial support.
The first approach was to follow the procedure first reported in the UK30. In this study it was
considered that once the whole sample of subjects in the RDS had achieved the status of
Stage H then any further (older) subjects were contributing to the data array inappropriately
and inflating the mean and median value. The identification of this age or censor point is the
oldest age for Stage G. For the UK Reference Data Set, it is 21.64 years. This is clearly
shown in Figures 3 and 4 where the mean value for the uncensored data is 22.68 years and
the mean value for the censored data is 19.73 years, a difference of 2.95 years. This is clearly
16
a difference which has an impact on estimates of the probability that a subject is over 18
years using current methodology26. These probability estimates for the 18 year threshold are
0.9533 [95.33%] for the uncensored value and 0.9512 [95.12%] for the censored values
respectively. This is a very small difference and is probably the result of the heavily skewed
data for stage H for both the uncensored and censored data.
The difficulty with this approach is that the calculation of the probability values are
dependent on the data array for Stage H being Normally distributed. As is shown in Figures
3 and 4 it is clear that this is not the case. For this reason it is necessary to use the Percentile
Summary Data. There is no simple calculation that indicates that the probability of a subject
being over 18. Instead, the investigator has to use the PERCENTRRANK.INC function in
Excel34 to identify the value in the data array that is closest to the 18 year threshold. For the
LL8Hm this is estimated as p = 0.7593 [75.93%] that the subject is older than 18 years or p =
0.2407 [24.07%] that the subject is under 18 years old. Because access to the original data
array is required this is more taxing than using the NORM.DIST function of Excel which
requires only the summary data. On logical grounds it gives a result consistent with the nonNormal distribution of Stage H.
In this study the abstracted data presented in this paper show a very wide span for the time
over which Stage H extends in the Lower Left Third Molar in the 1st nations people of
Canada. The main purpose of this paper is to establish how this has come about. In the data
extracted and calculated for females it extends from 14.43 years to 32.97 years, a spread of
18.54 years47. This is over three times the spread when compared to similar estimates for the
UK Caucasian RDS33 for which females extends from 16.50 years to 24.90 years, an interval
of 8.4 years.
To understand the implications of this, it is helpful to appreciate that the UK Caucasian RDS
is systematically redacted to remove unsuitable data that may inappropriately influence the
outcome summary data. This was achieved by first ensuring that any negative values arising
from mistaken data entry are corrected. Second, extreme values that are likely to be
untenable in terms of clinical growth and development are also excluded. The most efficient
way to do this is to limit the data set to ‘plus or minus’ three standard deviations from the
mean value for each of the TDSs present in the data set. As can be seen (Figure 2), the
Normal distribution is clearly discernible from the ± 3sd censored data for LL8Gm is a
17
considerable contrast with the uncensored data for LL8Hm (Figure 3) which is severely left
skewed. The reason for this is that the data in the upper region of the data set are from
mature subjects where root maturity was completed some months or even years previously.
Appropriate censoring of LL8Hm eliminates the redundant data but the distribution remains
non-Normal (Figure 4).
The identification of the end point i.e. when all third molars have become mature is
straightforward. The age at which Stage G ceases to be present is the age when Stage H is
present in 100% of the population is determined by the oldest age at which stage G is present.
This is represented by the 99.7th % Cumulative Probability Value (CPV) for Stage G in the
worksheet of summary data (Table 3).
This issue of utilizing data from the 0.05th to 99.5th CPV is potentially difficult. As explained
earlier, the data are censored at these two extremes to incorporate the information from TDS
that are developing normally. The use of CPV expands the range of the data to the population
estimates usually cover a noticeably wider range of ages than the sample data expressed in
percentiles. In this sense the CPV values are a more reliable estimate than the percentiles for
the data span in the population. However, clinically, in normal patients, such extreme values
have not been observed by any of the authors in a combined ‘lifework’ of 112 years.
It is recommended by the DARLInG team that the population estimates used are based on the
mid 50% of data spread around the mean value as 0.25 CPV to 0.75 CPV as this is usually
broader than the interquartile width that is based on percentile data (25th %-ile to the 75th %ile). Further work is in progress to identify the appropriate range of values from which a
DAE is made by carrying out estimates of masked radiographs for whom the CA is known,
but not the assessors.
The censoring of data within the ± 3sd is sometimes criticized ‘… as it excludes potentially
important data … ’32. This is certainly a consideration as outliers will be discarded. In
clinical dental practice such outliers are regarded as ‘abnormal’. Therefore the decision to
exclude such extreme values is justified. Furthermore, as regards the RDS of UK Caucasians,
it is clear that for almost every data set, LL8Gf, LL8Gm etc. numbering up to 258 data sets is
within the upper and lower limits set by the ± 3sd33. That is to say the upper and lower limits
set by this calculation are, respectively lower and higher than the values retrieved from the
18
DARLInG data set. Thus the calculated limits of ± 3sd are more extreme than the minimum
and maximum values obtained in the DARLInG UK Caucasian Dataset. The concern about
excluding high or low values is a theoretical rather than a practical issue.
A remaining concern is the confusion often apparent between the burden of proof for criminal
cases and civil cases in the law courts. For criminal cases the legal dictum is ‘beyond
reasonable doubt’. In terms of probability this of the order of 0.92 (92%) certainty48. For
civil cases the burden of proof is greater than 50%. This is described as ‘the balance of
probability in the UK, and ‘the preponderance of the evidence’ in the USA50. This concept
needs to be considered within the framework of legal proceedings51.
It is to provide information on these different levels of proof between Criminal proceedings
and civil proceedings. An interesting corollary to this is that subjects accused of a serious
crime are subject to the ‘beyond reasonable doubt’ dictum, but if, subsequent to conviction
are claiming to be under 18 years of age, subject to the ‘balance of probability’ threshold
when the court determines if the convicted subject should be detained under the adult penal
system or the ‘youth court’ penal system. This is an example of criminal justice and civil
justice combining to ensure just and fair treatment of the convict subject.
A comment on other final stage systems and their utility is tempting but outside the scope of
this paper as these used systems of tooth development using 10, 12, or even 15 stages52
although a cursory examination of these papers reveals similar problems with censoring the
final stage of development of the third permanent molars.
It is also clear that the distribution of data for AaA for Stage H is not Normally distributed.
This is because the usual growth change of the data is that a tooth will transfer, for example,
from stage F to Stage G thus there is a natural ‘entry’ and ‘exit’ pattern which results in a
Normal distribution (Figure 2). However, even where the AaA data for LL8Hm censored is
plotted, the resulting graph still appears top heavy over the upper fifty percent (Figure 4).
This is because in normal circumstances the numbers in the upper half of the population
distribution reduce as the tooth transforms into the next dental stage. The consequence of this
is the increased number of Stage H in the upper part of the plot.
19
The main finding from this review is that the age span for Stage H is, for many papers, too
large to be realistic. This is because of a failure by investigators to scrutinize the AaA values
close to the upper boundary of Stage H. Failure to do this has resulted in inappropriate
censoring of the data. The consequence of this is that the mean age at attainment for Stage H
is usually too large. The implications of this are that subjects at the 18 year threshold are
assigned an age that is too high and therefore possibly unfair to the subject. This injustice
translates into a probability value that is too high. The details of this are in a paper currently
in preparation (Lucas VS personal communication)
The recommendations on using summary data based on percentiles is not new, this has been
used extensively by the team in Germany for stage H11, 47. However, no detail of the
implications of using the extreme values (Minimum and Maximum) has previously been
discussed. In the present review it is discussed in detail in relation to the Normal distribution
summary data and the Percentile summary data.
Finally, this raises the issue of the numerical status of estimates of the median (and mean) of
Stage H. An average value provided with 100% CI (for criminal cases) or 50% CI (for civil
cases) rests on an estimate for a single tooth in a single individual only. This is because a
subject with an estimated age of 19.64 years and with a 50% confidence interval from17.94
years to 20.67 years indicates that the individual subject falls within these values or could be
higher. This ambiguity means that once estimated these values are not proper numbers in the
sense that they can be added together to provide an averaged age for developing third molars.
Further work is in progress to test the reliability of age estimations using the lower third
molar in a blind study of assessments, and the impact of other age markers related to third
molars, is in progress.
Conclusions
1. The summary data for Tooth Development stages should be clearly limited by an
appropriate statistical method viz. ±3sd.
2. For the final stage of tooth development the use of ±3sd is inappropriate as the data
for this final stage of dental development does not display a Normal distribution.
3. The appropriate censoring of Stage for Lower Third Molars brings down the
estimated mean or median age to below 20 years.
20
4. The way in which data for the AaA for Stage H of third molars needs to be managed
in such a way that the interests of justice and the needs of asylum seekers are met
appropriately and equitably need to be reviewed and restated to take proper account of
appropriately censored data for the final stage of dental development.
Acknowledgements
We are grateful to Dr Eric Whaites, Clinical Director of the Dental Hospital, for granting
access to the dental radiological archive at Guy’s and St Thomas’ NHS Trust.
This article is based on a paper presented to the American Academy of Forensic Science at a
meeting held in Orlando, Florida, USA in February 2015.
References
1.
Saunders E. The teeth a test of age, considered with reference to the factory children:
Addressed to both Houses of Parliament, Westminster, London UK. 1837. London: H.
Renshaw.
2.
Demirjian A, Goldstein H, Tanner JM. A New System of Dental Age Assessment. Hum
Biol. 1973; 45(2): 211-27.
3.
Gustafson G. Age determination on teeth. J Amer Dent Ass. 1950; 41: 45-54.
4.
Roberts GJ, McDonald F, Lucas VS, Neill M. The weighted average method 'WAM' for
Dental Age Estimation: a simpler method for children at the 10 year threshold. J Forensic
Leg Med. 2014; 26: 56-60.
5.
Mincer HH, Harris EF, Berryman HE. The ABFO study of third molar development and
its use as an estimator of chronological age. J Forensic Sci. 1993; 38(2):379-90.
21
6.
Garamendi PM, Landa MJ, Ballasteros J, Solano MA. Reliability of methods applied to
assess age minority in living subjects around 18 years old. A survey on a Moroccan
origin population. Forensic Sci Int. 2005: 154: 3-12.
7.
Blankenship JA, Mincer HM, Anderson KM, Woods MA, Burton EL. Third molar
development in the estimation of chronological age in American Blacks as compared
with Whites. J Forensic Sci. 2007; 52(2): 428-33. doi 10.1111/j.1556-4029.2006.00356.x
8.
Thevissen P. 2013. Dental Age Estimation in Sub-Adults: striving for an optimal
approach. Acta Biomedica Lovaniensia 605. Leuven University Press.
ISBN 978 94 6165 087 0.
9.
Lewis JM, Senn DR. Dental Age Estimation. In – Manuel of Forensic Odontology.
American Society of Forensic Odontology. 2013. Chapter 8. New York. ISBN 978-14398-5133-3.
10.
Roberts GJ & Petrie A. Dental Age Assessment: A Practical Approach. 2011.
Chapter 11 in Digital Forensic Science. IGI Global USA. ISBN 978-1-60960-483-7.
11.
Olze A, Schmeling A, Taniguchi M, Maeda H, Niekerk P van, Wernecke KD,
Geserick G. Forensic age estimation in living subjects: the ethnic factor in wisdom
tooth mineralization. Int J Leg Med. 2004; 118: 170-3.
12.
Knell B, Ruhstaller P, Prieels F, Schmeling A. Dental age diagnostics by means of
radiographical evaluation of the growth stages of lower wisdom teeth.
Int J Leg Med. 2009; 123: 465-69.
13.
O'Brien PMS, Broughton Pipkin F. Introduction to research methodology.
2nd Edition 2007. Royal College of Obstetricians and Gynaecologists Press.
London. ISBN 978-1-904752-01-1.
14.
Gore SM & Altman DG. Statistics in Practice. 1982. British Medical Association.
London. ISBN 0 7279 0085 4
22
15.
Kirkwood BR, Sterne JAC. Essential Medical Statistics. 1988. Blackwell Publishing.
Oxford. ISBN 0-86542-871-9.
16.
Prieto JL, Barberia E, Ortega R, Magana C. Evaluation of chronological age based
on third molar development in the Spanish population.
Int J Leg Med. 2005; 119(6):349-54.
17.
Thevissen PW, Fieuws S, Willems G, et al. Human dental age estimation using third
molar developmental stages: accuracy of age predictions not using country specific
information. Forensic Sci Int. 2010; 201: 106-11.
18.
Altalie S, Thevissen P, Fieuws S, Willems G. Optimal age estimation practice in
United Arab Emirates children. J Forensic Sci. 2014; 59(2): 383-85.
19.
Thevissen PW, Fieuws S, Willems G. Third molar development: measurements versus
scores as age predictor. Arch Oral Biol. 2011; 56: 1035-40.
20.
Zandi M, Shokri A, Malekzadeh H, Amini P, Shafiey P. Evaluation of third molar
development and its relation to chronological age: a panoramic radiographic study
J Maxillofac Oral Surg. 2014. November 21. Epub. PMID 25409631.
22. Zeng DL, Wu ZL, Cui MY. Chronological age estimation of third molar mineralization
on Han in southern China. Int J Leg Med. 2010; 124(2): 119-23.
23.
Thevissen PW, Fieuws S, Willems G. Third molar development: evaluation of nine
tooth development registration techniques for age estimations.
J Forensic Sci. 2013; 58(2): 393-7.
24.
Gustafson G, Koch G. Age estimation up to 16 years of age based on dental
development. Proc Finn Dent Soc. 1974; 25: 297-306.
25.
Mahtab M 2011 – Personal communication
26.
Senn DR, Weems RA. 2013. Manual of Forensic Odontology. 5th Edition. CRC
23
United States of America. ISBN 978-1-4398-5133-3.
27.
Mitchell JC, Roberts GJ, Donaldson ANA, Lucas VS. Dental age assessment (DAA)
for British Caucasians at the 16 year threshold. Forensic Sci Int 2009; 189: 19 – 23.
28.
Moze K, Roberts GJ. Dental Age Assessment (DAA ) of Afro-Trinidadian children and
adolescents. Development of a reference data set and comparison with Caucasians
resident in London UK. J Forensic Leg Med. 2012; 19: 272-79.
29.
Lee SS, Byun YS, Park MJ, Choi JH, Yoon CL, Shin KJ. The chronology of second
and third molar development in Koreans and its application to forensic age estimation.
Int J Leg Med 2010; 124:659-65.
30.
Boonpitaksathit T, Hunt N, Roberts GJ, Petrie A, Lucas VS. 2011 Dental age
assessment of adolescents and emerging adults in UK Caucasians using censored data
for stage H of third molar roots. Eur J of Orthod. 2010; 33(5): 503 –8.
31.
STATA Statistical Software. Issue 13.0 www.stata.com Texas. USA.
32.
Liversidge HM, Marsden PH. Estimating age and likelihood of having attained 18
years of age using mandibular third molars. Br Dent J. 2010; 209: E13.
32.
Altman D. Practical statistics for medical research. 1st edn. London: Chapman & Hall,
1991. p51 et seq.
33.
DARLInG Reference Data Set www.dentalage.co.uk/+R/RDS-UK-Caucasian
34.
Jelen B. Microsoft Excel 2013. QUE Publishing. USA. ISBN 978-0-78974857-7.
35.
Alsaffar H, Camilleri S. Dental Age Assessment of Maltese subjects between the ages
of 4 and 14 years. MSc Dissertation. University of Malta and King’s College
London 2013.
24
36.
El-Shewahi W, Camilleri S. Dental Age Assessment of Maltese subjects between the
ages of 15 and 26 years. MSc Dissertation. University of Malta and King’s College
London 2013.
37.
Jayaraman J. Development and Validation of Reference Dataset for Dental Age
Assessment in a Southern Chinese Population. PhD Thesis. 2014 University
of Hong Kong.
38.
Karimi A. Development of a Reference Data Set for Kuwaiti children, adolescents and
emerging adults agage assessment ed 3 – 26 years MSc dissertation 2015. King’s
College London.
39.
Oiknine S. Dental age assessment (DAA): development of a Reference Data set for
Israeli children between 3 and 26 years old. MSc dissertation. 2013.
King’s College London.
40.
Prieto JL, Barberia E, Ortega R, Magana C. Evaluation of chronological age based on
third molar development in the Spanish population.
Int J Leg Med. 2005; 119(6): 349-54.
41.
Alshihri AM, Kruger E, Tennant M. Western Saudi adolescent age estimation utilising
third molar development. Eur J Dent. 2014; 8(3):296-301.
42.
Knell B, Ruhstaller P, Prieels F, Schmeling A. Dental age diagnostics by means of
radiographical evaluation of the growth stages of lower wisdom teeth.
Int J Leg Med. 2009; 123: 465-9.
43.
Bai Y, Mao J, Zhu S, Wei W. Third-molar development in relation to chronologic age
in young adults of central China. J Huazh Uni Sci Tech Med Sci. 2008; 28(4):487-90.
44.
Lee SS, Byun YS, Park MJ, Choi JH, Yoon CL, Shin KJ. The chronology of second and
third molar development in Koreans and its application to forensic age estimation.
Int J Leg Med. 2010; 124:659-65
25
45.
Sisman Y, Uysal T, Yagmur F, Ramoglu SI. Third molar development in relation to
chronological age of Turkish children and young adults.
Angle Orthod. 2007; 77(6): 1040-45.
46.
Zeng DL, Wu ZL, Cui MY. Chronological age estimation of third molar mineralisation
on Han in southern China. Int J Leg Med. 2010; 124(2):119-23.
47.
Olze A, Pynn BR, Kraul V, Schulz R, Heinecke A, Pfeiffer H, Schmeling A. Studies
on chronology of third molar mineralisation in First Nations people of Canada.
Int J Leg Med. 2010; 124(5): 433-7
48.
Aitken CGG. Statistics and the evaluation of evidence for forensic scientists.
Statistics in Practice. 1995. John Wiley & Sons. Chichester. ISBN 0-471-95532-9
50.
Kadane JB. Statistics in the Law. 2008. Oxford University Press. Oxford.
52.
Roberts GJ, McDonald F, Lucas VS
51. Vito GF, Letessa EJ. Statistical applications in criminal justice. 1989. Law and
Criminal Justice Series. Sage Publications. Newbury Park. ISBN 0-8039-2983-8
~~~~~~~~~~~||||||||||~~~~~~~~~~~~