1609444.pdf

On the Dispersive versus Arrhenius Temperature Activation of NBTI Time
Evolution in Plasma Nitrided Gate Oxides: Measurements, Theory, and
Implications
D. Varghese, D. Saha, S. Mahapatra, K. Ahmed#1, F. Nouri#1, and M. Alam#2
Department of Electrical Engineering, IIT Bombay, Mumbai 400076, India (91-22-25720408 / [email protected])
#1
Applied Materials, Inc., Santa Clara, CA; #2 ECE Department, Purdue University, W. Lafayette, IN
Abstract
Negative Bias Temperature Instability (NBTI) is studied
in p-MOSFETs having Decoupled Plasma Nitrided (DPN)
gate oxides (EOT range of 12AO through 22AO). Threshold
voltage shift (∆VT) is shown to be primarily due to interface
trap generation (∆NIT) and significant hole trapping (∆NOT)
has not been observed. ∆VT follows power-law time (t)
dependence and Arrhenius temperature (T) activation.
Introduction
NBTI is a serious reliability concern for p-MOSFETs [1].
Oxynitrides (required for suppressing boron penetration and
gate leakage) show worse NBTI than control oxides and has
attracted much attention [2-5]. It is important to correctly
measure and extrapolate t evolution of ∆VT to accurately
determine device lifetime. However, this extrapolation is
complicated since (i) impact of delay time on measurement
is not fully understood, (ii) ∆VT origin (∆NIT that predicts tn
dependence or ∆NOT that predicts log (t) dependence) is still
unresolved, and (iii) the nature of temperature dependence
(dispersive vs. Arrhenius activated) remains controversial.
It is therefore necessary to resolve these inconsistencies for
correct estimation of device lifetime.
Highlights of this work
It is shown that: (i) in addition to t-delay, measurement
voltage (VG,meas) and T strongly influence n, (ii) t-delay=0
measurements show power law t dependence for various
EOT, N2 dose, stress-VG and T, (iii) recently reported large
NBTI dispersion is an artifact of t-delay, (iv) for t-delay=0,
short and long time degradation respectively are dispersive
and Arrhenius activated (their relation is explained), (v) no
∆NOT is observed, only ∆NIT governs overall ∆VT and hence
can be explained by R-D model [10-12], and (vi) measured
n along with activation energy (EA) of diffusion uniquely
identify neutral molecular H2 as the diffusion species.
Impact of measurement delay
Fig.1 shows ∆VT (t) (= −(∆Idlin/Idlin0) * VGT0) from onthe-fly [6] and normal (with delay) Idlin measurements. tdelay increases n, and is known to be due to ∆VT recovery
during the stress “stopped” phase [6,7,11]. For identical tdelay, n also increases at lower |VG,meas| (Fig.1, LHS) and
higher T (Fig.1, RHS). This is expected as ∆VT recovery,
believed due to passivation of broken Si- bonds [11,12] is
larger at higher T and lower |VG,meas| [1,7,13]. Fig.2
(LHS) shows ∆NIT (t) measured using charge pumping (CP). Increase in n with higher t-delay and higher T is clearly
seen. Fig.2 (RHS) shows ∆NIT (t) simulated using R-D
model with and without t-delay for different T. For tdelay=0, n~1/6 (for all T) as expected for H2 diffusion [11].
For t-delay>0, n increases at higher T, and is consistent
with the measured trends. Fig.3 shows n (from Idlin and CP) versus T for various t-delay. Increase in n at higher T can
be clearly seen, which is more pronounced for larger tdelay. Further, as average VG,meas for C-P (~0, mean value
of pulse) is lower than |VG,meas| for Idlin, n from C-P is
much larger than that from Idlin for a given t-delay and T.
These results are consistent with increased recovery at
larger t-delay, less negative VG,meas and higher T. Since
delay contaminates all measurements and results in
uncertain values of n, only “on-the-fly” Idlin [6] should be
used to quantify and interpret NBTI defect generation, as
has been done below.
NBTI time evolution
Fig.4 shows ∆VT (t) for various T, plotted in log-log and
lin-log scales. It is obvious that NBTI follows power-law
(not log) time dependence. When plotted in log-log scale, n
shows a slight increase with T for short t, indicating weak
dispersive diffusion [7,9]. However, the T-dependence of n
completely disappears for longer t, and n becomes (slightly
less than) 1/6 for all T. Fig.5 shows the T dependence of
∆VT (t) for various EOT, N2 dose, and stress-VG. Long time
degradation clearly shows an universal Arrhenius, powerlaw (n~1/6) behavior. Contrary to [8], the strong power-law
time dependence suggests ∆VT is governed by ∆NIT and not
∆NOT (at least for DPN oxynitrides for these EOT range).
Furthermore, strong dispersion reported in literature [7,9] is
most likely a consequence of measurement delay. Finally,
we believe historically measured n (~1/5 – 1/3) for NBTI
with uncertain t-delay can all be attributed to this universal
n~1/6 behavior (for t-delay=0), which is explained below.
Explanation using “standard” R-D model
The Arrhenius T activated (longer time), power-law time
evolution of ∆VT (~∆NIT) can be explained by R-D model
[1,10-12]:
∆NIT = (kF.N0/2kR)1/2.[D0.exp(-EA/kT)t]n,
where kF, kR are the Si-H dissociation and passivation rate
constants, N0 is total Si-H bond density, and D=D0.exp(EA/kT) is the diffusivity of released H species. Fig. 6 shows
that long t (varying T) data can be uniquely scaled along Xaxis and Y-axis directions to universal relations, which
yields EA for D and ∆VT respectively [1]. Obtained EA (D)
proves neutral H2 as the diffusion species [14]. This is also
consistent with n~1/6 power-law t dependence as predicted
by R-D model for H2 diffusion (Fig.2) [11]. Moreover, R-D
0-7803-9269-8/05/$20.00 (c) 2005 IEEE
Authorized licensed use limited to: INDIAN INSTITUTE OF TECHNOLOGY BOMBAY. Downloaded on January 3, 2009 at 00:42 from IEEE Xplore. Restrictions apply.
model suggests EA (∆VT) = EA (D) * n if EA (kF) ~ EA (kR)
[1], as also can be seen from Fig.6.
Interface-trap generation dominated NBTI
Fig.7 shows T dependence ∆VT (t) for devices with same
EOT but various N2 dose. ∆VT for both low and high N2
dose (and their difference) show similar EA (~EA (D) * n).
This proves that increase in NBTI due to increased plasma
N2 dose is due to increase in ∆NIT. To show the universality
that ∆NIT dominates ∆VT, a large number of samples with
different EOT and N2 dose (including control oxides) were
stressed at various T and oxide electric field (EOX). ∆VT (t)
was obtained using on-the-fly Idlin. For few cases ∆NIT (t)
was obtained (separately) from C-P on larger EOT samples.
The results are shown in Figs. 8 and 9 (control and plasma
N2 data plotted separately for clarity, and does not signify
actual magnitude).
Fig.8 shows X-axis scaling factors of T dependent ∆VT (t)
and ∆NIT (t) data. Very similar T activation (EA ranges from
0.54eV through 0.62eV) of all data clearly proves that ∆VT
~ ∆NIT, and plasma N2 and control oxides have very similar
T dependent kinetics. Since EA (kF) ~ EA (kR), T dependence
of ∆NIT is primarily due to that of D [1,9]. Moreover, that
the diffusing species is always neutral molecular H2 is also
unequivocally established (consistent EA (D) and n values).
Fig.9 shows EOX dependence of ∆VT and ∆NIT data. Since
diffusion species is neutral H2, EOX dependence is due to
½
½
that of (kF/kR) and has the form A.EOX exp (BEOX) [1,9].
Once again, qualitatively similar slope “B” suggests ∆VT ~
∆NIT, and plasma N2 and control oxides have similar EOX
dependent kinetics. We believe N2 incorporation impacts
“A”, possibly by reducing the Si-H bond dissociation
energy [15]. In short, control oxides and DPN oxides are
governed by very similar reaction and diffusion mechanics.
Dispersion in n
The transition from dispersive (short t) to Arrhenius (long
t) T activation (Fig.4) can be interpreted rigorously, and is
qualitatively explained using Fig.10. In the absence of deep
trapping sites, H2 would diffuse via shallow hopping sites
(standard diffusion) and shows weak T dependence. ∆NIT
equals total released H2 density (NH) and dNIT/dt = dFH/dx
= D0 d2NH/dx2 (FH: H2 flux).
In the presence of deep traps at a single energy level Ea=
EA, trapping and detrapping of H2 (to and from this level)
influences the T dependence of diffusion. If NHF, NHT are
free and trapped H2 density (NH = NHF + NHT), then NHF/N0
= [NHT/NT] exp (-EA/kT), if fast equilibrium is assumed. N0,
NT are the density of shallow hopping sites and deep traps.
Therefore, dFH/dx = D0 d2NHF/dx2 = D1 d2NH/dx2 (as only
free H2 can diffuse). Assuming (N0/NT) exp (-EA/kT)<<1,
and by rearranging terms, D1 = D0 (N0/NT) exp (-EA/kT).
Hence, Arrhenius T activation is obtained. In such a case,
NIT time evolution can be explained by standard R-D model
solution [1, 10-12] and the universal scaling scheme (Fig.6)
will hold good [1,9].
In the presence of trap energy distribution up to infinity,
trapping and detrapping of H2 to and from multiple energy
levels determine T dependence of diffusion. Emission time
constants of these levels are t=1/ν0 exp (E/kT), ν0 being the
attempt to escape frequency. As t increases, more and more
deep traps emit H2 and as a result, the energy centroid of
trapped H2, EM (t), goes down in energy. Using exponential
trap distribution as g(E) = NT/E0 exp(-E/E0) and assuming
fast equilibrium, NHF/N0 = [NHT/E0g(EM)] exp(-EM(t)/kT),
where EM (t) = kT ln (ν0t). Rearranging, NHF = NH [N0/NT]
exp [kT/E0 ln (ν0t)]/(ν0t), and dFH/dx = D0 d2NHF/dx2 = D2
d2NH/dx2, where D2 = D0 (N0/NT) (1/ν0t)^[1 – kT/E0], which
is a signature of dispersive diffusion [7,16].
For a real system with finite distribution of traps up to EA,
diffusion is dispersive till t=t0, as long as EM(t) reduces with
t. This get transformed to Arrhenius activated diffusion for
t>t0 as steady state is attained [EM(t0)~EA]. The transition
time is t0 = 1/ν0 [exp (EA/kT)]^[1/(1-kT/E0)], and reduces at
higher T. For EA = 0.58eV, Fig.4 predicts ν0 ~ 1012 s-1 and
E0 = 80meV, consistent with published values. (Accurate
determination of t0 can be influenced by non-equilibrium H
to H2 conversion, which can also affect short t slope).
Conclusion
NBTI time evolution is studied for plasma N2 oxynitride
p-MOSFETs. NBTI shows power-law time dependence for
a wide range of EOT, N2 dose, stress-VG and T. It is shown
that delay during measurements result in inaccurate powerlaw slope, which depends on delay time, measurement VG
and T. For zero delay measurements, short-time data shows
dispersive but long-time data show Arrhenius T activation.
It is clearly shown that strong long-time dispersion is an
artifact of measurement-delay. A theory for the transition of
dispersive to Arrhenius T activation is provided. NBTI is
governed by interface trap generation, which follows R-D
model. It is shown that DPN and control samples show very
similar T and EOX dependence of NBTI. Obtained n and EA
for diffusion suggests molecular H2 diffusion governs the
long-time evolution of NBTI behavior.
References
[1] S. Mahapatra et al., p.105, IEDM 2004
[2] Y. Mitani et al., p.509, IEDM 2002
[3] Y. Mitani et al., p.117, IEDM 2004
[4] V. Huard et al., p.40, IRPS 2004
[5] H. Aono et al., p.23, IRPS 2004
[6] S. Rangan et al., p.341, IEDM 2003
[7] B. Kaczer et al., p.381, IRPS 2005
[8] M. Dennis et al., p.109, IEDM 2004
[9] M. Alam et al., Micro. Reliability, v.45, p.71, 2005
[10] K. Jeppson et al., JAP, p. 2004, 1977
[11] S. Chakravarthi et al., p.273, IRPS 2004
[12] M. Alam, p.345, IEDM 2003
[13] S. Mahapatra et al., SSDM 2005
[14] M. L. Reed et al., JAP, p.5776, 1988
[15] S. S. Tan et al., p.70, SSDM 2003
[16] D. Monroe, Solid State Comm., p.435, 1986
Authorized licensed use limited to: INDIAN INSTITUTE OF TECHNOLOGY BOMBAY. Downloaded on January 3, 2009 at 00:42 from IEEE Xplore. Restrictions apply.
O
15
EOT=21.4A , Dose=5.54x10 cm
-VG(stress)
O
10
O
t-delay
O
delay=50ms
10
0
10
1
10
2
3
slope:
0.162, 0.189
0.177, 0.211
1
10
2
10
0.24
15
10
-2
VG(stress)=-3V
Idlin
VG(meas)=-1.5V
0.18
Delay=200ms
VG(meas)~VT
IMEC(IRPS'05)
0.16
0.14
0
50
100
150
O
Temperature ( C)
10
slope:
0.221, 0.264
0.232, 0.289
1
10
10
2
Delay / slope
0ms / 0.17 (T1, T2)
250ms / 0.24(T1), 0.256(T2)
3
0
-1
O
15
-2
EOT=12.3A , Dose=2.34x10 cm
stress (-VG)
1.9V
O
T=100 C
2.1V
10
10
15
slopes:
0.174, 0.14
0.19, 0.14
0.237, 0.14
0.07
0.06
O
T( C)
50
100
150
0.05
0.04
0.03
0.02
t0=20-40s
-2
-3
0.00
5x10 -1 0
1
2
3
4
-1
0
1
2
3
4
10 10 10 10 10 10 10 10 10 10 10 10
stress time (s)
Fig.4. T dependence of ∆VT (t) measured using on-the-fly Idlin
technique, showing power-law (not log) time dependence. Short
time (t<t0) data shows higher slope at higher T (dispersive). Long
time data shows identical slope for all T (Arrhenius activation).
-1
10
O
-1
EOT=12.3A
stress VG=-2.1V
stress VG=-2.1V
slope=0.14
slope=0.14
O
O
T=100 C
∆VT (V)
∆VT (V)
slope=0.14
O
T=27 C
0
10
1
2
10
10
10
stress time (s)
3
O
15
10
O
T=27 C
-2
10
0
0
2x10
-2
O
-2
2
On-the-fly Idlin
VG(stress)=-1.9V
t0=2-4s
T=27 C
10
10
EOT=12.3A , Dose=1.44x10 cm
-1
T=100 C
∆VT (V)
1
10 10
stress time (s)
0.01
200
Fig.3. T dependence of power-law time exponent
from Idlin and C-P measurements. Higher dispersion
(increase in n with T) is seen for higher measurement
delay. C-P slopes are greater than Idlin slopes for
same T and t-delay.
10
O
T=T1 C
O
O
EOT=21.4A , Dose=5.54x10 cm
50ms (delay)
350ms
350ms
C-P
500ms
0.22
0.20
O
T=27 C
∆VT (V)
time exponent
0.26
1
10
Fig.2. (LHS) T dependence of ∆NIT (t) measured using C-P for
different t-delay. Increase in slope at larger T is more pronounced
when delay is high. (RHS) Numerical solution of R-D model with
and without delay. T-dependent slope observed for non-zero delay.
∆VT (V)
0.28
10
1
Delay
350ms
500ms
2x10 0
10
3
Fig.1. ∆VT (t) from Idlin measurements without and with
delay. (LHS) Different measurement bias, (RHS) different
measurement T. Slope increases for higher delay time,
higher T and lower measurement VG.
0.30
O
0
0
10 10
10
stress time (s)
O
T=T2 C
-2
-2
-VG(meas) / n
3.0V/0.138
1.5V/0.162
1.0V/0.189
T2>T1
D: 100X
kF: 3.1X
kR: 3.5X
9
O
T=50 C
2
10
R-D simulation, H2 molecule
CP (f=800kHz)
VG=-3.0V (stress)
T=150 C
9
On-the-fly
10
2
-2
∆NIT (x10 cm )
T=150 C
Delay
50ms
350ms
T=50 C
VG=-3.0V (stress)
15
EOT=21.4A , Dose=5.54x10 cm
Delay Idlin, VG=-3.0V (stress)
VG=-1.5V (measure)
-VG(meas)
∆VT (V)
-2
-2
-1
∆NIT (x10 cm )
10
1
Dose (x10 cm)
1.44
2.34
2
10
10
10
stress time (s)
3
-2
EOT(A ), Dose (cm )
15
10
21.4, 1.44x10
-2
10
15
12.3, 2.34x10
0
1
2
10
10
10
stress time (s)
3
Fig.5. T dependence of ∆VT (t) for various stress-VG, N2 dose and EOT, measured using on-the-fly Idlin technique. Nondispersive power law behavior is universally observed.
Authorized licensed use limited to: INDIAN INSTITUTE OF TECHNOLOGY BOMBAY. Downloaded on January 3, 2009 at 00:42 from IEEE Xplore. Restrictions apply.
-2
10
O
15
Control, EOT=12.8A
scale factor (a.u.)
scaled along
X axis
VG=-2.1V
O
27 C
O
-2
50 C
10
O
75 C
O
100 C
scaled along
Y axis
O
8x10
O
Plasma N2, EOT=12.3A
-2
Dose=1.44x10 cm
On-the-fly Idlin
∆VT (scaled, a.u.)
4
10
3
10
2
10
1
X-scale
EA=0.58eV
Y-scale
EA=0.085eV
10
-2
O
15
O
15
O
15
O
15
-3
15
IDlin / 21.4A / 5.54x10 / 8.5
15
C-P / 21.4A / 5.54x10 / 8.5
EA ~ 0.58eV
1
10
Control
O
IDlin / 12.8A / - / 8.0
-1
26
C-P / 26A / - / 9.1
28
30
NH
38
40
Ea = EA (single level)
D1 = D01 exp (-EA/kT)
NH
O
time (log scale)
O
C-P / 21.4A / 5.54 / 125 C
10
-2
B=0.4
Control
O
O
IDlin / 13.6A / - / 125 C
T2
T1
(b)
time (log scale)
O
C-P / 26A / - / 125 C
6
7
8
9
Oxide field, EOX (MV/cm)
10
Distance in oxide
Distance in oxide
0 < Ea < ∞
0 < Ea < EA
D2 = D01 (1/ν0t)^ [1 - kT/E0]NH
D = D2 (0 < t < t0)
D = D1 (t > t0)
Ea
∆NIT (log scale)
(a)
∆NIT (log scale)
∆NIT (log scale)
T1
O
IDlin / 21.4A / 5.54 / 125 C
Fig.9. EOX dependence of ∆VT and ∆NIT data, measured
on films having different EOT and N2 dose. Very similar
EOX dependence is observed for all films. Line is best fit
to control oxide data, and can be used as eye-guide for
plasma oxynitride data.
EA
T2
O
O
10 5
Distance in oxide
EA=0
D = D0 = µ. kT/q
O
O
IDlin / 16.4A / 4.29 / 150 C
-3
32 34 36
-1
1/kT (eV )
Distance in oxide
O
IDlin / 16.4A / 4.29 / 100 C
O
Fig.8. X-axis scaling (like Fig.6) factors for T dependent ∆VT
and ∆NIT data, measured on films having different EOT and N2
dose. Very similar T dependence is observed for all films. Line
is best fit to control oxide data, and can be used as eye-guide
for plasma oxynitride data.
NH
10
-1
O
10
O
O
O
IDlin / 12.3A / 2.34 / 125 C Plasma N2
0.5
10
O
IDlin / 12.3A / 1.44 / 125 C
IDlin / 12.3A / 2.34 / 90 C
15
O
O
O
O
∆VT / [A. EOX ] (a.u.)
X-axis scale factor (a.u.)
10
IDlin / 16.4A / 4.29x10 / 7.5
2
-2
IDlin / 12.3A / 1.44 / 100 C
0
IDlin / 12.3A / 2.34x10 / 8.0
O
Fig.7. T activation of ∆VT data for various
N2 dose. Lower and higher N2 dose (and
their difference) show identical EA, similar
to that of control oxide.
15
IDlin / 12.3A / 2.34x10 / 8.9
O
On-the-fly Idlin
stress: VG=-2.1V, t=10s
O
IDlin / 12.3A / 1.44x10 / 8.2
3
10
EA~0.085eV
-2
Probe/EOT/Dose(x10 cm )/T
IDlin / 12.3A / 1.44x10 / 9.1
10
-2
4x1028 30 32 34 36 38 40
-1
1/kT (eV )
1028 30 32 34 36 38 40
-1
1/kT (eV )
Probe/EOT/Dose(cm )/EOX(MV/cm)
Plasma N2
-2
15
2.34x10 cm
Difference
Fig.6. (LHS) Universal scaling scheme: ∆VT (t) data for various T is scaled along X
and Y-axis directions to universal relations. (RHS) T activation of X and Y-axis scale
factors. Obtained EA from X-axis scaling suggests molecular H2 diffusion. EA (Yscale) = EA (X-scale) * n as predicted by R-D model [1].
4
15
1.44x10 cm
0
-3
0
Plasma N2
O
125 C
3x10 0 1
2
3
4
5
6
10 10 10 10 10 10 10
time (s)
10
-2
∞
T2
T1
time (log scale)
(c)
∆NIT (log scale)
EOT=12.3A
∆VT (V)
7x10
t = t0 (T2)
T2
T1
t = t0 (T1) (d)
time (log scale)
Fig.10. Schematic of H2 diffusion process: (a) without trapping, (b) with single energy trap at EA, (c) with distribution (shape factor E0)
of traps up to ∞, and (d) with finite distribution of traps up to EA. Resulting T dependence of ∆NIT: (a) weak (Einstein relationship), (b)
Arrhenius activated (c) dispersive, and (d) dispersive at short time (t< t0), non-dispersive Arrhenius-like at longer time (t>t0).
Authorized licensed use limited to: INDIAN INSTITUTE OF TECHNOLOGY BOMBAY. Downloaded on January 3, 2009 at 00:42 from IEEE Xplore. Restrictions apply.