On the Time-Frequency Detection of Chirps

Applied and Computational Harmonic Analysis 6, 252–281 (1999)
Article ID acha.1998.0254, available online at http://www.idealibrary.com on
On the Time–Frequency Detection of Chirps1
Eric Chassande-Mottin and Patrick Flandrin
Laboratoire de Physique (URA 1325 CNRS), Ecole Normale Supérieure de Lyon,
46 allée d’Italie, 69364 Lyon Cedex 07, France
E-mail: [email protected]; [email protected]
Communicated by Stephane G. Mallat
Received October 8, 1997; revised March 12, 1998
The question of detecting a chirp in the time–frequency plane is addressed.
Strategies based on line integration are discussed with respect to optimality and
adequacy of a representation to a given chirp. Linear and power-law chirps are
considered in some detail and a possible application of affine distributions to the
detection of gravitational waves is proposed, together with an effective implementation by means of reassigned spectrograms. © 1999 Academic Press
1. INTRODUCTION
Roughly speaking, “chirp” signals (or “chirps,” for short) correspond to waveforms
whose expression can be written in the time domain as
x~t! 5 a~t!e iw~t!,
(1)
with a(t) some positive, low-pass, and smooth amplitude function whose evolution is
slow as compared to the oscillations of the phase w (t). Defined this way, chirps are
intended to serve as models for monocomponent signals modulated in both amplitude and
frequency, their “instantaneous” frequency being assumed to be related to the “local”
oscillations of the phase.
Chirps are ubiquitous in nature. They can be observed in animal communication (birds,
frogs, whales, etc.) and echolocation (bats), geophysics (whistling atmospherics), astrophysics (gravitational waves radiated by coalescing binaries), acoustics (propagation of
impulses in dispersive media), or biology (epileptic seizure activity in EEG data, uterine
contractions in EMG, etc.). They are also extensively used in manmade systems, such as
radar and sonar, or in the nondestructive evaluation of materials and seismic exploration.
1
This work has been supported in part by CNRS (GdR-PRC ISIS and GREX). It was presented in part at the
4th World Congress of the Bernoulli Society, Vienna (Austria), August 1996.
1063-5203/99 $30.00
Copyright © 1999 by Academic Press
All rights of reproduction in any form reserved.
252
TIME–FREQUENCY CHIRP DETECTION
253
Intuitively, chirps x(t) call for a time–frequency description in which a properly defined
joint representation r x (t, f ) should mainly exist—in the time–frequency plane—in a
narrow neighborhood of a characteristic time–frequency curve +, interpreted either as an
“instantaneous frequency” (frequency as a function of time) or—from a dual
perspective—as a “group delay” (time as a function of frequency). Assuming this holds,
it therefore becomes very natural to propose heuristic schemes aimed at chirp detection by
searching for such a time–frequency localization in a time–frequency distribution of an
observation r(t), e.g., by using as test statistics
L~r! 5
E
r r~t, f !
+
and comparing it to some threshold based on noise-only assumptions.
Moreover, in the case where the curve + also depends on some unknown vector of
parameters u, introducing the parameterized quantity
L~r; u ! 5
E
r r~t, f !
+~u !
and looking for its maximum over u should allow not only for the detection of x(t) but
also for the estimation of u, such a strategy being reminiscent of a generalized Radon or
Hough transform.
Beyond heuristic considerations, however, the rationale for using test statistics as above
must be questioned and justified. In this respect, and for a given chirp to detect, three main
questions have to be addressed:
1. Which time–frequency representation should be used to make sense of the idea of
time-frequency localization?
2. How can a heuristic strategy based on some path integration in the time–frequency
plane be made optimal in some precise statistical sense?
3. What can be gained from a time–frequency formulation of optimal chirp detection?
The two first questions are thoroughly addressed in this paper and, although the third
one will not be considered in detail, the results reported here will provide the necessary
hints for justifying the usefulness of a time–frequency reformulation of chirp detection,
especially in terms of versatility and robustness. Special emphasis will be placed on the
specific case of “power-law” chirps, because of their importance in the context of
gravitational wave detection, but before proceeding to the power-law case, we will first
introduce some definitions (Section 2) and begin the discussion with the more simple case
of linear chirps, for which results have been known for a long time (Section 4). This will
offer the guidelines for considering later desired generalizations to nonlinear situations
such as power-law chirps (Section 5). Finally, a specific example related to the detection
of gravitational waves expected to be radiated by coalescing binaries will be discussed and
illustrated (Section 6).
It must be emphasized that the idea of using a time–frequency strategy for detecting
254
CHASSANDE-MOTTIN AND FLANDRIN
chirps is an old story and that various examples of application of such an approach have
already been proposed in the literature, e.g., in [4, 5, 12, 13, 15, 25, 28] or, more recently,
in [20, 21]. Moreover, many results which are needed for addressing the time–frequency
detection problem have also been treated, per se, in the individual contexts of detection
theory or time–frequency analysis (surveys can be found, e.g., in [8, 9, 14]). Most of them,
however, will be recalled (or even restated) in the following, the main objective of the
paper being to put together various ingredients and to combine them in a coherent fashion.
2. CHIRPS
Because of their great importance, chirps deserve, of course, a more precise and more
rigorous definition than the one given above. A sophisticated mathematical treatment of
chirps can be found in [22], whereas a discussion on the possibility of interpreting
representations such as (1) in terms of instantaneous amplitude and frequency is given in
[29]. We will not enter here, however, into the subtleties of both approaches and, when
necessary, we will only make use of the following definitions:
DEFINITION 1. A signal x(t) is said to be a chirp if it admits a representation as in (1),
with a(t) and w (t) such that
U a~t!ȧ~t!ẇ~t!U ! 1
and
U ẇẅ~t!~t!U ! 1,
2
where “ ˙ ” and “ ¨ ” stand for the first and second derivatives, respectively.
The two conditions above aim at formalizing the idea of having fast oscillations under
a slowly varying envelope. The first condition guarantees that, over a (local) pseudoperiod T(t) 5 2 p / ẇ (t), the amplitude a(t) experiences almost no relative change,
whereas the second condition imposes that T(t) itself is slowly varying, thus giving
meaning to the notion of pseudo-period.
DEFINITION 2. A chirp x(t) is said to be analytic if it is such that Re{ x(t)} and
Im{ x(t)} form a Hilbert transform pair.
An equivalent characterization of analytic chirps amounts to saying that their spectrum
is nonzero for positive frequencies only.
DEFINITION 3. Given an analytic chirp x(t), the instantaneous amplitude a x (t) and
instantaneous frequency f x (t) of Re{ x(t)} are given, respectively, by a x (t) 5 ux(t)u and
1
fx ~t! 5
ẇ~t!.
2p
TIME–FREQUENCY CHIRP DETECTION
255
DEFINITION 4. Given an analytic chirp x(t), the spectral envelope B X ( f ) and group
delay t X ( f ) of Re{ x(t)} are given, respectively, by B X ( f ) 5 uX( f )u and tX ~ f !
1
Ċ~ f !, with X( f ) the Fourier transform of x(t) and C( f ) the phase of X( f ). 2
5 2
2p
Moreover, we will only consider strictly monotonic chirps, i.e., chirps such that the
instantaneous frequency f x (t) and the group delay t X ( f ) are invertible functions.
Different types of chirps can be considered, depending on the form of a(t) and/or w (t).
We will adopt the following conventions:
DEFINITION 5. A chirp is said to be a linear chirp if it admits the representation (1) with
w (t) a quadratic polynomial in t,
w ~t! 5 2 p
Sa
2
D
t2 1 bt 1 g ,
with a, b, and g [ R, and a Þ 0.
It should be noted that, by construction, a linear chirp x(t) defined this way has no
1
ẇ~t! 5 at 1 b does not,
reason to be analytic, with the consequence that the quantity
2p
in general, identify to the actual instantaneous frequency of the real-valued signal
Re{ x(t)}. The actual conditions under which a linear chirp is almost analytic can be made
precise in some cases, when an explicit model is given for the amplitude a(t). In
particular, in the important case of a Gaussian amplitude, we can easily prove that a linear
2
chirp with a Gaussian amplitude e 2 pd t becomes almost analytic (i.e., almost vanishes for
negative frequencies) in the narrow-band limit where (a2 1 d2)/db2 3 0. This follows
from a direct calculation according to which
uX~ f !u 5 Ce 2p~d/~a 1d !!~ f2b! .
2
2
2
We get from this result that the central frequency of a chirp with a Gaussian amplitude is
b, whereas its bandwidth is proportional to (d 1 a2/d)1/2, whence the narrow-band
condition.
The situation of quasi-analyticity of linear chirps contrasts with that of power-law
chirps, which are analytic by construction and whose definition is the following:
DEFINITION 6. A chirp is said to be a power-law chirp (of indices r { R and k # 0)
if its spectrum is nonzero for positive frequencies only and if it admits the frequency
representation
X r,k~ f ! 5 Cf 2~r11!e i Ck~ f !U~ f !,
(2)
2
Throughout the paper, we will adopt the convention of using a lower case symbol for representing a quantity
in the time domain and the corresponding upper case symbol for denoting its Fourier transform, in the frequency
domain.
256
CHASSANDE-MOTTIN AND FLANDRIN
with C k ( f ) 5 22 p (cf k 1 t 0 f 1 g ) if k , 0, C 0 ( f ) 5 22 p (c log f 1 t 0 f 1 g ), C,
c, t 0 , g { R and where U( z ) stands for the unit step function.
A power-law chirp therefore is characterized by a group delay of the form t X ( f ) 5 t 0
1 ckf k21 . Although the definition could be extended to positive k’s, we will restrict
ourselves in the following to the case k # 0, for which the group delay corresponds to
some generalized hyperbola in the time-frequency plane.
3. DETECTION
3.1. Optimum Detection
Signal detection is usually considered from the point of view of the binary hypothesis
testing problem (see, e.g., [37])
H 0: r~t! 5 n~t!
H 1: r~t! 5 n~t! 1 s~t!,
with 2T/ 2 # t # T/ 2, and where s(t) is the reference signal to detect (supposed to be
known and of finite energy over [2T/ 2, T/ 2]), n(t) is some additive noise, and r(t) is
the available observation upon which the decision has to be taken.
Given this framework, designing an “optimal” detector depends not only on the a priori
knowledge one may have on the signal and on the noise, but also on the choice of a
criterion for optimality. A relevant concept in such a search for optimality is that of the
“likelihood ratio test” (LRT), which essentially consists in evaluating the test statistics
l ~r! 5
p 1~r!
,
p 0~r!
where p 0 (r) and p 1 (r) stand for the conditional probability density functions of the
observation under H 0 and H 1 , respectively. Once the LRT is computed, the detection
itself amounts to comparing it with a threshold and to deciding that the expected signal is
indeed present when the threshold is exceeded.
We are interested here in the case where the expected signal expresses as
s~t! 5 x~t; u !e ig,
where u is a vector of unknown parameters that we may wish to estimate, and g some
unknown random phase, uniformly distributed over [0, 2p], that we would like to
eliminate. In such a situation, the notion of LRT must be extended to that of generalized
LRT (GLRT), defined as
1 * 20p p 1~ru g !d g
l̃ ~r; u ! 5
.
2p
p 0~r!
257
TIME–FREQUENCY CHIRP DETECTION
Given this modified test statistics, detection is still based on a comparison with a
threshold, and (maximum likelihood) estimation can be simultaneously achieved according to
û 5 arg max l̃ ~r; u !.
u
In order to get an explicit form for the GLRT, some further assumptions are necessary
about the statistics of the additive noise. For the sake of simplicity, n(t) will be assumed
to be zero-mean, Gaussian, and white, i.e., such that
E$n~t!n~s!% 5 N 0d ~t 2 s!
for any t and s in R, with E{ z } the expectation operator. Such an assumption allows for
simplifications in the writing of the GLRT, since we have [37]
H E
1
p 1~ru g !
5 exp 2
p 0~r!
N0
J
T/ 2
~ur~t! 2 x~t; u !e igu 2 2 ur~t!u 2!dt .
2T/ 2
After some manipulations, the GLRT follows as
l̃ ~r; u ! 5 e 2Ex~u !/ 2N0
1
2p
E H
2p
exp
0
J
1
~F~ u !e 2ig 1 F~ u !e ig! d g ,
N0
with E x ( u ) the signal energy and
F~ u ! 5
E
T/ 2
r~t!x~t; u !dt.
2T/ 2
Expressing the above quantity in polar form as F( u ) 5 uF( u )ue i w F( u ) and reorganizing
terms, we are led to
l̃ ~r; u ! 5 e 2Ex~u !/ 2N0
1
2p
E H
2p
0
exp
J
2uF~ u !u
cos~ w F~ u ! 2 g ! d g ,
N0
a result which can be rewritten as
l̃ ~r; u ! 5 e 2Ex~u !/ 2N0I 0
S
D
2uF~ u !u
,
N0
where I 0 ( z ) is the modified Bessel function of first kind [1].
The remarkable point is that, I 0 ( z ) being a monotonic increasing function, the test
statistics reduces to compare uF( u )u (or any monotonic increasing function of uF( u )u) with
258
CHASSANDE-MOTTIN AND FLANDRIN
a threshold. It follows therefore that the basic ingredient for the considered GLRT
detection may simply reduce to
L~r; u ! 5
UE
T/ 2
U
2
r~t!x~t; u !dt .
2T/ 2
Concerning estimation, however, some care must be taken since maximizing over u the
simplified statistics L(r; u ) in place of the exact one l̃ (r; u ) implicitly assumes that the
signal energy E x ( u ) does not depend upon u.
The strategy invoked here is exceedingly simple, since it only amounts to correlating
the observation with a replica of the expected waveform, but one should not forget that
this is so because of the many assumptions which have been made, especially that of
Gaussianity. As far as interpretation is concerned, one can note that the GLRT detector
admits a companion interpretation in terms of “matched filtering,” a concept based on the
idea of filtering the observation in such a way that the signal-to-noise ratio (i.e., the
contrast between the two competing hypotheses) is maximized at the output of the filter.
Because of the corrupting random phase, the optimum GLRT detector turns out to
coincide with a matched filter followed by an envelope detector, a structure referred to as
“quadrature matched filtering [37].”
Until now, the additive noise has been assumed to be white. In the more realistic
situation where the noise, while stationary and zero-mean, is colored, the same strategy
can still be applied, mutatis mutandis, under the assumption of Gaussianity, provided that
the observation must first be whitened. More precisely, and provided that the time support
of the signal to detect is entirely contained in the observation interval [2T/ 2, T/ 2],
Parseval’s relation guarantees that
UE
U 5 UE
2
T/ 2
r~t!x~t; u !dt
2T/ 2
1`
R~ f !X~ f; u !df
0
U
2
for analytic signals. Therefore, if we introduce the whitening operation
E
x~t! 3 x w~t! 5
1`
0
X~ f !
ÎG n~ f !
e i 2p ftdf,
and if we apply this to the observation, the problem of detecting a given signal in colored
noise is (at least formally) transformed into a new problem of detecting a prefiltered
version of the original signal in white noise, thus leading to the following strategy:
L w~r; u ! 5
UE
1`
0
U
2
R~ f !X~ f; u !
df .
G n~ f !
(3)
It is basically this quantity which has to be given an equivalent time–frequency
formulation.
TIME–FREQUENCY CHIRP DETECTION
259
3.2. Time–Frequency Detection
As shown by the structure of the GLRT, optimum detection relies on a correlation
measure—in terms of the squared inner product— between the observation and a reference. This correlation can be expressed equivalently in the time domain or in the
frequency domain, and this naturally suggests that a third equivalent approach should be
feasible, in which the inner product would be directly written in time and frequency (i.e.,
jointly) by means of a suitable time–frequency distribution which can be thought of as a
“signature” well adapted to nonstationary signals.
The idea, therefore, is to introduce such a time–frequency distribution r x (t, f ) (which
necessarily must be at least quadratic in x) so that we may have, for any signals x 1 (t) and
x 2 (t), a relation of the type
u^x 1, x 2& tu 2 5 u^X 1, X 2& f u 2 5 ^^ r x1, r x2&& tf,
where ^ z , z & t , ^ z , z & f , and ^^ z , z && tf stand for suitably chosen inner products in the
time domain, the frequency domain, and the time–frequency domain, respectively (explicit definitions of such inner products will be detailed in the following for giving a
precise meaning to the above equalities).
Such an equivalence obviously has no reason to hold for any quadratic time–frequency
distribution. For instance, it is not verified by the simplest distributions we may think of,
namely the spectrogram (squared short-time Fourier transform) and the scalogram
(squared wavelet transform). This can be checked by direct inspection, but this claim (and,
with it, the way to find suitable distributions which overcome the limitations of spectrograms and scalograms) can be justified in a more interesting manner by considering
general classes of distributions to which spectrograms and scalograms belong. We can
introduce, for instance, the following definition [9, 14]:
DEFINITION 7. The class of all quadratic time–frequency distributions which are
covariant with respect to shifts in time and frequency is referred to as Cohen’s class and
reads
C ~xw!~t, f ! 5
EE
1`
w ~ j , t ! A x~ j , t !e 2i 2p~j t1t f !d j d t ,
2`
with
A x~ j , t ! 5
E S DS D
1`
2`
x t1
t
t
x t 2 e i 2pj tdt
2
2
and where w(j, t) is some arbitrary parameterization function such that w(0, 0) 5 1.
With this definition, it is easy to establish the following result [23]:
PROPOSITION 1.
i.e., satisfies
A time–frequency distribution belonging to Cohen’s class is unitary,
260
CHASSANDE-MOTTIN AND FLANDRIN
UE
1`
U 5E E
2
x 1~t!x 2~t!dt
2`
1`
C x~w1 !~t, f !C x~w2 !~t, f !dtdf,
2`
if and only if the arbitrary parameterization function w(j, t) is of modulus unity.
The consequence of this result is that a spectrogram with window h cannot be unitary
since it is well known [9, 14] that it belongs to Cohen’s class with w ( j , t ) 5 A h ( j , t ),
a quantity which cannot be of modulus unity over the entire (j, t) plane. A similar result
can be established for scalograms, which are known to be members of the so-called affine
class [14]. It appears, therefore, that those distributions (spectrogram and scalograms) are
a priori not qualified for serving as the basis of an optimum time–frequency detector,
although they may have been put forward for this purpose and may have proven useful as
suboptimum detectors [2, 20, 21]. Other distributions can be found, however, which are
optimal with respect to detection. Whereas the interested reader is referred, e.g., to [13,
31] for a general discussion about optimum time–frequency detection, we will focus here
on the specific case of chirp detection.
4. DETECTING LINEAR CHIRPS
The optimum time–frequency detection of linear chirps was first considered in [25].
From a time–frequency perspective, it turns out that linear chirps are intimately related to
a special member of Cohen’s class, the so-called Wigner–Ville distribution, defined as
follows [9, 14]:
DEFINITION 8. The Wigner–Ville distribution of a signal x(t) is the special case of
Cohen’s class attached to the parameterization w(j, t) 5 1, and its (real-valued) expression reads explicitly
W x~t, f ! 5
E S DS D
1`
x t1
2`
t
t
x t 2 e 2i 2p f td t .
2
2
(4)
The reason that linear chirps and the Wigner–Ville distribution are closely linked
together is given by the following
PROPOSITION 2. When applied to the linear chirp of Definition 5, with a(t) 5 1, the
Wigner–Ville distribution (4) is perfectly localized and reads
S
W x~t, f ! 5 d f 2
D
1
ẇ ~t! .
2p
Since the parameterization function of the Wigner–Ville distribution is w(j, t) 5 1, it
is of course of modulus unity, thus guaranteeing unitarity. It follows, therefore, that we
have
261
TIME–FREQUENCY CHIRP DETECTION
UE
1`
U 5E E
2
x 1~t!x 2~t!dt
2`
1`
W x1~t, f !W x2~t, f !dtdf
2`
for any two signals x 1 (t) and x 2 (t). In the case where x 1 (t) 5 r(t)1[2T/ 2,T/ 2] (with 1I (t)
the indicator function of the interval I) and where x 2 (t) is the linear chirp x 2 (t) 5
a(t)e i( w (t)22 pg ) , we get (thanks to the support-preserving property of the Wigner–Ville
distribution and its compatibility with modulations [9, 14])
UE
T/ 2
2T/ 2
U 5E E
2
r~t!a~t!e i~w~t!22pg!dt
T/ 2
2T/ 2
5
1`
S
W r,a~t, f ! d f 2
2`
E SE S
1`
T/ 2
2T/ 2
2`
W a t,
D
1
ẇ ~t! dtdf
2p
D
D
1
ẇ ~t! 2 j W r~t, j !d j dt.
2p
It follows, therefore, that the GLRT optimum detection (in zero-mean white Gaussian
noise) of a linear chirp with unknown parameters u 5 (a, b) can be achieved by using as
test statistics the time–frequency-based quantity
L~r; a , b ! 5
E
T/ 2
r r~t, a t 1 b !dt,
2T/ 2
with
r r~t, f ! 5
E
1`
W a~t, f 2 j !W r~t, j !d j .
2`
Given the linear chirp model (5), the energy of the signal to detect does not depend upon
the unknown parameters a and b, thus allowing for their maximum likelihood estimation
via L(r; a , b ).
5. DETECTING POWER-LAW CHIRPS
In the case of power-law chirps, the Wigner–Ville distributions is no longer a good
candidate since, although unitary, it lacks the localization property which permits us to
come up with a solution in terms of a path integral. In the specific case of hyperbolic
chirps (i.e., k 5 0), a well-adapted solution has been proposed in [28] on the basis of a
variant of the Wigner–Ville distribution (the so-called Altes–Marinovic distribution),
obtained from it by means of a warping operation. We will not follow this approach here
because of two different shortcominings, namely the fact that the resulting strategy is not
shift invariant in time (which may be a problem if the time origin of the chirp is unknown
and has to be estimated) and also that the technique developed for the case k 5 0 cannot
be directly extended to arbitrary (negative) k’s.
The framework we rather propose to use is that of affine time–frequency distributions,
262
CHASSANDE-MOTTIN AND FLANDRIN
as developed by J. and P. Bertrand [6]. Those distributions form a whole class of
time–frequency distributions but, as compared to the previously introduced Cohen’s class,
their introduction is motivated by a covariance requirement with respect to each solvable
three-parameter extension of the affine group. The effective construction results in a
parameterized family of distributions for which we will adopt the following definition [6]:
DEFINITION 9.
is given by
The Bertrand distribution (of index k { R) of an analytic signal X( f )
2~r11!2q
P ~k!
X ~t, f ! 5 f
E
1`
m k~u! X~ f l k~u!!X~ f l k~2u!!e i 2p tf z k ~u!du,
(5)
2`
with
z k~u! 5 l k~u! 2 l k~2u!.
In this definition, r and q are free real-valued parameters and m k (u) is some arbitrary
function, whereas the explicit form of the parameterization function l k (u) is fixed by
S
l k~u! 5 k
e 2u 2 1
e 2ku 2 1
D
1/~k21!
if k Þ 0, 1, the special cases attached to k 5 0 and k 5 1 being defined by continuity
as
l 0~u! 5
u
1 2 e 2u
and
S
l 1~u! 5 exp 1 1
D
ue 2u
.
e 2u 2 1
One can note that a Bertrand distribution is real valued if we have the hermitian
symmetry condition m k (u) 5 m k (2u), a condition we will assume to be satisfied in the
following.
In order to derive the time–frequency formulation of the power-law chirp detector, we
will need a few results about Bertrand distributions. We will summarize this material in
the following Propositions 3 to 6, whose proofs can be found in [6, 7].
Localization—Whereas the Wigner–Ville distribution was naturally adapted to
linear chirps, the adequacy between Bertrand distributions and power-law chirps is shown
by the following proposition [6]:
PROPOSITION 3. When applied to the power-law chirp (2), the Bertrand distribution of
index k (5) is perfectly localized on the group delay curve tX( f ) 5 t0 1 ck f k21 and reads
TIME–FREQUENCY CHIRP DETECTION
263
P X~k!r,k~t, f ! 5 C 2f 2~q11!d ~t 2 t X~ f !!,
(6)
if and only if the arbitrary weighting function mk(u) is given by
m k~u! 5 ż k~u!~ l k~u! l k~2u!! r11.
This proves that, in terms of time–frequency localization, the structure of Bertrand
distributions is matched to that of power-law chirps, which constitutes the first ingredient
for time–frequency detection via path integration.
Filtering—When leaving the class of power law chirps, the Bertrand distribution is
no longer perfectly localized. For example, when a signal is filtered, the following
proposition evidences that its Bertrand distribution filtered accordingly. More precisely
[7]:
PROPOSITION 4. When applied to a product signal X( f ) 5 M( f )Y( f ), the Bertrand
distribution P(k)
X (t, f ) reads
q11
P ~k!
X ~t, f ! 5 f
E
1`
~k!
PM
~t 2 u , f ! P ~k!
Y ~ u , f !d u ,
r
2`
where
M r~ f ! 5 f 2~r11!M~ f !.
Unitarity—The third ingredient we will need is unitarity, for which—introducing
suitable inner products on the half-line of positive frequencies and on the associated
time–frequency half-plane—we have the following result [6]:
PROPOSITION 5.
UE
1`
0
A Bertrand distribution is unitary, i.e., satisfies
U 5E E
2
X~ f !Y~ f !f 2 r11df
1`
2`
1`
~k!
2q
P ~k!
X ~t, f ! P Y ~t, f ! f dtdf
0
for any two signals X( f ) and Y( f ), if and only if the arbitrary weighting function mk(u)
is given by
m k~u! 5 ż 1/k 2~u!~ l k~u! l k~2u!! r11.
Extended unitarity—As a corollary to Propositions 3 and 5, the requirements of
localization and unitarity can be simultaneously fulfilled only if ż k (u) 5 1, an equation
whose only solution in k is k 5 0 (this can readily be established by noting that we have
the relation l k (u) 5 e u l k (2u), for any k [6]). Unless we want to consider only the case
of hyperbolic chirps, it seems, therefore, that the two properties of localization and
unitarity cannot be combined directly, so as to mimic what had been done previously in
264
CHASSANDE-MOTTIN AND FLANDRIN
the case of linear chirps. A way out, however, is possible, which relies on the following
proposition [6]:
PROPOSITION 6. Given a localized Bertrand distribution P(k)
X (t, f ) with k , 0, there
(k)
exists an auxiliary distribution P̃X (t, f ) characterized by
m̃ k~u! 5 ~ l k~u! l k~2u!! r11
and such that
UE
1`
U 5E E
2
X~ f !Y~ f !f
2 r11
df
1`
2`
0
1`
~k!
2q
P̃ ~k!
X ~t, f ! P Y ~t, f ! f dtdf
0
for any two signals X( f ) and Y( f ).
This offers some additional freedom in the manipulation of Bertrand distributions, by
softening the strict constraint of unitarity attached to one given distribution via the
introduction of a pair of distributions and a duality relation between them. In the specific
case k 5 21, this duality identifies to a situation considered by Unterberger (see [36]),
who coined the terms “active” and “passive” for distinguishing the corresponding distributions. As a generalization, we will therefore adopt the following definiton:
DEFINITION 10. The auxiliary distribution P̃ (k)
X (t, f ) is called the passive distribution
associated with P (k)
X (t, f ), the latter being referred to as active.
Although, by construction, the passive form of a distribution lacks the localization
property of its associated active form, the former can be explicitly related to the latter, as
shown by the following proposition:
PROPOSITION 7. The passive form P̃(k)
X (t, f ) of a localized Bertrand distribution is
related to the corresponding active form P(k)
X (t, f ) by
P̃ ~k!
X ~t, f ! 5 f
E
1`
G k~ f~t 2 u !! P ~k!
X ~ u , f !d u ,
2`
where
G k~s! 5
E
1`
e i 2pszk ~u!du.
2`
Proof. Starting from the definition of the passive distribution, based on the weighting
function m̃ k (u), we can write
265
TIME–FREQUENCY CHIRP DETECTION
2~r11!2q
P̃ ~k!
X ~t, f ! 5 f
E
1`
m̃ k~u! X~ f l k~u!!X~ f l k~2u!!e i 2p t f zk ~u!du
2`
5 f 2~r11!2q
E
1`
2`
5
E
1`
f 2~r11!2q
2`
ż k~u!
m̃ k~u!
@ m ~u! X~ f l k~u!!X~ f l k~2u!!#e i 2p t f zk ~u!du
m k~u! k
@ m k~u! X~ f l k~u!!X~ f l k~2u!!#e i 2p t f zk ~u!du,
so as to make appear explicitly the weighting function m k (u) of the associated active
distribution. Using then the fact that, for k # 0, the function z k (u) 5 l k (u) 2 l k (2u)
is one-to-one from R to R, one can make the change of variable u 5 z 21
k (v) in order to
express P̃ (k)
X (t, f ) as an ordinary Fourier transform. We get, therefore,
P̃ ~k!
X ~t, f ! 5
E
1`
2`
5
E
1`
2`
f 2~r11!2q
F mż zz
~
21
k
~
21
k
k
21
k
ż k~ z ~v!!
k
S
~v!!
~v!!
G
21
i 2p t fv
dv
X~ f l k~ z 21
k ~v!!!X~ f l k~2z k ~v!!! e
D
s
G k~s! P ~k!
X t 2 , f ds 5 f
f
E
1`
G k~ f~t 2 u !! P ~k!
X ~ u , f !d u ,
2`
with
G k~s! 5
E
2`
5
E
1`
2`
5
E
1
1`
1`
21
k
ż k~ z ~v!!
e i 2psvdv
d 21
~ z ~v!!e i 2psvdv
dv k
e i 2pszk ~u!du,
2`
whence the result follows.
Given an active distribution, its passive counterpart appears, therefore, as a filtered
version of it in time, the impulse response G k of the filter being frequency dependent (the
equivalent “width,” in time, of G k varies as the inverse of frequency). In the general case
(arbitrary k), no closed form expression exists for G k . Let us note, however, that in the
case where k 5 21 ( l 21 (u) 5 e u/ 2 , Unterberger distribution [6]), we get explicitly
G 21~s! 5
E
1`
2`
1
Î1 1 v 2/4
e i 2psvdv 5 4K 0~4 p usu!,
where K 0 ( z ) is the modified Bessel function of second kind [1], in accordance with the
results given in [36].
266
CHASSANDE-MOTTIN AND FLANDRIN
All the different results derived so far can now be combined together and lead to the
following central result:
PROPOSITION 8. Given the detection problem where the signal x(t; u0) to detect is the
power-law chirp (2) of group delay tX( f ) 5 t0 1 c0kf k21 with unknown parameters u0 5
(t0, c0), and where the additive noise n(t) is Gaussian, zero-mean, and stationary with
power spectral density Gn( f ), the optimum test statistics admits the equivalent time–
frequency formulation
L w~r; t, c! 5
E
1`
r R~t 1 ckf k21, f !df,
(7)
0
with
r R~t, f ! 5 C 2f 2q
E
1`
~k!
P̃ ~k!
A ~t 2 s, f ! P R ~s, f !ds
(8)
2`
and
A~ f ! 5
f 2~3r12!
U~ f !.
G n~ f !
(9)
Proof. Let us first assume that u 0 5 (t 0 , c 0 ) is known. In this case, starting from the
results of Propositions 3 (localization) and 6 (extended unitarity), we readily obtain [6],
for any signal Z( f )
UE
1`
0
U 5E E
2
Z~ f !X r,k~ f !f 2r11df
1`
2`
5 C2
1`
~k!
2q
P̃ ~k!
Z ~t, f ! P Xr,k~t, f ! f dtdf
0
E
1`
q21
P̃ ~k!
df.
Z ~t X~ f !, f ! f
0
It follows, therefore, that the left-hand side of the above equation exactly identifies with
the test statistics (3) if Z( f ) 5 R( f ) f 2(2r11) /G n ( f ) and X( f; u 0 ) 5 X r,k ( f ). As a
consequence, the right-hand side of the same equation provides an alternate time–
frequency formulation for the power-law chirp detection problem and, making use of the
results established in Proposition 4 (filtering), we end up with the claimed result, with u
5 u0, i.e., t 5 t 0 and c 5 c 0 . In the actual case where the parameter vector u0 is unknown,
the same statistics must be used by replacing u0 with a set of test values u 5 (t, c).
Detection is then achieved when
max L w~r; t, c! . h ,
~t,c!
TIME–FREQUENCY CHIRP DETECTION
267
where h is some prescribed threshold, while estimation of u0 can be carried out according
to
û 0 5 ~t̂ 0, ĉ 0! 5 arg max L w~r; t, c!.
~t,c!
According to this result, power-law chirps embedded in Gaussian noise can be optimally detected via a path integration strategy in the time–frequency plane, and their
parameters can be estimated by means of a generalized Radon or Hough transform applied
to a well-defined distribution. A potential application of this result will now be discussed.
6. THE EXAMPLE OF GRAVITATIONAL WAVES
Whereas the existence of gravitational waves was predicted long ago by general
relativity theory, no direct experimental evidence has yet been obtained. Detecting
gravitational waves on earth is very challenging because of the extremely tiny effects they
induce on physical systems, and it is only in the very recent past that projects at the front
end of current technologies may have been launched. In both VIRGO and LIGO
projects—the main projects, which are still under construction—the detector is basically
a giant laser interferometer, expected to convert the impinging of a gravitational wave into
a detectable motion of interference fringes. Because gravitational waves are so weak, and
although the interferometers have arms more than 3 km long, detectability requires a
relative sensitivity of the order of 10222. Given the noise limitations of the detectors, this,
however, should be possible in a frequency “window” between a few tenths and a few
hundreds of Hertz.
As far as detection itself is concerned (from a signal processing point of view), a key
question is to get some a priori information about possible structures for the expected
waveforms. In fact, a wide variety of situations can be considered [32], each corresponding to different types of signals, more or less well characterized. Nevertheless, it is
somewhat universally accepted that the most promising source of detectable gravitational
waves should be produced by the coalescence of very massive binary systems, the only
situation we will consider here.
6.1. A Model for Coalescing Binaries
A “coalescing binary” is a system of two very massive astrophysical objects (e.g.,
neutron stars or black holes), rotating around each other. In the process of this rotation,
some gravitational energy is radiated in the form of gravitational waves, with the
consequence that the objects become closer and closer, thus speeding up the rotation, up
to coalescence. It intuitively follows that coalescing binaries should naturally give rise to
gravitational waves which behave as chirps.
In a first (Newtonian) approximation, an explicit form can be given for the expected
waveform. Up to some unknown phase, it can be expressed as the real part of the
complex-valued signal [30, 34]
b
x~t; t 0, d! 5 A~t 0 2 t! 2ae 2i2pd~t02t! U~t 0 2 t!,
(10)
268
CHASSANDE-MOTTIN AND FLANDRIN
with a 5 41 and b 5 58. In this expression, t 0 is the coalescence time, and d and A are
constants which mainly depend on the individual masses of the objects and, of course, of
other geometrical quantities such as the distance of the binary from earth or the relative
orientation between the wavefronts and the detector. More precisely, given two objects of
individual masses m 1 and m 2 , one can introduce the “total mass” M 5 m 1 1 m 2 and the
21
“reduced mass” m such that m 21 5 m 21
1 1 m 2 . Using these two quantities, one can then
3/5 2/5
define [34] the “chirp mass” } 5 m M and, following [30], we have
d 5 160 3 3 3/8} 25/8
< 241} 25/8
J
J ,
with } J 5 }/M J and where M J stands for the solar mass. For an optimal relative
orientation between the detector and the binary, we have furthermore [26]
S D
} 5/4
} 5/4
4
J
J
221
221
< 3.37 3 10
,
A 5 3/4 1.92 3 10
3
r
r
where r is the earth– binary distance, expressed in Mpc.
According to Definition 1, the waveform (10) can thus be interpreted as a chirp if the
amplitude a(t) 5 (t 0 2 t) 2 a and the phase w (t) 5 2 p d(t 0 2 t) b are such that
U a~t!ȧ~t!ẇ~t!U 5 2padb ~t 2 t!
0
2b
!1
and
U ẇẅ~t!~t!U 5 ub2p2db1u ~t 2 t!
0
2
2b
! 1.
As mentioned in [11], these two conditions lead to one single condition according to
which the model (10) can be given a chirp interpretation over the time interval characterized by
t0 2 t @ tc 5
S
max$ a , u b 2 1u%
2pdb
D
1/b
.
(11)
In the specific case of gravitational waves, it follows from the values of the different
constants that we have
t c 5 3 3 ~1600 p ! 28/5} J < 3.6 3 10 26} J.
Assuming that the chirp interpretation is valid, the waveform (10) has (approximately)
for instantaneous frequency
f x~t! 5
5d
~t 2 t! 23/8
8 0
TIME–FREQUENCY CHIRP DETECTION
269
FIG. 1. Qualitative validity of the chirp interpretation for gravitational waves. Gravitational waves radiated
by coalescing binaries can be considered as chirps as long as their maximum frequency in much smaller than a
critical frequency which depends on the masses m 1 and m 2 of the binary. This diagram plots (solid lines) this
critical frequency when m 1 varies between 1M J and 10M J , and when m 2 5 km 1 , with 1 # k # 10. The dotted
line (plotted here arbitrarily at 500 Hz) stands for the high-frequency cutoff of the detector, which allows one
to get a rough bound for the validity of the approximation.
and the condition (11) defines, in turn, a frequency interval characterized by
21
4
f ! f c 5 f x~t c! 5 100 3 ~1600 p ! 3/5} 21
J < 1.66 3 10 } J .
(12)
Figure 1 illustrates the validity of this condition when m 1 varies between 1M J and
10M J , and when m 2 5 km 1 , with 1 # k # 10. It follows from this diagram that, in the
case where the high-frequency cutoff of the detector is supposed to be about 500 Hz, the
chirp interpretation can be considered as valid for a wide range of scenarios which are
likely to be observed.
Considering (10) as a chirp, its frequency spectrum can be obtained by means of a
stationary phase approximation, leading to the following result:
PROPOSITION 9. In the domain where it can be considered as a chirp, the waveform (10)
corresponds approximately to a power-law chirp in the sense of Definition 6, with an
envelope index r 5 (a 2 b/2)/(b 2 1), a phase index k 5 b/(b 2 1), a phase shift g 5
p/4, a chirp rate
270
CHASSANDE-MOTTIN AND FLANDRIN
c52
b21
~d b ! 21/~b21!,
b
(13)
and an amplitude
C5
A
Îu b 2 1u
~d b ! ~a21/ 2!/~b21!.
(14)
The relative error in this approximation is frequency dependent and is bounded by
Q~ f ! 5
S
~ b 2 2!~ b 2 1/ 2!
a2
5
1a1
4 b21
6
DS D
d
b
1/~b21!
f 2b/~b21!.
(15)
Proof. Computing the Fourier spectrum of (10) amounts to evaluating the integral
X~ f; t 0, d! 5 e 2i 2p f t0
E
1`
a~t!e ic~t!dt,
(16)
0
with a(t) 5 At 2 a and c (t) 5 22 p (dt b 2 ft).
It is clear that c (t) has no stationary point when f , 0 and, following the result of
Appendix A, we can first conclude that (10) is almost analytic, as long as the chirp
condition (11) is satisfied.
For positive frequencies, c (t) has one and only one nondegenerate stationary point,
namely
ts 5
SD
f
d
1/~b21!
,
with the condition c̈(t s ) . 0.
When specializing to the model (10) the general result (23), it turns out that (16) exactly
coincides with a power-law chirp in the sense of Definition 6, with the constants given in
(13) and (14). For each frequency, the stationary phase evaluation of the spectrum
amounts to considering the signal contribution at t 5 t s , and thus the remainder (24) at
this point. After a careful removal of all indeterminancies associated with the evaluation
of Q(t s ), we finally get the frequency-dependent result given in (15), which allows one to
bound the frequency domain over which the stationary phase approximation can be
considered as valid, given a maximum relative error.
Two remarks can be made at this point. First, whereas “chirp conditions” of Definition
1 are usually advocated for validating stationary phase approximations (see, e.g., [11, 30]
or [10]), the validity of the approximation is in fact controlled by (24) and this happens
to be much more intricate. Second, if we apply the above result (15) to the case of
gravitational waves, we get that, for the relative error in the stationary phase approximation to be at most x percent, frequency must be bounded by
TIME–FREQUENCY CHIRP DETECTION
271
FIG. 2. Stationary phase approximation for the spectrum of a gravitational wave. The waveform, expected
to be radiated by a coalescing binary composed of two objects of 1 M J and 10M J at a distance of 200 Mpc,
is plotted in (a), with the corresponding instantaneous frequency in (b). The energy spectral density is given in
(c) (solid line), along with the stationary phase approximation (dashed line). The validity of this approximation
is controlled by the frequency-dependent relative error plotted in (d).
f # 7.18 3 10 4x 3/5} 21
J .
(17)
in agreement with the qualitative chirp condition given in (12). Therefore, both the
heuristic and the exact criteria turn out to be of the same nature, but the results of
Proposition 9 allow for a quantitative control of the approximation.
Figure 2 presents a typical example of a waveform and illustrates the effectiveness of
the stationary phase approximation.
6.2. A Simplified Time-Frequency Detector
Strictly speaking, the optimum time–frequency detector (7) requires one to compute a
filtered version (in time) of the Bertrand distribution of the observation. This involves,
unfortunately, a very heavy computational burden and, in order to end up with a feasible
solution, it is mandatory to consider simpler, yet accurate, time–frequency descriptions
instead of the exact function r R (t, f ) given in (8). Whereas such a simplification may not
be possible in the general case, it turns out that it can be effectively achieved in the specific
case of gravitational waves, thanks to the specific values of the physical parameters which
are involved.
In fact, if we come back to (8), we can write in an equivalent way
E
1`
2`
r R~t, f !e 2i 2p j tdt 5 C 2f 2r112qh~u!
E
1`
2`
2i 2p j t
P ~k!
dt
R ~t, f !e
(18)
272
CHASSANDE-MOTTIN AND FLANDRIN
with
h~u! 5
~ l k~u! l k~2u!! r11
ż k~u!
A~ f l k~u!! A~ f l k~2u!!
(19)
and
u 5 z 21
k
S jD
f
.
Due to low-frequency (seismic noise) and high-frequency (photon noise) limitations,
the effective observation bandwidth is necessarily restricted to some bandpass frequency
interval f 2 # f # f 1 (with typical values that we can choose to be f 2 ' 50 Hz and f 1
' 500 Hz). This has for consequence that the Fourier spectrum
E
1`
2i 2p j t
P ~k!
dt 5 f 2r112q~ l k~u! l k~2u!! r11R~ f l k~u!!R~ f l k~2u!!
R ~t, f !e
2`
is nonzero only in the range
uuu # u 1 5 log
f1
,
f2
and it follows that the u-dependent prefactor h(u) of the Fourier transform of P (k)
R (t, f )
can be ignored in (18) as long as it is almost equal to 1 for uuu # u 1 .
Within the above-mentioned frequency band, we can consider (see, e.g., [20, 21]) that
the power spectrum density G n ( f ) of the observation noise n(t) has essentially a
continuous background which behaves as G n ( f ) 5 s 2 f 2 e , with e ' 1.3 Assuming
therefore that
A~ f ! 5 s 22f e2~3r12!
for f 2 # f # f 1 , we get from (19) that
h~u! 5 s 24
~ l k~u! l k~2u!! e2~2r11!
ż k~u!
for uuu # u 1 . In the case of coalescing binaries (k 5 2 35 , r 5 61 ) and “1/f ” noise (e 5
1), this reduces to
s 4h~u! 5
3
~ l 25/3~u! l 25/3~2u!! 21/3
ż 25/3~u!
,
(20)
Let us notice that this is a first approximation and that, in the case of actual detectors, it will certainly have
to be refined on the basis of more realistic noise models.
TIME–FREQUENCY CHIRP DETECTION
273
FIG. 3. When the frequency band of the detector is limited, the time–frequency function to be used can be
well approximated by a Bertrand distribution provided that the function s 4 h(u), defined in (20), acts as a
convolution unit in the space of u-limited functions. The validity of this approximation is illustrated here by
plotting in the upper diagram s 4 h(u) (solid line) and the indicator function of the u-interval associated with the
frequency range 50 – 450 Hz (dotted line), and by comparing in the lower diagram their Fourier transforms.
a quantity which, in the considered space of u-limited functions, can be considered as a
convolution unit, as illustrated in Fig. 3.
When the above approximation is valid, it leads therefore to a detector as in (7), but
with the simplification
C 2 q12e2~4r13! ~k!
f
P R ~t, f !.
s4
r R~t, f ! <
Given this simplified structure, the final problem reduces to finding some accurate and
easy-to-compute approximation to the Bertrand distribution P (k)
R (t, f ). Since the key
feature of this distribution is to satisfy the perfect localization (6) on “matched” chirps, the
solution that we propose is to replace it with a reassigned spectrogram [3, 5] Š hX (t, f )
which, when applied to the same power-law chirps, is known to behave approximately as
Š Xh r,k~t, f ! < C 2f 22~r11!d ~t 2 t X~ f !!.
(21)
The principle of reassignment is briefly recalled in Appendix B and the effectiveness of
this approximation is illustrated in Fig. 4.
Comparing (6) and (21), we are led to choosing q 5 2r 1 1, with the final form of the
approximated optimum detector given by
L w~r; t, c! <
C2
s4
E
1`
0
Š hR~t 1 ckf k21, f ! f 2~e2~r11!!df.
274
CHASSANDE-MOTTIN AND FLANDRIN
FIG. 4. Time–frequency distributions for gravitational waves. Given a gravitational wave radiated by a
coalescing binary, a “matched” time–frequency distribution is expected to be as localized as possible along the
instantaneous frequency curve. The “ideal” representation (a) is compared with a number of candidate distributions (coalescence time is fixed to t 0 5 0). From a theoretical point of view, it is known that the desired
localization is guaranteed by using a matched Bertrand distribution (k 5 2 35 ): this is illustrated in (b), where
the algorithm described in [17] has been used. An effective and easy-to-compute approximation is given by the
reassigned spectrogram (c). These two situations contrast with the more classical solutions provided by the
Wigner–Ville distribution (d), the spectrogram (e), and the scalogram (f).
In the specific case of coalescing binaries, we can prefer to parameterize the signal to
detect by means of its coalescence time t and reduced chirp mass }J. Together with the
right constants, we finally get (up to an amplitude factor)
L w~r; t, } J! }
E
Š hR~ t , f ! f 22/3,
+~t,}J!
with
28/3
+~t, } J! 5 $~ t , f !ut 2 t 5 3 3 100 8/3} 25/3
%.
J f
6.3. An Illustration
In order to illustrate the effectiveness of the proposed approach, we present in Fig. 5
two different examples based upon one of the typical situations discussed in [20, 21]. Both
TIME–FREQUENCY CHIRP DETECTION
275
FIG. 5. Detection of a gravitational wave. This figure illustrates the efficiency of an optimum time–
frequency-based detection for a gravitational wave with coalescence time t 5 0 of a binary composed of two
objects of 1M J and 10M J at a distance of 200 Mpc in case (a) and 1 Gpc in case (b). Since the distance between
the binary and Earth changes the only signal amplitude, the signal-to-noise ratio is the only parameter that has
been modified between these two examples. Each plot compares the squared envelope of the output of the
matched filter (dashed– dotted line) with a time–frequency strategy based on a line integration over either a
classical spectrogram (dashed line) or its reassigned version (solid line). In order to make appear more clearly
what is gained in terms of contrast, the maximum of each of these curves has been arbitrarily normalized to unity.
examples will assume that the binary consists of two objects of 1M J and 10M J
(coalescence time set to t 5 0). The binary is located at a distance of 200 Mpc from Earth
in the first example, and 1 Gpc in the second one. The simulation was run by corrupting
the data with Gaussian additive noise, with e 5 1 and s 2 5 0.7 3 10242/Hz over a
frequency range of 50 –500 Hz. The proposed strategy, based on the reassigned spectrogram, does not reach the ideal performance predicted by the matched filter theory, because
of the limited accuracy of the different approximations which have been involved in its
derivation (in particular, the band-limited nature of the signal implies that the Bertrand
distribution cannot be perfectly localized along the group delay curve). However, the
figure shows that this strategy clearly allows for the detection of the chirp and that it also
overperforms a crude path integration based on a standard spectrogram.
In the examples of Fig. 5, the chirp mass }J was implicitly assumed to be known,
which is by no means the case in practice. Assuming that }J is unknown, a refined
strategy amounts to applying the previous one in parallel by performing as many line
integrations as is necessary for sampling values of }J over some expected range. Figures
6 and 7 exhibit the application of this strategy on the reassigned spectrogram and standard
spectrogram, respectively. This joint detection– estimation problem allows also for an
estimate of }J to be obtained. It should be noted that, when scanning test values for }J,
the reference signal energy is modified. The output of each detector therefore must be
divided by a factor proportional to the squared amplitude of the reference signal (which
varies as }5/3
J ) in order to compare coherent results.
276
CHASSANDE-MOTTIN AND FLANDRIN
FIG. 6. Joint detection– estimation for gravitational waves based on the reassigned spectrogram. In the case
where the chirp mass parameter }J is unknown, different line integrations (similar to those of Fig. 5, but over
a number of different time–frequency curves) have to be performed, here on the reassigned spectrogram. This
results in a surface whose maximum allows for the detection of the gravitational wave (when it exceeds some
prescribed threshold) and for the estimation of both the time of coalescence and the chirp mass (actual values
are indicated with dashed lines).
7. CONCLUSION
The purpose of this paper was to combine elements from optimum detection theory and
time–frequency analysis so as to provide a coherent framework for intuitive strategies
aimed at detecting chirplike signals by some line integration in the time–frequency plane.
The example of gravitational waves (expected to be radiated by coalescing binaries) is
particularly important in this respect, and the possibility of their time–frequency detection
was discussed in some detail. Once the conditions for such an alternative quasi-optimum
strategy have been established, the question now is to discuss further what can be really
gained from such a new approach, especially in terms of versatility and robustness.
APPENDIX A: THE STATIONARY PHASE METHOD
Let I be an integral of the form
I5
E
a~t!e ic~t!dt,
(22)
V
where both a(t) . 0 and c (t) are C 1 , whereas supp{a(t)} is restricted to some interval
V of the real line over which a(t) is integrable.
TIME–FREQUENCY CHIRP DETECTION
277
FIG. 7. Joint detection– estimation for gravitational waves based on the spectrogram. The surface shown
here is the response of the detector– estimator based on path integrations of the standard spectrogram. This
should be compared with the one in Fig. 6. The smoothness of the detection peak and the poor contrast between
its maximum and the noise level make the detection procedure more difficult. Dashed lines indicated actual
values of the time of coalescence and the chirp mass.
Assuming that the amplitude a(t) is slowly varying as compared to the oscillations of
the phase c (t), (22) corresponds to an oscillatory integral. Using standard arguments, we
can therefore heuristically consider that, when integrated, positive and negative contributions of fast oscillations tend to cancel each other, with the consequence that the main
contribution to (22) only comes from the vicinity of those points where oscillations are
significantly slowed down, i.e., where the derivative of the phase is zero. This is the
essence of the stationary phase principle.
In the specific case of the model (22), classical results from stationary phase theory (as
presented, e.g., in [33]) cannot be directly applied, because phase oscillations are not
controlled by a multiplicative parameter becoming arbitrarily large. Nevertheless, if we
assume that c (t) has one and only one nondegenerate stationary point t s (i.e., that ċ (t s )
5 0 and c̈(t s ) Þ 0), we can (following, e.g., [19, 35]) make the change of variables
u2 5
c ~t! 2 c ~t s!
c̈ ~t s!/ 2
so as to rewrite (22) in the form
I 5 e ic~ts!
E
V9
g~u!e ibu du,
2
278
CHASSANDE-MOTTIN AND FLANDRIN
with g(u) 5 a( x(u))(du/dx) 21 and b 5 c̈ (t s )/ 2. Following [19], we get for such an
expression a decomposition of the type I 5 I a 1 R, where the first term corresponds to
the stationary phase approximation
Ia 5
Î
2p
u c̈ ~t s!u
a~t s!e ic~ts!e i~sgn c̈~ts!!p/4,
(23)
whereas the remainder R is such that
Q5
U IRU # Q
m
5
a
5 supu{V9 ug̈u
.
4 u b ug~t s!
The stationary phase approximation is therefore valid if we have Q m ! 1. An explicit
evaluation of this quantity leads to Qm 5 supt{V Q~t!, with
Q~t! 5 5 Î2u c̈ ~t s!u
U a~ta ! cċ U U aä ċc 1 32 aȧċ S 1 2 cċc̈D
1/ 2
2
s
2
S S ċc̈ D
1 3c
2
2
2
cc 3 c̈
2 3
2 ċ 2
ċ
DU
,
(24)
where c denotes the function c(t) 2 c(ts) for short.
The above closed form expression for the remainder allows one both to make explicit
the conditions under which (23) is a valid approximation for (22) and to bound the
corresponding error.
In the case where there is no stationary point, it is intuitive that the integral (22) should
tend to zero as oscillations become faster. To make this point more precise, let us assume
that ċ (t) Þ 0 for any t { V, so that we can rewrite (22) as
I5
E
V
a~t!
i ċ ~t!e ic~t!dt.
i ċ ~t!
An integration by parts leads to
I5
a~t! ic~t!
e
i ċ ~t!
U
1i
­V
E F
a~t!
V
G
ȧ~t!
c̈ ~t! ic~t!
e dt,
2 2
a~t! ċ ~t! ċ ~t!
from which it follows that
uIu #
U ċa~t!~t!U 1 S I a~t!ȧ~t!ċ~t!I 1 I ċc̈~t!~t!I D E ua~t!udt.
2
­V
`
`
V
Assuming that the amplitude may be regularized so as to make negligible the first term,
we get that
TIME–FREQUENCY CHIRP DETECTION
I
uIu
ȧ~t!
#
iai 1
a~t! ċ ~t!
279
I 1 I ċc̈~t!~t!I .
2
`
`
This result can be interpreted the following way: as compared to the situation where the
oscillations of the phase would be infinitely slowed down, the absolute value of the
oscillatory integral (22) with no stationary point is bounded by a quantity whose decay to
zero is controlled by chirplike conditions.
In the case we are here mainly interested in (i.e., the evaluation of a Fourier transform),
we have c (t) 5 w (t) 2 2 p ft, where w (t) is the phase of a chirp. Since we have ẇ (t) .
0 for any t { V, we can conclude that the frequency domain for which no stationary point
exists is the half-line of negative frequencies. Since we have futhermore c̈(t) 5 ẅ (t) and
ċ (t) $ ẇ (t) when f , 0, we get that
I a~t!ȧ~t!ċ~t!I # I a~t!ȧ~t!ẇ~t!I
`
`
and
I ċc̈~t!~t!I # I ẇẅ~t!~t!I .
2
`
2
`
This proves that the chirp conditions of Definition 1 are sufficient to guarantee the
quasi-analyticity of the exponential model (1), in the sense that spectral contributions at
negative frequencies are negligible.
APPENDIX B: THE REASSIGNMENT METHOD
The spectrogram S hX (t, f ) 5 uF hX (t, f )u 2 , i.e., the squared modulus of the short-time
i2 p ( j 2f )t
Fourier transform F hX (t, f ) 5 * 1`
d j , can also be viewed as
2` X( j ) H*( j 2 f )e
a smoothed version of the Wigner–Ville distribution of the signal
S hX~t, f ! 5
EE
1`
W x~s, j !W h~s 2 t, j 2 f !dsd j .
2`
The purpose of reassignment [24] is to refocus the spectrogram on the time–frequency
energy content given by the Wigner–Ville distribution. It basically consists in moving the
values of the spectrogram from their initial computation point to a time–frequency
location (t̂(t, f ), f̂(t, f )) given by a local center of mass computed over the Wigner–Ville
distribution of the signal
Š hX~t, f ! 5
EE
1`
2`
S hX~s, j ! d ~t 2 t̂~s, j !, f 2 f̂~s, j !!dsd j .
280
CHASSANDE-MOTTIN AND FLANDRIN
This results in a squeezing of each signal component along its associated group delay
and/or instantaneous frequency path. For example, the reassigned spectrogram is perfectly
localized on linear chirps with a constant envelope,
H Sa
x~t! 5 exp i2 p
2
t2 1 bt 1 g
DJ
f Š hX~t, f ! 5 d ~ f 2 ~ a t 1 b !!.
In the case of nonlinear chirps, a similar squeezing effect will be obtained provided that
the instantaneous frequency or the group delay of the signal is almost linear locally, i.e.,
in a time–frequency domain whose effective support is defined by the time and frequency
widths of the analysis window. One can mention that reassignment has been proved not
to be restricted to the spectrogram and its application has been extended [3] to all
smoothed distributions within Cohen and affine classes.
From a computational viewpoint, reassignment is supported by efficient algorithms. In
the case of the spectrogram, we get explicitly [3]
t̂~t, f ! 5 t 1 Re $F uX~t, f !/F hX~t, f !%
f̂~t, f ! 5 f 2 Im $F vX~t, f !/F hX~t, f !%,
(25)
where u(t) 5 th(t) and v(t) 5 dh/dt. Therefore, the resulting algorithm combines, in a
proper way, three STFTs of the signal based on three distinct windows (and even two only
if the window is a Gaussian).
REFERENCES
1. M. Abramowitz and I. Stegun, “Handbook of Mathematical Functions with Formulas, Graphs and Mathematical Tables,” Dover, New York, 1972.
2. R. A. Altes, Detection, estimation and classification with spectrograms, J. Acoust. Soc. Amer. 67 (1980),
1232–1246.
3. F. Auger and P. Flandrin, Improving the readability of time-frequency and time-scale representations by
reassignment methods, IEEE Trans. Signal Process. SP-43 (1995), 1068 –1089.
4. S. Barbarossa, Analysis of multicomponent LFM signals by a combined Wigner–Hough transform, IEEE
Trans. Signal Process. SP-43 (1995).
5. S. Barbarossa and O. Lemoine, Analysis of nonlinear FM signals by pattern recognition of their timefrequency representation, IEEE Signal Process. Lett. SPL-3 (1996), 112–115.
6. J. Bertrand and P. Bertrand, A class of affine Wigner distributions with extended covariance properties, J.
Math. Phys. 33 (1992), 2515–2527.
7. J. Bertrand and P. Bertrand, Some practical aspects of the affine time-frequency distributions, in “Proc. 13th
Conference GRETSI,” pp. 25–28, Juan-Les-Pins, France, 1991.
8. G. Boudreaux-Bartels, Mixed time-frequency signal transformations, in “The Transforms and Applications
Handbook” (A. D. Poularikas, Ed.), CRC Press, Boca Raton, FL, 1996.
9. L. Cohen, “Time-Frequency Analysis,” Prentice Hall, Englewood Cliffs, NJ, 1995.
10. N. Delprat, B. Escudié, Ph. Guillemain, R. Kronland-Martinet, Ph. Tchamitchian, and B. Torrésani,
Asymptotic wavelet and Gabor Analysis: Extraction of instantaneous frequencies, IEEE Trans. Inform.
Theory 38 (1992), 644 – 664.
TIME–FREQUENCY CHIRP DETECTION
281
11. L. S. Finn and D. F. Chernoff, Observing binary inspiral in gravitational radiation: One interferometer, Phys.
Rev. D 47 (1993), 2198 –2219.
12. P. Flandrin, On detection-estimation procedures in the time-frequency plane, in “Proc. IEEE Int. Conf. on
Acoust., Speech and Signal Proc.,” pp. 2331–2334, Tokyo, 1986.
13. P. Flandrin, A time-frequency formulation of optimum detection, IEEE Trans. Acoust. Speech Signal
Process. ASSP-36 (1988), 1377–1384.
14. P. Flandrin, “Temps-Fréquence,” Hermès, Paris, 1993.
15. P. Flandrin and P. Gonçalvès, From wavelets to time-scale energy distributions, in “Recent Advances in
Wavelet Analysis” (L. L. Schumaker and G. Webb, Eds.), pp. 309 –334, Academic Press, Boston, 1994.
16. P. Flandrin and P. Gonçalvès, Geometry of affine time-frequency distributions, Appl. Comput. Harmon.
Anal. 3 (1996), 10 –39.
17. P. Gonçalvès and R. G. Baraniuk, A pseudo-Bertrand distribution for time-scale analysis, IEEE Signal
Process. Lett. 3 (1996), 82– 84.
18. I. S. Gradshteyn and I. M. Ryzhik, “Tables of Integrals, Series and Products,” Academic Press, New York, 1980.
19. J. Harthong, Études sur la Mécanique Quantique, Astérisque (1984), 7–26.
20. J.-M. Innocent and B. Torrésani, A multiresolution strategy for detecting gravitational waves generated by
binary coalescence, Internal Report CPT-96/P.3379, CPT-CNRS, Marseille, 1996.
21. J.-M. Innocent and B. Torrésani, Wavelets and binary coalescences detection, Appl. Comput. Harmon. Anal.
4 (1997), 113–116.
22. S. Jaffard and Y. Meyer, Wavelet methods for pointwise regularity and local oscillations of functions, Mem.
Amer. Math. Soc. 123 (1996), 587.
23. A. J. E. M. Janssen, On the locus and spread of pseudo-density functions in the time-frequency plane, Philips
J. Res. 37 (1982), 79 –110.
24. K. Kodera, C. De Villedary, and R. Gendrin, A new method for the numerical analysis of nonstationary
signals, Phys. Earth Plan. Int. 12 (1976), 142–150.
25. S. M. Kay and G. F. Boudreaux-Bartels, On the optimality of the Wigner distribution for detection, in “Proc.
IEEE Int. Conf. on Acoust., Speech and Signal Proc.,” pp. 1017–1020, Tampa, 1985.
26. S. D. Mohanty and S. V. Dhurandhar, Hierarchical search strategy for the detection of gravitational waves
from coalescing binaries, Phys. Rev. D 54 (1996), 7108 –7128.
27. F. Oberhettinger, “Fourier Transforms of Distributions and Their Inverses—A Collection of Tables,”
Academic Press, New York, 1973.
28. A. Papandreou, S. M. Kay, and G. F. Boudreaux-Bartels, The use of hyperbolic time-frequency representations for optimum detection and parameter estimation of hyperbolic chirps, in “Proc. IEEE Int. Symp. on
Time-Frequency and Time-Scale Analysis,” pp. 369 –372, Philadelphia, 1994.
29. B. Picinbono, On instantaneous amplitude and phase of signals, IEEE Trans. Signal Process. SP-45 (1997),
552–560.
30. B. S. Sathyaprakash and S. V. Dhurandhar, Choice of filters for the detection of gravitational waves from
coalescing binaries, Phys. Rev. D 44 (1991), 3819 –3834.
31. A. M. Sayeed and D. L. Jones, Optimal detection using bilinear time-frequency and time-scale representations, IEEE Trans. Signal Process. SP-43 (1995), 2872–2883.
32. B. F. Schutz, Gravitational wave sources and their detectability, Classical Quantum Gravity 6 (1989),
1761–1780.
33. E. M. Stein, “Harmonic Analysis,” Princeton Univ. Press, Princeton, 1993.
34. K. S. Thorne, Gravitational radiation, in “300 Years of Gravitation” (S. W. Hawking and W. Israel, Eds.),
pp. 330 – 458, Cambridge Univ. Press, Cambridge, 1987.
35. B. Torrésani, “Analyse Continue par Ondelettes,” InterÉditions/CNRS Éditions, Paris, 1995.
36. A. Unterberger, The calculus of pseudo-differential operators of Fuchs type, Comm. Partial Differential
Equations 9 (1984), 1179 –1236.
37. A. D. Whalen, “Detection of Signals in Noise,” Academic Press, San Diego, 1971.