Extracting the Modal Parameters of the Horizontal and Vertical

Extracting the Modal Parameters of the Horizontal and
Vertical Transverse Polarisations of a Guitar String
Bertrand Scherrer, Philippe Depalle
To cite this version:
Bertrand Scherrer, Philippe Depalle. Extracting the Modal Parameters of the Horizontal and
Vertical Transverse Polarisations of a Guitar String. 9 pages. 2009. <hal-00392757v1>
HAL Id: hal-00392757
https://hal.archives-ouvertes.fr/hal-00392757v1
Submitted on 4 Jul 2009 (v1), last revised 4 Jan 2009 (v2)
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from
teaching and research institutions in France or
abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
publics ou privés.
Extracting the Modal Parameters of the
Horizontal and Vertical Transverse Polarisations
of a Guitar String
Bertrand Scherrer, Philippe Depalle,
Sound Processing and Control Laboratory (SPCL),
CIRMMT – McGill University, Montréal, Canada
[email protected]
June 8, 2009
Abstract
This study presents first results for the audio-based identification of the type
of stroke used on the classical guitar. The link between the type of stroke and
the modal parameters of the horizontal and vertical transverse polarisations of the
string is revealed through physical considerations. We then describe in detail the
specialized sound analysis framework we developed and explain the reasons for
certain analysis strategies and parameter choices. Finally we present some first
results on the extraction of the modal parameters of the transverse polarisations
and outline our future research directions.
1
Introduction
Musicians can often infer how an instrument was played simply by listening to a
recording. This implies that the information about the instrumental gesture [1] is embedded in the sound signal structure and could be extracted armed only with two microphones and a computer performing signal analysis. In this paper, we present some
first results on the identification of the type of stroke on the classical guitar. The core of
our approach is to use prior knowledge on the acoustics of the guitar to devise a sound
analysis strategy.
Such physically-informed methods have been published for the retrieval of the
plucking position along a guitar string. Most recently, [2] used the fact that the effect of the plucking position can be modeled by a first order feedforward comb filter
whose delay is proportional to the relative plucking position [3]. Using a weighted
least square algorithm it was then possible to estimate the feedfoward delay that would
best match the observed partials amplitude pattern.
The present study is concerned with the extraction of another instrumental gesture:
the type of stroke. Indeed, performers resort to different strokes according to the sound
they want to obtain. Typically, the rest stroke is used to produce a “strong, full sound"
whereas the free stroke is used when a “softer" sound is sought for [4]. The rest stroke,
is characterized by the fact that, after the pluck, the finger tends to rest on the next
string. The free stroke, is just the contrary: at the end of the pluck, the plucking finger
1
clears the next string. In more quantitative terms, the difference between these strokes
actually resides in the angle with which the string is left to vibrate at the end of the
interaction between the plucking finger and the string: the angle of release (θr ).
In Section 2, we present the physical considerations that relate the angle of release
to the modal parameters of the transverse polarisations of the string. Then, in Section 3,
we describe the analysis framework we developed to actually extract polarisation modal
parameters from recordings. Section 4 reveals the first results of modal parameter
identification for different angles of release based on sound recordings. Finally, Section
5 recapitulates our observations and outlines future research directions.
2
From rest/free stroke to modal parameters of string
polarisations
Fig. 1 illustrates how the rest and free strokes can be characterized by their angle of
release. Fig. 1 (a) features the trajectories of a point along the string during the fingerstring interaction for both types of stroke as measured in [5]. The circled dots marked
lf ree and lrest represent the points at which the string is released for the free and rest
stroke respectively. We can see that the different kinds of plucks result in differences
in angle of release θr,rest and θr,f ree . Let us now draw the link between the angle of
release and the modal parameters of the transverse polarisations of the string.
(a) Trajectory of a string before release
for rest and free stroke (from [5]).
(b) Coordinate system.
Figure 1: Comparing rest and free strokes.
Understanding what makes the difference between the sound of the rest and free
strokes requires the understanding of how the vibrational forces of the string are transformed into body vibrations through the bridge [6]. This knowledge can be accessed
via the mechanical admittance matrix of the bridge. If we only consider the horizontal
(along ~y ) and vertical (along ~z) transverse polarisations of the string, this matrix will
characterize, as a function of frequency, how much the bridge moves along ~y and ~z as
a result of the forces exerted by the two polarisations. The presence of cross terms in
the matrix indicates the presence of coupling between the horizontal and vertical polarisations. Adapting a reasoning presented by [7], one finds that the angle of release of
the string will modify the coupling between resonances of the horizontal and vertical
polarisations.
In other words, according to the type of pluck, the modal parameters of the resonances of the horizontal and vertical polarisations will be different. In the next section,
2
we present the analysis framework we use to extract these modal parameters from a
recording.
3
Description of the Analysis Framework
The different parts of our analysis framework are detailed in the following and Fig. 2
gives an overview of the whole processing chain.
Figure 2: Polarisation identification process.
3.1
Preliminary Analysis Steps
The first step in the analysis is to center and cut the signal (Fig. 2, ).
1 Typically, we
want to analyze the sound after the onset transient while staying close enough to it so
that both polarisations are still present.
Then, we estimate the fundamental and the higher partials (F0 , F1 , . . . , Fm ) from
scc , the centered and cut signal (Fig. 2, ).
We use Audiosculpt [8] to perform a
2
standard additive analysis (STFT, peak picking, etc.) combined with a partial tracking
focused on the harmonics of the fundamental frequency.
Finally, based on different frequency-based criteria (Fig. 2, ),
3 we decide on which
partials to perform further analysis. The frequencies of the selected partials are noted
Fn in the rest of the paper. In practice, we discard the partials that are too close from
the main body modes (frequencies under 250 Hz). This is due to the fact that we only
want to focus on string resonances and not a mix of body and string resonances. We
also decide to restrict our analysis to partials that are of strong enough amplitude. In
other words, we will tend to focus on partials close to the maxima of the “plucking
filter" [2]. Fig. 3, illustrates the effect of the plucking position on the amplitude of the
partials. The tone was produced using the rest stroke, 10 cm from the bridge, on the
open E4 (330Hz) string, of length 65cm.
3.2
Fine Analysis of the Partials
We use a “high resolution” (HR) spectral analysis method (the well-known ESPRIT
method [9] ) to perform a finer analysis of the signal around each partial frequency
Fn . For that purpose, we need to pre-process scc (Fig. 2, ).
4 First we bandpass filter
scc using a complex, linear phase, filter of central frequency ν0 = Fn and bandwidth
δν. The filter is obtained using the window method (a Blackman window is used as
in [10]). Note that the resulting signal sn is now complex and only contains positive
frequencies. Also, since the complexity of ESPRIT increases tremendously when it is
3
Guitar Pluck Spectrum
80
Plucking Filter
Magnitude of Guitar Pluck
60
40
Partial 10
A (dB) →
20
0
−20
−40
−60
−80
−100
0
500
1000
1500
2000
2500
f (Hz) →
3000
3500
4000
4500
5000
Figure 3: Influence of the plucking position on the amplitude of the partials, for the
open string E4, of length 65cm, plucked at 10 cm from the bridge, using a rest stroke.
fed with more than 1000 samples at 44.1kHz [11] we need to down-sample the signal.
So, after the bandpass filter, the signal is modulated to DC and down-sampled by a
factor M = 1/(2δν) [11].
Next, the frequencies, damping factors, amplitudes and phases of K components
are estimated (Fig. 2, ).
5 In order to obtain reliable results, we need to choose appropriate parameters for the ESPRIT method. There is an optimal value for the “pencil
parameter” p [12] but we have to carefully choose the model order K. Theoretically,
an order of two would be enough since we are looking for only two transverse polarisations in a complex signal. As usual with parametric methods, however, it is good to
slightly overestimate the order of the model. Note that, if we increase the number of
components of the model, we also increase the number of components to choose from
when identifying the horizontal and vertical polarisations. In practice we use K = 6.
3.3 Component Selection
For a given sn , we are only interested in two out of the K components estimated at
the previous stage. This means we need to perform some post-processing on the data
(Fig. 2, ).
6 This step is detailed further, for a given sn , in Fig. 4. The poles estimated
by the HR analysis are represented as circles in the z-plane. The radius of a circle
corresponds to the amplitude of the pole it represents. Note that the damping factor
δmax and frequency Fn are normalized by the sampling rate Fs .
ˆ
A first sorting is done by discarding the poles ẑi = e(δ̂i +j2πfi ) that are obviously
irrelevant: too damped (δ̂i < δmax ) or diverging (δ̂i > 0) and too far from the partial
frequency Fn (|fˆi − Fn | > ∆f ). From these poles, we only keep the poles with high
enough amplitude (within -20 dB of the component of highest amplitude).
Next, we have to identify the poles corresponding to the horizontal and vertical
polarisations from the poles that passed the first sorting steps (within the grey area in
Fig. 4). Here, we use two criteria derived from the physics of the guitar: |δv | > |δh |
and fh < fv . The first criterion simply takes into account the fact that the top plate
of the instrument is more efficient at absorbing energy in the vertical (~z in Fig. 1 (b))
direction than in the horizontal direction (~y in Fig. 1 (b)). Hence the damping of the
vertical polarisation (δv ) will be greater than that of the horizontal polarisation (δh ).
The second criterion is based on observations by Woodhouse in [13]: unlike the vertical
polarisation, the horizontal polarisation is not completely stopped at the frets, bridge
4
Figure 4: Sorting Poles from HR analysis around Fn .
saddle, and nut. This leads to a slight lateral displacement of the string in the horizontal
plane, and thus an increased vibrating length.
The pair of poles that meets these criteria is then identified as the horizontal, vertical
polarisation pair estimate. In Fig. 4 the pair of poles selected is highlighted with a
thicker border. The horizontal polarisation is highlighted in lighter grey. In case there
are more than one pair satisfying the physics-based criteria, we chose to take the one
with greatest energy.
4
4.1
Tests and Results
Sound Data Base
To test our method, we asked a classically trained guitarist to play around 20 “identical" plucks for four different angles of release. We included free and rest strokes as
well as some more exaggerated plucks: strictly parallel and strictly perpendicular to
the soundboard. The strings that were not played were damped to avoid sympathetic
vibrations. All the tones were performed by plucking the string with the middle finger,
with an angle of attack of 45 degrees, at a constant plucking position of 10 cm from
the bridge. The guitar used was an Alhambra Model 1C (entry-level study guitar). The
recording was performed in a controlled acoustic environment (hemi-anechoic room)
and the microphones were positioned 50 cm away from the guitar. The performer was
instructed to keep the guitar in such a way that the microphones would aim at the junction between the tone hole and the fret board with a 45 degree angle. The analog to
digital conversion was performed using an RME Micstasy at 44.1kHz, 16 bits.
4.2
Results of the Estimation of the Modal Parameters of the Transverse Polarisations
Fig. 5, and 6 display the estimated vertical (inverted triangles) and horizontal (squares)
polarisations of partial 10 in a frequency-damping plane for the E4 (330Hz) and B3
(248Hz) strings respectively. The four different types of strokes (parallel, perpendicular, rest and free) are differenciated by their colour. For a given sound the transverse
polarisations are linked by a dotted line. Out of the 25 partials we extracted on each of
5
the 20 occurences of a given stroke, we only included the results obtained for partial
10.
For both strings, and especially for the E4 string, we can observe that there are
some distinct groups corresponding to angles of release despite a few outliers in each
category. Fig. 7 and 8 display the average location in the frequency-damping plane
of both polarisations as well as the standard deviation in each dimension (rectangle in
dotted lines), for the four angle of release categories, for both strings.
A first observation is that there is a difference in variance between the rest and
free strokes: the free strokes tend to have a bigger variance. This is maybe due to
the fact that the performer has less constraints on the exact displacement of his finger
when pulling the string up than when he is pushing the string down stopping on the
next string. Also there seems to be a trend in the ordering of the types of strokes in
frequency: perpendicular, parallel, rest stroke and free stroke. The exact reasons for
this are not quite clear yet but we believe that the answer could be found with further
physical study of the coupling between the two transverse polarisation via the bridge.
Partial 10 at 3310.24 Hz
0
δ (s−1)→
−5
−10
V polar., Para. Stroke
H polar., Para. Stroke
V polar., Perp. Stroke
H polar., Perp. Stroke
V polar., Rest Stroke
H polar., Rest Stroke
V polar., Free Stroke
H polar., Free Stroke
−15
3302
3307
f (Hz) →
3312
3317
Figure 5: Horizontal and vertical polarisation for partial 10 of string 1 (E4: 330Hz),
in the frequency-damping plane. The portion of sound used for the analysis spanned
from 0.15s to 0.55s after the onset transient.
5
Conclusion
We have presented some first results for the audio-based identification of the type of
stroke on the classical guitar. We showed the direct realtionship between the type
of stroke and the angle of release of the string. We then proceeded to draw the link
between this angle and the modal parameters of the transverse polarisations of the
string. We outlined the analysis framework we developed to estimate these modal
parameters and gave the first results of our analyses. At least for two strings out of
the three investigated, some well chosen partials exhibit a grouping of horizontal and
vertical polarizations according to the type of pluck.
The future directions for our research include some improvements to the analysis
framework. We are thinking of implemeting an automatic detection of the end of the
transient. Also, we are planning on thoroughly evaluating the effect of certain analysis
parameters to improve the reliablity of the pole estimation produced by ESPRIT. At a
more theoretical level, we are also working on formulating an explicit correspondance
6
Partial 10 at 2479.50 Hz
0
−1
δ (s )→
−5
−10
−15
2471
V polar., Para. Stroke
H polar., Para. Stroke
V polar., Perp. Stroke
H polar., Perp. Stroke
V polar., Rest Stroke
H polar., Rest Stroke
V polar., Free Stroke
H polar., Free Stroke
2476
f (Hz) →
2481
2486
Figure 6: Horizontal and vertical polarisation for partial 10 of string 2 (B3: 248Hz),
in the frequency-damping plane. The portion of sound used for the analysis spanned
from 0.15s to 0.55s after the onset transient.
Partial 10 at 3310.24 Hz
0
δ (s−1)→
−5
−10
−15
3302
mean V polar., Para. Stroke
mean H polar., Para. Stroke
mean V polar., Perp. Stroke
mean H polar., Perp. Stroke
mean V polar., Rest Stroke
mean H polar., Rest Stroke
mean V polar., Free Stroke
mean H polar., Free Stroke
3307
f (Hz) →
3312
3317
Figure 7: Mean location of the horizontal and vertical polarisation for partial 5 of
string 1 (E4: 330Hz), in the frequency-damping plane. The portion of sound used for
the analysis spanned from 0.15s to 0.55s after the onset transient.
between the modal parameters of the transverse polarisations and the angle of release,
based on an acoustical model of the coupling between the transverse polarisations.
This final step currently constitutes the missing link between the type of stroke and the
modal parameters of the transverse polarisations of the string.
6
Acknowledgments
Thanks go to Federico O’Reilly Regueiro for producing the sounds used in this work.
The authors would also like to acknowledge the support provided by Center for Interdisciplinary Research in Music and Media Technology (CIRMMT) and NSERC.
7
Partial 10 at 2479.50 Hz
0
δ (s−1)→
−5
−10
−15
2471
mean V polar., Para. Stroke
mean H polar., Para. Stroke
mean V polar., Perp. Stroke
mean H polar., Perp. Stroke
mean V polar., Rest Stroke
mean H polar., Rest Stroke
mean V polar., Free Stroke
mean H polar., Free Stroke
2476
f (Hz) →
2481
2486
Figure 8: Mean location of the horizontal and vertical polarisation for partial 10 of
string 2 (B3: 248Hz), in the frequency-damping plane. The portion of sound used for
the analysis spanned from 0.15s to 0.55s after the onset transient.
References
[1] C. Cadoz, “Instrumental gesture and musical composition,” in Proc. of the Int.
Computer Music Conf., 1988.
[2] C. Traube and P. Depalle, “Deriving the plucking point along a guitar string from
a least-square estimation of a comb filter delay,” in Proc. of IEEE CCECE, May
2003, pp. 2001–3.
[3] J. O. Smith, “Physical modeling using digital waveguides,” Computer Music
Journal, vol. 16, pp. 74–91, 1992.
[4] J. Taylor, Tone Production on the Classical Guitar, Musical New Services Ltd.,
1978.
[5] M. Pavlidou, A Physical Model of the String-Finger Interaction on the Classical
Guitar, Ph.D. thesis, University of Wales, Cardiff, 1997.
[6] C. Lambourg and A. Chaigne, “Measurement and modeling of the admittance
matrix at the bridge in guitars,” in Proc. of the Stockholm Musical Acoustics
Conf., 1993, pp. 448–53.
[7] C. E. Gough, “The theory of string resonances on musical instruments,” Acustica,
vol. 49, no. 2, pp. 124–41, October 1981.
[8] N. Bogaards, A. Roebel, and X. Rodet, “Sound analysis and processing with
Audiosculpt 2,” in Proc. Int. Computer Music Conf., 2004.
[9] R. Roy, A. Paulraj, and T. Kailath, “ESPRIT – a subspace rotation approach to
estimation of parameters of cisoids in noise,” IEEE Trans. on Acoustics, Speech,
Signal Processing, vol. 34, no. 5, pp. 1340–2, October 1986.
[10] B. David, Caractérisations acoustiques de structures vibrantes par mise en atmosphère raréfiée, Ph.D. thesis, Université Paris VI, 1999.
8
[11] J. Laroche, “The use of the matrix pencil method for the spectrum analysis of
musical signals,” Journal of the Acoustical Society of America, vol. 4, pp. 1958–
65, 1994.
[12] Y. Hua and T. Sarkar, “Matrix pencil method for estimating parameters of exponentially damped/undamped sinusoids in noise.,” IEEE Trans. on Acoustics,
Speech, Signal Processing, vol. 38, pp. 814–24, 1990.
[13] J. Woodhouse, “Plucked guitar transients: Comparison of measurements and
synthesis,” Acta Acustica, vol. 90, pp. 945–65, 2004.
9