9th EUROPEAN MUSIC ANALYSIS CONFERENCE — EUROMAC 9 Erica Bisesi Royal Institute of Technology – Stockholm, Sweden & Comenius University – Bratislava, Slovakia [email protected] , [email protected] Measuring and Modelling Perceived Distance Among Collections in Post-Tonal Music: Music theory Meets Music Psychology ABSTRACT Can we perceive similarity among chords or keys, and which computational models best account for it? How distance profiles are related to other structural aspects (e.g. grouping, melody, rhythm) in different pieces of music? Background Models for distance between pitch-class sets include psychoacoustical models between chord–roots (Parncutt 1988; 1993), chord Tonnetz (Cohn 1998; Tymoczko 2011), interval proximity (Rahn 1979–1980; Morris 1980; Lewin 1987), and voice leading (Straus 2003; 2005; 2014). Krumhansl (1998) tested and supported three different neo-Riemannian models of triadic distance against psychological data. Roger and Callender (2006) explored correlations between perceived distance between triads and factors like tuning environment, direction of motion, and relationship of moving voices. Milne and Holland (2016) provided a psychoacoustical explanation for perceived triadic distance based on a spectral pitch-class representation. Aims and repertoire studied Our repertoire includes three post-tonal pieces belonging to different stylistic conventions: Webern Canon for Voice and Clarinet Op. 16 No. 2, Bartok Mikrokosmos No. 84 (Merriment), and Scriabin Poem Op. 69 No. 1. In this paper, we discuss preliminary results for the first of these pieces, the Webern Canon for Voice and Clarinet Op. 16 No. 2. This piece belongs to a series of vocal works composed by Anton Webern in the 1910s and 1920s, and exemplifies how his seemingly austere, constructivist methods can be combined with expressive achievements. It ‘also emphasize(s) Webern’s penchant for canonical structures — indeed, the entire collection is rife with echoed contours of lines in counterpoint and complementary motivic shapes as melodies unfolding in mirror images, like the angular peaks of a mountain range reflected in inversion on the surface of a lake’ (Jeremy Grinshaw). We compare five contrasting models of the distance between collections in different post-tonal pieces and test them against perceptual data. The models include an adaptation of the Parncutt (1988) psychoacoustical model of distance between chord-roots, three models of distance based on interval similarity (ASYM, MEMBn, REL: Morris 1980; Rahn 1979–1980; Lewin 1987), and atonal voice-leading (uniformity, balance, smoothness: Straus 2003). Methods 1) Music Analysis. In a first stage, the profiles of distance between sets are computed by means of algorithms (Ariza 2002; Bisesi et al. in preparation). To this purpose, the piece has been preliminarily segmented into chunks, each chunk containing one pair of sets to be compared. Given the specific symmetry in the music structure, i.e. the form of canon, we segmented the piece at the level of single bars and compared, for each bar, the upper with the lower voice. By ignoring the first and the last bars (which contains only one pair), we extracted in total 11 pairs. Example 1 provides an example of segmentation for the first three bars, where sets have been named according to the Set Theory. The complete list of 11 pairs, as resulting from this preliminary analysis, is listed in Table 1. 0236 0156 I2 0236 I2 I2 0156 0126 Ex. 1. Webern Canon for Voice and Clarinet Op. 16 No. 2, bars 1-3: segmentation and set analysis. Table 1. Pairwise comparison between sets in the Webern Canon for Voice and Clarinet Op. 16 No. 2. Sets Pair Bar 1 2 3 4 5 6 7 8 9 10 11 2 3 4 5 6 7 8 9 10 11 12 Forte number 4-12A / 4-8 4-8 / 4-5B 4-5A / 4-z15B 4-z15A / 4-13B 4-13A / 3-2A 3-2B / 4-4B 4-4A / 4-7 4-7 / 4-4B 4-4A / 4-4B 4-4A / 4-11B 4-11A / 3-3B Set Classes 0236 / 0156 0156 / 0126 0126 / 0146 0146 / 0136 0136 / 013 013 / 0125 0125 / 0145 0145 / 0125 0125 / 0125 0125 / 0135 0135 / 014 Figure 1 reports the values of the distance between each pair of sets listed in Table 1, as calculated for each of the models considered in this study. The algorithms being used are: 9th EUROPEAN MUSIC ANALYSIS CONFERENCE — EUROMAC 9 • Parncutt (1988): Pearson’s correlation coefficients between the pitch-salience profiles of Set X and Y in each pair XY, as calculated over the 12 pitch classes. • Morris (1980): a similarity index SYM (X,Y), comparing the different elements of vectors X = [x1, x2, …, xN] and Y = [y1, y2, … ,yN]. • Rahn (1979-1980): embedding ATMEMBn (S, X, Y), as Trial No. 9 was used as a benchmark. In total, each participant rated 33 stimuli. Results Measured and predicted similarities between sets are shown respectively in Figure 1 and 2. defined by: 12 ATMEMB( A , B) = å MEMB (X , A, B) n 2 n=2 #A +2 #B -(# A + #B+ 2) , where MEMBn (S, X, Y) is a count of all subsets S of a specified size n embedded mutually in Sets X and Y, and the symbol # stands for cardinality. • Lewin (1987): a measure of the extent of relatedness based on a probabilistic approach, REL (X,Y). • Straus (2003): the minimum distance, defined as dmin = min (U min , Bmin ), where Umin is the minimum uniformity (measured by the minimum extent to which the voices move by the same or nearly-the-same interval), and Bmin is the minimum balance (the minimum extent to which the voices can be understood to flip symmetrically around some common axis) in the voice/leading space mapping Set X onto Set Y. As far as the Parncutt model is concerned, the algorithm has been implemented in a customized Matlab code. For the other models, calculations have been performed by means of Athena CL-1.4.9. 2) Music Perception. In a second stage, set distances have been innovatively explored from the viewpoint of music perception. The main purpose of this pilot study was to both investigate the best arrangement for the trials, and to test the experimental procedure. In this regard, since the beginning we were faced with two main question marks. First, what are the best realizations for presenting the stimuli — i.e. should they be chords or arpeggios, and, in the latter case, in which pitch order? Second, are the stimuli to be presented with exactly the same pitches as in the notation, with the advantage of being respectful of the musical score but the disadvantage of introducing a high degree of inhomogeneity in the stimuli in terms of the pitch classes that are involved? To answer the first question, each stimulus was presented to the participants in three different realizations: chords, ascending arpeggios, and notated arpeggios (with subsequent pitches following the same order as in the score). Half participants received the three blocks in this order: ascending arpeggios, chords, notated arpeggios; the other half of the participants received the notated arpeggios first, then the chords, and finally the ascending arpeggios. Inside each block, the stimuli were randomized. To answer the second question, we chose to preserve the notated pitch position, in order to discriminate whether absolute pitch position might affect ratings for set similarity. Besides this, all the pairs were normalized to have the first note as a C4 (pc = 0). For each pair of sets listed in Table 1, we asked 15 musicians (age = 46.2 ± 31.26%, 9 males and 6 females) to rate similarity between the two sets in each pair on a rating scale from 1 to 3. Fig. 1. Predicted similarities between sets in Webern Canon Op. 16 No. 2, according to the five models considered in this study. Fig. 2. Mean ratings for similarity between sets in Webern Canon Op. 16 No. 2. The three blocks correspond respectively to ascending arpeggios, chords, and notated arpeggios. The vertical bars denote a 0.95 confidence interval. 1) Comparison Between Models. Although featuring different scaling, the five models considered in this study are fairly consistent in terms of tendencies and overall predictions for macroformal structure. In particular, we observe that all of the models predict a high level segmentation of the piece into two main blocks — respectively from bar 2 to bars 6-7 and from bars 6-7 to bar 12. The two blocks exhibit a similar profile — a monotonic increasing / decreasing function with peaks respectively at bar 5 (sets 4-z15A and 4-13B) and bar 10 (set 4-4A and 4-4B). This segmentation might be regarded as an evidence of internal symmetry. (Regarding the meaning of peaks in the context of each model, see the reference literature). Interestingly, this internal symmetry is confirmed by perceptual data, though peak positions do match in only one case: the set pair 4-z15A / 4-13B of bar 5 in block 1. Other two pairs, 4-4A / 4-4B in bar 10 and block 3, and 4-5A / 4-z15B in bar 4 and blocks 2 and 3 feature consistency between model predictions and data at a lower level. The reason because the sets in 9th EUROPEAN MUSIC ANALYSIS CONFERENCE — EUROMAC 9 pair 4-4A / 4-7 of bar 7 were rated as more similar as predicted might be identified with higher physical closeness between the two sets in this trial. We observe that the main difference between the Parncutt model and the other four models is that the former focuses on absolute pitches, while the latter are applied to interval vectors. We also observe that consistency between all of the models is higher when the Parncutt model is applied to absolute correlations between pitch-salience profiles, i.e. when only the magnitude of correlation is accounted for, regardless of its direction. Reasoning in terms of similarity between sets, this suggests that — according with the models — the more relevant information is carried by the interval class content of comparing sets, rather than by the physical absolute pitch positions in each set of a pair. Finally, we notice that the Rahn and the Lewin models are very similar to each other, while the Morris model looks closer to the Straus (2003) minimum distance profile. 2) Data Analysis: Main Effects. We performed 2-way repeated-measures ANOVA on factors Block and Stimulus, where blocks correspond to the three different realizations in which the stimuli were presented to the participants. Results are reported in Table 2. Only the factor Stimulus is significant at the 95% confidence level (p < 0.001), indicating that participants did distinguish between the trials — i.e. distances between different sets are significantly perceivable — but not across the blocks. Mean ratings for similarity between sets are reported in Figure 2. Note the higher similarity between average ratings in blocks 1 and 2 (ascending arpeggios vs. chords; r = .85) and 2 and 3 (chords vs. notated arpeggios; r = .79), than in blocks 1 and 3 (ascending vs. notated arpeggios; r = .67). Table 2. Two-way repeated-measures ANOVA for the pilot study discussed in the text. 2P is the estimation of the effect size. Effect Block Stimulus Block x Stimulus F .54 9.18 1.35 P .54 <.001*** .15 2P .04 .40 .09 3) Interrater Reliability. The upper part of Table 3 shows results of analysis of interrater reliability as an estimation of internal consistency associated to the scores. Both single blocks and different combinations of blocks are considered. Note that reliability is pretty high, with the exception of block 3 (notated arpeggios). In the lower part of the table, we present average and maximum Pearson’s correlations between all pairs of raters and across all the musical examples. Both ratings belonging to single blocks and average ratings between two or more block are considered. Table 3. Upper panel: Interrater reliability on the measure of similarity between set pairs, inside single and combined blocks. AA stands for ‘ascending arpeggios’, NA for ‘notated arpeggios’, and C for ‘chords’. Lower panel: Average and maximum Pearson’s correlation r between all pairs of raters and across all of the music examples. Both ratings belonging to single blocks and inter-block-averaged ratings are considered. AA NA C AA-NA AA-C Cronbach’s Alpha .81 .53 .88 .82 .92 .87 .92 NA-C AA-NA-C Pairwise Correlation r average .24 .07 .33 .22 .38 .26 .20 AA NA C <AA, NA> <AA, C> <NA, C > <AA, NA, C> maximum .80 .79 .87 .86 .84 .84 .90 Correlations between participants are lower for notated arpeggios or blocks including notated arpeggios. This indicates that perception of similarity between sets is affected by a systematic effect of pitch order, which in turn depends on physical absolute pitch positions — in opposition to what asserted above about the models. If correlations are, on average, not very high, then this is because some pairs of raters are negatively correlated. However, in a considerable number of cases, raters do correlate quite highly to each others. 4) Preliminary Comparison Between Models and Ratings. At the stage of a preliminary investigation, any quantitative reasoning about agreement/disagreement between models and ratings is premature, as we are still exploring which are the best realizations for the trials in terms of chords and/or arpeggios, and in terms of pitch classes. For this reason, all the results reported since now have to be considered as preliminary, as a different realization in the trials might lead to different conclusions. In order to quantify the effect of pitch transposition on set similarity, we measured the statistical significance of the difference between measured and predicted values for similarity in the benchmark Trial n. 9 (4-4A / 4-4B), according to the three summarizing models of Figure 1. According to all of the models, this trial exhibits the highest expected similarity. Results for the combined block set are shown in Table 4. As we can see, for all of the three models we should reject the null hypothesis that data do correspond to model predictions. We performed the t test on also separate blocks, and found that the only block where there is a significant difference between the two samples at the level of p < .001 is block 1 (ascending arpeggios) (t = 1.3). This confirms that the absolute pitch positions do affect ratings for similarity between set pairs significantly more than the structure of interval vectors, as well as that there is a systematic effect of pitch order, when sets are arranged in arpeggios. Table 4. Statistical ssignificance of the difference between measured and predicted values for set similarity in Trial n. 9. f are the degrees of freedom, and p is an estimator of the significance of the results. Student’s t Test Model ABS (correlation pcSAL) av (SYM, ATMEMB, REL) minDIST T 6.43 12.20 17.40 f 44 44 44 All Blocks tcr (95%) 2.015 2.015 2.015 p <.001*** <.001*** <.001*** 9th EUROPEAN MUSIC ANALYSIS CONFERENCE — EUROMAC 9 Implications The main results of this pilot study can be summarized as follows: (i) computational analyses by means of five different models of similarity between sets consistently predict a symmetrical structure in the macroform of the Webern Canon Op. 16 No. 2, which is associated to relatedness between pairs of sets belonging to each bar; main peaks corresponding to pairs sets 4-z15A / 4-13B and (unsurprisingly) 4-4A / 4-4B; (ii) this internal symmetry is confirmed by perceptual ratings for similarity, though peak positions do not totally match between models and data; (iii) mismatch between predicted and observed peak positions is likely a consequence of inhomogeneity between trials in terms of inversion, register and pitch commonality; (iv) the pitch-time structure of trials (i.e., chords, ascending arpeggios, or notated arpeggios) does not contribute significantly to the overall variance. On the base of these preliminary results, we planned to perform a more rigorous study, differing from the pilot test in two main aspects: first, in order to minimize the systematic effects related to absolute pitch position, both sets in each pair will be normalized to C4; second, although without any pretence of generalization, this approach will be applied on a larger sample of pairs — in the specific case the ones outlined by set analysis of Bartok Mikrokosmos No. 84 (Merriment) and Scriabin Poem Op. 69 No. 1. A comparison between model predictions and measurements by means of multiple regression analysis will conclude the study. In a parallel study involving the same pieces, we are looking at the correlations between distance profiles and other structural aspects, in particular melody and rhythm. Results of both studies will be presented at the conference. Our approach links together music theory/analysis and perception using methods of statistical and computer sciences. We are challenging these four disciplines to work more closely together and take each other’s ideas and methods more seriously. Keywords music modelling, music perception, post-tonal music, structure, distance REFERENCES Ariza, Christopher, 2002. AthenaCL. http://www.athenacl. org. Bisesi, Erica, Friberg, Anders, and Parncutt, Richard, in preparation. ‘A Computational Model of Accent Salience in Tonal Music’. Cohn, Richard, 1998. ‘Introduction to Neo-Riemannian Theory: A Survey and a Historical Perspective’, Journal of Music Theory 42/2: 167–180. Grinshaw, Jeremy http://www.allmusic.com/composition/canons-5-on-latin-texts-fo r-voice-clarinet-bass-clarinet-op-16-mc0002356179 Krumhansl, Carol L., 1998. ‘Perceived Triad Distance: Evidence Supporting the Psychological Reality of Neo-Riemannian Transformations, Journal of Music Theory, 42/2: 265–281. Lewin, David, 1988. Generalizaed Musical Intervals and Transformations. Oxford and New York: Oxford University Press. Milne, Andrew J., and Holland, Simon, 2016. ‘Empirically Testing Tonnetz, Voice-Leading, and Spectral Models of Perceived Harmonic Distance’, Journal of Mathematics and Music, 10/1: 59–85. Morris, Robert, 1987. ‘A Similarity Index for Pitch-Class Sets’, Perspective of New Music 18/1–2: 445–460. Parncutt, Richard, 1988. ‘Revision of Terhardt’s Psychoacoustical Model of the Root(s) of a Musical Chord’, Music Perception 6: 65–94. Rahn, John, 1979–1980. ‘Relating Sets’, Journal of Music Theory 49/1: 45–108. Rogers, Nancy, and Callender, Clifton, 2006. ‘Judgments of Distance Between Trichords’, in Mario Baroni, Anna Rita Addessi, Roberto Caterina, and Marco Costa (eds.), Proceedings of the 9th International Conference on Music Perception and Cognition. Bologna, Italy: Bononia University Press, 1686–1691. Straus, N. Joseph, 2003. ‘Uniformity, Balance, and Smoothness in Atonal Voice Leading’, Music Theory Spectrum 25/2: 305–352. Straus, N. Joseph, 2005. ‘Voice Leading in Set-Class Space’, Journal of Music Theory, 49/1: 45–108. Straus, N. Joseph, 2014. ‘Total Voice Leading’, Music Theory Online, 20/2. Tymoczko, Dmitri, 2011. A Geometry of Music. Harmony and Counterpoint in the Extended Common Practice. New York: Oxford University Press.
© Copyright 2026 Paperzz