Eigenspectral Analysis of Hermitian Adjacency Matrices for the

Journal of Mathematical Sociology, 29:265–294, 2005
Copyright # Taylor & Francis Inc.
ISSN: 0022-250X print/1545-5874 online
DOI: 10.1080/00222500590957473
Eigenspectral Analysis of Hermitian Adjacency
Matrices for the Analysis of Group Substructures
Bettina Hoser
Andreas Geyer-Schulz
Information Services and Electronic Markets,
Universität Karlsruhe (TH), Germany
In this paper we propose the use of the eigensystem of complex adjacency matrices
to analyze the structure of asymmetric directed weighted communication.
The use of complex Hermitian adjacency matrices allows to store more data relevant to asymmetric communication, and extends the interpretation of the resulting eigensystem beyond the principal eigenpair. This is based on the fact, that the
adjacency matrix is transformed into a linear self-adjoint operator in Hilbert
space.
Subgroups of members, or nodes of a communication network can be characterised
by the eigensubspaces of the complex Hermitian adjacency matrix. Their relative
‘traffic-level’ is represented by the eigenvalue of the subspace, and their members
are represented by the eigenvector components. Since eigenvectors belonging to
distinct eigenvalues are orthogonal the subgroups can be viewed as independent
with respect to the communication behavior of the relevant members of each
subgroup.
As an example for this kind of analysis the EIES data set is used. The substructures and communication patterns within this data set are described.
Keywords: Hermitian matrix, eigensystem, perturbation, social network analysis,
rank prestige index
1. INTRODUCTION
Today’s trend towards embedded, wearable, and networked devices
leads to an increasing stream of communication traffic data, which
Address correspondence to Bettina Hoser, Institut fur Informationswirtschaft
und-management, Abteilung für Informationsdienste und elektronische Märkte,
Universität Karlsruhe (TH), Kaiserstrasse 12, D-76128 Karlsruhe, Germany. E-mail:
[email protected]
265
266
B. Hoser and A. Geyer-Schulz
can be used to reveal social network structures. Wellman, for example,
envisions the rise of network societies that require new tools for
enhancing social capital (Wellman, 2001).
Eigenvector centrality measurements have become a standard
procedure in the analysis of group structures. Mostly symmetric
(dichotomized) data have been used. Bonacich and Lloyd (Bonacich
and Lloyd, 2001) present an introduction of the use of eigenvector-like
measurements of centrality for asymmetric data. The analysis of directed, valued, asymmetric relationships within a social network poses
some difficulties. In this paper we will propose a method based on
the status (rank prestige) index method described in Wasserman
and Faust (Wasserman, 1994, p. 205–219) but adapted to complex
adjacency matrices. As an example, we discuss the use of this method
in the analysis of part of the full EIES data set as presented in Wasserman and Faust (Wasserman, 1994) as well as an EIES subset as presented by Freeman (Freeman, 1997).
The following constructs from social network analysis are commonly used in the analysis of group structure (Wasserman, 1994):
. Centrality: A member in a newsgroup or a subgroup is viewed as
central, whenever he or she has a high out degree (¼ number of outgoing connections with a high number of different co-members or
other subgroups).
. Prestige: A member in a newsgroup or a subgroup is viewed as prestigious, whenever he or she has a high in degree (¼ number of
inbound connections with a high number of different co-members
or other subgroups).
. Rank prestige or status: Centrality and prestige will be viewed with
respect to the rank of the member within the group. This is the
construct of interest for the rest of this paper.
Wasserman and Faust (Wasserman 1994, p. 205–219) describe to a
great extent the history of the status (or as they define it: rank prestige) index. This index is based on the idea, that the rank of a group
member depends on the rank of the members to whom he or she is
connected. Stated in mathematical terms this yields the eigenvalue
equation (for an eigenvalue equal to 1). The components of the principal eigenvector are the rank prestige indices of each group member.
The highest ranked member, the one with the largest rank prestige
index, is the one who is either chosen by a few but high ranking comembers, or by many relatively low ranked co-members. Until now,
this index has only been calculated for real matrices, based on directed and possibly weighted graphs, which can give rise to the problem
Eigenspectral Analysis of Hermitian Adjacency Matrices
267
of complex eigenvalues (which have, in this context, no interpretation
as of now). As a second problem, this index can only use the principal
eigenvector for the interpretation of the group’s rank structure. This
is due to the fact that the eigenvector system does not construct a
vector space that lends itself easily to interpretation in the field of
social network analysis.
For asymmetric relations, Bonacich and Lloyd (Bonacich and Lloyd,
2001) show that for some networks the standard eigenvector measures
for network centrality do not lead to meaningful results. As a solution
they suggest the notion of a-centrality, which combines an individual’s
own inherent status with a weight a for the relative importance of the
perceived status.
There have been different approaches to the analysis of asymmetric
communication behavior. Freeman (Freeman, 1997) proposed to use
the possibility to split any asymmetric square matrix into its symmetric and skew-symmetric part, perform a singular value decomposition of the skew-symmetric matrix, and showed, that the result
could be interpreted as a ranking of dominance.
Recent research by Tyler et al. (Tyler et al., 2003) showed that
they could identify subgroups in asymmetric email networks by analyzing betweenness centrality in the form of inter-community edges
with a large betweenness value. These edges are then removed until
the graph decomposes into separate communities, thus re-organizing
the graph structure. Barnett and Rice (Barnett and Rice, 1985)
showed, that asymmetrical data in its raw form might have more
information than matrices that have been transformed to avoid negative eigenvalues.
Chino (Chino, 1998) proposed the use of Hermitian matrices and
Hilbert space theory in the analysis of preference data over time in
psychology. He could show that the sign of the eigenvalues yields
information about the orientation of behavior within a given subspace,
and that it can then be shown how people move (behave) towards each
over time (or move apart).
We propose here the use of complex adjacency matrices (Hoser
and Geyer-Schulz, 2003) and Hilbert space theory to be able to keep
the information concerning the communication asymmetry and at
the same time to get more information concerning the structure
of the newsgroup as well as the communication behavior within the
subgroups of the newsgroup.
The remainder of this paper is organized as follows: In section 2 we
introduce the properties of Hilbert space and complex Hermitian
matrices. Also, the characteristics of their eigensystem and their
behavior under perturbation are discussed. We then show in section
268
B. Hoser and A. Geyer-Schulz
3 how we construct a complex-valued sociomatrix that is then
transformed into a Hermitian matrix. The structure and characteristics of the resulting eigenspace is discussed. We further introduce
some relevant constructs from social network analysis needed for
the interpretation of the resulting eigensystem. In section 4 we
present two examples with synthetic data and two applications to real
data. Section 5 presents our conclusions.
2. HILBERT SPACE AND HERMITIAN MATRICES
In this section, we introduce the notation and basic facts about complex numbers, Hilbert space, eigensystems and the perturbation of
eigensystems.
The complex number z can be represented in algebraic form or
equivalently in exponential form:
z ¼ a þ ib ¼ jzjei/
ð1Þ
with the real part of z being denoted as ReðzÞ
¼ a, the
ffi imaginary part
pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
as ImðzÞ ¼ b, the absolute value as jzj ¼ a2 þ b2 , and the phase
2
as 0 / ¼ arccos ReðzÞ
jzj p, with i the imaginary unit (i ¼ 1).
z ¼ a ib denotes the complex conjugate of z. The following four equations are of special interest in this paper:
z1 z2 ¼ jz1 jjz2 jeið/1 þ/2 Þ
ð2Þ
z þ z ¼ 2ReðzÞ
z¼z
zz ¼ jzj2
if and only if
ð3Þ
z2R
ð4Þ
ð5Þ
The notation will be as follows: unless otherwise stated all numbers
are complex (2 C). Column vectors will be written in bold face x, with
the components xj ; j ¼ 1; . . . ; n. The vector space will be denoted by
V ¼ Cn . Matrices will be denoted as capital letters A with akl representing the entry in the k-th row and the l-th column. Greek letters
will denote eigenvalues with kk representing the k-th eigenvalue.
The complex conjugate transpose of a vector x will be denoted as x .
The transpose of a vector x will be denoted as xt .
The outer product of two vectors x and y is defined as:
0
1
x1 y1 . . . x1 yn
xy ¼ @ . . . . . . . . . A
ð6Þ
xn y1 . . . xn yn
Eigenspectral Analysis of Hermitian Adjacency Matrices
269
The cross product of two vectors x; y 2 R2 is defined as:
x y ¼ x1 y2 y1 x2
ð7Þ
The inner product of x; y 2 Cn is a semilinear form on a given vector
space V and will be represented as
hxjyi ¼ x y ¼
n
X
ð8Þ
xk yk
k¼1
The norm will be denoted k x k and will be defined as follows:
pffiffiffiffiffiffiffiffiffiffiffiffi
hxjxi ¼ k x k
ð9Þ
For vector space V where the norm is defined as in Equation (9) the
following rules hold:
hxjxi 0 withhxjxi ¼ 0 if and only if x ¼ 0
a2C
haxjyi ¼ ahxjyi; hxjayi ¼ ahxjyi
hx þ yjzi ¼ hxjzi þ hyjyi
hxjyi ¼ hyjxi
ð10Þ
ð11Þ
ð12Þ
ð13Þ
Hilbert space is a complete normed inner product space as defined
by Equations (8)–(9). Completeness is given because the Cauchy condition holds.
The following inequalities will also be helpful:
. Cauchy-Schwarz inequality:
jhxjyij pffiffiffiffiffiffiffiffiffiffiffiffipffiffiffiffiffiffiffiffiffiffiffi
hxjxi hyjyi
ð14Þ
(with equality if x ¼ ay; a 2 C)
. Bessel inequality: for any vector h and an orthonormal basis xk
n
X
jhhjxk ij2 k h k2
ð15Þ
j¼1
. For any vector h and a complete orthonormal basis xk the Bessel
inequality becomes Parseval’s equality:
n
X
k¼1
jhhjxk ij2 ¼k h k2
ð16Þ
270
B. Hoser and A. Geyer-Schulz
. Triangle inequality:
k x þ y kk x k þ k y k
ð17Þ
(with equality if x ¼ ay as in Eq. (14) and in addition a 2 R).
In Hilbert space the Equations (14) and (17) become equalities and
the Parseval equality Eq. (16) holds.
The adjoint space X of X is the set of all semilinear forms (Equation
(11)) (e.g., inner product) on vector space X (Kato, 1995, p. 11). A selfadjoint linear operator is called Hermitian. A matrix H is called
Hermitian, if and only if
H ¼ H
ð18Þ
with H representing the conjugate complex transpose of H. This
means that the matrix entries can be written as hlk ¼ hkl . Hermitian
matrices are also normal:
HH ¼ H H
ð19Þ
Hx ¼ kx
ð20Þ
The eigenvalue equation
of a complex Hermitian matrix H can be represented due to its complete orthonormal eigenvector system in the spectral (Fourier) representation:
H¼
n
X
kk P k ;
Pk ¼ xk xk
ð21Þ
k¼1
Pn
with Pk as orthogonal projectors. Note, that
k¼1 Pk ¼ I; Pk ¼ Pk ;
2
Pk ¼ Pk . The set of all eigenvalues is called spectrum.
The eigenvector components are interpreted as a generalized rank
prestige index of each subgroup member. In addition, each member
has for each subgroup structure=eigenvector in the spectral representation a different rank prestige index. This index depends on his
relation to the respective anchor of the subgroup.
An orthonormal basis can be chosen such that:
hxk jxl i ¼ dkl
(follows from (18))
ð22Þ
with
dkl ¼ 0
1
if k 6¼ l;
if k ¼ l
ð23Þ
Eigenspectral Analysis of Hermitian Adjacency Matrices
271
This also holds true for arbitrary rotation:
hei/k xk jei/l xl i ¼ ei/k ei/l hxk jxl i
ð24Þ
¼e
ið/l /k Þ
hxk jxl i
ð25Þ
¼e
ið/l /k Þ
dkl
ð26Þ
Hermitian matrices have full rank, and therefore all eigenvalues are
simple and real:
kk 2 R
8k
ð27Þ
because hHxjxi ¼ hkxjxi ¼ khxjxi and hxjHxi ¼ hxjkxi ¼ khxjxi and
H ¼ H imply hHxjxi ¼ hxjHxi and thus k ¼ k which means k 2 R
(see Meyer, 2000, p. 548; Kato, 1995, p. 53).
Since all eigenvalues of a Hermitian matrix are real Equation (27),
the interpretation of the eigenvalues does not pose the difficulty of
interpreting complex eigenvalues of non-symmetric real matrices.
Since the matrix H has full rank and is diagonalizable,
1 ¼ algebraic multiplicity ¼ geometric multiplicity which means that
every one of the n simple eigenvalues has but one eigenvector (see
Meyer, 2000, p. 510).
For a complex Hermitian matrix with trðHÞ ¼ 0 some eigenvalues
have to be negative due to the fact that
trðHÞ ¼
n
X
k¼1
hkk ¼
n
X
kk ¼ 0
ð28Þ
k¼1
As a special case consider a matrix B of order n þ m with
0nn A
B¼
A
0mm
ð29Þ
with A representing a n by m matrix. In the special case of n ¼ 1 and
m ¼ k this matrix represents a directed, weighted star graph. The two
non-zero eigenvalues of that system are described as follows:
rðBÞ ¼ fþk1 ; k2 g
ð30Þ
(see Meyer, 2000, p. 555) with jk1 j ¼ jk2 j.
The interpretation of negative eigenvalues poses a challenge, but
Richards and Seary (Richards, 2000, section F), Chino (Chino, 1998),
and Barnett and Rice (Barnett and Rice, 1985) offer some suggestions
for the interpretation of negative eigenvalues within the social network analysis framework.
272
B. Hoser and A. Geyer-Schulz
Richards and Seary, who use real matrices, point out that while
positive eigenvalues give information about the ‘‘clustering of interconnected nodes,’’ negative eigenvalues emphasize the ‘‘partitioning
of the network into sets of nodes that have similar patterns of connections with nodes in other sets, but few connections amongst themselves’’ (Richards, 2000, section F).
Chino uses Hilbert space theory to explain asymmetric behavior
(Chino, 1998). He constructs a complex Hermitian matrix out of a real
valued sociomatrix by using a well-known representation for square
matrices (Bronstein et al., 2000, p. 263) S ¼ Ss þ Sas which splits S
in a symmetric part Ss and an skew-symmetric part Sas . Chino starts
with a real square asymmetric similarity matrix S. He decomposes
this matrix by S ¼ 12 ðS þ St Þ þ 12 ðS St Þ ¼ Ss þ Sas where St is the
transpose of S. Chino constructs the Hermitian matrix H as
H ¼ Ss þ iSas . He uses the resulting eigenspace to explain behavior
in groups over time. The sign of the eigenvalues in the complex
eigensubspace under consideration defines the positive direction of
movement between members of the group. Formally, Chinos
pffiffi matrix
HC and the matrix H proposed here are related by HC ¼ 22 H as we
show in section 3. The difference is in the better interpretation of
the proposed method, since the orthogonal projectors can easily be
back rotated and thus yield insight into the filtered adjacency
matrices.
By virtue of Equation (16) it follows that because of
n
X
k H k2 ¼
k2k
ð31Þ
k¼1
2
the sum of all eigenvalues definesP
the total variation
in H
Pn r contained
P
n
2
(see Chino, 1998, p. 58). k H k ¼k P i¼1 ki Pi k¼ h i¼1 ki Pi j ni¼1 ki Pi i ¼
hk1 P1 jk1 P1 i þ þ hkn Pn jkn Pn i ¼ ni¼1 k2i because hPi jPj i ¼ dij .
The sign of the eigenvalues helps to identify heterogeneity in the
communication patterns. Another approach to this view was taken
by Barnett and Rice (Barnett and Rice, 1985, p. 293) who defined
the warp to explain heterogeneity. This is in the case of matrices with
trðAÞ ¼ 0 not adaptable. But still the sign gives information about the
behavior of a subgroup towards the rest of the group and the behavior
of the subgroup within itself.
The eigenvalues sorted by their absolute value jk1 j jkn j help
to identify the dominant substructures for interpretation.
For social network analysis, we can identify the most active group
member, because for jkmax j the largest absolute eigenvector component
jxmax;m j belongs to the most active group member. Since there can be
an eigenvalue with the same absolute value but different sign (see
Eigenspectral Analysis of Hermitian Adjacency Matrices
273
Equation (30)), there could also be a second eigenvector with the same
subgroup members but with a phase shift of p.
Next we assume an unperturbed Hermitian matrix A with eigenvalues a1 ; . . . ; an , with a1 an and a Hermitian perturbation matrix
E with eigenvalues E1 ; . . . ; En , with E1 En . For the eigenvalues
k1 ; . . . ; kn , with k1 kn of the resulting Hermitian matrix
H ¼ A þ E a Weyl type perturbation bound is given by
aj þ E1 kj aj þ En
ð32Þ
See (Horn and Johnson, 1990, p. 181–182), (Kato, 1995, p. 61) and
(Meyer, 2000, p. 551–552). This bound can help to analyze mixed patterns of communication.
3. THE CONSTRUCTION OF A COMPLEX ADJACENCY
MATRIX
In the following we consider communication patterns which can
be modeled as a directed, and weighted graph G ¼ fN; Eg with N
denoting the nodes or members and E denoting the edges, links or
communications between different members. Self references are
excluded.
Consider, for example, an email network. Network members
represent nodes in the graph, the links are built from the unique
message IDs and the ‘‘To’’ and ‘‘From’’ fields, whenever two authors
are linked by a message. The direction of the link is given by the
sequence of the message IDs, the weight by the number of messages
sent in this direction.
In the literature, several methods of constructing real adjacency
matrices from such graphs are being used. For example, some authors
consider only one direction in the construction process, others consider
the difference and dichotomize according, e.g., to frequency, . . . In the
process of constructing a real adjacency matrix from such a graph
information is lost.
Therefore, we propose the following construction rules for a complex
adjacency matrix H from a graph G as defined above:
1. We construct a square complex adjacency matrix A with n members
by
akl ¼ m þ ip
ð33Þ
with m the number of outbound messages from node k to node l,
and p the number of inbound messages from node l to node k and
i representing the imaginary unit. As can be seen akl ¼ ialk .
274
B. Hoser and A. Geyer-Schulz
ip
2. We rotate A by multiplying A with e 4 (Eq.2) in order to obtain a
Hermitian matrix H:
p
H ¼ A ei4
ð34Þ
Proof:
akl ¼ rei/
alk ¼ iakl ¼ irei/
akk ¼ 0 because of the exclusion of self references
hkl ¼ akl eiw ¼ rei/ eiw ¼ reið/þwÞ
p
hlk ¼ alk eiw ¼ irei/ eiw ¼ ei2 reiðw/Þ
For Hermitian matrices hkl ¼ hlk must hold, therefore,
p
reið/þwÞ ¼ reið2þw/Þ .
(g) This holds, if / þ w ¼ p2 w þ /.
(h) Solving for w leads to 2w ¼ p2. Thus w ¼ p4 :
qed
(a)
(b)
(c)
(d)
(e)
(f)
3. Under this similarity transformation the coordinate independent
characteristics of the original communication patterns is kept
(Meyer, 2000, p. 256), no information is lost.
Table 1 shows, how the following four qualitatively different types
of communication behavior remain visible after the rotation:
1. No self reference.
2. More outbound than inbound traffic ðk ! l > l ! kÞ leads to a
negative sign of the imaginary part of hkl .
3. More inbound than outbound traffic ðl ! k > k ! lÞ leads to a positive sign of the imaginary part of hkl .
4. Outbound equals inbound traffic ðk ! l ¼ l ! kÞ.
Note that the matrix is invariant under rotation.
This construction of H is related to ChinoÇs construction HC by
pffiffi
HC ¼ 22 H. To show this we set HC ¼ 12 ðB þ Bt Þ þ i 12 ðB Bt Þ and
p
H ¼ Aei2 , and we compute HC ¼ 12 ðB þ Bt Þ þ i 12 ðB Bt Þ ¼ 12 ðBð1 þ iÞ
pffiffi
pffiffi
pffiffi
p
p
p
þBt ð1 iÞÞ ¼ 22 ðBei2 þ Bt ei2 Þ ¼ 22 ðB iBt Þei2 ¼ 22 H.
TABLE 1 Communication Behavior Representation
Communication behavior
no self reference
k!l>l!k
l!k>k!l
k!l¼l!k
akl ¼ m þ ip
hkl ¼ mr þ ipr
akk ¼ 0
m>p
m<p
m¼p
hkk ¼ 0
pr < 0
pr > 0
pr ¼ 0, mr > 0
Eigenspectral Analysis of Hermitian Adjacency Matrices
275
Because of the rotation invariance of a complete orthonormal eigenvector system, we can improve the visibility of special eigenvector components by applying a rotation which makes these components real.
For example, Mathematica (Wolfram Research, 1999) automatically
rotates the eigenvector so that the eigenvector component with the
highest absolute value becomes real positive. This element is regarded
in this paper as the anchor or most influential member of a subgroup.
4. EIGENSPECTRAL ANALYSIS OF NETWORK DATA
To show how the eigensystem of a complex Hermitian adjacency
matrix reflects a communication structure, we present two different
synthetic structures and as an example of real data we take the EIES
data set (Wasserman, 1994) as well as a subset thereof as described by
Freeman (Freeman, 1997, p. 11). The eigensystems have been calculated using the function Eigensystem of Mathematica (Wolfram
Research, 1999).
4.1 Eigenspectral Analysis of a Star Graph
As a basic construct, consider a directed and weighted star graph with
5 members as in Figure 1.
The complex adjacency matrix A1 belonging to the graph in Figure 1
is:
0
1
0
1 þ 2i 2 þ 3i 2 þ i 1 þ i
B2 þ i
0
0
0
0 C
B
C
C
A1 ¼ B
3
þ
2i
0
0
0
0
ð35Þ
B
C
@ 1 þ 2i
0
0
0
0 A
1þi
0
0
0
0
FIGURE 1 Star graph with 5 vertices corresponding to Eq. (35).
276
B. Hoser and A. Geyer-Schulz
TABLE 2 Eigensystem for H1 with z ¼ a þ ib
k
5.0
5.0
0
0
0
x
0.71
0:3 þ 0:1i
0:5 þ 0:1i
0:3 0:1i
0:2
0.71
0:30 0:10i
0:50 0:1i
0:30 þ 0:10i
0.20
0
0:27 þ 0:09i
0:138 þ 0:028i
0:083 0:028i
0.94
0
0:36 þ 0:27i
0:19 þ 0:11i
0.86
0:083 þ 0:028i
0
0:72
0:64 0:08i
0:18 þ 0:13i
0:134 þ 0:045i
which after rotation becomes the Hermitian matrix H1 in Eq. (36):
0
1
0
2:1 þ 0:7i 3:5 þ 0:7i 2:1 0:7i 1:4
B 2:1 0:7i
0
0
0
0 C
B
C
C
H1 ¼ B
3:5
0:7i
0
0
0
0
ð36Þ
B
C
@ 2:1 þ 0:7i
0
0
0
0 A
1:4
0
0
0
0
The corresponding eigensystem is given in two different representations to make certain aspects more visible. As can be seen in both
Tables 2 and 3, there are two eigenvalues of the same absolute value
but with different sign. As was shown in Equation (30) this is indeed
the characteristic of the adjacency matrix of a star graph. As can also
be seen the eigenvectors belonging to the two eigenvalues are the
same in absolute values but differ by p in phase. Note, that the member with ID 1 is the center of the star graph and is indicated as such
by the real eigenvector component that has no shift in phase between
eigenvector one and two. This member can thus be described as the
most central and prestigious member of the group. It can also be
seen, that the member with ID 3 is the one that has the most contact
with the anchor within the group because he has the second-highest
absolute eigenvector component. It can also be seen that the
TABLE 3 Eigensystem for H1 with z ¼ jzjei/ðzÞ
5:0
0
5:0
0
0
k
x
jzj
/ðzÞ
jzj
/ðzÞ
jzj
/ðzÞ
jzj
/ðzÞ
jzj
/ðzÞ
0.71
0.32
0.51
0.32
0.20
0
2.82
2.94
2.82
p
0.71
0.32
0.51
0.32
0.20
0
0.32
0.20
0.32
0
0
0.28
0.14
0.087
0.94
undef
2.82
2.94
2.82
0
0
0.45
0.22
0.86
0.087
undef
2.50
2.62
0
2.82
0
0.72
0.64
0.22
0.14
undef
0
3.02
0.64
0.32
Eigenspectral Analysis of Hermitian Adjacency Matrices
277
members 2 and 4 are of same relevance (same absolute value of the
eigenvector component) but show opposite direction in communication (complex conjugate imaginary part), indicating that while
member 2 writes more to member 1, member 4 receives more from
member 1.
4.2 Eigenspectral Analysis of a Perturbed Star Graph
Next, let us now consider a perturbed star graph as presented
in Figure 2. The matrix A2 represents the graph shown in
Figure 2. The perturbation is given by an inbound and outbound
connection between members with ID 2 and 3 of weight 2 in both
directions.
0
1
0
1 þ 2i 2 þ 3i 2 þ i 1 þ i
B2 þ i
0
2 þ 2i
0
0 C
B
C
C
A2 ¼ B
3
þ
2i
2
þ
2i
0
0
0
ð37Þ
B
C
@ 1 þ 2i
0
0
0
0 A
1þi
0
0
0
0
which after rotation becomes the Hermitian matrix H2 in Equation
(38):
1
0
0
2:1 þ 0:7i 3:5 þ 0:7i 2:1 0:7i 1:4
B 2:1 0:7i
0
2:8
0
0 C
C
B
B
ð38Þ
H2 ¼ B 3:5 0:7i
2:8
0
0
0 C
C
@ 2:1 þ 0:7i
0
0
0
0 A
1:4
0
0
0
0
FIGURE 2 Perturbed star graph with 5 vertices as in Eq. (37).
278
B. Hoser and A. Geyer-Schulz
TABLE 4 Eigensystem for H2 with z ¼ a þ ib
4:5
k
6.2
x
0.61
0:46 0:13i
0:56 0:13i
0:21 þ 0:07i
0.14
2:5
0.70
0:26 0:01i
0:027 þ 0:068i
0.78
0:57 þ 0:07i 0:50 0:06i
0:33 0:11i
0:22 þ 0:08i
0:22
0:15 þ 0: 103 i
0.79
0
0:23 0:08i
0:34 þ 0:21i
0:25 þ 0:17i
0.70
0:42 0:14i
0
0
0
0:51 0:17i
0.85
The eigensystem belonging to matrix H2 has the form shown in
Tables 4 and 5.
As predicted by Equation (32) the eigenvalues of matrix H1 are now
disturbed. The consequence of the perturbation can be seen when we
view H2 as the sum of the unperturbed matrix H1 and the perturbation matrix P2 of the form
1
0
0
0
0
0 0
B0
0
2 þ 2i 0 0 C
C
B
B
ð39Þ
P2 ¼ B 0 2 þ 2i
0
0 0C
C
@0
0
0
0 0A
0
0
0
0 0
The eigenvalues of matrix P2 after rotation are k1 ¼ 2:8; k2 ¼ 2:8.
Thus the largest eigenvalue k1 ¼ 6:2 of the perturbed matrix H2
should be inside the Weyl-type bounds given by 5:0 þ 2:8 ¼ 7:8 6:2 2:2 ¼ 5:0 2:8 and the smallest eigenvalue k2 ¼ 4:5 should be inside
the bound given by 5 þ 2:8 ¼ 2:2 4:5 5 2:8 ¼ 7:8. This is
the case as seen in Table 4. Note, that due to the perturbation the symmetry of the eigenvalues in Table 2 is destroyed. In addition, the perturbation leads to a break in symmetry between the eigenvectors. The
eigenvector components affected by the perturbation do no longer
show the regular phase shift of p anymore. For members with ID 2
TABLE 5 Eigensystem for H2 with z ¼ ðjzj; /ðzÞÞ
4:5
6.2
2:5
0
0.79
k
x
jzj
/ðzÞ
jzj
/ðzÞ
jzj
/ðzÞ
jzj
/ðzÞ
jzj
/ðzÞ
0.61
0.48
0.57
0.22
0.14
0
0.27
0.22
0.32
0
0.70
0.073
0.58
0.35
0.22
0
1.19
3.02
2.82
p
0.27
0.78
0.50
0.23
0.15
3.10
0
3.03
0.37
0.04
0.25
0.40
0.31
0.70
0.44
0.32
2.60
2.54
0
0.32
0
0
0
0.53
0.85
undef
undef
undef
2.82
0
Eigenspectral Analysis of Hermitian Adjacency Matrices
279
and 3 this is visible in Table 5. However, a perturbed star graph can
still be recognized by the the existence of an anchor, which can be
identified by the eigenvector component with the highest absolute
value in two eigenvectors at the same ID position. In addition the difference in subgroup behavior can be seen. The subspace corresponding
to k1 shows a more equal distribution of absolute values then the subspace corresponding to k2 . This second subspace strengthens the star
like character of the communication.
4.3 Eigenspectral Analysis of Freeman’s EIES Data Set
To show the advantages of the method, we present the analysis of the
EIES data set as given in Wasserman and Faust (Wasserman, 1994,
p. 747). We have deleted the diagonal entries, since self reference is
not discussed.
When this data set is transformed as described in section 3, the distribution of the spectrum is given in Figure 3. For a better visualization of the symmetry inherent in the spectrum we have rearranged
the eigenvalues such that in Figure 4 the positive and negative eigenvalues kj sorted by jkj j.
From Figure 5 it is clear to see that the cumulative variance
PK 2
kk
r2c;K ¼ Pk¼1
with K < N covered by the first 5 eigenvalues is with
N
2
k¼1
kk
98% already close to 1. For this reason we will only consider the five
corresponding eigenspaces in the following analysis.
FIGURE 3 Eigenspectrum of the complete EIES data set sorted by value.
280
B. Hoser and A. Geyer-Schulz
FIGURE 4 Eigenspectrum of the complete EIES data set sorted by absolute
value and sign.
The symmetry of the spectrum suggests a star like pattern with perturbation as described in section 4.2. This is indeed the case as can be
seen in the distribution of the absolute value of the eigenvector components in Table 6. In Table 6 the eigenvectors corresponding to the first
5 eigenvalues and the 10 highest ranked members are given. As can be
seen from this table, the first ranked members of subspaces 1 and 2
FIGURE 5 Cumulative covered variance r2c;K by eigenvalues kk .
281
ID
k1
1.0
ID
k2
0.56
ID
k3
0.31
ID
k4
0.23
ID
k5
0.17
jx5l j
/5l
jx4l j
/4l
jx3l j
/3l
jx2l j
/2l
jx1l j
/1l
1
0.56
0
1
0.74
0
8
0.69
0
29
0.60
0
8
0.49
0
29
0.41
0.13
29
0.44
2.86
32
0.50
3.12
24
0.43
2.92
32
0.39
0.07
8
0.35
0.04
2
0.39
3.00
30
0.34
2.95
2
0.43
2.89
30
0.37
0.23
2
0.33
0.17
8
0.21
2.33
11
0.29
2.96
31
0.29
0.40
29
0.33
3.10
32
0.28
0.015
32
0.11
2.15
29
0.14
2.22
15
0.20
3.02
24
0.30
2.95
31
0.24
0.24
11
0.11
2.78
1
0.12
1.65
32
0.17
2.74
22
0.20
2.61
11
0.21
0.03
15
0.07
2.78
2
0.11
1.30
12
0.15
2.89
11
0.20
0.11
TABLE 6 Ranking of Authors by ID in the First Five Subspaces of the Full Data Set
24
0.16
0.09
18
0.06
3.13
9
0.10
0.45
11
0.13
2.71
2
0.19
2.82
30
0.10
0.23
26
0.06
2.74
31
0.09
0.24
14
0.13
2.82
15
0.19
3.06
27
0.09
0.24
31
0.06
1.10
17
0.06
3.09
10
0.11
2.65
14
0.14
2.73
282
B. Hoser and A. Geyer-Schulz
FIGURE 6 Distribution of jx1l j.
are the same as well as for subspaces 3 and 5, suggesting centers of
star-like patterns.
Figures 6 and 7 show the distribution of the absolute value of the
eigenvector components in the first two subspaces. The distribution
in the first subspace is more similar to a uniform distribution than
the second. This implies a connected communication pattern in the
FIGURE 7 Distribution of jx2l j.
Eigenspectral Analysis of Hermitian Adjacency Matrices
283
FIGURE 8 Distribution of /ðx1l Þ.
first subspace. The rapid decrease in the second distribution on the
other hand indicates a strong star-like pattern (around author with
ID 1) with the relevant authors with ID 29 and 2.
The distribution of the phase in the two subspaces is given in
Figures 8 and 9. As can be seen the distribution in Figure 8 is such
that the phase only varies between 0 / p4. This suggests that the
FIGURE 9 Distribution of /ðx2l Þ.
284
B. Hoser and A. Geyer-Schulz
communication between members in this pattern is balanced with
respect to direction with a little more weight on outbound communication with respect to the central member. Figure 9, on the other
hand, supports the star-like communication pattern because of the
high incidence of phase between þ 34 p and 34 p, which enhances
the relevance of the members of the star. It can also be seen that
while Wellman (ID 29) writes more often to the center, members
with IDs 2 and 8 receive more messages from the center. This is also
suggested by the spectrum and the absolute value distribution in
Figure 7. This second subspace serves as a corrective pattern for
the first subspace to show the star like structure around author
with ID 1.
In the first two subspaces we find that the group is first of all
connected and that the author with ID 1 is the major source and
focus of communication. In addition, it can be seen that one subgroup
(IDs 1, 29, 8 and 2) around the author with ID 1 is is very active and
more connected within itself than with the rest of the group. The
direction within this group is such that while the author with ID
29 is more outbound oriented, authors 2 and 8 are more inbound
oriented with respect to author 1. Members 8 and 32 show a high
similarity with respect to communication direction with the center,
as can be seen from their respective phase being close to 0 (or p).
These two (ID 8 and 32) play the major role in the third and fifth
subspaces.
Let us now consider the third and fifth subspace, since they share
the same most relevant member, namely the author with ID 8.
The distribution of the absolute values of the eigenvector components
is given in Figures 10 and 11. Figure 10 suggests a star like pattern
with one strong center around author 8 and four authors (32, 30, 29
and 24) who are relevant in that pattern. Figure 11 on the other hand
points towards a well connected pattern. This is similar to the subspaces 2 and 1.
The phase distributions given in Figures 12 and 13 support this
view. In Figure 13 it can be seen that the phase varies mainly between
0 / p4 and 3p
4 / p, which implies a balanced pattern regarding direction. Figure 12 on the other hand, shows a distribution almost
across the entire range.
In subspace 4 Wellman (ID 29) is the most relevant member.
Mullins (ID 24) and White (ID 2) show a strong similarity in pattern
behavior, which is obvious because of the same absolute value and
the same phase in comparison with the central member. The star-like
behavior can again be deduced from the distribution of the absolute
values in Figure 14 and the phase distribution in Figure 15.
Eigenspectral Analysis of Hermitian Adjacency Matrices
285
FIGURE 10 Distribution of jx3l j.
To summarize, the analysis of the EIES data set yields the following
information:
. The strongest pattern (eigenvalue k1 ) is that of an interconnected
group, with a lot of traffic centered on member with ID 1 (Freeman).
He communicates much with IDs 29, 8 and 2. This pattern states
FIGURE 11 Distribution of jx5l j.
286
B. Hoser and A. Geyer-Schulz
FIGURE 12 Distribution of /ðx3l Þ.
that a strong subgroup is connected to the complete group. This
strong pattern that covers about 60% of the total variance is most
probably because Freeman was the main communicator due to his
role as administrator in that exchange.
. The second-strongest pattern (eigenvalue k2 ) shows the behavior
within the subgroup. The subgroup is centered around ID 1 with
FIGURE 13 Distribution of /ðx5l Þ.
Eigenspectral Analysis of Hermitian Adjacency Matrices
287
FIGURE 14 Distribution of jx4l j.
IDs 29 and 8 being of next highest relevance in that subgroup. This
subgroup has a star-like communication pattern. The direction of
communication within that subgroup is also given.
. The third subspace reveals the pattern and subgroup hidden within
the main buzz of communication in the first subspaces. This pattern
now states that Bernard (ID 8) is the center of a star with members
FIGURE 15 Distribution of /ðx4l Þ.
288
B. Hoser and A. Geyer-Schulz
32, 30 and 11. Since Bernard is also part of the star around
Freeman, it may well be concluded that he serves as a link between
those subgroups.
. The fifth subspace shows that the subgroup around Bernard is again
well connected with the rest of the group.
. The fourth subspace shows that Wellman (ID 29) is the center of the
star with Mullins (ID 24) and White (ID 2) as strong members. This
again suggests the idea that Wellman is the link between the subgroup around Freeman and the people connected to himself in his
subgroup. The direction of communication is from Wellman to his
subgroup members.
The members with IDs 1 (Freeman), 29 (Wellman), 8 (Bernard),
2 (White) and 24 (Mullins) also play a major role in the subset analysis
of the next section.
4.4 Eigenspectral Analysis of a Subset of Freeman’s
EIES Data Set
As a smaller data example, consider the subset of EIES data given in
(Freeman, 1997, p.11). The difference here in comparison to the full
data set is that the one-to-all communication has been eliminated
(Freeman, 2004). According to (Freeman 2004), we give for each member of the subset his ID in the subset and the complete set to link the
two data sets: Freeman (ID 1,1), White (ID 2,2), Alba (ID 3,4), Bernard
(ID 4,8), Doreian (ID 5,11), Mullins (ID 6,24), and, last but not least,
Wellman (ID 7,29). In the rest of this section only names will be used
when referring to members of the subset. In Table 7 the outbound
numbers of messages between the member of the group are given.
TABLE 7 Number of Message Exchanged between Members in the
EIES Sub Set
Author
ID
Freeman
1
White
2
Alba
4
Bernard
8
Doreian
11
Mullins
24
Wellman
29
Freeman
White
Alba
Bernard
Doreian
Mullins
Wellman
1
2
4
8
11
24
29
0
84
16
127
57
23
118
115
0
10
22
9
4
24
17
4
0
17
4
3
5
93
5
15
0
57
9
35
53
5
3
57
0
8
15
33
0
3
12
8
0
45
84
15
4
34
10
33
0
Eigenspectral Analysis of Hermitian Adjacency Matrices
289
The resulting complex adjacency matrix F is given in Equation (40):
0
0
B 84 þ 115i
B
B
B 16 þ 17i
B
B
F ¼ B 127 þ 93i
B
B 57 þ 53i
B
B
@ 23 þ 33i
118 þ 84i
115 þ 84i 17 þ 16i
93 þ 127i 53 þ 57i
4 þ 10i
5 þ 22i
10 þ 4i
0
15 þ 17i
3 þ 4i
3 þ 3i
22 þ 5i
17 þ 15i
0
57 þ 57i
12 þ 9i
9 þ 5i
4 þ 3i
57 þ 57i
0
8 þ 8i
4
3 þ 3i
9 þ 12i
8 þ 8i
0
24 þ 15i
5 þ 4i
35 þ 34i
15 þ 10i
45 þ 33i
0
5 þ 9i
33 þ 23i 84 þ 118i
4i
1
15 þ 24i C
C
C
4 þ 5i C
C
C
34 þ 35i C
C
10 þ 15i C
C
C
33 þ 45i A
0
ð40Þ
The eigensystem of matrix F after rotation is given in Tables 8 and 9.
The results suggest the following interpretation:
. Freeman is the center of a very active star. About 93% of the total
communication level variance (Equation 31) is explained by this
structure. This corresponds to the results in section 4.3.
To see this we identify ID 1 as the anchor of eigenvectors x1 and x2
in Table 9 and compute the explained variance of this substructure
P
with k21 þ k22 = 7i¼1 k2i approximately 93%. Main corresponding partners are White, Bernard and Wellman, which also corresponds to
the previous results in Table 6. Only the ranking differs slightly.
To check for a star pattern we decomposed the matrix F into
the unperturbed ‘‘Freeman-star’’-matrix G, with g1l ¼ f1l and
gk1 ¼ fk1 for all k; l and gk;l ¼ 0 for all others, and a perturbation
matrix D with d1l ¼ 0; 8l and dk1 ¼ 0; ; 8k and dkl ¼ fkl ; 8k ¼
f2; . . . ; 7g; l ¼ f2; . . . ; 7g. We showed, that for the eigenvalues of
the F ¼ G þ D the Weyl-type bounds hold, which is an indication
that we really have correctly identified a star-shaped substructure.
. The first eigenvalue k1 ¼ 340 suggests, that this represents the communication behavior that Freeman and his subgroup show with
respect to the whole group. It points towards a strong connected
group which is again connected to the rest. Whereas the sign of
the second eigenvalue k2 ¼ 230 seems to point towards the fact
that the subgroup forms a star graph. These results are suggested
by the distribution of the absolute value of the eigenvector component. In addition from the phase information it can be seen that
the the direction of communication is different between White on
the one hand and Wellman and Bernard on the other in regard to
Freeman. While Bernard and Wellman are more outbound oriented,
White is more inbound oriented.
. In the eigenvector x3 the anchor is Bernard. His corresponding
partners are Doreian and Wellman. Bernard is at the center of a
link between Wellman and Doreian with almost equal amounts of
290
230
0.76
0.39 0.05i
0.011 0.012i
0.38 þ 0.06i
0.079 0.011i
0.013 0.032i
0.33 þ 0.07i
340
0.62
0.33 þ 0.06i
0.098 0.008i
0.45 0.07i
0.29 0.02i
0.17 þ 0.01i
0.40 0.06i
kk
xkl
p
55
0.073 0.021i
0.55 þ 0.07i
0.015 þ 0.004i
0.20 þ 0.03i
0.39 0.02i
0.55
0.53 þ 0.13i
90
0.056 0.005i
0.018 0.057i
0.121 þ 0.003i
0.65
0.55 þ 0.02i
0.22 þ 0.001i
0.45 þ 0.03i
TABLE 8 Eigensystem for Fei4 with z ¼ a þ ib
0.027 0.039i
0.12 0.16i
0.16 þ 0.03i
0.35 þ 0.15i
0.47 þ 0.25i
0.56
0.42 0.11i
25
0.15 0.001i
0.69
0.118 0.037i
0.18 þ 0.1i
0.32 þ 0.21i
0.51 0.11i
0.16 0.04i
13
0.059 0.026i
0.106 0.059i
0.97
0.001 þ 0.015i
0.14 þ 0.02i
0.15 þ 0.05i
0.002 þ 0.014i
2.9
291
p
/ðzÞ
0
0.18
0.09
0.15
0.07
0.05
0.15
jzj
0.62
0.34
0.098
0.45
0.29
0.17
0.41
kk
xkl
340
0.76
0.39
0.016
0.38
0.080
0.034
0.34
jzj
0
3.00
2.32
3.00
3.00
1.97
2.93
/ðzÞ
230
0.056
0.06
0.121
0.65
0.56
0.22
0.45
jzj
jzj
0.076
0.44
0.016
0.21
0.39
0.55
0.55
/ðzÞ
0.09
1.88
3.12
0
3.10
0.02
3.07
90
TABLE 9 Eigensystem for Fei4 with z ¼ ðjzj; /ðzÞÞ
/ðzÞ
2.86
0.15
0.29
2.99
0.04
0
2.91
55
0.047
0.20
0.16
0.38
0.53
0.56
0.44
jzj
25
0.96
0.92
2.93
2.75
2.64
0
0.25
/ðzÞ
0.15
0.69
0.12
0.20
0.38
0.52
0.16
jzj
/ðzÞ
0.02
0
0.30
2.64
2.56
2.94
2.89
13
0.064
0.12
0.97
0.015
0.14
0.16
0.014
jzj
/ðzÞ
2.73
2.63
0
1.67
3.03
0.34
1.73
2.9
292
B. Hoser and A. Geyer-Schulz
communication between Bernard and his two partners. They cover
about 4%.
This result differs slightly from the previous results, since authors
with ID 32 and 30 are not in the subset data. Apart from that
Doreian and Wellman also are strong partners in the full data set.
. In eigenvector x4 and x5 Mullins is the center of a star. His star
though explains only about 2% of the total communication level.
In addition he communicates mostly with Wellman, which can be
seen in the fact that the absolute value of the corresponding eigenvector components are very close together.
To summarize, these examples show:
1. The eigenvalues of F show the perturbation behavior of a matrix as
in Equation (29).
2. The eigenvalues react to the perturbation in that they split asymmetrically according to Equation (32).
3. The eigenvalues can be used to show how much of the total variation is covered by one subgroup Equation (31).
4. The eigenvectors belonging to two distinct, but coordinated eigenvalues show the same subgroup members, maybe slightly in different order of relevance depending on the kind of perturbation. See
Table 9 columns x4 and x5 for the ‘‘Mullins-star.’’
5. The largest absolute value eigenvector component relates to the
most relevant subgroup member. See last paragraph in section 3.
6. Since for dominant star graph-like structures the eigenvalues come
in pairs of the same absolute value but different sign, which means
their subspaces are rotated by p, the eigenvector components also
show this rotation and if the matrix is perturbed the components
will also have a perturbation of their phase.
5. CONCLUSIONS
We showed that the proposed eigensystem analysis of Hermitian
adjacency matrices is an improvement in comparison to the existing
analysis tools based on the rank prestige index as defined in
Wasserman and Faust (Wasserman, 1994, p. 205-219) in that the proposed method describes the complete substructure of a group by virtue
of the use of all eigenpairs based on their relevance as defined by their
explanatory strength in terms of variance and in that no arbitrary
assumptions about the symmetrization of real adjacency matrices
are used. It also seems to support the notion that not only group
structure but also core=periphery questions might be solved with this
Eigenspectral Analysis of Hermitian Adjacency Matrices
293
method. The additional phase information might further help to define
in which direction the communication flows.
We presume, that the behavior of the eigenvalues and their
respective eigenvector components is comparable to the behavior of
a perturbed matrix of the form H ¼ A þ E, even if we do not know
the exact form of the unperturbed matrix. We tentatively decompose
the matrix in its subspaces according to the identified dominant substructure from the spectral decomposition of the perturbed matrix
and check if the Weyl-type bounds still hold for this decomposition.
If this is true, we take this as an indication that we correctly identified a substructure. We do not yet know whether this can be done
repeatedly.
The analysis of the data sets showed that the proposed method
offers consistent interpretation of a well-known data set. In addition,
if offers more information regarding direction of communication with
groups and between subgroups. To investigate this further we plan
to design and analyze communication networks in such a way as to
gain a deeper understanding of the possibilities. One promising communication network is a market, where the trading behavior can be
modeled in such a way that it resembles an asymmetric communication network. We have set up two types of markets, namely election
markets (http://psm.em.uni-karlsruhe.de/psm/) and technology forecast markets (http://tfm.em.uni-karlsruhe.de/psm/) and we plan to
analyze the data resulting from these markets with the method proposed in this market. Further application might be e-mail traffic
within an organization. Thus there is a high potential in the method
to gain further insight into the communication behavior and its
relevance in respect to group structure.
REFERENCES
Barnett, G. A. & Rice, R. E. (1985). Longitudinal Non-Euclidean Networks: Applying
Galileo. Social Networks, 7, 287–322.
Bonacich, P. & Lloyd, P. (2001). Eigenvector-like measurement of centrality for asymmetric relations. Social Networks, 23, 191–201.
Bronstein, I. N., Semendjajew, K. A., & Musiol, G. (2000). Taschenbuch der Mathematik,
Harry Deutsch: Frankfurt am Main.
Chino, N. (1998). Hilbert space theory in psychology, Bulletin of the Faculty of Letters
No.28 Japan: Aichi Gakuin University.
Freeman, L. (2004). Personal communication. E-mail: (27 Oct 2004).
Freeman, L. C. (1997). Uncovering organizational hierarchies. Computational & Mathematical Organization Theory, 3(1), 5–18.
Hoser, B. & Geyer-Schulz, A. (2003). Use of Hermitian matrices in the analysis of newsgroups. Verhandlungen der Deutschen Physikalischen Gesellschaft: AKF-Frühjahrstagung Dresden 2003, 38(5), 525.
294
B. Hoser and A. Geyer-Schulz
Kato, T. (1995). Perturbation Theory for Linear Operators, New York: Springer, 2nd
edition.
Meyer, C. D. (2000). Matrix Analysis and Applied Linear Algebra, Philadelphia: Society
for Industrial and Applied Mathematics.
Richards, W. & Seary, A. (2000). Eigen analysis of networks. Journal of Social Structure,
1(2), 1–19.
Tyler, J. R., Wilkinson, D. M., & Huberman, B. A. (2003). E-mail as Spectroscopy:
Automated Discovery of Community Structure within Organizations. e-print condmat=0303264.
Wasserman, S. & Faust, K. (1994). Social Network Analysis, Methods and Applications,
Cambridge: Cambridge University Press.
Wellman, B. (2001). Computer Networks As Social Networks. Science, 293, 2031–2034.
Wolfram Research, I., editor (1999). Mathematica, Illinois: Wolfram Research, Inc.,
Champaign, Version 4 edition.