Titel des Artikels

Scott, John. (2000) “Social Network analysis”
Course: Inter organisational relationships
Author: Michiel Westra ([email protected]
Abstract: This article provides an introduction to the theory and practice of network analysis in the social sciences. It explains the general framework, the basic concepts, technical measures and it reviews the available computer programs. Definitions of points,
lines and paths are given and their use in clarifying such measures as density, fragmentation and centralization. The author identifies the various cliques, components and circles
into which networks are formed, and outlines an approach to the study of socially structured positions. Finally, the use of multi dimensional methods for investigating social
networks is discussed.
CHAPTER 3:
Social network analysis emerged as a set of methods for the analysis of social
HANDLING RE-
structures, methods that specifically allow an investigation of the relational as-
LATIONAL DA-
pects of these structures. All social research data, once collected, must be held in
TA
some kind of data matrix (Galtung 1967), a framework in which the raw or coded
data can be organised in a more or less efficient way. The logical structure of the
data matrix is always that of a table.
Variables
Sex
Age
Income etc...
CASE-BYCases: 1
AFFILIATION
MATRIX
2
attributes
3
4
…
figure 3.1 a data matrix for variable analysis
This data matrix (case by variable) cannot be used for relational data. This kind of
data must be seen in terms of a case-by-affiliation matrix in which affiliations are
organisations, events etc, in which the cases (agents) are involved. This is called a
two-mode or rectangular, because the rows and columns refer to different sets of
data.
Figure 3.2. show this kind of matrix. Three people are involved (labelled 1,2,3)
and three events (labelled A,B,C). Where a specific individual participates in a
particular event, there’s a ‘1’ in the corresponding cell of the matrix. Non1/10
participation is shown by a ‘0’. The sociogram next to the matrix can be read as
saying that each person meets the other two at a particular event.
CASE-BY-CASE
It may be quitte impossible, using conventional manual methods of drawing, to
MATRIX
construct a sociogram for a large network. Alternative ways of recording the connections are made. The solution that has been most widely adopted has been to
construct a case-by-case matrix in which each agent is listed twice – once in the
rows and once in the columns. In this case, both the rows and the columns will
represent the cases, and the individual cells will show whether or not pairs of
individuals are related through common affiliation. The presence or absence of
connections between pairs of agents is represented by ‘1’ or ‘0’ in the appropriate
cells of the matrix. This matrix shows the actual relations or ties among the
agents.
AFFILIATION-
This matrix shows affiliations in both rows and its columns, with the individual
BY-
cells showing whether particular pairs of affiliations are linked through common
AFFILIATION
agents. This matrix is extremely important in network analysis and can often
MATRIX
throw light on important aspects of the social structure that are not apparent from
the case-by-case matrix.
INCIDENCE
The rectangular matrix (case-by-affiliation) is generally termed an incidence ma-
AND ADJACEN-
trix, while the square matrices (case-by-case, or affiliation-by-affiliation) are
CY MATRIX
termed adjacency matrices.
(See also appendix, figure 3.3)
Most techniques of network analysis involve the direct manipulation of adjacency
matrices, and so involve a prior conversion of the original incidence matrix into its
two constituent adjacency matrices. It is critically important therefore that researchers understand the for of their data (incidence or adjacency) and the assumptions that underpin particular procedures of network analysis.
For example, if a researcher collects two-mode data on cases and their affiliations,
then, it will generally be most appropriate to organize this info into an incidence
matrix form which the adjacency matrices used in network analysis can later be
derived. However in some situations it will be possible for a researcher to collect
relational data in a direct case-by-case form, for example friendship choices within
a small group. In this situation, the information can be immediately organized in
an adjacency matrix. (there are no corresponding incidence matrix and no complementary adjacency matrix of affiliations. All agents have merely a single affiliation in common – the fact of having chosen one another as friends).
2/10
The choice of which set of agents to treat as the cases for the purpose of network
analysis will depend simply on which is seen as being the most significant in terms
of the research design. (For example, if the organizations are assumed to be of
the greatest importance, then a sample of organizations will be selected for study
and the only people who will figure in the analysis will be those who happen to be
members of these organizations). It makes no difference which of the two (organizations or members) are regarded as cases.
The distinction between cases and affiliations may generally be regarded as a
purely conventional feature of research designs for network analysis.
SOME CONVEN- A further aspect of this convention is to place the cases on the rows of the inciTIONS
dence matrix and the affiliations on its columns. It is also conventional to refer to
the rows before the columns when describing the contents of any particular cell,
and to use the letter ‘a’ to refer to the actual value contained in that cell. a(i,j) is
the general form for the content which refers to the value contained in the cell
corresponding to the intersection of row ‘i’ with column ‘j’.
ADJACENCY
The diagonal cells with this matrix are different from all other cells in the matrix.
MATRIX
In a square matrix, the diagonal cells shows the relations between any particular
case and itself. Therefore, these cells contain no values and should be ignored in
the analysis. Appendix 3.5 also shows that the adjacency matrix are symmetrical
around their diagonals; the top half of each matrix is an identical, mirror image of
its bottom half. Therefore, many analytical procedures in network analysis require
only the bottom half of this matrix and not the full matrix.
LEVEL OF
One of the most important considerations in variable analysis is the level of meas-
MEASUREMENT
urement that is appropriate for a variable. (nominal, ordinal, ratio, interval). Appendix 3.6 uses two dimensions to classify the four levels of measurement in
relational data;
1. Numeration (binary or valued)
2. Directionality (undirected or directed).
For example, the simplest type of relational data (type 1) is that which is both
undirected and binary.
STORAGE OF
If relational data is properly stored, they can be managed and manipulated more
RELATIONAL
efficiently. The need to use computers for network analysis, then, means that it is
DATA
important to consider how the logical structure of the data matrix can be translated into a computer file. For example the research on interlocking directorships
3/10
invloves (1) generating a list of directors in the target compagnies (2) sorting this
into alphabetical order, (3) and then identifying those names that appear two or
more times. (using a word processor to create a data file..)
THE SELECTION The selection problems concern the boundedness of social relations and the posOF RELATIONAL sibility of drawing relational data from samples. Also, social relations are social
DATA
constructs, produced on the basis of the definitions of the situation made
by
group members. Therefore, ‘ close friendship’ for example, may mean different
things to different people. This issue is important, as researchers often have unrealistic views about the boundaries of relational systems (Laumann et al., 1989)
Researchers are involved in a process of conceptual elaboration and model building, not a simple process of collecting pre-formed data.
Two general approaches to this task have been identified: (1) positional approach
and (2) the relation approach.
POSITIONAL
In this approach, the researcher samples from among the occupants of particular
APPROACH
formally defined positions or group memberships. First, the positions or groups
that are of interest are identified, and then their occupants or members are sampled.
RELATIONAL
This approach can be used where there are no relevant positions , where there is
APPROACH
no comprehensive listing available, or where the knowledge of the agents themselves is crucial in determining the boundaries of the population. (for example,
informants are asked to nominate powerful members of the community and these
nominations are combined into a target population). The researcher must have
good reasons to believe that the informants will have a good knowledge of the
target population and are able to report this accurately.
A particular variant of this approach is the ‘snowballing technique’ in which a small
numbers of informants are studied and each is asked to nominate others for
study. In this method, the social relation itself is used as a chain of connection for
building the group.
SAMPLING
There are different sample problems, especially when focusing on large scale social systems. For example, in the case of a fairly small village with a population of
5,000 people, the adjacency matrix would contain 25,000,000 cells, which is beyond the capacity of most available computers and software. It might be assumed, therefore, that sampling from large populations would provide a similar
workable solution for social network analysis. (appendix 3.8 gives a schematic
account of the ideal sampling process in social) But, a representative sample of
agents, does not, in itself, give a useful sample of relations (Alba, 1982:44).
4/10
There seem to be three different responses to these sampling problems;
1. to abandon any attempt to measure the global properties of social networks and to restrict attention to personal ego-centric networks;
2. to use a form of snowballing (which is not a random sample);
3. a way moving on to some of the more qualitative features of social networks (Burt, 1983). Burt, in particular, is concerned with the identification
of ‘positions’ or structural locations such as roles. (obtaining info from
each respondent about their social attributes and the attributes of those
to whom they are connected with. Then grouping of agents into sets with
commonly combinations of attributes, and then arrange these into a setby-set square matrix that shows the frequencies of relations between
members of the various categories.
CHAPTER 4:
If the sociogram is one way of representing relational matrix data (see previous
POINTS, LINES
chapter), the language of graph theory is another, and more general, way of do-
AND DENSITY
ing this. It is a starting point for many of the most fundamental ideas of social
network analysis.
Data in matrix form can be read by the programs, and suitable graph theoretical
concepts can be explored without the researcher needing to know anything about
the mechanism of the theory or of the matrix algebra.
SOCIOGRAMS
Graph theory concerns sets of elements and the relations among these; elements
AND GRAPH
being termed as points and the relations lines.Thus a matrix describing the rela-
THEORY
tions among a group a people can be converted into a graph of points connected
by lines.
The graphs of graph theory express the qualitative patterns of connection among
points. In a graph, it is the pattern of connections that is important, and not the
actual positioning of the points on the page. This theory does involve concepts of
length and location, for example, but these do not correspond to these concepts
of physical length and location which we are most familiar with.
There are different graphs, depending on the element which is focused on;
1. directed graph = if the relations are directed from one agent to another.
2. valued graph (example: appendix 3.5)= if the intensity of the relation is
an important consideration and can be represented by a numerical value.
A matrix for a directed graph will not usually be symmetrical, as relations will not
normally be reciprocated.
IMPORTANT
Two points that are connected by a line are said to be a adjacent to one anoth-
TERMS
er. It means that two agents represented by the points are directly related or
5/10
connected with one another.
Those points to which a particular point is adjacent are termed its neighbourhood, and the total number of other points in its neighbourhood is termed its
degree. Thus, the degree of points is a numerical measure of the size of its
neighbourhood.
A sequence of lines in a graph is called a walk, and a walk in which each point
and each line are district is called a path.
The length of the path is the number of lines it contains. The distance between two points is the length of the shortest path that connects them.
The indegree of a point is the total number of other points that have lines directed
towards it; and its outdegree is the total number of points to which it direct
lines.
The more points that are connected to one another, the more dense will the
graph be. Density depends upon two other parameters of network structure: (1)
inclusiveness – the total number of points minus the number of isolated pointsand (2) the sum of the degree of its points.
Density of a graph is defined as the number of lines in a graph, expressed as a
proportion of the maximum possible number of lines.
The formula for this is:
L : n (n-1) / 2
The simplest and most straightforward way to measure the density of a large
network from a sample data would be to estimate it from the mean degree of
cases included in the sample. The degree sum – the sum of the degrees of all
points in the graph – is equal to the estimated mean degree multiplied by the
total number of case in the population. As the maximum possible number of lines
can always be calculated directly from the total number of points (it is always
equal to n (n-1) / 2 in an undirected graph), the density of the graph can be estimated by calculating:
(n x mean degree) /2 : n (n-1) / 2
When reduced, the formula = (n x mean degree) / n (n-1).
6/10
APPENDIX
7/10
8/10
9/10
10/10