國立雲林科技大學
National Yunlin University of Science and Technology
Finding a Team of Experts in
Social Networks
Theodoros Lappas, Kun Liu, and Evimaria Terzi
KDD, 2009
Reported by Wen-Chung Liao, 2009/12/22
1
Intelligent Database Systems Lab
N.Y.U.S.T.
I. M.
Outlines
2
Motivation
Objective
Preliminary
Problems
Algorithms
Experiments
Conclusions
Comments
Intelligent Database Systems Lab
N.Y.U.S.T.
I. M.
Motivation
The success of a project depends not only on
the expertise of the people who are involved,
but also on how effectively they collaborate,
communicate and work together as a team.
Figure 1: Network of connections
between individuals in
{a, b, c, d, e}.
3
Xa={algorithms},
Xb={web programming},
Xc={software engineering, distributed systems},
Xd={software engineering}
Xe={software engineering, distributed systems,
web programming}.
T={algorithms, software engineering, distributed
systems, web programming}.
X’ = {a, b, c} or X” = {a, e}
Intelligent Database Systems Lab
N.Y.U.S.T.
I. M.
Objectives
4
Given a task T that requires a set of skills, our
goal is to find a set of individuals X’ X, such
that every required skill in T is exhibited by at
least one individual in X’.
Additionally, the members of team X’ should
define a subgraph in G with low
communication cost.
Intelligent Database Systems Lab
N.Y.U.S.T.
I. M.
Preliminaries
X = {1, . . ., n}: n individuals
A = {a1, . . . , am}: a universe
of m skills
Xi A
T: a task, a subset of skills
required to perform a job.
T A.
5
S(a): the support set of the
skill a, the set of individuals in
X that has the skill a.
S(a) = {i | i X and a Xi}.
G(X,E): an undirected and
weighted graph
d(i, i’)
Path(i, i’)
d(i,X’) = mini’ X’ d(i, i’)
Path(i, X’)
G[X’]: the subgraph of G that
contains only the nodes in X’.
2
3
4
1
7
6
8
5
Intelligent Database Systems Lab
N.Y.U.S.T.
I. M.
Problems
Problem [Team Formation]:
Given the set of n individuals X = {1, . . . , n}, a graph G(X,E),
and task T, find X’ X, so that C (X’, T) = T, and the
communication cost Cc(X’) is minimized.
Diameter (R): Cc-R(X’)
Minimum Spanning Tree (Mst): Cc-Mst (X’),
Proposition 1. The Diameter-TF problem is NP-complete.
Proposition 2. The Mst-TF problem is NP-complete.
6
Intelligent Database Systems Lab
N.Y.U.S.T.
I. M.
Algorithms
S(a1)
3
2
4
1
7
S(a0)
={1,7}
S(a2)
6
O(|S(arare)| × n)
Proposition 3. Cc-R(X’) ≦2 Cc-R(X*)
7
8
S(a3)
5
R1=max{1, 2, 2}=2
R7=max{1, 0, 1}=1
X’= {7}∪{2, 8}
Intelligent Database Systems Lab
N.Y.U.S.T.
I. M.
O(|T| × |X|)
O(|X0| × |E|)
2
3 v*
4
1
7
v
8
X’
6
8
5
Intelligent Database Systems Lab
X0
N.Y.U.S.T.
I. M.
O(k × |E|)
2
Y2
3
4
1
7
8
Y1
v
X’
9
6
5
Y3
v*
Intelligent Database Systems Lab
N.Y.U.S.T.
I. M.
Experiments
• GreedyDiameter (GreedyMST)
•
•
•
•
•
•
•
•
10
DBLP dataset: papers in DB, DM, AI and T conferences.
Xdblp: authors that have at least three papers.
5508 individuals.
Xi: terms that appear in at least two titles of papers that
author i has co-authored.
1792 distinct skills.
Authors i, i’ are connected in Gdblp (Xdblp,E) if they appear
as co-authors in at least two papers.
A task T(t, s) is generated: (1) select S from {DB, DM,
AI,T} with |S| = s. (2) randomly pick t required skills.
For every (s, t), generate 100 random tasks, t=2, 4,… , 20
and s = 1,…,4.
Intelligent Database Systems Lab
N.Y.U.S.T.
I. M.
Experiments
11
Intelligent Database Systems Lab
N.Y.U.S.T.
I. M.
Conclusion
12
Address the problem of forming a team of
skilled individuals to perform a given task,
while minimizing the communication cost
among the members of the team.
Prove that the Team Formation problem is NPHard.
Propose appropriate approximation algorithms.
Intelligent Database Systems Lab
N.Y.U.S.T.
I. M.
Comments
13
Advantage
Shortage
Applications
─ Team formation
Intelligent Database Systems Lab
© Copyright 2026 Paperzz