GraphX - Fossies

Preprocessing
Compute
Post Proc.
<</ />>
</>
XML
Raw
Data
ETL
Initial
Graph
Analyz
Slice
Compute
e
Subgraph PageRank Top
Users
Repeat
GraphX
Raw Wikipedia
Hyperlinks
PageRank
Top 20 Pages
<</ />>
</>
HDFS
XML
Spark Preprocess
HDFS
Compute
Naïve Spark
Spark Post.
1492
Giraph + Spark
605
GraphX
342
GraphLab + Spark
375
0
200
400
600
800 1000 1200 1400 1600
Total Runtime (in Seconds)
Property Graph
Advisor
3
Id
Property (V)
3
(rxin, student)
7
(jgonzal, postdoc)
5
(franklin, professor)
2
(istoica, professor)
5
rxin
stu.
franklin
, prof.
jgonzal
,
pst.doc
.
Colleague
Collab.
7
Vertex Table
2
istoica
prof.
Edge Table
SrcId
DstId
Property (E)
3
7
Collaborator
5
3
Advisor
2
5
Colleague
5
7
PI
Data-Parallel
Table
Property Graph
Row
Row
Result
Row
Row
Graph-Parallel
Hyperlinks
Raw
Wikipedia
PageRank
Title PR
Text
Table
Title Body
<</ />>
</>
Top 20 Pages
Term-Doc
Graph
Topic Model
(LDA)
Word Topics
XML
Word Topic
Discussion
Table
User Disc.
Editor Graph
Community
Detection
User
Community
Community
Topic
User Com.
Topic Com.
Vertex
Table
(RDD)
Property Graph
B
C
A
Part. 1
D
2D Vertex Cut Heuristic
A
F
D
E
Routing
Table
(RDD)
Part. 2
Edge Table
(RDD)
A
B
A
C
A
A 1 2
B
B 1
B
C
C
C 1
C
D
A
E
A
F
E
D
E
F
D
D
1 2
E
E 2
F
F 2
Edge Cut
Vertex Cut