Preprocessing Compute Post Proc. <</ />> </> XML Raw Data ETL Initial Graph Analyz Slice Compute e Subgraph PageRank Top Users Repeat GraphX Raw Wikipedia Hyperlinks PageRank Top 20 Pages <</ />> </> HDFS XML Spark Preprocess HDFS Compute Naïve Spark Spark Post. 1492 Giraph + Spark 605 GraphX 342 GraphLab + Spark 375 0 200 400 600 800 1000 1200 1400 1600 Total Runtime (in Seconds) Property Graph Advisor 3 Id Property (V) 3 (rxin, student) 7 (jgonzal, postdoc) 5 (franklin, professor) 2 (istoica, professor) 5 rxin stu. franklin , prof. jgonzal , pst.doc . Colleague Collab. 7 Vertex Table 2 istoica prof. Edge Table SrcId DstId Property (E) 3 7 Collaborator 5 3 Advisor 2 5 Colleague 5 7 PI Data-Parallel Table Property Graph Row Row Result Row Row Graph-Parallel Hyperlinks Raw Wikipedia PageRank Title PR Text Table Title Body <</ />> </> Top 20 Pages Term-Doc Graph Topic Model (LDA) Word Topics XML Word Topic Discussion Table User Disc. Editor Graph Community Detection User Community Community Topic User Com. Topic Com. Vertex Table (RDD) Property Graph B C A Part. 1 D 2D Vertex Cut Heuristic A F D E Routing Table (RDD) Part. 2 Edge Table (RDD) A B A C A A 1 2 B B 1 B C C C 1 C D A E A F E D E F D D 1 2 E E 2 F F 2 Edge Cut Vertex Cut
© Copyright 2026 Paperzz