Proceedings of the 3rd Annual ACM Web Science Conference, 2013

 Proceedings of the 3rd Annual ACM Web Science Conference, 2013 Paris, France WebSci ‘13 Conference Co-­‐Chairs: Hugh Davis, Harry Halpin, Alex Pentland Program Chairs: Mark Bernstein, Lada Adamic, Harith Alani, Alexandre Monnin, Richard Rogers The Association for Computing Machinery 2 Penn Plaza, Suite 701New York New
York 10121-0701
ACM COPYRIGHT NOTICE. Copyright © 2012 by the Association for Computing
Machinery, Inc. Permission to make digital or hard copies of part or all of this work
forpersonal or classroom use is granted without fee provided that copies are not
made or distributed for profit or commercial advantage and that copies bear this
notice and thefull citation on the first page. Copyrights for components of this work
owned by others than ACM must be honored. Abstracting with credit is permitted.
To copy otherwise, to republish, to post on servers, or to redistribute to lists,
requires prior specific permission and/or a fee. Request permissions from
Publications Dept., ACM, Inc., fax +1 (212) 869-0481, or [email protected].
For other copying of articles that carry a code at the bottom of the first or last page,
copying is permitted provided that the per-copy fee indicated in the code is paid
through the Copyright Clearance Center, 222 Rosewood Drive, Danvers, MA 01923, +1978-750-8400, +1-978-750-4470 (fax).
Notice to Past Authors of ACM-Published Articles
ACM intends to create a complete electronic archive of all articles and/or other material
previously published by ACM. If you have written a work that was previously published
by ACM in any journal or conference proceedings prior to 1978, or any SIG Newsletter
at any time, and you do NOT want this work to appear in the ACM Digital Library,
please inform [email protected], stating the title of the work, the author(s), and
where and when published.
ACM ISBN: 978-1-4503-1889-1
Extended abstracts in these proceedings are not covered under the above ACM
copyright. Authors retain copyrights to content included as extended abstracts.
WebSci2013
Program Committee
Program Committee
Lada Adamic
Harith Alani
Ioannis Anagnostopoulos
Lora Aroyo
Bruno Bachimont
Alain Barrat
Nancy Baym
Stéphane B. Bazan
Mark Bernstein
Jamie Blustein
Michael Bywater
Dominique Cardon
Les Carr
Carlos Alberto Alejandro Castillo
Ciro Cattuto
Pablo Cesar
John Henry Clippinger
Kate Crawford
Brian Croxall
Hugh Davis
David Deroure
Stefan Dietze
Alan Dix
Graeme Earl
Jim Fallows
Miriam Fernandez
Fabien Gandon
Aldo Gangemi
Serge Garlatti
Carole Goble
Dave Gray
Susan Halford
Harry Halpin
Conor Hayes
Clare Hooper
Yuk Hui
Nicolas Jullien
Marcel Karnstedt
David Kolb
Jerome Kunegis
George P. Landow
Christophe Lejeune
Pierre Livet
Cathy Marshall
Stacey Mason
J. Nathan Matias
Microsoft
Université Saint-Joseph de Beyrouth
Orange
CWI
University of Southampton
World Wide Web Consortium
Bates College
Aix-Marseille University
Microsoft
1
WebSci2013
Yelena Mejova
Alexandre Monnin
Yann Moulier-Boutang
Frank Nack
Wolfgang Nejdl
Kieron O’Hara
Gilles Phillips
Sophie Pène
Daniele Quercia
Jill Walker Rettberg
Richard Rogers
Daniel Romero
Inbal Ronen
Matthew Rowe
Daniel Schwabe
Wendy Seltzer
Judith Simon
Eddie Soulier
Steffen Staab
Thanassis Tiropanis
Susana Pajares Tosca
Johann Ugander
Michalis Vafopoulos
Wouter Van Atteveldt
Tommaso Venturini
Mark Veyrat
Mark Weal
Ingmar Weber
Matthew Weber
Su White
Marcus Wigan
2
Program Committee
Centre Pompidou Research and Innovation Institute
University of Amsterdam
ens
Oxford Systematics
WebSci2013
Table of Contents
Table of Contents
Toward a Next Generation of Network Models for the Web . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Hans Akkermans and Rena Bakhshi
1
Traditional media seen from social media . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
Jisun An, Daniele Quercia, Meeyoung Cha, Krishna Gummadi and Jon Crowcroft
Why individuals seek diverse opinions (or why they don’t) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
Jisun An, Daniele Quercia and Jon Crowcroft
The Web Science Curriculum at work: The Digital Economy Master Program at
USJ-Beirut. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
Stéphane Bazan and Michalis Vafopoulos
From networked publics to issue publics: Reconsidering the public/private distinction in
web science . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
Andreas Birkbak
Filling the Gaps Among DBpedia Multilingual Chapters for Question Answering . . . . . . . . . 33
Julien Cojan, Elena Cabrio and Fabien Gandon
Assessing the Educational Linked Data Landscap. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
Mathieu D’Aquin, Alessandro Adamou and Stefan Dietze
Social Media as a Measurement Tool of Depression in Populations . . . . . . . . . . . . . . . . . . . . . . . . 47
Munmun De Choudhury, Scott Counts and Eric Horvitz
Identifying Research Talent Using Web-Centric Databases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
Anca Dumitrache, Paul Groth and Peter van Den Besselaar
A comparison between online and offline prayer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
Fabian Eikelboom, Paul Groth, Victor de Boer and Laura Hollink
On Measuring the Impact of Hyperlinks on Reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
Gemma Fitzsimmons, Mark Weal and Denis Drieghe
Theres no such thing as raw data. Exploring the socio-technical life of a government
dataset. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
Mark Frank and Tim Davies
AltOA: A Framework for Dissemination Through Disintermediation . . . . . . . . . . . . . . . . . . . . . . 79
Richard Fyson, Simon Coles and Leslie Carr
R-energy for Evaluating Robustness of Dynamic Networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
Ming Gao, Ee-Peng Lim and David Lo
An Investigation into Correlations between Financial Sentiment and Prices in Financial
Markets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
Paul Gaskell, Thanassis Tiropanis and Frank McGroarty
The Performativity of Data: Re-conceptualizing the Web of Data . . . . . . . . . . . . . . . . . . . . . . . . . 91
Marie Joan Kristine Gloria, Dominic Difranzo, Marco Fernando Navarro and Jim
Hendler
1
WebSci2013
Table of Contents
Producing a Unified Graph Representation from Multiple Social Network Views . . . . . . . . . . 100
Derek Greene and Padraig Cunningham
Voice-based Web access in rural Africa . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
Nana Baah Gyan, Victor de Boer, Anna Bon, Chris van Aart, Stephane Boyera,
Hans Akkermans, Mary Allen, Aman Grewal and Max Froumentin
Petition growth and success rates on the UK No. 10 Downing Street Website . . . . . . . . . . . . . 114
Scott Hale, Helen Margetts and Taha Yasseri
Does the Web Extend the Mind? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121
Harry Halpin
Semantic Tagging on Historical Maps . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130
Bernhard Haslhofer, Werner Robitza, Carl Lagoze and Francois Guimbretiere
Towards A Redefinition of Time in Information Networks? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140
Sebastien Heymann and Benedicte Le Grand
Web Science and the Two (Hundred) Cultures: Representation of Disciplines Publishing
in Web Science . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144
Clare J. Hooper, Georgeta Bordea and Paul Buitelaar
Sentiment and Topic Analysis on Social Media: A Multi-Task Multi-Label Classification
Approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 154
Shu Huang, Wei Peng, Jingxuan Li and Dongwon Lee
Sprint Methods for Web Archive Research . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164
Hugo C. Huurdeman, Anat Ben-David and Thaer Sammar
Who Wants To Get Fired? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173
Ricardo Kawase, Bernardo Pereira Nunes, Eelco Herder, Wolfgang Nejdl and Marco
Antonio Casanova
Detecting Cyberbullying: Query Terms and Techniques. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 177
April Kontostathis, Kelly Reynolds, Andy Garron and Lynne Edwards
Preferential Attachment in Online Networks: Measurement and Explanations . . . . . . . . . . . . . 187
Jérôme Kunegis, Marcel Blattner and Christine Moser
Can simple social copying heuristics explain tag popularity in a collaborative tagging
system? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 197
Jared Lorince and Peter Todd
Simultaneously Detecting Fake Reviews and Review Spammers using Factor Graph Model 207
Yuqing Lu, Lei Zhang, Yudong Xiao and Yangguang Li
Experiences Surveying the Crowd: Reflections on Methods, Participation, and Reliability . 216
Catherine C. Marshall and Frank M. Shipman
Toward Google Borders . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 226
Antoine Mazières and Samuel Huron
The Rise and the Fall of a Citizen Reporter . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 230
Panagiotis Metaxas and Eni Mustafaraj
2
WebSci2013
Table of Contents
An Empirical Analysis of Characteristics of Useful Comments in Social Media . . . . . . . . . . . . 231
Elaheh Momeni and Gerhard Sageder
Mechanical Turk as an Ontology Engineer? Using Microtasks as a Component of an
Ontology-Engineering Workflow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 235
Natasha F. Noy, Jonathan Mortensen, Paul Alexander and Mark Musen
Aemoo: exploring knowledge on the Web . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 245
Andrea Giovanni Nuzzolese, Valentina Presutti, Aldo Gangemi, Alberto Musetti and
Paolo Ciancarini
Uncovering the Wider Structure of Extreme Right Communities Spanning Popular
Online Networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 249
Derek O’Callaghan, Derek Greene, Maura Conway, Joe Carthy and Padraig
Cunningham
Challenges and Opportunities of Local Journalism: A Case Study of the 2012 Korean
General Election . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 259
Souneil Park, Minsam Ko, Jaeung Lee, Aram Choi and Junehwa Song
Rethinking Measurements Of Social Media Use By Charities: A Mixed Methods Approach269
Christopher Phethean, Thanassis Tiropanis and Lisa Harris
Mining User Behaviors: A study of check-in patterns in Location Based Social Networks . . 270
Daniel Preotiuc-Pietro and Trevor Cohn
Dont Worry, Be Happy: The Geography of Happiness on Facebook . . . . . . . . . . . . . . . . . . . . . . . 280
Daniele Quercia
Collabmap: Crowdsourcing Maps for Emergency Planning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 290
Sarvapali Ramchurn, Trung Dong Huynh, Matteo Venanzi and Bing Shi
Modeling Movements in Oil, Gold, Forex and Market Indices using Search Volume Index
and Twitter Sentiments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 300
Tushar Rao and Saket Srivastava
Studying Facebook via Data Extraction: The Netvizz Application . . . . . . . . . . . . . . . . . . . . . . . . 310
Bernhard Rieder
Debanalizing Twitter: The Transformation of an Object of Study . . . . . . . . . . . . . . . . . . . . . . . . 320
Richard Rogers
Designing the W3C Open Annotation Data Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 330
Robert Sanderson, Paolo Ciccarese and Herbert Van de Sompel
The Utility of Social and Topical Factors in Anticipating Repliers in Twitter Conversations340
Johannes Schantl, Claudia Wagner, Rene Kaiser and Markus Strohmaier
Are User-contributed Reviews Community Property? Exploring the Beliefs and
Practices of Reviewers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 350
Frank Shipman and Catherine Marshall
Why dont we trust health websites that help us help each other? . . . . . . . . . . . . . . . . . . . . . . . . . 360
Elizabeth Sillence, Claire Hardy and Pam Briggs
3
WebSci2013
Table of Contents
Location Tracking via Social Networking Sites . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 369
Lisa Thomas, Pamela Briggs and Linda Little
BlueFinder: Recommending Wikipedia Links Using DBpedia Properties . . . . . . . . . . . . . . . . . . 370
Diego Torres, Hala Skaf-Molli, Pascal Molli and Alicia Diaz
Automatically Extracting Frames from Media Content using Syntacting Analysis . . . . . . . . . 380
Wouter Van Atteveldt, Tamir Sheafer and Shaul Shenhav
From Information Delivery to Interpretation Support: Evaluating Cultural Heritage
Access on the Web . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 388
Chiel Van Den Akker, Marieke Van Erp, Lora Aroyo, Ardjan van Nuland, Lourens
Van Der Meij, Susan Legêne and Guus Schreiber
Considering People with Disabilities as Überusers for Eliciting Generalisable Coping
Strategies on the Web . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 398
Markel Vigo and Simon Harper
Content-Based Similarity Measures of Weblog Authors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 402
Christopher Wienberg, Melissa Roemmele and Andrew Gordon
Why Forums? An Empirical Analysis into the Facilitating Factors of Carding Forums . . . . 410
Michael Yip, Nigel Shadbolt and Craig Webber
Unpicking the Privacy Paradox: Can Structuration Theory Help to Explain
Location-Based Privacy Decisions? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 420
Aristea-Maria Zafeiropoulou, David Millard, Craig Webber and Kieron O’Hara
4