From Carina Wyborn

What language are we speaking?
The CODATA/ICSU initiative on
Data Standards for Science
Lesley Wyborn
National Computational Infrastructure ANU
© National Computational
Infrastructure 2017
Transdisciplinary Infrastructure at NCI: Information Viewpoint is weak
Climate and
Weather Sci. LAB
National Map
Open
Data
Access
Users
eReefs
Eco
Science
Cloud
Marine Science
Cloud
Virtual
Geophysics
Laboratory
EarthServer
OGC Web
Feature
Service
Technology
Services
NCI NERDIP DATA SERVICES
OGC Web
Map
Service
OGC Web
Coverage
Service
OGC Web
Processing
Service
OGC Web
Cov.
Process
Service
CS/W
OpenSearch
OPeNDAP
GeoNetwork
Catalogue
GSKY
GeoServer
Rasdaman
NCI Index
Database
THREDDS
10 PB NCI NERDIP EARTH SYSTEMS, ENVIROMENTAL AND SOLID EARTH DATA COLLECTIONS
Landsat
MODIS
Himawari
© National Computational
Infrastructure 2017
CMIP 5
Numerical
Weather
Prediction
Geophysics
Hazards
Models
Bathymetry
Elevation
GPS
Driver: transdisciplinary research (including Future Earth)
•
Researchers across the science disciplines, the humanities, the social sciences and those beyond
academia need to work together to create integrated data platforms that interoperate horizontally
across discipline boundaries, and enable access to data by a diversity of users from high end
researchers, to undergraduates and to the general public.
•
Requires a transdisciplinary approach, starting at the conception of any data collection campaign
•
That is, Data must be BORN CONNECTED
(From Carina Wyborn)
© National Computational
Infrastructure 2017
CODATA Task Group on Scientific Data Standards
[email protected]
F.A.I.R Data is a Prerequisite for Transdisciplinary Science
© National Computational
Infrastructure 2017
CODATA Task Group on Scientific Data Standards
[email protected]
Transdisciplinary science requires interoperable vocabularies
Increasing Metadata, Context
& Knowledge Representation
Strong
Semantic
s
Adapted from Leo Orbst 20087-2008
Axiology
Higher Order Logic
Logical Theory
2nd Order Logic
Conceptual Model
Semantic
Interoperability
First Order Logic
OWL
UML
RDF
Thesaurus
Structural
Interoperability
ER Model
Taxonomy
List
Syntactic
Interoperability
RDBMS, XML
Controlled
Weak
Vocabulary
Semantic Recovery Discovery
s
© National Computational
Infrastructure 2016
Intelligence
Questioning/Answering
Smart Behaviours
Increasing Reasoning Capability
CODATA Task Group on Scientific Data Standards
[email protected]
Task Group: Co-ordinating Data Standards amongst Scientific Unions
•
•
•
© National Computational
Infrastructure 2017
CODATA Task Group on Scientific Data Standards
September 2016: CODATA Task
Group Approved, led by
Marshall Ma and Lesley
Wyborn
Proposal was simple: map
what information standards, in
particular vocabularies and
ontologies, are being
developed/endorsed by the
science unions
More on:
http://www.codata.org/taskgroups/coordinating-datastandards
[email protected]
Moving up from a CODATA Task Group to an ICSU-CODATA Commission
•
Hot of the Press: we are moving up from a CODATA Task Group to form the ICSU-CODATA
Commission on Data Standards for Science
•
Inter-Union Workshop on 21st Century Scientific and Technical Data June 19-21 June, Paris
will involve selected Science Unions and Data Organisations. Its purpose will be:
–
–
–
–
•
Agreement on maturity index (5 star) rating for vocabularies;
Agree on the key elements of a roadmap of priorities for the Commission;
The potential for collaboration between the groups represented and the Commission; and
How broader coordination of effort within the scientific community might be achieved.
A major conference is planned for Autumn 2017, which will extend to all ICSU unions and
ISSC international disciplinary associations.
© National Computational
Infrastructure 2016
CODATA Task Group on Scientific Data Standards
[email protected]
Most important goal of Task Group/Commission: a ‘5 star’ for Vocabularies
Someone’s text list (brain fart?) on the web
Machine-readable lists
Non-proprietary lists of words with simple definitions
Concept-based, community definitions, RDF, governed
Concept-based, RDF, linked, endorsed, multilingual
© National Computational
Infrastructure 2016
CODATA Task Group on Scientific Data Standards
[email protected]
Kick off meeting in Paris on 19-21 June, 2017: attendees (including Simon Cox)
Science
Unions
Standards
Organisations & others
Science/Data
Organisations
Umbrellas
Attendees from this
Symposium
ICA
OGC
ESIP
ICSU
Simon Cox
IUBS
W3C
ODIP
CODATA
Lesley Wyborn
IUCr
RDA
DataOne
Ingo Simonis
IUGG
ANDS
GBIF
Peter Fox (?)
IUGS
European Commission
ELIXIR
IUPAC
FAO/IGAD
Belmont Forum
IUFRO
ZBW
Biosharing
IUSS (?)
NIST
IPGD
IUFoST (?)
DDI
ANDS (?)
IUTOX (?)
Dublin Core (?)
GEO (?)
IUPHAR (?)
Darwin Core (?)
GPSDD (?)
© National Computational
Infrastructure 2016
CODATA Task Group on Scientific Data Standards
[email protected]