TY - GEN
T1 - Building the Seshat ontology for a global history databank
AU - Brennan, Rob
AU - Feeney, Kevin
AU - Mendel-Gleason, Gavin
AU - Bozic, Bojan
AU - Turchin, Peter
AU - Whitehouse, Harvey
AU - Francois, Pieter
AU - Currie, Thomas E.
AU - Grohmann, Stephanie
N1 - Publisher Copyright:
© Springer International Publishing Switzerland 2016.
PY - 2016
Y1 - 2016
N2 - This paper describes OWL ontology re-engineering from the wiki-based social science codebook (thesaurus) developed by the Seshat: Global History Databank. The ontology describes human history as a set of over 1500 time series variables and supports variable uncertainty, temporal scoping, annotations and bibliographic references. The ontology was developed to transition from traditional social science data collection and storage techniques to an RDF-based approach. RDF supports automated generation of high usability data entry and validation tools, data quality management, incorporation of facts from the web of data and management of the data curation lifecycle. This ontology re-engineering exercise identified several pitfalls in modelling social science codebooks with semantic web technologies; provided insights into the practical application of OWL to complex, real-world modelling challenges; and has enabled the construction of new, RDF-based tools to support the large-scale Seshat data curation effort. The Seshat ontology is an exemplar of a set of ontology design patterns for modelling uncertainty or temporal bounds in standard RDF. Thus the paper provides guidance for deploying RDF in the social sciences. Within Seshat, OWL-based data quality management will assure the data is suitable for statistical analysis. Publication of Seshat as high-quality, linked open data will enable other researchers to build on it.
AB - This paper describes OWL ontology re-engineering from the wiki-based social science codebook (thesaurus) developed by the Seshat: Global History Databank. The ontology describes human history as a set of over 1500 time series variables and supports variable uncertainty, temporal scoping, annotations and bibliographic references. The ontology was developed to transition from traditional social science data collection and storage techniques to an RDF-based approach. RDF supports automated generation of high usability data entry and validation tools, data quality management, incorporation of facts from the web of data and management of the data curation lifecycle. This ontology re-engineering exercise identified several pitfalls in modelling social science codebooks with semantic web technologies; provided insights into the practical application of OWL to complex, real-world modelling challenges; and has enabled the construction of new, RDF-based tools to support the large-scale Seshat data curation effort. The Seshat ontology is an exemplar of a set of ontology design patterns for modelling uncertainty or temporal bounds in standard RDF. Thus the paper provides guidance for deploying RDF in the social sciences. Within Seshat, OWL-based data quality management will assure the data is suitable for statistical analysis. Publication of Seshat as high-quality, linked open data will enable other researchers to build on it.
KW - Cliodynamics
KW - Ontology design patterns
KW - Ontology engineering
UR - https://www.scopus.com/pages/publications/84978969323
U2 - 10.1007/978-3-319-34129-3_42
DO - 10.1007/978-3-319-34129-3_42
M3 - Conference contribution
AN - SCOPUS:84978969323
SN - 9783319341286
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 693
EP - 708
BT - The Semantic Web
A2 - d’Aquin, Mathieu
A2 - Sack, Harald
A2 - Lange, Christoph
A2 - Lange, Christoph
A2 - Ponzetto, Simone Paolo
A2 - Blomqvist, Eva
A2 - Ghidini, Chiara
PB - Springer Verlag
T2 - 13th International Conference on Semantic Web, ESWC 2016
Y2 - 29 May 2016 through 2 June 2016
ER -