The Web Science Trust

Quality, Trust, and Utility of Scientific Data on the Web: Towards a Joint Model

Gamble, Matthew and Goble, Carole (2011) Quality, Trust, and Utility of Scientific Data on the Web: Towards a Joint Model. pp. 1-8. In: Proceedings of the ACM WebSci'11, June 14-17 2011, Koblenz, Germany.

[img]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
279Kb
[img]
Preview
Image (PNG)
30Kb

Official URL: http://www.websci11.org/fileadmin/websci/Papers/177_paper.pdf

Abstract

In science, quality is paramount. As scientists increasingly look to the Web to share and discover scientic data, there is a growing need to support the scientist in assessing the quality of that data. However, quality is an ambiguous and overloaded term. In order to support the scientic user in discovering useful data we have systematically examined the nature of \quality" by exploiting three, prevalent properties of scientic data sets: (1) that data quality is commonly de- ned objectively; (2) the provenance and lineage in its production has a well understood role; and (3) \tness-for-use" is a denition of utility rather than quality or trust, where the quality and trust-worthiness of the data and the entities that produced that data inform its utility. Our study is presented in two stages. First we review existing information quality dimensions and detail an assessment-oriented classi cation. We introduce denitions for quality, trust and utility in terms of the entities required in their assessment; producer, provider, consumer, process, artifact and quality standard. Next we detail a novel and experimental approach to assessment by modelling the causal relationships between quality, trust, and utility dimensions through the construction of decision networks informed by provenance graphs. To ground and motivate our discussion throughout we draw on the European Bioinformatics Institute's Gene Ontology Annotations database. We present an initial demonstration of our approach with an example for ranking results from the Gene Ontology Annotation database using an emerging objective quality measure, the Gene Ontology Annotation Quality score.

Item Type:Conference or Workshop Item (Paper)
Web Science Comments:WebSci Conference 2011
Subjects:WS5 Psychology > WS53 Human Information Processing
Web Science Events > Web Science 2011
ID Code:443
Deposited By: Lisa Sugiura
Deposited On:07 Jun 2011 16:52
Last Modified:25 Oct 2011 17:12

Repository Staff Only: item control page

EPrints Logo
Web Science Repository is powered by EPrints 3 which is developed by the School of Electronics and Computer Science at the University of Southampton. More information and software credits.