Abstract

Storing data in RDF format helps in simpler data interchange among different researchers compared to present approaches. There has been tremendous increase in the applications that use RDF data. The nature of RDF data is such that it tends to increase explosively. This makes it necessary to consider the time for retrieval and scalability of data while selecting a suitable RDF data store for developing applications. The research concentrates on comparing BigOWLIM. Bigdata, 4store and Virtuoso RDF stores on basis of their scalability and performance of storing and retrieving cancer proteomics and mass spectrometry data using SPARQL queries. In this research the author compares RDF data stores on a single machine as baseline and extends 4store and BigOWLIM data stores on a cluster for comparison. The author uncovers that Virtuoso has the best performance on data consisting of less than 250,000 triples whereas 4store has better scalability and performance for the larger data.

Keywords

RDF, Database, Semantic Web, Ontology, Data Stores

Date of this Version

7-11-2011

Department

Computer and Information Technology

Department Head

Jeffrey Brewer

Month of Graduation

August

Year of Graduation

2011

Degree

Master of Science

Head of Graduate Program

Jeffrey Brewer

Advisor 1 or Chair of Committee

John Springer

Committee Member 1

John Springer

Committee Member 2

Kari Clase

Committee Member 3

Raymond Hansen

Share

COinS