Date of Award


Degree Type


Degree Name

Master of Science (MS)


Technology Leadership and Innovation

First Advisor

Sabine Brunswicker

Committee Chair

Sabine Brunswicker

Committee Member 1

Kathryne A. Newton

Committee Member 2

John A. Springer


Contests have become an important instrument for fostering the development of novel open data mash-ups, in short open data innovations. Literature calls for new methods for measuring the similarity of open data mash-ups in order to identify code cloning and creative re-use of components of applications. Theoretically grounded computationally methods for identifying the similarity of open data contests are lacking. This study explores the similarity measurement of data-based mashups in the context of an open data innovation contest. Three different dimensions of mashup similarity are defined: code similarity, functional feature similarity, and visualized feature similarity. The results from the contest, including the source code, the running project and the descriptive documents, are collected as the research data for this study. Data analysis is based on the design and development of computational approaches to measure technology and functional similarity. The findings of this study will be helpful in better understanding the similarity of solutions in an open data innovation contest. This study contributes to the theoretical and practical approaches for similarity measurement, especially in the field of mashup development.