A framework for the statistical analysis of mass spectrometry imaging experiments

Kyle Bemis, Purdue University


Mass spectrometry (MS) imaging is a powerful investigation technique for a wide range of biological applications such as molecular histology of tissue, whole body sections, and bacterial films , and biomedical applications such as cancer diagnosis. MS imaging visualizes the spatial distribution of molecular ions in a sample by repeatedly collecting mass spectra across its surface, resulting in complex, high-dimensional imaging datasets. Two of the primary goals of statistical analysis of MS imaging experiments are classification (for supervised experiments), i.e. assigning pixels to pre-defined classes based on their spectral profiles, and segmentation (for unsupervised experiments), i.e. assigning pixels to newly discovered segments with relatively homogenous and distinct spectral profiles. To accomplish these goals, this research provides both statistical methods and statistical computing tools. First, we propose a novel spatial shrunken centroids framework for performing classification and segmentation of MS imaging experiments with feature selection. Spatial shrunken centroids combines spatial smoothing with statistical regularization in a model-based framework appropriate for both supervised and unsupervised settings. Second, we provide Cardinal, a free and open-source R package for processing, visualization, and statistical analysis of MS imaging experiments. Cardinal is the first R package designed specifically for MS imaging, and the first software for MS imaging that focuses specifically on experiments and statistical analysis. In addition to providing tools for statistical analysis, it also provides infrastructure to enable other statisticians to more easily develop new methods for MS imaging experiments. Lastly, to enable scalability of Cardinal to larger-than-memory datasets, we provide matter, a free and open-source R package for statistical computing with structured datasets-on-disk, such as MS imaging data files. Together, spatial shrunken centroids, Cardinal, and matter aim to allow scalable statistical analysis for high-resolution, high-throughput MS imaging experiments.




Chun, Purdue University.

Subject Area

Statistics|Bioinformatics|Computer science

Off-Campus Purdue Users:
To access this dissertation, please log in to our
proxy server