Open Access Dissertations

New Covariance-Based Feature Extraction Methods for Classification and Prediction of High-Dimensional Data

Mopelola Adediwura Sofolahan, Purdue UniversityFollow

Date of Award

Fall 2013

Degree Type

Dissertation

Degree Name

Doctor of Philosophy (PhD)

Department

Electrical and Computer Engineering

First Advisor

Okan K. Ersoy

Committee Chair

Okan K. Ersoy

Committee Member 1

Arif Ghafoor

Committee Member 2

Cordelia M. Brown

Committee Member 3

Michael D. Zoltowski

Abstract

When analyzing high dimensional data sets, it is often necessary to implement feature extraction methods in order to capture relevant discriminating information useful for the purposes of classification and prediction. The relevant information can typically be represented in lower-dimensional feature spaces, and a widely used approach for this is the principal component analysis (PCA) method. PCA efficiently compresses information into lower dimensions; however, studies indicate that it is not optimal for feature extraction especially when dealing with classification problems. Furthermore, for high-dimensional data having limited observations, as is typically the case with remote sensing data and nonstationary data such as financial data, covariance matrix estimation becomes unreliable, and this adversely affects the representation of data in the PCA domain. In this thesis, we first introduce a new feature extraction method called summed component analysis (SCA), which makes use of the structure of eigenvectors of the common covariance matrix to generate new features as sums of certain original features. Secondly, we present a variation of SCA, known as class summed component analysis (CSCA). CSCA takes advantage of the relative ease of computing the class covariance matrices and uses them to determine data transformations. Since the new features consist of simple sums of the original features, we are able to gain a conceptual meaning of the new representation of the data which is appealing for man-machine interface. We evaluate these methods on data sets with varying sample sizes and on financial time series, and are able to show improved classification and prediction accuracies.

Recommended Citation

Sofolahan, Mopelola Adediwura, "New Covariance-Based Feature Extraction Methods for Classification and Prediction of High-Dimensional Data" (2013). Open Access Dissertations. 57.
https://docs.lib.purdue.edu/open_access_dissertations/57

Download

Included in

Electrical and Electronics Commons, Finance and Financial Management Commons

COinS

Open Access Dissertations

New Covariance-Based Feature Extraction Methods for Classification and Prediction of High-Dimensional Data

Date of Award

Degree Type

Degree Name

Department

First Advisor

Committee Chair

Committee Member 1

Committee Member 2

Committee Member 3

Abstract

Recommended Citation

Included in

Search

Links

Links for Authors

Browse

Open Access Dissertations

New Covariance-Based Feature Extraction Methods for Classification and Prediction of High-Dimensional Data

Author

Date of Award

Degree Type

Degree Name

Department

First Advisor

Committee Chair

Committee Member 1

Committee Member 2

Committee Member 3

Abstract

Recommended Citation

Included in

Share

Search

Links

Links for Authors

Browse