Abstract

Studies on drug design datasets are continuing to grow. These datasets are usually known as hard modeled, having a large number of features and a small number of samples. The most common problems in the drug design area are of regression type. Committee machines (ensembles) have become popular in machine learning because of their high performance. In this study, dynamics of ensembles on regression related drug design problems are investigated on a big dataset collection. The study tries to determine the most successful ensemble algorithm, the base algorithm-ensemble pair having the best / worst results, the best successful single algorithm, and the similarities of algorithms according to their performances. We also discuss whether ensembles always generate better results than single algorithms.

Date of this Version

9-2009

Download

Included in

Electrical and Computer Engineering Commons

COinS

Department of Electrical and Computer Engineering Technical Reports

Evaluation of Regression Ensembles on Drug Design Datasets

Abstract

Date of this Version

Included in

Search

Links

Links for Authors

Browse

Department of Electrical and Computer Engineering Technical Reports

Evaluation of Regression Ensembles on Drug Design Datasets

Authors

Abstract

Date of this Version

Included in

Share

Search

Links

Links for Authors

Browse