Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 2 of 2

Full-Text Articles in Physical Sciences and Mathematics

Investigations Of Variable Importance Measures Within Random Forests, Andrew C. Merrill May 2009

Investigations Of Variable Importance Measures Within Random Forests, Andrew C. Merrill

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Random Forests (RF) (Breiman 2001; Breiman and Cutler 2004) is a completely nonparametric statistical learning procedure that may be used for regression analysis and. A feature of RF that is drawing a lot of attention is the novel algorithm that is used to evaluate the relative importance of the predictor/explanatory variables. Other machine learning algorithms for regression and classification, such as support vector machines and artificial neural networks (Hastie et al. 2009), exhibit high predictive accuracy but provide little insight into predictive power of individual variables. In contrast, the permutation algorithm of RF has already established a track record for …


Comparison Of Random Forests And Cforest: Variable Importance Measures And Prediction Accuracies, Rong Xia Jan 2009

Comparison Of Random Forests And Cforest: Variable Importance Measures And Prediction Accuracies, Rong Xia

All Graduate Plan B and other Reports, Spring 1920 to Spring 2023

Random forests are ensembles of trees that give accurate predictions for regression, classification and clustering problems. The CART tree, the base learn er employed by random forests, has been criticized because of bias in the selection of splitting variables. The performance of random forests is suspect due to this criticism. A new implementation of random forests, Cforest, which is claimed to outperform random forests in both predictive power and variable importance measures , was developed based on Ctree, an implementation of conditional inference trees.

We address the underlying mechanism of random forests and Cforest in this report. Comparison of random …