UMBC logo
UMBC High Performance Computing Facility
Please note that this page is under construction. We are documenting the 240-node cluster maya that will be available after Summer 2014. Currently, the 84-node cluster tara still operates independently, until it becomes part of maya at the end of Summer 2014. Please see the 2013 Resources Pages under the Resources tab for tara information.
Stein's Method in High Dimensional Classification and Applications
Do-Hwan Park and Junyong Park, Department of Mathematics and Statistics

It is often the case, that high-dimensional data consists of only a few informative components. Standard statistical modeling and estimation in such a situation, is prone to inaccuracies due to overfitting, unless regularization methods are practiced. In the context of classification, we propose a class of regularization methods through shrinkage estimators. The shrinkage is based on variable selection coupled with SCAD shrinkage by using Stein's unbiased estimator of the risk, and we derive an estimator for the optimal shrinkage method. We demonstrate and examine our method on simulated data and three real data sets and compare it to Independence Rule and Feature Annealed Independence Rule.