Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability

Culminating Projects in Applied Statistics

Theses/Dissertations

Articles 1 - 5 of 5

Full-Text Articles in Physical Sciences and Mathematics

Meta-Analysis Of Lapatinib Plus Capecitabine Versus Capecitabine In The Treatment Of Her2 Positive Breast Cancer, Lynda Smith Dec 2015

Meta-Analysis Of Lapatinib Plus Capecitabine Versus Capecitabine In The Treatment Of Her2 Positive Breast Cancer, Lynda Smith

Culminating Projects in Applied Statistics

BACKGROUND:

Breast cancer is the most common type of cancer in women despite advances in research and detection methods. Approximately 25 to 30 percent of newly diagnosed cases of breast cancer will overexpress HER2, human epidermal growth factor receptor 2, and are at a greater risk for disease progression and poorer clinical outcomes. The traditional treatment is associated with irreversible cardiac dysfunction. An alternative treatment involving lapatinib plus capecitabine has been reported in some randomized controlled clinical trials comparing treatment outcomes. To quantify the effectiveness of lapatinib plus capecitabine combination therapy versus capecitabine monotherapy in treating metastatic breast cancer, a …


High Dimensional Model Selection And Validation: A Comparison Study, Zhengyi Li May 2015

High Dimensional Model Selection And Validation: A Comparison Study, Zhengyi Li

Culminating Projects in Applied Statistics

Model selection is a challenging issue in high dimensional statistical analysis, and many approaches have been proposed in recent years. In this thesis, we compare the performance of three penalized logistic regression approaches (Ridge, Lasso, and Elastic Net) and three information criteria (AIC, BIC, and EBIC) on binary response variable in high dimensional situation through extensive simulation study. The models are built and selected on the training datasets, and their performance are evaluated through AUC on the validation datasets. We also display the comparison results on two real datasets (Arcene Data and University Retention Data). The performance differences among those …


Optimal Matching Distances Between Categorical Sequences: Distortion And Inferences By Permutation, Juan P. Zuluaga Dec 2013

Optimal Matching Distances Between Categorical Sequences: Distortion And Inferences By Permutation, Juan P. Zuluaga

Culminating Projects in Applied Statistics

Sequence data (an ordered set of categorical states) is a very common type of data in Social Sciences, Genetics and Computational Linguistics.

For exploration and inference of sets of sequences, having a measure of dissimilarities among sequences would allow the data to be analyzed by techniques like clustering, multimensional scaling analysis and distance-based regression analysis. Sequences can be placed in a map where similar sequences are close together, and dissimilar ones will be far apart. Such patterns of dispersion and concentration could be related to other covariates. For example, do the employment trajectories of men and women tend to form …


Forecasting Emergency Department Volumes Using Time Series And Other Techniques, Uchechukwu A. Nwoke Aug 2013

Forecasting Emergency Department Volumes Using Time Series And Other Techniques, Uchechukwu A. Nwoke

Culminating Projects in Applied Statistics

The aim of this research is to forecast patient volumes in the Emergency Department of a regional hospital in Minnesota, which eventually will aid in addressing the issue of registered nurse staffing fluctuation, more specifically, productivity and capacity planning in the ED. Several methods are applied to forecast arrival patient volume, and cumulative patient volume to evaluate each model’s performance. The methods considered are linear regression, time series models and dynamic latent factor method. Long term forecast for as long as six months ahead is the goal here due union regulations that only allows for significant changes in registered nurse …


A Study Of The Effects Of Using Computer Spreadsheets On Student Development Of Algebraic Thinking To Compare Function Growth, Beth Stone Mar 2013

A Study Of The Effects Of Using Computer Spreadsheets On Student Development Of Algebraic Thinking To Compare Function Growth, Beth Stone

Culminating Projects in Applied Statistics

Problem:

The purpose of this research was to explore the effects of incorporating computer spreadsheets on the development of algebraic thinking as it pertains to growth patterns of functions. The study also examined students' ability to transfer skills learned on a computer spreadsheet to problems using pencil and paper, as well the effect of incorporating computer spreadsheets on student motivation, engagement, and communication. Specific questions addressed in the study include: How does investigating an algebra problem about growth patterns by entering numbers, creating formulas, and noticing patterns on a computer spreadsheet affect students' ability to learn the concepts of linear …