Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 15 of 15

Full-Text Articles in Physical Sciences and Mathematics

Assessing Extant Methods For Generating G-Optimal Designs And A Novel Methodology To Compute The G-Score Of A Candidate Design, Hyrum John Hansen May 2024

Assessing Extant Methods For Generating G-Optimal Designs And A Novel Methodology To Compute The G-Score Of A Candidate Design, Hyrum John Hansen

All Graduate Theses and Dissertations, Fall 2023 to Present

Experimental designs are used by scientists to allocate treatments such that statistical inference is appropriate. Most traditional experimental designs have mathematical properties that make them desirable under certain conditions. Optimal experimental designs are those where the researcher can exercise total control over the treatment levels to maximize a chosen mathematical property. As is common in literature, the experimental design is represented as a matrix where each column represents a variable, and each row represents a trial. We define a function that takes as input the design matrix and outputs its score. We then algorithmically adjust each entry until a design …


Rfviz: An Interactive Visualization Package For Random Forests In R, Christopher Beckett Dec 2018

Rfviz: An Interactive Visualization Package For Random Forests In R, Christopher Beckett

All Graduate Plan B and other Reports, Spring 1920 to Spring 2023

Random forests are very popular tools for predictive analysis and data science. They work for both classification (where there is a categorical response variable) and regression (where the response is continuous). Random forests provide proximities, and both local and global measures of variable importance. However, these quantities require special tools to be effectively used to interpret the forest. Rfviz is a sophisticated interactive visualization package and toolkit in R, specially designed for interpreting the results of a random forest in a user-friendly way. Rfviz uses a recently developed R package (loon) from the Comprehensive R Archive Network (CRAN) to create …


Comparing Performance Of Gene Set Test Methods Using Biologically Relevant Simulated Data, Richard M. Lambert Dec 2018

Comparing Performance Of Gene Set Test Methods Using Biologically Relevant Simulated Data, Richard M. Lambert

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Today we know that there are many genetically driven diseases and health conditions. These problems often manifest only when a set of genes are either active or inactive. Recent technology allows us to measure the activity level of genes in cells, which we call gene expression. It is of great interest to society to be able to statistically compare the gene expression of a large number of genes between two or more groups. For example, we may want to compare the gene expression of a group of cancer patients with a group of non-cancer patients to better understand the genetic …


Implementing The Use Of Personal Activity Data In An Introductory Statistics Course, Lacy Christensen Aug 2018

Implementing The Use Of Personal Activity Data In An Introductory Statistics Course, Lacy Christensen

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Integrating real data into a classroom is one of the recommendations in the Guidelines for Assessment and Instruction in Statistics Education (GAISE) college report which lays out guidelines for an introductory statistics course (Committee, GAISE College Report ASA Revision, 2016). In order to assess the effect of using real data in a classroom, the students received physical activity trackers to wear during an undergraduate introductory statistics course taught in the summer. This tracker, a Fitbit, enabled students to monitor and record their steps, calories, and active time throughout the class. Collecting personal activity data (PAD) creates a large database which …


Mindset, Attitudes, And Success In Statistics, Matthew Isaac May 2018

Mindset, Attitudes, And Success In Statistics, Matthew Isaac

Undergraduate Honors Capstone Projects

Students in many disciplines are required to take an introductory statistics course while pursuing a college education. Despite the utility of statistical methods in future research and career pursuits, many students have negative views of statistics. We are interested in how students' mindsets and attitudes towards statistics impact their performance in an undergraduate statistics course. We administered a survey to students in several undergraduate statistics courses at Utah State University. This survey included questions addressing mathematics experience, attitudes towards statistics, mindset, and course performance. We observed that the majority of students indicated the presence of a growth mindset and positive …


Imputation For Random Forests, Joshua Young Aug 2017

Imputation For Random Forests, Joshua Young

All Graduate Plan B and other Reports, Spring 1920 to Spring 2023

This project introduces two new methods for imputation of missing data in random forests. The new methods are compared against other frequently used imputation methods, including those used in the randomForest package in R. To test the effectiveness of these methods, missing data are imputed into datasets that contain two missing data mechanisms including missing at random and missing completely at random. After imputation, random forests are run on the data and accuracies for the predictions are obtained. Speed is an important aspect in computing; the speeds for all the tested methods are also compared.

One of the new methods …


A Comparison Of Five Statistical Methods For Predicting Stream Temperature Across Stream Networks, Maike F. Holthuijzen Aug 2017

A Comparison Of Five Statistical Methods For Predicting Stream Temperature Across Stream Networks, Maike F. Holthuijzen

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

The health of freshwater aquatic systems, particularly stream networks, is mainly influenced by water temperature, which controls biological processes and influences species distributions and aquatic biodiversity. Thermal regimes of rivers are likely to change in the future, due to climate change and other anthropogenic impacts, and our ability to predict stream temperatures will be critical in understanding distribution shifts of aquatic biota. Spatial statistical network models take into account spatial relationships but have drawbacks, including high computation times and data pre-processing requirements. Machine learning techniques and generalized additive models (GAM) are promising alternatives to the SSN model. Two machine learning …


Statistical Methods For Assessing Individual Oocyte Viability Through Gene Expression Profiles, Michael O. Bishop May 2017

Statistical Methods For Assessing Individual Oocyte Viability Through Gene Expression Profiles, Michael O. Bishop

All Graduate Plan B and other Reports, Spring 1920 to Spring 2023

Abstract

Statistical Methods for Assessing Individual Oocyte Viability Through Gene Expression Profiles

By

Michael O. Bishop

Utah State University, 2017

Major Professor: Dr. John R. Stevens

Department: Mathematics and Statistics

Oocytes are the precursor cells to the female gamete, or egg. While reproduction may vary from species to species, within humans and most domesticated animals, the oocyte maturation process is fairly similar. As an oocyte matures, there are various processes that take place, all of which have an effect on the viability of the individual oocyte. Barring outside damage that may come to the oocyte, one of the primary reasons …


Collecting, Analyzing And Interpreting Bivariate Data From Leaky Buckets: A Project-Based Learning Unit, Florence Funmilayo Obielodan May 2011

Collecting, Analyzing And Interpreting Bivariate Data From Leaky Buckets: A Project-Based Learning Unit, Florence Funmilayo Obielodan

All Graduate Plan B and other Reports, Spring 1920 to Spring 2023

Despite the significance and the emphasis placed on mathematics as a subject and field of study, achieving the right attitude to improve students‟ understanding and performance is still a challenge. Previous studies have shown that the problem cuts across nations around the world, both developing countries and developed alike. Teachers and educators of the subject have responsibilities to continuously develop innovative pedagogical approaches that will enhance students‟ interests and performance. Teaching approaches that emphasize real life applications of the subject have become imperative. It is believed that this will stimulate learners‟ interest in the subject as they will be able …


Approaches To Promote Active, Conceptual Learning In A Pedagogically Hybrid Introductory Statistics Course, Brittany L. Allred Jan 2009

Approaches To Promote Active, Conceptual Learning In A Pedagogically Hybrid Introductory Statistics Course, Brittany L. Allred

All Graduate Plan B and other Reports, Spring 1920 to Spring 2023

Statistics education is an active area of research where discovery-based learning is becoming more prominent. This project reviews how the USEI and GAISE recommendations can be implemented in the statistics classroom. Further, the project describes the creation of a library of classroom materials for an introductory statistics course. The results are also discussed from implementing various library materials, along with the student response to hands-on learning techniques.


Small Sample Methods For The Analysis Of Clustered Binary Data, Lawrence J. Cook May 2008

Small Sample Methods For The Analysis Of Clustered Binary Data, Lawrence J. Cook

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

There are several solutions for analysis of clustered binary data. However, the two most common tools in use today, generalized estimating equations and random effects or mixed models, rely heavily on asymptotic theory. However, in many situations, such as small or sparse samples, asymptotic assumptions may not be met. For this reason we explore the utility of the quadratic exponential model and conditional analysis to estimate the effect size of a trend parameter in small sample and sparse data settings. Further we explore the computational efficiency of two methods for conducting conditional analysis, the network algorithm and Markov chain Monte …


Shamat: A Matrix Manipulation Program, Shahriyar Dadkhah May 1987

Shamat: A Matrix Manipulation Program, Shahriyar Dadkhah

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

This report is both a users guide and a programmers manual for running and modifying the program SHAMAT, an interactive matrix calculator. The program is written in Turbo Pascal version 3.0 for MS-DOS computers. This software enables the user to type in matrix equations for solving statistical problems such as multiple regression, analysis of variance, etc. All matrix operations necessary for linear models analysis are included in this program. Since each operation uses a separate subroutine, program enhancement, modification and updating is demonstrated to be easy.


The Prior Distribution In Bayesian Statistics, Kai-Tang Chen May 1979

The Prior Distribution In Bayesian Statistics, Kai-Tang Chen

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

A major problem associated with Bayesian estimation is selecting the prior distribution. The more recent literature on the selection of the prior is reviewed. Very little of a general nature on the selection of the prior is formed in the literature except for non-informative priors. This class of priors is seen to have limited usefulness. A method of selecting an informative prior is generalized in this thesis to include estimation of several parameters using a multivariate prior distribution. The concepts required for quantifying prior information is based on intuitive principles. In this way, it can be understood and controlled by …


Computer Programs Supporting The Teaching Of Statistics, Chien-Hwa Liu Jan 1973

Computer Programs Supporting The Teaching Of Statistics, Chien-Hwa Liu

All Graduate Plan B and other Reports, Spring 1920 to Spring 2023

During the past few years there has been an increasing interest in developing computer packages to enhance the teaching of elementary statistics. The conventional ways of teaching statistics have used such devices as desk calculators, tables of functions, short-cut calculating formulas and electronic calculators, etc. to manipulate the involved computations. Electronic computers, in the past decade, have been broadly used in universities and colleges in many ways. It is only natural to extend the use of computers to the teaching function. Remote terminals can now be installed in any classroom and bring the computer to the students.


Simulation Of Mathematical Models In Genetic Analysis, Dinesh Govindal Patel May 1964

Simulation Of Mathematical Models In Genetic Analysis, Dinesh Govindal Patel

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

In recent years a new field of statistics has become of importance in many branches of experimental science. This is the Monte Carlo Method, so called because it is based on simulation of stochastic processes. By stochastic process, it is meant some possible physical process in the real world that has some random or stochastic element in its structure. This is the subject which may appropriately be called the dynamic part of statistics or the statistics of "change," in contrast with the static statistical problems which have so far been the more systematically studied. Many obvious examples of such processes …