Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Statistics

2018

Discipline
Institution
Publication
Publication Type
File Type

Articles 1 - 30 of 63

Full-Text Articles in Physical Sciences and Mathematics

Re-Describing Surface Roughness, Vincent Wagner Dec 2018

Re-Describing Surface Roughness, Vincent Wagner

Essential Studies UNDergraduate Showcase

The purpose of this project is to explore a non-traditional method of identifying and describing variance in data. The original goal was to provide a more useful description of surface roughness for use in calculating pressure loss due to pipe friction in the oil and gas industry. This approach uses simple trigonometric calculations to capture more information about the point to point variance of a given data set, as well as information related to the ratio of measured length vs total contact length. This method utilizes steps similar to the bootstrap method in statistics, however, rather than sampling a data …


Rfviz: An Interactive Visualization Package For Random Forests In R, Christopher Beckett Dec 2018

Rfviz: An Interactive Visualization Package For Random Forests In R, Christopher Beckett

All Graduate Plan B and other Reports, Spring 1920 to Spring 2023

Random forests are very popular tools for predictive analysis and data science. They work for both classification (where there is a categorical response variable) and regression (where the response is continuous). Random forests provide proximities, and both local and global measures of variable importance. However, these quantities require special tools to be effectively used to interpret the forest. Rfviz is a sophisticated interactive visualization package and toolkit in R, specially designed for interpreting the results of a random forest in a user-friendly way. Rfviz uses a recently developed R package (loon) from the Comprehensive R Archive Network (CRAN) to create …


Comparing Performance Of Gene Set Test Methods Using Biologically Relevant Simulated Data, Richard M. Lambert Dec 2018

Comparing Performance Of Gene Set Test Methods Using Biologically Relevant Simulated Data, Richard M. Lambert

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Today we know that there are many genetically driven diseases and health conditions. These problems often manifest only when a set of genes are either active or inactive. Recent technology allows us to measure the activity level of genes in cells, which we call gene expression. It is of great interest to society to be able to statistically compare the gene expression of a large number of genes between two or more groups. For example, we may want to compare the gene expression of a group of cancer patients with a group of non-cancer patients to better understand the genetic …


The Gaise College Report: The American Statistical Association Meets Sound Pedagogy In Central Virginia, Beverly Wood Nov 2018

The Gaise College Report: The American Statistical Association Meets Sound Pedagogy In Central Virginia, Beverly Wood

Beverly Wood

Research in undergraduate statistics education often centers on the introductory course required for a large percentage of college students. While acknowledging the diverse setting, audience, and purpose of introductory courses, existing research assumes that courses offered by different disciplines share the same goals and teaching practices. The purpose of this study is to examine the objectives for student outcomes and pedagogical delivery of introductory statistics courses in various academic departments to provide explicit evidence for this assumption. The American Statistical Association’s Guidelines for Assessment and Instruction in Statistics Education (GAISE) are meant to apply to all introductory courses. The College …


Guidelines For Assessment And Instruction In Statistics Education (Gaise) College Report 2016, Robert Carver, Michelle Everson, John Gabrosek, Nicholas Horton, Robin Lock, Megan Mocko, Allan Rossman, Ginger Holmes Roswell, Paul Velleman, Jeffrey Witmer, Beverly Wood Nov 2018

Guidelines For Assessment And Instruction In Statistics Education (Gaise) College Report 2016, Robert Carver, Michelle Everson, John Gabrosek, Nicholas Horton, Robin Lock, Megan Mocko, Allan Rossman, Ginger Holmes Roswell, Paul Velleman, Jeffrey Witmer, Beverly Wood

Beverly Wood

In 2005 the American Statistical Association (ASA) endorsed the Guidelines for Assessment and Instruction in Statistics Education (GAISE) College Report. This report has had a profound impact on the teaching of introductory statistics in two- and four-year institutions, and the six recommendations put forward in the report have stood the test of time. Much has happened within the statistics education community and beyond in the intervening 10 years, making it critical to re-evaluate and update this important report. For readers who are unfamiliar with the original GAISE College Report or who are new to the statistics education community, the full …


Using Machine Learning To Accurately Predict Ambient Soundscapes From Limited Data Sets, Katrina Lynn Pedersen Oct 2018

Using Machine Learning To Accurately Predict Ambient Soundscapes From Limited Data Sets, Katrina Lynn Pedersen

Theses and Dissertations

The ability to accurately characterize the soundscape, or combination of sounds, of diverse geographic areas has many practical implications. Interested parties include the United States military and the National Park Service, but applications also exist in areas such as public health, ecology, community and social justice noise analyses, and real estate. I use an ensemble of machine learning models to predict ambient sound levels throughout the contiguous United States. Our data set consists of 607 training sites, where various acoustic metrics, such as overall daytime L50 levels and one-third octave frequency band levels, have been obtained. I have data for …


Statistics (Abac), April Abbott, Gary Dicks, Jan Gregus, Buddhi Pantha, Melanie Partlow, Lori Pearman, Amanda Urquhart, Eunkyung You Oct 2018

Statistics (Abac), April Abbott, Gary Dicks, Jan Gregus, Buddhi Pantha, Melanie Partlow, Lori Pearman, Amanda Urquhart, Eunkyung You

Mathematics Grants Collections

This Grants Collection for Statistics was created under a Round Ten ALG Textbook Transformation Grant.

Affordable Learning Georgia Grants Collections are intended to provide faculty with the frameworks to quickly implement or revise the same materials as a Textbook Transformation Grants team, along with the aims and lessons learned from project teams during the implementation process.

Documents are in .pdf format, with a separate .docx (Word) version available for download. Each collection contains the following materials:

  • Linked Syllabus
  • Initial Proposal
  • Final Report


Elementary Statistics (Ghc), Camille Pace, Katie Bridges, Laura Ralston, Elizabeth Clark, Brent Griffin, Kamisha Decoudreaux, Zac Johnston, Vincent Manatsa Oct 2018

Elementary Statistics (Ghc), Camille Pace, Katie Bridges, Laura Ralston, Elizabeth Clark, Brent Griffin, Kamisha Decoudreaux, Zac Johnston, Vincent Manatsa

Mathematics Grants Collections

This Grants Collection for Elementary Statistics was created under a Round Eleven ALG Textbook Transformation Grant.

Affordable Learning Georgia Grants Collections are intended to provide faculty with the frameworks to quickly implement or revise the same materials as a Textbook Transformation Grants team, along with the aims and lessons learned from project teams during the implementation process.

Documents are in .pdf format, with a separate .docx (Word) version available for download. Each collection contains the following materials:

  • Linked Syllabus
  • Initial Proposal
  • Final Report


Seeing And Understanding Data, Beverly Wood, Charlotte Bolch Oct 2018

Seeing And Understanding Data, Beverly Wood, Charlotte Bolch

Statistics and Probability

No abstract provided.


Statistical Design Of Experiment Techniques In Manufacturing, Caroline M. Kerfonta Oct 2018

Statistical Design Of Experiment Techniques In Manufacturing, Caroline M. Kerfonta

Senior Theses

There are many statistical techniques used to design experiments. These techniques are used in many different fields. This thesis will focus on the use of the three most common techniques used to design statistical experiments in manufacturing.

The three techniques that will be investigated are completely randomized design, randomized block design, and factorial design. These techniques will be compared, contrasted, and explained. Research examples will be presented along with sample R code for each technique. These examples will be accompanied by analysis of the techniques as well as an overview of the uses and history of experiments in manufacturing


Minimizing The Perceived Financial Burden Due To Cancer, Hassan Azhar, Zoheb Allam, Gino Varghese, Daniel W. Engels, Sajiny John Aug 2018

Minimizing The Perceived Financial Burden Due To Cancer, Hassan Azhar, Zoheb Allam, Gino Varghese, Daniel W. Engels, Sajiny John

SMU Data Science Review

In this paper, we present a regression model that predicts perceived financial burden that a cancer patient experiences in the treatment and management of the disease. Cancer patients do not fully understand the burden associated with the cost of cancer, and their lack of understanding can increase the difficulties associated with living with the disease, in particular coping with the cost. The relationship between demographic characteristics and financial burden were examined in order to better understand the characteristics of a cancer patient and their burden, while all subsets regression was used to determine the best predictors of financial burden. Age, …


Secondary Data Analysis Project, Jonathan M. Gallimore Aug 2018

Secondary Data Analysis Project, Jonathan M. Gallimore

SF 420 PR - Gallimore - Fall 2018

This activity is designed to give students an opportunity to apply what they have learned in statistics to a real dataset.

This activity will help students apply what they have learned in statistics to real world data and answer their own research questions. Students will also practice reporting their results in a paper using APA format.


Deep Machine Learning For Mechanical Performance And Failure Prediction, Elijah Reber, Nickolas D. Winovich, Guang Lin Aug 2018

Deep Machine Learning For Mechanical Performance And Failure Prediction, Elijah Reber, Nickolas D. Winovich, Guang Lin

The Summer Undergraduate Research Fellowship (SURF) Symposium

Deep learning has provided opportunities for advancement in many fields. One such opportunity is being able to accurately predict real world events. Ensuring proper motor function and being able to predict energy output is a valuable asset for owners of wind turbines. In this paper, we look at how effective a deep neural network is at predicting the failure or energy output of a wind turbine. A data set was obtained that contained sensor data from 17 wind turbines over 13 months, measuring numerous variables, such as spindle speed and blade position and whether or not the wind turbine experienced …


Bayesian Analytical Approaches For Metabolomics : A Novel Method For Molecular Structure-Informed Metabolite Interaction Modeling, A Novel Diagnostic Model For Differentiating Myocardial Infarction Type, And Approaches For Compound Identification Given Mass Spectrometry Data., Patrick J. Trainor Aug 2018

Bayesian Analytical Approaches For Metabolomics : A Novel Method For Molecular Structure-Informed Metabolite Interaction Modeling, A Novel Diagnostic Model For Differentiating Myocardial Infarction Type, And Approaches For Compound Identification Given Mass Spectrometry Data., Patrick J. Trainor

Electronic Theses and Dissertations

Metabolomics, the study of small molecules in biological systems, has enjoyed great success in enabling researchers to examine disease-associated metabolic dysregulation and has been utilized for the discovery biomarkers of disease and phenotypic states. In spite of recent technological advances in the analytical platforms utilized in metabolomics and the proliferation of tools for the analysis of metabolomics data, significant challenges in metabolomics data analyses remain. In this dissertation, we present three of these challenges and Bayesian methodological solutions for each. In the first part we develop a new methodology to serve a basis for making higher order inferences in metabolomics, …


Implementing The Use Of Personal Activity Data In An Introductory Statistics Course, Lacy Christensen Aug 2018

Implementing The Use Of Personal Activity Data In An Introductory Statistics Course, Lacy Christensen

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Integrating real data into a classroom is one of the recommendations in the Guidelines for Assessment and Instruction in Statistics Education (GAISE) college report which lays out guidelines for an introductory statistics course (Committee, GAISE College Report ASA Revision, 2016). In order to assess the effect of using real data in a classroom, the students received physical activity trackers to wear during an undergraduate introductory statistics course taught in the summer. This tracker, a Fitbit, enabled students to monitor and record their steps, calories, and active time throughout the class. Collecting personal activity data (PAD) creates a large database which …


Calculus Of The Impossible: Review Of The Improbability Principle (2014) By David Hand And The Logic Of Miracles (2018) By Lásló Mérő, Samuel L. Tunstall Jul 2018

Calculus Of The Impossible: Review Of The Improbability Principle (2014) By David Hand And The Logic Of Miracles (2018) By Lásló Mérő, Samuel L. Tunstall

Numeracy

David J. Hand. 2014. The Improbability Principle: Why Coincidences, Miracles, and Rare Events Happen Every Day (New York, NY: Scientific American/Farrar, Straus and Giroux) 288 pp. ISBN: 978-0374175344.

Lásló Mérő. 2018. The Logic of Miracles: Making Sense of Rare, Really Rare, and Impossibly Rare Events (New Haven, CT: Yale University Press) 288 pp. ISBN: 978-0300224153.

David Hand and Lásló Mérő both grapple with the occurrence of seemingly impossible events in these two popular science books. In this comparative review, I describe the two books, and explain why I prefer Hand's treatment of the impossible.


Development Of A Tool To Assess Students’ Conceptual Understanding In Introductory Statistics, Nathan L. Tintle, Jill Vander Stoep Jul 2018

Development Of A Tool To Assess Students’ Conceptual Understanding In Introductory Statistics, Nathan L. Tintle, Jill Vander Stoep

Faculty Work Comprehensive List

Few tools exist to assess students’ conceptual understanding in post-secondary, introductory statistics courses. The CAOS test is widely considered to be the gold standard, but was first published in 2007 and does not necessarily reflect some of the changes in student learning at the secondary level. Furthermore, it may not be sensitive enough to measure student conceptual understanding in modern post-secondary statistics courses (e.g., simulation-based inference). In this paper we will describe the process of developing a new instrument which uses some CAOS items, as well as additional new items to improve validity and reliability. We will share the validity …


On The Detection Of Statistical Heterogeneity In Rain Measurements, A. R. Jameson, Michael L. Larsen, A. Kostinski Jul 2018

On The Detection Of Statistical Heterogeneity In Rain Measurements, A. R. Jameson, Michael L. Larsen, A. Kostinski

Department of Physics Publications

The application of the Wiener–Khintchine theorem for translating a readily measured correlation function into the variance spectrum, important for scale analyses and for scaling transformations of data, requires that the data be wide-sense homogeneous (stationary), that is, that the first and second moments of the probability distribution of the variable are the same at all times (stationarity) or at all locations (homogeneity) over the entire observed domain. This work provides a heuristic method independent of statistical models for evaluating whether a set of data in rain is wide-sense stationary (WSS). The alternative, statistical heterogeneity, requires 1) that there be no …


Hierarchical Bayesian Data Fusion Using Autoencoders, Yevgeniy Vladimirovich Reznichenko Jul 2018

Hierarchical Bayesian Data Fusion Using Autoencoders, Yevgeniy Vladimirovich Reznichenko

Master's Theses (2009 -)

In this thesis, a novel method for tracker fusion is proposed and evaluated for vision-based tracking. This work combines three distinct popular techniques into a recursive Bayesian estimation algorithm. First, semi supervised learning approaches are used to partition data and to train a deep neural network that is capable of capturing normal visual tracking operation and is able to detect anomalous data. We compare various methods by examining their respective receiver operating conditions (ROC) curves, which represent the trade off between specificity and sensitivity for various detection threshold levels. Next, we incorporate the trained neural networks into an existing data …


Association Tests For Genetic Effect And Its Interaction With Environmental Factors, Zhengyang Zhou Jul 2018

Association Tests For Genetic Effect And Its Interaction With Environmental Factors, Zhengyang Zhou

Statistical Science Theses and Dissertations

My research is in the area of statistical genetics, and it contains three projects: (1) Differentiating the Cochran-Armitage (CA) trend test and Pearson’s chi-square test: location and dispersion; (2) Decomposing Pearson’s chi-square test: a linear regression and its departure from linearity; (3) Testing nonlinear gene-environment (GxE) interaction through varying coefficient and linear mixed models.

(1) In genetic case-control association studies, a standard practice is to perform the CA trend test with 1 degree-of-freedom (df) under the assumption of an additive model. However, when the true genetic model is recessive or near recessive, it is outperformed by Pearson’s chi-square test with …


Finding Meaning In A Multivariable World: A Conceptual Approach To An Algebra-Based Second Course In Statistics, Karen Mcgaughey, Beth Chance, Nathan L. Tintle, Soma Roy, Todd Swanson, Jill Vander Stoep Jul 2018

Finding Meaning In A Multivariable World: A Conceptual Approach To An Algebra-Based Second Course In Statistics, Karen Mcgaughey, Beth Chance, Nathan L. Tintle, Soma Roy, Todd Swanson, Jill Vander Stoep

Faculty Work Comprehensive List

Although the teaching of the first course in statistics has improved dramatically in recent years, there has been less focus on a similarly conceptual-based second course aimed at non-majors. We present a curriculum for the second course, designed to expand statistical literacy across disciplines, which focuses on conceptual understanding of multivariable relationships through data visualization, study design, the role of confounding variables, reduction of unexplained variation, and simulation-based inference, rather than the mathematically-based discourse often used in the second course. Our curriculum uses a student-centered pedagogical approach, utilizing guided discovery activities based on real-world case studies, facilitated by student-focused technology …


Pseudo Power Law Statistics In A Jammed, Amorphous Solid, Jacob Brian Hass Jun 2018

Pseudo Power Law Statistics In A Jammed, Amorphous Solid, Jacob Brian Hass

Physics

Simulations have shown that in many solid materials, rearrangements within the solid obey power-law statistics. A connection has been proposed between these statistics and the ability of a system to reach a limit cycle under cyclic driving. We study experimentally a 2D jammed solid that reaches such a limit cycle. Our solid consists of microscopic plastic beads adsorbed at an oil-water interface and cyclically sheared by a magnetically driven needle. We track each particles trajectory in the solid to identify rearrangements. By associating particles both spatially and temporally, we can measure the extent of each rearrangement. We study specifically the …


A 3d Characteristics Database Of Land Engraved Areas With Known Subclass, Entni Lin Jun 2018

A 3d Characteristics Database Of Land Engraved Areas With Known Subclass, Entni Lin

Student Theses

Subclass characteristics on bullets may mislead firearm examiners when they rely on traditional 2D images. In order to provide indelible examples for training and help avoid identification errors, 3D topography surface maps and statistical methods of pattern recognition are applied to toolmarks on bullets containing known subclass characteristics. This research was conducted by collecting 3D topography surface map data from land engraved areas of bullets fired through known barrels. This data was processed and used to train the statistical algorithms to predict their origin. The results from the algorithm are compared with the “right answers” (i.e. correct IDs) of the …


Discrete Ranked Set Sampling, Heng Cui May 2018

Discrete Ranked Set Sampling, Heng Cui

Statistical Science Theses and Dissertations

Ranked set sampling (RSS) is an efficient data collection framework compared to simple random sampling (SRS). It is widely used in various application areas such as agriculture, environment, sociology, and medicine, especially in situations where measurement is expensive but ranking is less costly. Most past research in RSS focused on situations where the underlying distribution is continuous. However, it is not unusual to have a discrete data generation mechanism. Estimating statistical functionals are challenging as ties may truly exist in discrete RSS. In this thesis, we started with estimating the cumulative distribution function (CDF) in discrete RSS. We proposed two …


Analysis Of 2016-17 Major League Soccer Season Data Using Poisson Regression With R, Ian D. Campbell May 2018

Analysis Of 2016-17 Major League Soccer Season Data Using Poisson Regression With R, Ian D. Campbell

Undergraduate Theses and Capstone Projects

To the outside observer, soccer is chaotic with no given pattern or scheme to follow, a random conglomeration of passes and shots that go on for 90 minutes. Yet, what if there was a pattern to the chaos, or a way to describe the events that occur in the game quantifiably. Sports statistics is a critical part of baseball and a variety of other of today’s sports, but we see very little statistics and data analysis done on soccer. Of this research, there has been looks into the effect of possession time on the outcome of a game, the difference …


Mindset, Attitudes, And Success In Statistics, Matthew Isaac May 2018

Mindset, Attitudes, And Success In Statistics, Matthew Isaac

Undergraduate Honors Capstone Projects

Students in many disciplines are required to take an introductory statistics course while pursuing a college education. Despite the utility of statistical methods in future research and career pursuits, many students have negative views of statistics. We are interested in how students' mindsets and attitudes towards statistics impact their performance in an undergraduate statistics course. We administered a survey to students in several undergraduate statistics courses at Utah State University. This survey included questions addressing mathematics experience, attitudes towards statistics, mindset, and course performance. We observed that the majority of students indicated the presence of a growth mindset and positive …


Forecasting Labor Force Participation At The Regional Level In The United States: The Case Of Maine, Maryam Kashkooli May 2018

Forecasting Labor Force Participation At The Regional Level In The United States: The Case Of Maine, Maryam Kashkooli

Honors College

This project attempts to investigate the future of labor force participation in Maine using an econometric forecasting approach. Forecasting has become an increasingly popular form of statistical analysis which uses historical distributions to help estimate future distributions of econometric models. There exists extensive literature on forecasting employment, however the literature on forecasting labor force participation is relatively small. I adapt existing econometric models and make use of time series information on sociodemographic factors such as age and net migration in order to determine how Maine’s changing demographic structure is affecting its labor force and how these effects will carry on …


Fm Radio Signal Propagation Evaluation And Creating Statistical Models For Signal Strength Prediction In Differing Topographic Environments, Timothy Land May 2018

Fm Radio Signal Propagation Evaluation And Creating Statistical Models For Signal Strength Prediction In Differing Topographic Environments, Timothy Land

Electronic Theses and Dissertations

Radio wave signal strength and associated propagation models are rarely analyzed across individual geographic provinces. This study evaluates the effectiveness of the Radio Mobile model to predict radio wave signal strength in the Blue Ridge and Valley and Ridge physiographic provinces. A spectrum analyzer was used on 19 FM transmitters to determine model accuracy. Statistical analysis determined the significance between different terrain factors and signal strength. Field signal strength was found to be related to test site elevation, transmitter azimuth, elevation angle, transmitter elevation, path loss, and distance. Using 76 signal strength receiver sites, Ordinary Least Square regression models predicted …


The Psychology Of Baseball: How The Mental Game Impacts The Physical Game, Kiera Dalmass Apr 2018

The Psychology Of Baseball: How The Mental Game Impacts The Physical Game, Kiera Dalmass

Honors Scholar Theses

The purpose of this study was to find whether or not sports psychology can be effective. Baseball was chosen as the sport for the study because baseball can be analyzed for nearly every single factor of the game, with the exception of the mental readiness or state of the player when he steps onto the field. It therefore provides the optimal atmosphere to provide clinical and statistical support to the field of sports psychology. Despite the various, numerous pieces of literature that praise and show support for sports psychology, there hasn’t been clinical research to support it. Additionally, multiple sports …


Developing Methods Of Processing And Analyzing Genetic Data To Examine Tiger Salamander Population Structure, Dennis Dongmin Kim Apr 2018

Developing Methods Of Processing And Analyzing Genetic Data To Examine Tiger Salamander Population Structure, Dennis Dongmin Kim

Undergraduate Research Symposium 2018

Professor Heather Waye and her colleagues conducted a pilot study in 2014 to measure genetic diversity and dispersal pattern in a population of tiger salamanders in west-central Minnesota. The ultimate goal of this research was to analyze the genetic differences between tiger salamander larvae captured in breeding ponds within Pepperton Waterfowl Production Area to understand the population structure and movement patterns. They expected that ponds closer to each other would have more similar genetic information, and that genetic differences between ponds would increase with geographic distance. However, the initial analysis using standard techniques failed to uncover useful patterns in the …