Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Statistics

Discipline
Institution
Publication Year
Publication
Publication Type
File Type

Articles 1 - 30 of 825

Full-Text Articles in Physical Sciences and Mathematics

Comparing Performance Of Gene Set Test Methods Using Biologically Relevant Simulated Data, Richard M. Lambert Dec 2018

Comparing Performance Of Gene Set Test Methods Using Biologically Relevant Simulated Data, Richard M. Lambert

All Graduate Theses and Dissertations

Today we know that there are many genetically driven diseases and health conditions.These problems often manifest only when a set of genes are either active or inactive. Recent technology allows us to measure the activity level of genes in cells, which we call gene expression. It is of great interest to society to be able to statistically compare the gene expression of a large number of genes between two or more groups. For example, we may want to compare the gene expression of a group of cancer patients with a group of non-cancer patients to better understand the genetic ...


Rfviz: An Interactive Visualization Package For Random Forests In R, Christopher Beckett Dec 2018

Rfviz: An Interactive Visualization Package For Random Forests In R, Christopher Beckett

All Graduate Plan B and other Reports

Random forests are very popular tools for predictive analysis and data science. They work for both classification (where there is a categorical response variable) and regression (where the response is continuous). Random forests provide proximities, and both local and global measures of variable importance. However, these quantities require special tools to be effectively used to interpret the forest. Rfviz is a sophisticated interactive visualization package and toolkit in R, specially designed for interpreting the results of a random forest in a user-friendly way. Rfviz uses a recently developed R package (loon) from the Comprehensive R Archive Network (CRAN) to create ...


Minimizing The Perceived Financial Burden Due To Cancer, Hassan Azhar, Zoheb Allam, Gino Varghese, Daniel W. Engels, Sajiny John Aug 2018

Minimizing The Perceived Financial Burden Due To Cancer, Hassan Azhar, Zoheb Allam, Gino Varghese, Daniel W. Engels, Sajiny John

SMU Data Science Review

In this paper, we present a regression model that predicts perceived financial burden that a cancer patient experiences in the treatment and management of the disease. Cancer patients do not fully understand the burden associated with the cost of cancer, and their lack of understanding can increase the difficulties associated with living with the disease, in particular coping with the cost. The relationship between demographic characteristics and financial burden were examined in order to better understand the characteristics of a cancer patient and their burden, while all subsets regression was used to determine the best predictors of financial burden. Age ...


Secondary Data Analysis Project, Jonathan M. Gallimore Aug 2018

Secondary Data Analysis Project, Jonathan M. Gallimore

SF 420 PR - Gallimore - Fall 2018

This activity is designed to give students an opportunity to apply what they have learned in statistics to a real dataset.

This activity will help students apply what they have learned in statistics to real world data and answer their own research questions. Students will also practice reporting their results in a paper using APA format.


Deep Machine Learning For Mechanical Performance And Failure Prediction, Elijah Reber, Nickolas D. Winovich, Guang Lin Aug 2018

Deep Machine Learning For Mechanical Performance And Failure Prediction, Elijah Reber, Nickolas D. Winovich, Guang Lin

The Summer Undergraduate Research Fellowship (SURF) Symposium

Deep learning has provided opportunities for advancement in many fields. One such opportunity is being able to accurately predict real world events. Ensuring proper motor function and being able to predict energy output is a valuable asset for owners of wind turbines. In this paper, we look at how effective a deep neural network is at predicting the failure or energy output of a wind turbine. A data set was obtained that contained sensor data from 17 wind turbines over 13 months, measuring numerous variables, such as spindle speed and blade position and whether or not the wind turbine experienced ...


Implementing The Use Of Personal Activity Data In An Introductory Statistics Course, Lacy Christensen Aug 2018

Implementing The Use Of Personal Activity Data In An Introductory Statistics Course, Lacy Christensen

All Graduate Theses and Dissertations

Integrating real data into a classroom is one of the recommendations in the Guidelines for Assessment and Instruction in Statistics Education (GAISE) college report which lays out guidelines for an introductory statistics course (Committee, GAISE College Report ASA Revision, 2016). In order to assess the effect of using real data in a classroom, the students received physical activity trackers to wear during an undergraduate introductory statistics course taught in the summer. This tracker, a Fitbit, enabled students to monitor and record their steps, calories, and active time throughout the class. Collecting personal activity data (PAD) creates a large database which ...


Bayesian Analytical Approaches For Metabolomics : A Novel Method For Molecular Structure-Informed Metabolite Interaction Modeling, A Novel Diagnostic Model For Differentiating Myocardial Infarction Type, And Approaches For Compound Identification Given Mass Spectrometry Data., Patrick J. Trainor Aug 2018

Bayesian Analytical Approaches For Metabolomics : A Novel Method For Molecular Structure-Informed Metabolite Interaction Modeling, A Novel Diagnostic Model For Differentiating Myocardial Infarction Type, And Approaches For Compound Identification Given Mass Spectrometry Data., Patrick J. Trainor

Electronic Theses and Dissertations

Metabolomics, the study of small molecules in biological systems, has enjoyed great success in enabling researchers to examine disease-associated metabolic dysregulation and has been utilized for the discovery biomarkers of disease and phenotypic states. In spite of recent technological advances in the analytical platforms utilized in metabolomics and the proliferation of tools for the analysis of metabolomics data, significant challenges in metabolomics data analyses remain. In this dissertation, we present three of these challenges and Bayesian methodological solutions for each. In the first part we develop a new methodology to serve a basis for making higher order inferences in metabolomics ...


International Data Sources & Data Literacy, Lisa Deluca Jul 2018

International Data Sources & Data Literacy, Lisa Deluca

Lisa DeLuca, MLIS, MPA

No abstract provided.


Calculus Of The Impossible: Review Of The Improbability Principle (2014) By David Hand And The Logic Of Miracles (2018) By Lásló Mérő, Samuel L. Tunstall Jul 2018

Calculus Of The Impossible: Review Of The Improbability Principle (2014) By David Hand And The Logic Of Miracles (2018) By Lásló Mérő, Samuel L. Tunstall

Numeracy

David J. Hand. 2014. The Improbability Principle: Why Coincidences, Miracles, and Rare Events Happen Every Day (New York, NY: Scientific American/Farrar, Straus and Giroux) 288 pp. ISBN: 978-0374175344.

Lásló Mérő. 2018. The Logic of Miracles: Making Sense of Rare, Really Rare, and Impossibly Rare Events (New Haven, CT: Yale University Press) 288 pp. ISBN: 978-0300224153.

David Hand and Lásló Mérő both grapple with the occurrence of seemingly impossible events in these two popular science books. In this comparative review, I describe the two books, and explain why I prefer Hand's treatment of the impossible.


Hierarchical Bayesian Data Fusion Using Autoencoders, Yevgeniy Vladimirovich Reznichenko Jul 2018

Hierarchical Bayesian Data Fusion Using Autoencoders, Yevgeniy Vladimirovich Reznichenko

Master's Theses (2009 -)

In this thesis, a novel method for tracker fusion is proposed and evaluated for vision-based tracking. This work combines three distinct popular techniques into a recursive Bayesian estimation algorithm. First, semi supervised learning approaches are used to partition data and to train a deep neural network that is capable of capturing normal visual tracking operation and is able to detect anomalous data. We compare various methods by examining their respective receiver operating conditions (ROC) curves, which represent the trade off between specificity and sensitivity for various detection threshold levels. Next, we incorporate the trained neural networks into an existing data ...


Ratchet Mechanisms In Macroevolutionary Processes, Trevor J. Dimartino Jun 2018

Ratchet Mechanisms In Macroevolutionary Processes, Trevor J. Dimartino

Computer Science Graduate Theses & Dissertations

How have we arrived at the diverse set of complex species that we currently find in our world? Using statistical simulations of evolutionary processes, this study investigates how the fundamental minimum sizes of species increase irreversibly over time, and how complexities evolved along the way compound throughout that process. Our results imply that unless a random mutation opens up a new dimension of nichespace for the clade to expand within, the mutation will eventually become extinct due to inherent genetic drift.


A 3d Characteristics Database Of Land Engraved Areas With Known Subclass, Entni Lin Jun 2018

A 3d Characteristics Database Of Land Engraved Areas With Known Subclass, Entni Lin

Student Theses

Subclass characteristics on bullets may mislead firearm examiners when they rely on traditional 2D images. In order to provide indelible examples for training and help avoid identification errors, 3D topography surface maps and statistical methods of pattern recognition are applied to toolmarks on bullets containing known subclass characteristics. This research was conducted by collecting 3D topography surface map data from land engraved areas of bullets fired through known barrels. This data was processed and used to train the statistical algorithms to predict their origin. The results from the algorithm are compared with the “right answers” (i.e. correct IDs) of ...


Forecasting Labor Force Participation At The Regional Level In The United States: The Case Of Maine, Maryam Kashkooli May 2018

Forecasting Labor Force Participation At The Regional Level In The United States: The Case Of Maine, Maryam Kashkooli

Honors College

This project attempts to investigate the future of labor force participation in Maine using an econometric forecasting approach. Forecasting has become an increasingly popular form of statistical analysis which uses historical distributions to help estimate future distributions of econometric models. There exists extensive literature on forecasting employment, however the literature on forecasting labor force participation is relatively small. I adapt existing econometric models and make use of time series information on sociodemographic factors such as age and net migration in order to determine how Maine’s changing demographic structure is affecting its labor force and how these effects will carry ...


Analysis Of 2016-17 Major League Soccer Season Data Using Poisson Regression With R, Ian D. Campbell May 2018

Analysis Of 2016-17 Major League Soccer Season Data Using Poisson Regression With R, Ian D. Campbell

Undergraduate Theses and Capstone Projects

To the outside observer, soccer is chaotic with no given pattern or scheme to follow, a random conglomeration of passes and shots that go on for 90 minutes. Yet, what if there was a pattern to the chaos, or a way to describe the events that occur in the game quantifiably. Sports statistics is a critical part of baseball and a variety of other of today’s sports, but we see very little statistics and data analysis done on soccer. Of this research, there has been looks into the effect of possession time on the outcome of a game, the ...


The Psychology Of Baseball: How The Mental Game Impacts The Physical Game, Kiera Dalmass Apr 2018

The Psychology Of Baseball: How The Mental Game Impacts The Physical Game, Kiera Dalmass

Honors Scholar Theses

The purpose of this study was to find whether or not sports psychology can be effective. Baseball was chosen as the sport for the study because baseball can be analyzed for nearly every single factor of the game, with the exception of the mental readiness or state of the player when he steps onto the field. It therefore provides the optimal atmosphere to provide clinical and statistical support to the field of sports psychology. Despite the various, numerous pieces of literature that praise and show support for sports psychology, there hasn’t been clinical research to support it. Additionally, multiple ...


Developing Methods Of Processing And Analyzing Genetic Data To Examine Tiger Salamander Population Structure, Dennis Dongmin Kim Apr 2018

Developing Methods Of Processing And Analyzing Genetic Data To Examine Tiger Salamander Population Structure, Dennis Dongmin Kim

Undergraduate Research Symposium 2018

Professor Heather Waye and her colleagues conducted a pilot study in 2014 to measure genetic diversity and dispersal pattern in a population of tiger salamanders in west-central Minnesota. The ultimate goal of this research was to analyze the genetic differences between tiger salamander larvae captured in breeding ponds within Pepperton Waterfowl Production Area to understand the population structure and movement patterns. They expected that ponds closer to each other would have more similar genetic information, and that genetic differences between ponds would increase with geographic distance. However, the initial analysis using standard techniques failed to uncover useful patterns in the ...


Introduction To Statistics (Ga Southern), Scott Kersey, Stephen Carden Apr 2018

Introduction To Statistics (Ga Southern), Scott Kersey, Stephen Carden

Mathematics Grants Collections

This Grants Collection for Introduction to Statistics was created under a Round Eight ALG Textbook Transformation Grant.

Affordable Learning Georgia Grants Collections are intended to provide faculty with the frameworks to quickly implement or revise the same materials as a Textbook Transformation Grants team, along with the aims and lessons learned from project teams during the implementation process.

Documents are in .pdf format, with a separate .docx (Word) version available for download. Each collection contains the following materials:

  • Linked Syllabus
  • Initial Proposal
  • Final Report


Mathematics In Contemporary Society - Chapter 5 (Spring 2018), Patrick J. Wallach Apr 2018

Mathematics In Contemporary Society - Chapter 5 (Spring 2018), Patrick J. Wallach

Open Educational Resources

Mathematics in Contemporary Society is the textbook that corresponds to MA-321, the course of the same name. The course is designed to provide students with mathematical ideas and methods found in the social sciences, the arts, and in business. Topics will include fundamentals of statistics, scatterplots, graphics in the media, problem solving strategies, dimensional analysis, mathematics in music and art, and mathematical modeling. EXCEL is used to explore real world applications.


Mathematics In Contemporary Society - Chapter 7 (Spring 2018), Patrick J. Wallach Apr 2018

Mathematics In Contemporary Society - Chapter 7 (Spring 2018), Patrick J. Wallach

Open Educational Resources

Mathematics in Contemporary Society is the textbook that corresponds to MA-321, the course of the same name. The course is designed to provide students with mathematical ideas and methods found in the social sciences, the arts, and in business. Topics will include fundamentals of statistics, scatterplots, graphics in the media, problem solving strategies, dimensional analysis, mathematics in music and art, and mathematical modeling. EXCEL is used to explore real world applications.


Mathematics In Contemporary Society - Chapter 6 (Spring 2018), Patrick J. Wallach Apr 2018

Mathematics In Contemporary Society - Chapter 6 (Spring 2018), Patrick J. Wallach

Open Educational Resources

Mathematics in Contemporary Society is the textbook that corresponds to MA-321, the course of the same name. The course is designed to provide students with mathematical ideas and methods found in the social sciences, the arts, and in business. Topics will include fundamentals of statistics, scatterplots, graphics in the media, problem solving strategies, dimensional analysis, mathematics in music and art, and mathematical modeling. EXCEL is used to explore real world applications.


Mathematics In Contemporary Society - Chapter 8 (Spring 2018), Patrick J. Wallach Apr 2018

Mathematics In Contemporary Society - Chapter 8 (Spring 2018), Patrick J. Wallach

Open Educational Resources

Mathematics in Contemporary Society is the textbook that corresponds to MA-321, the course of the same name. The course is designed to provide students with mathematical ideas and methods found in the social sciences, the arts, and in business. Topics will include fundamentals of statistics, scatterplots, graphics in the media, problem solving strategies, dimensional analysis, mathematics in music and art, and mathematical modeling. EXCEL is used to explore real world applications.


Mathematics In Contemporary Society - Chapter 10 (Spring 2018), Patrick J. Wallach Apr 2018

Mathematics In Contemporary Society - Chapter 10 (Spring 2018), Patrick J. Wallach

Open Educational Resources

Mathematics in Contemporary Society is the textbook that corresponds to MA-321, the course of the same name. The course is designed to provide students with mathematical ideas and methods found in the social sciences, the arts, and in business. Topics will include fundamentals of statistics, scatterplots, graphics in the media, problem solving strategies, dimensional analysis, mathematics in music and art, and mathematical modeling. EXCEL is used to explore real world applications.


Mathematics In Contemporary Society - Chapter 9 (Spring 2018), Patrick J. Wallach Apr 2018

Mathematics In Contemporary Society - Chapter 9 (Spring 2018), Patrick J. Wallach

Open Educational Resources

Mathematics in Contemporary Society is the textbook that corresponds to MA-321, the course of the same name. The course is designed to provide students with mathematical ideas and methods found in the social sciences, the arts, and in business. Topics will include fundamentals of statistics, scatterplots, graphics in the media, problem solving strategies, dimensional analysis, mathematics in music and art, and mathematical modeling. EXCEL is used to explore real world applications.


Mathematics In Contemporary Society - Chapter 1 (Spring 2018), Patrick J. Wallach Apr 2018

Mathematics In Contemporary Society - Chapter 1 (Spring 2018), Patrick J. Wallach

Open Educational Resources

Mathematics in Contemporary Society is the textbook that corresponds to MA-321, the course of the same name. The course is designed to provide students with mathematical ideas and methods found in the social sciences, the arts, and in business. Topics will include fundamentals of statistics, scatterplots, graphics in the media, problem solving strategies, dimensional analysis, mathematics in music and art, and mathematical modeling. EXCEL is used to explore real world applications.


Mathematics In Contemporary Society - Chapter 2 (Spring 2018), Patrick J. Wallach Apr 2018

Mathematics In Contemporary Society - Chapter 2 (Spring 2018), Patrick J. Wallach

Open Educational Resources

Mathematics in Contemporary Society is the textbook that corresponds to MA-321, the course of the same name. The course is designed to provide students with mathematical ideas and methods found in the social sciences, the arts, and in business. Topics will include fundamentals of statistics, scatterplots, graphics in the media, problem solving strategies, dimensional analysis, mathematics in music and art, and mathematical modeling. EXCEL is used to explore real world applications.


Mathematics In Contemporary Society - Chapter 3 (Spring 2018), Patrick J. Wallach Apr 2018

Mathematics In Contemporary Society - Chapter 3 (Spring 2018), Patrick J. Wallach

Open Educational Resources

Mathematics in Contemporary Society is the textbook that corresponds to MA-321, the course of the same name. The course is designed to provide students with mathematical ideas and methods found in the social sciences, the arts, and in business. Topics will include fundamentals of statistics, scatterplots, graphics in the media, problem solving strategies, dimensional analysis, mathematics in music and art, and mathematical modeling. EXCEL is used to explore real world applications.


Mathematics In Contemporary Society - Chapter 4 (Spring 2018), Patrick J. Wallach Apr 2018

Mathematics In Contemporary Society - Chapter 4 (Spring 2018), Patrick J. Wallach

Open Educational Resources

Mathematics in Contemporary Society is the textbook that corresponds to MA-321, the course of the same name. The course is designed to provide students with mathematical ideas and methods found in the social sciences, the arts, and in business. Topics will include fundamentals of statistics, scatterplots, graphics in the media, problem solving strategies, dimensional analysis, mathematics in music and art, and mathematical modeling. EXCEL is used to explore real world applications.


Mathematics In Contemporary Society - Chapter 11 (Spring 2018), Patrick J. Wallach Apr 2018

Mathematics In Contemporary Society - Chapter 11 (Spring 2018), Patrick J. Wallach

Open Educational Resources

Mathematics in Contemporary Society is the textbook that corresponds to MA-321, the course of the same name. The course is designed to provide students with mathematical ideas and methods found in the social sciences, the arts, and in business. Topics will include fundamentals of statistics, scatterplots, graphics in the media, problem solving strategies, dimensional analysis, mathematics in music and art, and mathematical modeling. EXCEL is used to explore real world applications.


Webwork Problems For Openstax Introductory Statistics, Scott Kersey, Stephen Carden Apr 2018

Webwork Problems For Openstax Introductory Statistics, Scott Kersey, Stephen Carden

Mathematics Ancillary Materials

These open-source mathematics homework problems are programmed for the WeBWorK mathematics platform and correspond to chapters in OpenStax Introductory Statistics. They were created through a Round Eight Textbook Transformation Grant.


Runs Of Identical Outcomes In A Sequence Of Bernoulli Trials, Matthew Riggle Apr 2018

Runs Of Identical Outcomes In A Sequence Of Bernoulli Trials, Matthew Riggle

Masters Theses & Specialist Projects

The Bernoulli distribution is a basic, well-studied distribution in probability. In this thesis, we will consider repeated Bernoulli trials in order to study runs of identical outcomes. More formally, for t ∈ N, we let Xt ∼ Bernoulli(p), where p is the probability of success, q = 1 − p is the probability of failure, and all Xt are independent. Then Xt gives the outcome of the tth trial, which is 1 for success or 0 for failure. For n, m ∈ N, we define Tn to be the number of trials needed to first observe n consecutive successes (where the nth ...