Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Mathematics

PDF

Theses/Dissertations

Statistics

Institution
Publication Year
Publication

Articles 1 - 30 of 114

Full-Text Articles in Entire DC Network

Using A Distributive Approach To Model Insurance Loss, Kayla Kippes Apr 2023

Using A Distributive Approach To Model Insurance Loss, Kayla Kippes

Student Research Submissions

Insurance loss is an unpredicted event that stands at the forefront of the insurance industry. Loss in insurance represents the costs or expenses incurred due to a claim. An insurance claim is a request for the insurance company to pay for damage caused to an individual’s property. Loss can be measured by how much money (the dollar amount) has been paid out by the insurance company to repair the damage or it can be measured by the number of claims (claim count) made to the insurance company. Insured events include property damage due to fire, theft, flood, a car accident, …


Length Bias Estimation Of Small Businesses Lifetime, Simeng Li Apr 2023

Length Bias Estimation Of Small Businesses Lifetime, Simeng Li

Honors Theses

Small businesses, particularly restaurants, play a crucial role in the economy by generating employment opportunities, boosting tourism, and contributing to the local economy. However, accurately estimating their lifetimes can be challenging due to the presence of length bias, which occurs when the likelihood of sampling any particular restaurant's closure is influenced by its duration in operation. To address the issue, this study conducts goodness-of-fit tests on exponential/gamma family distributions and employs the Kaplan-Meier method to more accurately estimate the average lifetime of restaurants in Carytown. By providing insights into the challenges of estimating the lifetimes of small businesses, this study …


Applications Of Machine Learning Algorithms In Materials Science And Bioinformatics, Mohammed Quazi Jun 2022

Applications Of Machine Learning Algorithms In Materials Science And Bioinformatics, Mohammed Quazi

Mathematics & Statistics ETDs

The piezoelectric response has been a measure of interest in density functional theory (DFT) for micro-electromechanical systems (MEMS) since the inception of MEMS technology. Piezoelectric-based MEMS devices find wide applications in automobiles, mobile phones, healthcare devices, and silicon chips for computers, to name a few. Piezoelectric properties of doped aluminum nitride (AlN) have been under investigation in materials science for piezoelectric thin films because of its wide range of device applicability. In this research using rigorous DFT calculations, high throughput ab-initio simulations for 23 AlN alloys are generated.

This research is the first to report strong enhancements of piezoelectric properties …


Many-Objective Evolutionary Algorithms: Objective Reduction, Decomposition And Multi-Modality., Monalisa Pal Dr. Jan 2022

Many-Objective Evolutionary Algorithms: Objective Reduction, Decomposition And Multi-Modality., Monalisa Pal Dr.

Doctoral Theses

Evolutionary Algorithms (EAs) for Many-Objective Optimization (MaOO) problems are challenging in nature due to the requirement of large population size, difficulty in maintaining the selection pressure towards global optima and inability of accurate visualization of high-dimensional Pareto-optimal Set (in decision space) and Pareto-Front (in objective space). The quality of the estimated set of Pareto-optimal solutions, resulting from the EAs for MaOO problems, is assessed in terms of proximity to the true surface (convergence) and uniformity and coverage of the estimated set over the true surface (diversity). With more number of objectives, the challenges become more profound. Thus, better strategies have …


Mathematical Formulations For Complex Resource Scheduling Problems., T. R. Lalita Dr. Jan 2022

Mathematical Formulations For Complex Resource Scheduling Problems., T. R. Lalita Dr.

Doctoral Theses

This thesis deals with development of effective models for large scale real-world resource scheduling problems. Efficient utilization of resources is crucial for any organization or industry as resources are often scarce. Scheduling them in an optimal way can not only take care of the scarcity but has potential economic benefits. Optimal utilization of resources reduces costs and thereby provides a competitive edge in the business world. Resources can be of different types such as human (personnel-skilled and unskilled), financial(budgets), materials, infrastructures(airports and seaports with designed facilities, windmills, warehouses’ area, hotel rooms etc) and equipment (microprocessors, cranes, machinery, aircraft simulators for …


Finding The Best Predictors For Foot Traffic In Us Seafood Restaurants, Isabel Paige Beaulieu Jan 2022

Finding The Best Predictors For Foot Traffic In Us Seafood Restaurants, Isabel Paige Beaulieu

Honors Theses and Capstones

COVID-19 caused state and nation-wide lockdowns, which altered human foot traffic, especially in restaurants. The seafood sector in particular suffered greatly as there was an increase in illegal fishing, it is made up of perishable goods, it is seasonal in some places, and imports and exports were slowed. Foot traffic data is useful for business owners to have to know how much to order, how many employees to schedule, etc. One issue is that the data is very expensive, hard to get, and not available until months after it is recorded. Our goal is to not only find covariates that …


Analyzing Marriage Statistics As Recorded In The Journal Of The American Statistical Association From 1889 To 2012, Annalee Soohoo Jan 2022

Analyzing Marriage Statistics As Recorded In The Journal Of The American Statistical Association From 1889 To 2012, Annalee Soohoo

CMC Senior Theses

The United States has been tracking American marriage statistics since its founding. According to the United States Census Bureau, “marital status and marital history data help federal agencies understand marriage trends, forecast future needs of programs that have spousal benefits, and measure the effects of policies and programs that focus on the well-being of families, including tax policies and financial assistance programs.”[1] With such a wide scope of applications, it is understandable why marriage statistics are so highly studied and well-documented.

This thesis will analyze American marriage patterns over the past 100 years as documented in the Journal of …


Mary Eleanor Spear's Importance To The History Of Statistical Visualization, Melanie Williams Jan 2022

Mary Eleanor Spear's Importance To The History Of Statistical Visualization, Melanie Williams

CMC Senior Theses

This paper will demonstrate why Mary Eleanor Spear (1897-1986) is an important figure in the history of statistical visualization. She lead an impressive career working in the federal government as a data analyst before "data analyst" became a thing. She wrote and illustrated two comprehensive textbooks which furthered the art of statistical visualization. Her textbooks cover extensive graphing knowledge still valuable to statisticians and viewers today. Most notable of her works is her development of the box plot. In addition to Spear's career and contributions, this paper will also address the lack of female representation in science, technology, engineering, and …


A Brief Treatise On Bayesian Inverse Regression., Debashis Chatterjee Dr. Dec 2021

A Brief Treatise On Bayesian Inverse Regression., Debashis Chatterjee Dr.

Doctoral Theses

Inverse problems, where in a broad sense the task is to learn from the noisy response about some unknown function, usually represented as the argument of some known functional form, has received wide attention in the general scientific disciplines. However, apart from the class of traditional inverse problems, there exists another class of inverse problems, which qualify as more authentic class of inverse problems, but unfortunately did not receive as much attention.In a nutshell, the other class of inverse problems can be described as the problem of predicting the covariates corresponding to given responses and the rest of the data. …


Some Nonparametric Hybrid Predictive Models : Asymptotic Properties And Applications., Tanujit Chakraborty Dr. Nov 2021

Some Nonparametric Hybrid Predictive Models : Asymptotic Properties And Applications., Tanujit Chakraborty Dr.

Doctoral Theses

Prediction problems like classification, regression, and time series forecasting have always attracted both the statisticians and computer scientists worldwide to take up the challenges of data science and implementation of complicated models using modern computing facilities. But most traditional statistical and machine learning models assume the available data to be well-behaved in terms of the presence of a full set of essential features, equal size of classes, and stationary data structures in all data instances, etc. Practical data sets from the domain of business analytics, process and quality control, software reliability, and macroeconomics, to name a few, suffer from various …


On Tests Of Independence Among Multiplerandom Vectors Of Arbitrary Dimensions., Angshuman Roy Dr. Apr 2021

On Tests Of Independence Among Multiplerandom Vectors Of Arbitrary Dimensions., Angshuman Roy Dr.

Doctoral Theses

Measures of dependence among several random vectors and associated tests of independence play a major role in different statistical applications. Blind source separation or independent component analysis (see, e.g., Hyv¨arinen et al., 2001; Shen et al., 2009), feature selection and feature extraction (see, e.g., Li et al., 2012), detection of serial correlation in time series (see, e.g., Ghoudi et al., 2001) and finding the causal relationships among the variables (see, e.g., Chakraborty and Zhang, 2019) are some examples of their wide-spread applications. Tests of independence has vast applications in other areas of sciences as well. For instance, to characterize the …


Essays In Social Choice Theory., Dipjyoti Majumdar Dr. Feb 2021

Essays In Social Choice Theory., Dipjyoti Majumdar Dr.

Doctoral Theses

The purpose of this thesis is to explore some issues in social choice theory and decision theory. Social choice theory provides the theoretical foundations for the field of public choice and welfare economics. It tries to bring together normative aspects like perspective value judgements and positive aspects, like strategic con- siderations. The second feature which is our focus, is closely related to the problem of providing appropriate incentives to agents, an issue of prime importance in eco- nomics.Consider for example, a set of agents who must elect one among a set of can- didates. These candidates may be physical agents …


Bayesian Topological Machine Learning, Christopher A. Oballe Aug 2020

Bayesian Topological Machine Learning, Christopher A. Oballe

Doctoral Dissertations

Topological data analysis encompasses a broad set of ideas and techniques that address 1) how to rigorously define and summarize the shape of data, and 2) use these constructs for inference. This dissertation addresses the second problem by developing new inferential tools for topological data analysis and applying them to solve real-world data problems. First, a Bayesian framework to approximate probability distributions of persistence diagrams is established. The key insight underpinning this framework is that persistence diagrams may be viewed as Poisson point processes with prior intensities. With this assumption in hand, one may compute posterior intensities by adopting techniques …


Market Research On Student Concert Attendance At Bgsu's College Of Musical Arts, Mary Solomon May 2019

Market Research On Student Concert Attendance At Bgsu's College Of Musical Arts, Mary Solomon

Honors Projects

Bowling Green State University boasts a well established College of Musical Arts which holds concerts performed by esteemed faculty, prestigious guest artists, and students. The school hosts these events in Kobacker Hall and Bryan Recital Hall which can accommodate up to 800 and 250 audience members, respectively. However, performances in Kobacker hall only fill one- fourth of the 800 seats, on average. Why is this so? This project aims to investigate the factors that influence students’ decisions to attend concerts at the College of Musical Arts (CMA). By methodology of survey research and statistical analysis, this project will look into …


Investigating The Factors That Best Describe Student Experience And Performance In College, Abigale Wynn Jan 2019

Investigating The Factors That Best Describe Student Experience And Performance In College, Abigale Wynn

Undergraduate Honors Thesis Collection

The National Survey of Student Engagement (NSSE) surveys students at four-year institutions around the United States in order to offer Universities accessible ways to evaluate their students' experiences and performance. The NSSE data is collected in the form of a Likert-scale survey geared towards first year and senior year students. It asks questions about how they spend their time throughout the academic year and how they rate their experience. This thesis looks at the NSSE survey data from Butler University in 2016 and attempts to apply classification techniques and predictive models to draw conclusions about student performance. Methods such as …


Utilizing Multi-Level Classification Techniques To Predict Adverse Drug Effects And Reactions, Victoria Puhl Jan 2019

Utilizing Multi-Level Classification Techniques To Predict Adverse Drug Effects And Reactions, Victoria Puhl

Undergraduate Honors Thesis Collection

Multi-class classification models are used to predict categorical response variables with more than two possible outcomes. A collection of multi-class classification techniques such as Multinomial Logistic Regression, Na\"{i}ve Bayes, and Support Vector Machine is used in predicting patients’ drug reactions and adverse drug effects based on patients’ demographic and drug administration. The newly released 2018 data on drug reactions and adverse drug effects from U.S. Food and Drug Administration are tested with the models. The applicability of model evaluation measures such as sensitivity, specificity and prediction accuracy in multi-class settings, are also discussed.


Modeling Stochastically Intransitive Relationships In Paired Comparison Data, Ryan Patrick Alexander Mcshane Jan 2019

Modeling Stochastically Intransitive Relationships In Paired Comparison Data, Ryan Patrick Alexander Mcshane

Statistical Science Theses and Dissertations

If the Warriors beat the Rockets and the Rockets beat the Spurs, does that mean that the Warriors are better than the Spurs? Sophisticated fans would argue that the Warriors are better by the transitive property, but could Spurs fans make a legitimate argument that their team is better despite this chain of evidence?

We first explore the nature of intransitive (rock-scissors-paper) relationships with a graph theoretic approach to the method of paired comparisons framework popularized by Kendall and Smith (1940). Then, we focus on the setting where all pairs of items, teams, players, or objects have been compared to …


Cramer Type Moderate Deviations For Random Fields And Mutual Information Estimation For Mixed-Pair Random Variables, Aleksandr Beknazaryan Jan 2019

Cramer Type Moderate Deviations For Random Fields And Mutual Information Estimation For Mixed-Pair Random Variables, Aleksandr Beknazaryan

Electronic Theses and Dissertations

In this dissertation we first study Cramer type moderate deviation for partial sums of random fields by applying the conjugate method. In 1938 Cramer published his results on large deviations of sums of i.i.d. random variables after which a lot of research has been done on establishing Cramer type moderate and large deviation theorems for different types of random variables and for various statistics. In particular results have been obtained for independent non-identically distributed random variables for the sum of independent random to estimate the mutual information between two random variables. The estimates enjoy a central limit theorem under some …


Analysis Of 2016-17 Major League Soccer Season Data Using Poisson Regression With R, Ian D. Campbell May 2018

Analysis Of 2016-17 Major League Soccer Season Data Using Poisson Regression With R, Ian D. Campbell

Undergraduate Theses and Capstone Projects

To the outside observer, soccer is chaotic with no given pattern or scheme to follow, a random conglomeration of passes and shots that go on for 90 minutes. Yet, what if there was a pattern to the chaos, or a way to describe the events that occur in the game quantifiably. Sports statistics is a critical part of baseball and a variety of other of today’s sports, but we see very little statistics and data analysis done on soccer. Of this research, there has been looks into the effect of possession time on the outcome of a game, the difference …


Mindset, Attitudes, And Success In Statistics, Matthew Isaac May 2018

Mindset, Attitudes, And Success In Statistics, Matthew Isaac

Undergraduate Honors Capstone Projects

Students in many disciplines are required to take an introductory statistics course while pursuing a college education. Despite the utility of statistical methods in future research and career pursuits, many students have negative views of statistics. We are interested in how students' mindsets and attitudes towards statistics impact their performance in an undergraduate statistics course. We administered a survey to students in several undergraduate statistics courses at Utah State University. This survey included questions addressing mathematics experience, attitudes towards statistics, mindset, and course performance. We observed that the majority of students indicated the presence of a growth mindset and positive …


Students’ Interpretations Of Categorical Data Using Dynamic Graphical Representations, Adam Eide Jan 2018

Students’ Interpretations Of Categorical Data Using Dynamic Graphical Representations, Adam Eide

Master's Theses and Doctoral Dissertations

Statistical association is an important concept in statistics. An exploratory study examined how students reason about statistical association utilizing graphical representations constructed with CODAP, a dynamic statistical graphing software. Task-based interviews were conducted with three 6th grade students prior to formal instruction. Students’ conceptions of a statistical relationship, proportional reasoning skill level, ability to interpret bivariate categorical graphs (particularly segmented bar graphs and two-way binned plots), and ability to identify association of two categorical variables were all investigated through interview tasks and responses to inquiry. Students were found to have developing proportional reasoning skills and struggled to correctly define and …


A Comparison Of Five Statistical Methods For Predicting Stream Temperature Across Stream Networks, Maike F. Holthuijzen Aug 2017

A Comparison Of Five Statistical Methods For Predicting Stream Temperature Across Stream Networks, Maike F. Holthuijzen

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

The health of freshwater aquatic systems, particularly stream networks, is mainly influenced by water temperature, which controls biological processes and influences species distributions and aquatic biodiversity. Thermal regimes of rivers are likely to change in the future, due to climate change and other anthropogenic impacts, and our ability to predict stream temperatures will be critical in understanding distribution shifts of aquatic biota. Spatial statistical network models take into account spatial relationships but have drawbacks, including high computation times and data pre-processing requirements. Machine learning techniques and generalized additive models (GAM) are promising alternatives to the SSN model. Two machine learning …


Neural Network Predictions Of A Simulation-Based Statistical And Graph Theoretic Study Of The Board Game Risk, Jacob Munson Jan 2017

Neural Network Predictions Of A Simulation-Based Statistical And Graph Theoretic Study Of The Board Game Risk, Jacob Munson

Murray State Theses and Dissertations

We translate the RISK board into a graph which undergoes updates as the game advances. The dissection of the game into a network model in discrete time is a novel approach to examining RISK. A review of the existing statistical findings of skirmishes in RISK is provided. The graphical changes are accompanied by an examination of the statistical properties of RISK. The game is modeled as a discrete time dynamic network graph, with the various features of the game modeled as properties of the network at a given time. As the network is computationally intensive to implement, results are produced …


A Traders Guide To The Predictive Universe- A Model For Predicting Oil Price Targets And Trading On Them, Jimmie Harold Lenz Dec 2016

A Traders Guide To The Predictive Universe- A Model For Predicting Oil Price Targets And Trading On Them, Jimmie Harold Lenz

Doctor of Business Administration Dissertations

At heart every trader loves volatility; this is where return on investment comes from, this is what drives the proverbial “positive alpha.” As a trader, understanding the probabilities related to the volatility of prices is key, however if you could also predict future prices with reliability the world would be your oyster. To this end, I have achieved three goals with this dissertation, to develop a model to predict future short term prices (direction and magnitude), to effectively test this by generating consistent profits utilizing a trading model developed for this purpose, and to write a paper that anyone with …


Mechanism Design For Land Acquisition., Soumendu Sarkar Dr. Sep 2016

Mechanism Design For Land Acquisition., Soumendu Sarkar Dr.

Doctoral Theses

Conversion of land use from agriculture to industry is a typical feature of economic development in many densely populated countries. Large scale construction often requires industry or the government to acquire vast areas of land that are inhabited and often cultivated, by hundreds and even thousands of people. For some landowners, possession signifies power and status in society, while for others, it is the only means for earning a livelihood.Adamopoulos and Restuccia (2014) use data from World Census of Agriculture to show that average farm size in the poorest 20% of countries is 1.6 hectares, while that in the richest …


Regression Analysis Of Success In Major League Baseball, Johnathon Tyler Clark May 2016

Regression Analysis Of Success In Major League Baseball, Johnathon Tyler Clark

Senior Theses

This thesis is designed to explore whether a team’s success in any given season can be predicted or explained by any number of statistics in that season. There are thirty teams in the MLB; of these thirty, ten make the postseason bracket-style playoffs. The MLB is divided up into two leagues, the American League and the National League; these two leagues are then divided up into three divisions each, the West Division, the Central Division, and the East Division. To make the playoffs a team must either have the most wins in its division after the last game of the …


Inference On Time-To-Event Distribution From Retrospective Data With Imperfect Recall., Sedigheh Salehabadi Dr. Mar 2016

Inference On Time-To-Event Distribution From Retrospective Data With Imperfect Recall., Sedigheh Salehabadi Dr.

Doctoral Theses

Time-to-event data arises from measurements of time till the occurrence of an event of interest. Such data are common in the fields of biology, epidemiology, pub- lic health, medical research, economics and industry. The event of interest can be the death of a human being (Klein and Moeschberger, 2003), failure of a machine (Zhiguo et al., 2007), onset of menarche in adolescent and young adult females (Bergsten-Brucefors, 1976; Chumlea et al., 2003; Mirzaei, Sengupta and Das, 2015), onset (or relapse) of a disease (Klein and Moeschberger, 2003), dental develop- ment (Demirjian, Goldstien and Tanner, 1973; Eveleth and Tanner, 1990), breast …


Some Distribution-Free Two-Sample Tests Applicable To High Dimension, Low Sample Size Data., Munmun Biswas Dr. Feb 2016

Some Distribution-Free Two-Sample Tests Applicable To High Dimension, Low Sample Size Data., Munmun Biswas Dr.

Doctoral Theses

The advancement of data acquisition technologies and computing resources have greatly facilitated the analysis of massive data sets in various fields of sciences. Researchers from different disciplines rigorously investigate these data sets to extract useful information for new scientific discoveries. Many of these data sets contain large number of features but small number of observations. For instance, in the fields of chemometrics (see e.g., Schoonover et al. (2003)), medical image analysis (see e.g., Yushkevich et al. (2001)) and microarray gene expression data analysis (see e.g., Eisen and Brown (1999), Alter et al. (2000)), we often deal with data of dimensions …


Nonparametric Methods For Data In Infinite Dimensional Space., Anirvan Chakraborty Dr. Dec 2015

Nonparametric Methods For Data In Infinite Dimensional Space., Anirvan Chakraborty Dr.

Doctoral Theses

For univariate as well as finite dimensional multivariate data, there is an extensive literature on nonparametric statistical methods. One of the reasons for the popularity of nonparametric methods is that it is often difficult to justify the assumptions (e.g., Gaussian distribution of the data) made in the models used in parametric methods. Nonparametric procedures use more flexible models, which involve less assumptions. So, they are more robust against possible departures from the model assumptions, and are applicable to a wide variety of data. Nonparametric methods outperform their parametric competitors in many situations, where the assumptions required for the parametric methods …


Essays In Political Economy And Voting., Mihir Bhattacharya Dr. Oct 2015

Essays In Political Economy And Voting., Mihir Bhattacharya Dr.

Doctoral Theses

This thesis comprises three chapters on issues in political economy and voting. The first chapter considers a multilevel multidimensional aggregation problem in voting. The second chapter considers a model of party formation where citizens propose links to other candidates. The final chapter considers a model of electoral competition between regional and national parties.We provide a brief description of each chapter below. 1.1 Multilevel Multidimensional Consistent AggregatorsIn this chapter we study gerrymander-proof or consistent aggregation rules in different contexts. There are several papers that have studied the structure of consistent voting rules satisfying various versions of consistency. Virtually all these papers …