Open Access. Powered by Scholars. Published by Universities.®
Other Statistics and Probability Commons™
Open Access. Powered by Scholars. Published by Universities.®
- Institution
-
- California Polytechnic State University, San Luis Obispo (3)
- Western University (2)
- City University of New York (CUNY) (1)
- Georgia Southern University (1)
- Michigan Technological University (1)
-
- Stephen F. Austin State University (1)
- University of Denver (1)
- University of Louisville (1)
- University of Massachusetts Amherst (1)
- University of Nebraska at Omaha (1)
- University of New Hampshire (1)
- University of North Florida (1)
- University of Texas at Tyler (1)
- Virginia Commonwealth University (1)
- West Virginia University (1)
- Keyword
-
- Statistics (3)
- 60x30TX (1)
- Analysis (1)
- Bayesian methods (1)
- Bias (1)
-
- Big data (1)
- Bivariate density estimation (1)
- Bootstrap (1)
- Bootstrapping (1)
- CUSUM (1)
- Circular Statistics Data Applied Linear Circular Correlation (1)
- Community college faculty (1)
- Computational Neuroscience (1)
- Computer Science Education (1)
- Computing (1)
- Concept drift (1)
- Conditional data methods (1)
- Cormack-Jolly-Seber models (1)
- Correlated fates (1)
- Data Analysis (1)
- Data Cleaning (1)
- Data Manipulation (1)
- Data-Driven Modeling (1)
- Data-driven deep learning (1)
- Decision Trees (1)
- Differentiated log-density approximants (1)
- Distribution (1)
- Distribution theory (1)
- ENFA (1)
- Education (1)
- Publication Year
Articles 1 - 18 of 18
Full-Text Articles in Other Statistics and Probability
Imputation Strategies For Different Categories Of Missing Data, Karthik Chalumuri
Imputation Strategies For Different Categories Of Missing Data, Karthik Chalumuri
Honors Theses and Capstones
Addressing missing data in research is crucial for ensuring the reliability and validity of study findings, yet it remains a significant challenge. This study investigates the impact of missing data on research outcomes and explores the underutilization of existing tools for managing missingness, potentially leading to gaps in critical information with tangible implications for decision-making processes (Dziura et al.).
Focusing on the different categories of missing data—Missing Completely At Random (MCAR), Missing At Random (MAR), and Missing Not At Random (MNAR)—this research examines various imputation strategies tailored to each category. Specifically, we compare the efficacy of several model-based imputation methods, …
A Data-Driven Multi-Regime Approach For Predicting Real-Time Energy Consumption Of Industrial Machines., Abdulgani Kahraman
A Data-Driven Multi-Regime Approach For Predicting Real-Time Energy Consumption Of Industrial Machines., Abdulgani Kahraman
Electronic Theses and Dissertations
This thesis focuses on methods for improving energy consumption prediction performance in complex industrial machines. Working with real-world industrial machines brings several challenges, including data access, algorithmic bias, data privacy, and the interpretation of machine learning algorithms. To effectively manage energy consumption in the industrial sector, it is essential to develop a framework that enhances prediction performance, reduces energy costs, and mitigates air pollution in heavy industrial machine operations. This study aims to assist managers in making informed decisions and driving the transition towards green manufacturing. The energy consumption of industrial machinery is substantial, and the recent increase in CO2 …
Addressing The Impact Of Time-Dependent Social Groupings On Animal Survival And Recapture Rates In Mark-Recapture Studies, Alexandru M. Draghici
Addressing The Impact Of Time-Dependent Social Groupings On Animal Survival And Recapture Rates In Mark-Recapture Studies, Alexandru M. Draghici
Electronic Thesis and Dissertation Repository
Mark-recapture (MR) models typically assume that individuals under study have independent survival and recapture outcomes. One such model of interest is known as the Cormack-Jolly-Seber (CJS) model. In this dissertation, we conduct three major research projects focused on studying the impact of violating the independence assumption in MR models along with presenting extensions which relax the independence assumption. In the first project, we conduct a simulation study to address the impact of failing to account for pair-bonded animals having correlated recapture and survival fates on the CJS model. We examined the impact of correlation on the likelihood ratio test (LRT), …
Attempting To Predict The Unpredictable: March Madness, Coleton Kanzmeier
Attempting To Predict The Unpredictable: March Madness, Coleton Kanzmeier
Theses/Capstones/Creative Projects
Each year, millions upon millions of individuals fill out at least one if not hundreds of March Madness brackets. People test their luck every year, whether for fun, with friends or family, or to even win some money. Some people rely on their basketball knowledge whereas others know it is called March Madness for a reason and take a shot in the dark. Others have even tried using statistics to give them an edge. I intend to follow a similar approach, using statistics to my advantage. The end goal is to predict this year’s, 2022, March Madness bracket. To achieve …
Examining The Credibility Of Story-Based Causal Methodologies, Megan E. Kauffmann
Examining The Credibility Of Story-Based Causal Methodologies, Megan E. Kauffmann
Electronic Theses and Dissertations
The purpose of this study was to explore how evaluators justify using story-based methodologies when examining causality. The two primary research questions of the study included: 1) what arguments are made by evaluators to justify the credibility of story-based causal methodologies to evaluation stakeholders; and 2) from the perspective of evaluators, how do contextual factors influence whether story-based causal methodologies are perceived as credible by evaluation stakeholders? A case study was conducted to examine the cases of four evaluators who had experience implementing a story-based methodology in an evaluation. Data collection procedures included two interviews with each participant and a …
Maximum Likelihood Estimator Method To Estimate Flaw Parameters For Different Glass Types, Nabhajit Goswami
Maximum Likelihood Estimator Method To Estimate Flaw Parameters For Different Glass Types, Nabhajit Goswami
Dissertations, Master's Theses and Master's Reports
Glass is commonly used in architectural applications, such as windows and in-fill panels and structural applications, such as beams and staircases. Despite the popularity of structural glass use in buildings, an engineering design standard to determine the required component or member strength for design loads does not exist. Glass is a brittle material that lacks a well-defined yield or ultimate stress, unlike ductile materials. The traditional engineering methods used to design a ductile material cannot be used to design a glass component. Glass fails in tension primarily due to the presence of microscopic flaws present on the surface that acts …
Role Of Inhibition And Spiking Variability In Ortho- And Retronasal Olfactory Processing, Michelle F. Craft
Role Of Inhibition And Spiking Variability In Ortho- And Retronasal Olfactory Processing, Michelle F. Craft
Theses and Dissertations
Odor perception is the impetus for important animal behaviors, most pertinently for feeding, but also for mating and communication. There are two predominate modes of odor processing: odors pass through the front of nose (ortho) while inhaling and sniffing, or through the rear (retro) during exhalation and while eating and drinking. Despite the importance of olfaction for an animal’s well-being and specifically that ortho and retro naturally occur, it is unknown whether the modality (ortho versus retro) is transmitted to cortical brain regions, which could significantly instruct how odors are processed. Prior imaging studies show different …
A Study Of Cusum Statistics On Bitcoin Transactions, Ivan Perez
A Study Of Cusum Statistics On Bitcoin Transactions, Ivan Perez
Theses and Dissertations
In this thesis, our objective is to study the relationship between transaction price and volume in the BTC/USD Coinbase exchange. In the second chapter, we develop a consecutive CUSUM algorithm to detect instantaneous changes in the arrival rate of market orders. We begin by estimating a baseline rate using the assumption of a local time-homogeneous Poisson process. Our observations lead us to reject the plausibility of a time-homogeneous Poisson model on a more global scale by using a chi squared test. We thus proceed to use CUSUM-based alarms to detect consecutive upward and downward changes in the arrival rate of …
Design Of Experiment And Analysis Techniques For Fuel Consumption Data Using Heavy-Duty Diesel Vehicles And On-Road Testing, Sarah Ann Mills
Design Of Experiment And Analysis Techniques For Fuel Consumption Data Using Heavy-Duty Diesel Vehicles And On-Road Testing, Sarah Ann Mills
Graduate Theses, Dissertations, and Problem Reports
Chassis dynamometer and on-road testing are usually employed to test vehicle operation. Testing on a chassis dynamometer reduces data variability compared to on-road testing due to the controlled environment but it does not account for other important variables that affects real-world vehicle operation. This study used on-road testing to investigate the differences between two test fuels under real-world conditions. Three heavy-duty diesel vehicles were driven on different routes for a period of three months. Each vehicle was instrumented with flow meters to gather fuel consumption data, which was then compared to the fuel rate broadcasted by the engine control unit …
Evaluation Of Using The Bootstrap Procedure To Estimate The Population Variance, Nghia Trong Nguyen
Evaluation Of Using The Bootstrap Procedure To Estimate The Population Variance, Nghia Trong Nguyen
Electronic Theses and Dissertations
The bootstrap procedure is widely used in nonparametric statistics to generate an empirical sampling distribution from a given sample data set for a statistic of interest. Generally, the results are good for location parameters such as population mean, median, and even for estimating a population correlation. However, the results for a population variance, which is a spread parameter, are not as good due to the resampling nature of the bootstrap method. Bootstrap samples are constructed using sampling with replacement; consequently, groups of observations with zero variance manifest in these samples. As a result, a bootstrap variance estimator will carry a …
Initial Evidence Of Construct Validity Of Data From A Self-Assessment Instrument Of Technological Pedagogical Content Knowledge (Tpack) In 2-Year Public College Faculty In Texas, Kristin C. Scott
Human Resource Development Theses and Dissertations
Technological pedagogical content knowledge (TPACK) has been studied in K-12 faculty in the U.S. and around the world using survey methodology. Very few studies of TPACK in post-secondary faculty have been conducted and no peer-reviewed studies in U.S. post-secondary faculty have been published to date. The present study is the first reliability and validity of data from a TPACK survey to be conducted with a large sample of U.S. post-secondary faculty. The professorate of 2-year public college faculty in Texas will help their institutions meet the goals of the state’s higher education strategic plan, 60x30TX. In order to do …
Advances In Semi-Nonparametric Density Estimation And Shrinkage Regression, Hossein Zareamoghaddam
Advances In Semi-Nonparametric Density Estimation And Shrinkage Regression, Hossein Zareamoghaddam
Electronic Thesis and Dissertation Repository
This thesis advocates the use of shrinkage and penalty techniques for estimating the parameters of a regression model that comprises both parametric and nonparametric components and develops semi-nonparametric density estimation methodologies that are applicable in a regression context.
First, a moment-based approach whereby a univariate or bivariate density function is approximated by means of a suitable initial density function that is adjusted by a linear combination of orthogonal polynomials is introduced. Such adjustments are shown to be mathematically equivalent to making use of standard polynomials in one or two variables. Once extended to apply to density estimation, in which case …
Some New And Generalized Distributions Via Exponentiation, Gamma And Marshall-Olkin Generators With Applications, Hameed Abiodun Jimoh
Some New And Generalized Distributions Via Exponentiation, Gamma And Marshall-Olkin Generators With Applications, Hameed Abiodun Jimoh
Electronic Theses and Dissertations
Three new generalized distributions developed via completing risk, gamma generator, Marshall-Olkin generator and exponentiation techniques are proposed and studied. Structural properties including quantile functions, hazard rate functions, moment, conditional moments, mean deviations, R\'enyi entropy, distribution of order statistics and maximum likelihood estimates are presented. Monte Carlo simulation is employed to examine the performance of the proposed distributions. Applications of the generalized distributions to real lifetime data are presented to illustrate the usefulness of the models.
A New Right Tailed Test Of The Ratio Of Variances, Elizabeth Rochelle Lesser
A New Right Tailed Test Of The Ratio Of Variances, Elizabeth Rochelle Lesser
UNF Graduate Theses and Dissertations
It is important to be able to compare variances efficiently and accurately regardless of the parent populations. This study proposes a new right tailed test for the ratio of two variances using the Edgeworth’s expansion. To study the Type I error rate and Power performance, simulation was performed on the new test with various combinations of symmetric and skewed distributions. It is found to have more controlled Type I error rates than the existing tests. Additionally, it also has sufficient power. Therefore, the newly derived test provides a good robust alternative to the already existing methods.
Niche-Based Modeling Of Japanese Stiltgrass (Microstegium Vimineum) Using Presence-Only Information, Nathan Bush
Niche-Based Modeling Of Japanese Stiltgrass (Microstegium Vimineum) Using Presence-Only Information, Nathan Bush
Masters Theses
The Connecticut River watershed is experiencing a rapid invasion of aggressive non-native plant species, which threaten watershed function and structure. Volunteer-based monitoring programs such as the University of Massachusetts’ OutSmart Invasives Species Project, Early Detection Distribution Mapping System (EDDMapS) and the Invasive Plant Atlas of New England (IPANE) have gathered valuable invasive plant data. These programs provide a unique opportunity for researchers to model invasive plant species utilizing citizen-sourced data. This study took advantage of these large data sources to model invasive plant distribution and to determine environmental and biophysical predictors that are most influential in dispersion, and to identify …
A Study Of The Parametric And Nonparametric Linear-Circular Correlation Coefficient, Robin Tu
A Study Of The Parametric And Nonparametric Linear-Circular Correlation Coefficient, Robin Tu
Statistics
Circular statistics are specialized statistical methods that deal specifically with directional data. Data that is angular require specialized techniques due to the modulo 2π (in radians) or modulo 360◦ (in degrees) nature of angles.
Correlation, typically in terms of Pearson’s correlation coefficient, is a measure of association between two linear random variables x and y. In this paper, the specific circular technique of the parametric and nonparametric linear-circular correlation coefficient will be explored where correlation is no longer between two linear variables x and y, but between a linear random variable x and circular random variable θ.
A simulation …
Using The R Library Rpanel For Gui-Based Simulations In Introductory Statistics Courses, Ryan M. Allison
Using The R Library Rpanel For Gui-Based Simulations In Introductory Statistics Courses, Ryan M. Allison
Statistics
As a student, I noticed that the statistical package R (http://www.r-project.org) would have several benefits of its usage in the classroom. One benefit to the package is its free and open-source nature. This would be a great benefit for instructors and students alike since it would be of no cost to use, unlike other statistical packages. Due to this, students could continue using the program after their statistical courses and into their professional careers. It would be good to expose students while they are in school to a tool that professionals use in industry. R also has powerful …
Software Internationalization: A Framework Validated Against Industry Requirements For Computer Science And Software Engineering Programs, John Huân Vũ
Master's Theses
View John Huân Vũ's thesis presentation at http://youtu.be/y3bzNmkTr-c.
In 2001, the ACM and IEEE Computing Curriculum stated that it was necessary to address "the need to develop implementation models that are international in scope and could be practiced in universities around the world." With increasing connectivity through the internet, the move towards a global economy and growing use of technology places software internationalization as a more important concern for developers. However, there has been a "clear shortage in terms of numbers of trained persons applying for entry-level positions" in this area. Eric Brechner, Director of Microsoft Development Training, suggested …