Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability Commons

Open Access. Powered by Scholars. Published by Universities.®

12,115 Full-Text Articles 18,591 Authors 3,366,482 Downloads 247 Institutions

All Articles in Statistics and Probability

Faceted Search

12,115 full-text articles. Page 4 of 368.

Structure-Activity Relationship Of Novel Diphenyl Ureas Targeting Mycobacterium, Piper Burghduf 2020 Grand Valley State University

Structure-Activity Relationship Of Novel Diphenyl Ureas Targeting Mycobacterium, Piper Burghduf

Student Scholars Day Posters

In 2017, the World Health Organization reported that 10 million people were infected with tuberculosis, 1.6 million of whom died. Tuberculosis is caused by a bacterium called Mycobacterium tuberculosis, which primarily infects an individual’s lungs. Unfortunately, failure to adhere to the long and arduous drug regimen has contributed to the emergence of antibiotic-resistant strains of M. tuberculosis. Therefore, the need for novel antibiotics is imperative to saving millions of lives. Our lab has recently developed a family of diphenyl ureas that exhibited increased antimicrobial activity toward Mycobacterium. Reported herein is the continuation of our previous research involving the ...


D-Vine Copula Model For Dependent Binary Data, Huihui Lin, N. Rao Chaganty 2020 Old Dominion University

D-Vine Copula Model For Dependent Binary Data, Huihui Lin, N. Rao Chaganty

College of Sciences Posters

High-dimensional dependent binary data are prevalent in a wide range of scientific disciplines. A popular method for analyzing such data is the Multivariate Probit (MP) model. But the MP model sometimes fails even within a feasible range of binary correlations, because the underlying correlation matrix of the latent variables may not be positive definite. In this research, we proposed pair copula models, assuming the dependence between the binary variables is first order autoregressive (AR(1))or equicorrelated structure. Also, when Archimediean copula is used, most paper converted Kendall Tau to corresponding copula parameter, there is no explicit function of Pearson ...


An Analysis Of Dredge Efficiency For Surfclam And Ocean Quahog Commercial Dredges, Leanne Poussard 2020 The University of Southern Mississippi

An Analysis Of Dredge Efficiency For Surfclam And Ocean Quahog Commercial Dredges, Leanne Poussard

Master's Theses

Between 1997 and 2011, The National Marine Fisheries Service conducted 50 depletion experiments to estimate survey gear efficiency and stock density for Atlantic surfclam (Spisula solidissima) and ocean quahog (Arctica islandica) populations using commercial hydraulic dredges. The Patch Model was formulated to estimate gear efficiency and organism density from the data. The range of efficiencies estimated is substantial, leading to uncertainty in the application of these estimates in stock assessment. Analysis of depletion experiment simulations showed that uncertainty in the estimates of gear efficiency from depletion experiments was reduced by higher numbers of dredge tows per experiment, more tow overlap ...


How Data Is Changing The World Of Healthcare, Cameron Marous 2020 Ohio Northern University

How Data Is Changing The World Of Healthcare, Cameron Marous

Honors Capstone Enhancement Presentations

No abstract provided.


Group Testing Identification: Objective Functions, Implementation, And Multiplex Assays, Brianna D. Hitt 2020 University of Nebraska-Lincoln

Group Testing Identification: Objective Functions, Implementation, And Multiplex Assays, Brianna D. Hitt

Dissertations and Theses in Statistics

Group testing is the process of combining items into groups to test for a binary characteristic. One of its most widely used applications is infectious disease testing. In this context, specimens (e.g., blood, urine) are amalgamated into groups and tested. For groups that test positive, there are many algorithmic retesting procedures available to identify positive individuals. The appeal of group testing is that the overall number of tests needed is significantly less than for individual testing when disease prevalence is small and an appropriate algorithm is chosen. Group testing has a number of applications beyond infectious disease testing, such ...


Boom Or Bust: Examining The Relationship Between High School Recruiting Rankings And The Nfl Draft, Nicholas E. Tice 2020 University of South Carolina

Boom Or Bust: Examining The Relationship Between High School Recruiting Rankings And The Nfl Draft, Nicholas E. Tice

Senior Theses

The goal of this thesis is to model the probability of a high school football player’s chance of being drafted based on information taken from their recruiting profile. The response variable is binary and defined as drafted (1) or undrafted (0). The independent variables were collected by scraping data from the recruiting websites including height, weight, position, hometown, recruiting grade and other socioeconomic factors based on the player’s high school. 247Sports and ESPN were the two recruiting services used and compared in this study. Because of the binary nature of the dependent variable, logistic regression and decision trees ...


Alarm Forecasting In Natural Gas Pipelines, Colin Quinn 2020 Marquette University

Alarm Forecasting In Natural Gas Pipelines, Colin Quinn

Master's Theses (2009 -)

This thesis examines alarm forecasting methods for a natural gas production pipeline to assure the efficient transportation of high-quality natural gas. Natural gas production companies use pipelines to transport natural gas from the extraction well to a distribution point. Forecasting natural gas pipeline pressure alarms helps control room operators maintain a functioning pipeline and avoid costly down time. As gas enters the pipeline and travels to the distribution point, it is expected that the gas meets certain specifications set in place by either state law or the customer receiving the gas. If the gas meets these standards and is accepted ...


Using Zero-Inflated Poisson Model And Zero-Inflated Negative Binomial Model On Dental Services Of Wisconsin, 2014 Data, Ke Xu 2020 Marquette University

Using Zero-Inflated Poisson Model And Zero-Inflated Negative Binomial Model On Dental Services Of Wisconsin, 2014 Data, Ke Xu

Master's Theses (2009 -)

Professional dental care to ensure optimum oral health of public plays an important role in the public health system. Facing the truth that there has been decline in dental care utilization for decade in 20th century, more and more attention has been paid to the oral health of children from this century. Children from age 0 to 21 years old experience rapid physical and oral development. Investigating the utilization of dental service for these children will provide useful information for the future study of the insurance system. In this thesis, two regression methods will be studied, the Zero-Inflated Poisson model ...


Exact Distribution Of Linkage Disequilibrium In The Presence Of Mutation, Selection, Or Minor Allele Frequency Filtering, Jiayi Qu, Stephen D. Kachman, Dorian Garrick, Rohan L. Fernando, Hao Cheng 2020 University of California, Davis

Exact Distribution Of Linkage Disequilibrium In The Presence Of Mutation, Selection, Or Minor Allele Frequency Filtering, Jiayi Qu, Stephen D. Kachman, Dorian Garrick, Rohan L. Fernando, Hao Cheng

Faculty Publications, Department of Statistics

Linkage disequilibrium (LD), often expressed in terms of the squared correlation (r2) between allelic values at two loci, is an important concept in many branches of genetics and genomics. Genetic drift and recombination have opposite effects on LD, and thus r2 will keep changing until the effects of these two forces are counterbalanced. Several approximations have been used to determine the expected value of r2 at equilibrium in the presence or absence of mutation. In this paper, we propose a probability-based approach to compute the exact distribution of allele frequencies at two loci in a finite population at any generation ...


Investigating Major League Baseball Pitchers And Quality Of Contact Through Cluster Analysis, Charlie Marcou 2020 Grand Valley State University

Investigating Major League Baseball Pitchers And Quality Of Contact Through Cluster Analysis, Charlie Marcou

Honors Projects

This paper investigates the quality of contact that a pitcher allows. Not much is currently known about quality of contact, but if factors determining quality of contact could be determined it could assist teams in identifying and developing pitching talent. There are many problems that come with investigating the control pitchers have over contact allowed, but one area to investigate is whether quality of contact is a repeatable skill. Furthermore, if it is a repeatable skill, then it is important to investigate what kind of benefit controlling contact allowed brings a pitcher. Along with this, groundball and flyball tendencies, and ...


Using Alteryx Designer In Audit, Nolan Asiala 2020 Grand Valley State University

Using Alteryx Designer In Audit, Nolan Asiala

Honors Projects

My senior project was built around data analysis and how it relates to the auditing profession. Initially, I was planning on attending a data analytics competition, but that was canceled due to the events of COVID-19. This project utilized the Alteryx Designer program to demonstrate how it can be used during an audit engagement. By creating a workflow in Alteryx Designer, a report from a client can be cleaned and reformatted into a working dataset. My project includes two Excel files, a Microsoft Word document that serves as a brief introduction to the program, and a video describing the workflow ...


Dice Questions Answered, Warren Campbell, William P. Dolan 2020 Civil Engineering

Dice Questions Answered, Warren Campbell, William P. Dolan

SEAS Faculty Publications

Superstitious discussion of fair and unfair dice has pervaded the tabletop gaming industry since its inception. Many of these are not based on any quantitative data or studies. Consequently, misconceptions have been spread widely. One dice float test video on Youtube currently has 925,000 views (Fisher, 2015a). To combat the flood of misconceptions we investigated the following questions: 1) Are dice cursed? 2) Are D20s (20-sided dice) less fair than D6s (6-sided dice)? 3) Do float tests tell anything about the fairness of dice? 4) Are some dice systems inherently fairer than others? 5) Are density differences or dimensions ...


Rmse-Minimizing Confidence Intervals For The Binomial Parameter, Kexin Feng 2020 William & Mary

Rmse-Minimizing Confidence Intervals For The Binomial Parameter, Kexin Feng

Undergraduate Honors Theses

Let X1, X2, . . . , Xn be independent and identically distributed Bernoulli(p) random variables with unknown parameter p satisfying 0 < p < 1. Let X = Pn i=1 Xi be the number of successes in the n mutually independent Bernoulli trials. The maximum likelihood estimator of p is ˆp = X/n. For fixed n and α, there are n + 1 distinct 100(1 − α)% confidence intervals associated with X = 0, 1, 2, . . . , n. Currently there is no known exact confidence interval for p. Our goal is to construct the confidence interval for p whose actual coverage is closest to the stated coverage, using the root mean squared error, RMSE, to measure the difference between the actual coverage and the stated coverage. The approximate confidence interval for p developed here minimizes the RMSE for a sample size n and a significance level α.


On The Authentic Notion, Relevance, And Solution Of The Jeffreys-Lindley Paradox In The Zettabyte Era, Miodrag M. Lovric 2020 Radford University

On The Authentic Notion, Relevance, And Solution Of The Jeffreys-Lindley Paradox In The Zettabyte Era, Miodrag M. Lovric

Journal of Modern Applied Statistical Methods

The Jeffreys-Lindley paradox is the most quoted divergence between the frequentist and Bayesian approaches to statistical inference. It is embedded in the very foundations of statistics and divides frequentist and Bayesian inference in an irreconcilable way. This paradox is the Gordian Knot of statistical inference and Data Science in the Zettabyte Era. If statistical science is ready for revolution confronted by the challenges of massive data sets analysis, the first step is to finally solve this anomaly. For more than sixty years, the Jeffreys-Lindley paradox has been under active discussion and debate. Many solutions have been proposed, none entirely satisfactory ...


Conflicts In Bayesian Statistics Between Inference Based On Credible Intervals And Bayes Factors, Miodrag M. Lovric 2020 Radford University

Conflicts In Bayesian Statistics Between Inference Based On Credible Intervals And Bayes Factors, Miodrag M. Lovric

Journal of Modern Applied Statistical Methods

In frequentist statistics, point-null hypothesis testing based on significance tests and confidence intervals are harmonious procedures and lead to the same conclusion. This is not the case in the domain of the Bayesian framework. An inference made about the point-null hypothesis using Bayes factor may lead to an opposite conclusion if it is based on the Bayesian credible interval. Bayesian suggestions to test point-nulls using credible intervals are misleading and should be dismissed. A null hypothesized value may be outside a credible interval but supported by Bayes factor (a Type I conflict), or contrariwise, the null value may be inside ...


Nanoindentation Characterization Of Elastic Properties Of Shales And Swelling Clay Minerals, Shengmin Luo 2020 University of Massachusetts Amherst

Nanoindentation Characterization Of Elastic Properties Of Shales And Swelling Clay Minerals, Shengmin Luo

Doctoral Dissertations

Oil and gas shales are a class of multiscale, multiphase, hybrid inorganic-organic sedimentary rocks that consist of a generally uniform, preferentially oriented clay matrix with randomly embedded silt and sand particles as solid inclusions. A thorough understanding of the mechanical properties of shales is crucial for the exploration and production of oil and gas in the unconventional shale reservoirs, but it can be a challenging task due to their nature of compositional heterogeneity and microstructural anisotropy. In efforts to better characterize the mechanical properties of shales across different length scales and to fundamentally understand the laws of upscaling from individual ...


Ground Weather Radar Signal Characterization Through Application Of Convolutional Neural Networks, Stephen M. Lee 2020 Air Force Institute of Technology

Ground Weather Radar Signal Characterization Through Application Of Convolutional Neural Networks, Stephen M. Lee

Theses and Dissertations

The 45th Weather Squadron supports the space launch efforts out of the Kennedy Space Center and Cape Canaveral Air Force Station for the Department of Defense, NASA, and commercial customers through weather assessments. Their assessment of the Lightning Launch Commit Criteria (LLCC) for avoidance of natural and rocket triggered lightning to launch vehicles is critical in approving space shuttle and rocket launches. The LLCC includes standards for cloud formations, which requires proper cloud identification and characterization methods. Accurate reflectivity measurements for ground weather radar are important to meet the LLCC for rocket triggered lightning. Current linear interpolation methods for ground ...


Conceptualization And Application Of Deep Learning And Applied Statistics For Flight Plan Recommendation, Nicholas C. Forrest 2020 Air Force Institute of Technology

Conceptualization And Application Of Deep Learning And Applied Statistics For Flight Plan Recommendation, Nicholas C. Forrest

Theses and Dissertations

The Air Forces Pilot Training Next (PTN) program seeks a more efficient pilot training environment emphasizing the use of virtual reality flight simulators alongside periodic real aircraft experience. The PTN program wants to accelerate the training pace and progress in undergraduate pilot training compared to traditional undergraduate pilot training. Currently, instructor pilots spend excessive time planning and scheduling flights. This research focuses on methods to auto-generate the planning of in-flight events using hybrid filtering and deep learning techniques. The resulting approach captures temporal trends of user-specific and program-wide student performance to recommend a feasible set of graded flight events for ...


An Analysis Of A Lighting Prediction Threshold For 45th Weather Squadron Electric Field Mill Data, Charles A. Skrovan 2020 Air Force Institute of Technology

An Analysis Of A Lighting Prediction Threshold For 45th Weather Squadron Electric Field Mill Data, Charles A. Skrovan

Theses and Dissertations

The mission of the 45th Weather Squadron (45 WS) is to “exploit the weather to assure safe access to air and space” for Patrick Air Force Base, Cape Canaveral Air Force Station (CCAFS), and Kennedy Space Center (KSC) in support of various operations (United States Air Force, n.d.). To support that mission the 45 WS hosts a suite of weather detection instruments that include a lightning warning system that consists of an array of 31 electric field mills (EFM) and a lightning detection and ranging system (Department of the Air Force, 1976). Electric field mills at Cape Canaveral continuously ...


Sampling The Porridge: A Comparison Of Ordered Variable Regression With F And R2 And Multiple Linear Regression With Corrected F And R2 In The Presence Of Multicollinearity, Grayson L. Baird, Stephen L. Bieber 2020 Brown University

Sampling The Porridge: A Comparison Of Ordered Variable Regression With F And R2 And Multiple Linear Regression With Corrected F And R2 In The Presence Of Multicollinearity, Grayson L. Baird, Stephen L. Bieber

Journal of Modern Applied Statistical Methods

Differences between the multiple linear regression model with Corrected R2 and Corrected F and the ordered variable regression model with R2 and F when intercorrelation is present are illustrated with simulated and real-world data.


Digital Commons powered by bepress