Open Access. Powered by Scholars. Published by Universities.®

Applied Statistics Commons

Open Access. Powered by Scholars. Published by Universities.®

3,520 Full-Text Articles 4,903 Authors 2,834,925 Downloads 168 Institutions

All Articles in Applied Statistics

Faceted Search

3,520 full-text articles. Page 1 of 107.

Multi-Type Branching Processes In Time-Varying Environments, Arash Jamshidpey 2024 Columbia University

Multi-Type Branching Processes In Time-Varying Environments, Arash Jamshidpey

Biology and Medicine Through Mathematics Conference

No abstract provided.


An Improved Bayesian Pick-The-Winner (Ibpw) Design For Randomized Phase Ii Clinical Trials, Wanni Lei, Maosen Peng, Xi K. Zhou 2024 Weill Cornell Medicine

An Improved Bayesian Pick-The-Winner (Ibpw) Design For Randomized Phase Ii Clinical Trials, Wanni Lei, Maosen Peng, Xi K. Zhou

COBRA Preprint Series

Phase II clinical trials play a pivotal role in drug development by screening a large number of drug candidates to identify those with promising preliminary efficacy for phase III testing. Trial designs that enable efficient decision-making with small sample sizes and early futility stopping while controlling for type I and II errors in hypothesis testing, such as Simon’s two-stage design, are preferred. Randomized multi-arm trials are increasingly used in phase II settings to overcome the limitations associated with using historical controls as the reference. However, how to effectively balance efficiency and accurate decision-making continues to be an important research topic. …


A Novel Correction For The Multivariate Ljung-Box Test, Minhao Huang 2024 Chapman University

A Novel Correction For The Multivariate Ljung-Box Test, Minhao Huang

Computational and Data Sciences (PhD) Dissertations

This research introduces an analytical improvement to the Multivariate Ljung-Box test that addresses significant deviations of the original test from the nominal Type I error rates under almost all scenarios. Prior attempts to mitigate this issue have been directed at modification of the test statistics or correction of the test distribution to achieve precise results in finite samples. In previous studies, focused on designing corrections to the univariate Ljung-Box, a method that specifically adjusts the test rejection region has been the most successful of attaining the best Type I error rates. We adopt the same approach for the more complex, …


"Who Wrote The Epistle, God Only Knows": A Statistical Authorial Analysis Of Hebrews In Comparison With Pauline And Lukan Literature, Benjamin J. Erickson 2024 Liberty University

"Who Wrote The Epistle, God Only Knows": A Statistical Authorial Analysis Of Hebrews In Comparison With Pauline And Lukan Literature, Benjamin J. Erickson

Senior Honors Theses

The authorship of Hebrews has been a point of contention for scholars for the past two millennia. While the epistle is traditionally attributed to Paul, many scholars assert that it carries thematic, structural, and stylistic differences from the remainder of his extant epistles; therefore, many other possible authors have been proposed. Of these, only Luke has other New Testament writings. Therefore, this project conducts a statistical comparison of Hebrews to the Pauline and Lukan corpora using stylometric authorial analysis methods. This analysis demonstrates that Hebrews is stylistically closer to Lukan literature than Pauline (but not to a significant degree), and …


The Relationship Between Fatalities In Police Violence And Their Identifying Characteristics: Age, Gender, Race, And Region, Yuechu Hu 2024 University of Minnesota - Morris

The Relationship Between Fatalities In Police Violence And Their Identifying Characteristics: Age, Gender, Race, And Region, Yuechu Hu

Undergraduate Research Symposium 2024

Police violence, highlighted by the George Floyd incident in 2020, has intensified concerns about police brutality and perceived racism in U.S. law enforcement (AP News, 2022). Therefore, we intend to analyze Fatal Encounters data, which documents non-police deaths that occur in the presence of the police in the United States. By creating statistical tables and graphs, as well as applying time-series methods, classification and regression trees, and a multinomial logistic regression model, we find that males and transgender people are more likely than females to encounter victimization during police brutality enforcement for any cause of death. Victims older than 19 …


Application And Effectiveness Of Artificial Intelligence For The Border Management Of Imported Frozen Fish In Taiwan, Wen-Chin Tu, Wan-Ling Tsai, Chi-Hao Lee, Chia-Fen Tsai, Jen-Ting Wei, King-Fu Lin, Shou-Mei Wu, Yih-Ming Weng 2024 Food and Drug Administration, Ministry of Health and Welfare, Taipei, Taiwan

Application And Effectiveness Of Artificial Intelligence For The Border Management Of Imported Frozen Fish In Taiwan, Wen-Chin Tu, Wan-Ling Tsai, Chi-Hao Lee, Chia-Fen Tsai, Jen-Ting Wei, King-Fu Lin, Shou-Mei Wu, Yih-Ming Weng

Journal of Food and Drug Analysis

In Taiwan, the number of applications for inspecting imported food has grown annually and noncompliant products must be accurately detected in these border sampling inspections. Previously, border management has used an automated border inspection system (import food inspection (IFI) system) to select batches via a random sampling method to manage the risk levels of various food products complying with regulatory inspection procedures. Several countries have implemented artificial intelligence (AI) technology to improve domestic governmental processes, social service, and public feedback. AI technologies are applied in border inspection by the Taiwan Food and Drug Administration (TFDA). Risk management of border inspections …


Session 6: Model-Based Clustering Analysis On The Spatial-Temporal And Intensity Patterns Of Tornadoes, Yana Melnykov, Yingying Zhang, Rong Zheng 2024 University of Alabama - Tuscaloosa

Session 6: Model-Based Clustering Analysis On The Spatial-Temporal And Intensity Patterns Of Tornadoes, Yana Melnykov, Yingying Zhang, Rong Zheng

SDSU Data Science Symposium

Tornadoes are one of the nature’s most violent windstorms that can occur all over the world except Antarctica. Previous scientific efforts were spent on studying this nature hazard from facets such as: genesis, dynamics, detection, forecasting, warning, measuring, and assessing. While we want to model the tornado datasets by using modern sophisticated statistical and computational techniques. The goal of the paper is developing novel finite mixture models and performing clustering analysis on the spatial-temporal and intensity patterns of the tornadoes. To analyze the tornado dataset, we firstly try a Gaussian distribution with the mean vector and variance-covariance matrix represented as …


Session 6: The Size-Biased Lognormal Mixture With The Entropy Regularized Algorithm, Tatjana Miljkovic, Taehan Bae 2024 Miami University - Oxford

Session 6: The Size-Biased Lognormal Mixture With The Entropy Regularized Algorithm, Tatjana Miljkovic, Taehan Bae

SDSU Data Science Symposium

A size-biased left-truncated Lognormal (SB-ltLN) mixture is proposed as a robust alternative to the Erlang mixture for modeling left-truncated insurance losses with a heavy tail. The weak denseness property of the weighted Lognormal mixture is studied along with the tail behavior. Explicit analytical solutions are derived for moments and Tail Value at Risk based on the proposed model. An extension of the regularized expectation–maximization (REM) algorithm with Shannon's entropy weights (ewREM) is introduced for parameter estimation and variability assessment. The left-truncated internal fraud data set from the Operational Riskdata eXchange is used to illustrate applications of the proposed model. Finally, …


A Causal Inference Approach For Spike Train Interactions, Zach Saccomano 2024 The Graduate Center, City University of New York

A Causal Inference Approach For Spike Train Interactions, Zach Saccomano

Dissertations, Theses, and Capstone Projects

Since the 1960s, neuroscientists have worked on the problem of estimating synaptic properties, such as connectivity and strength, from simultaneously recorded spike trains. Recent years have seen renewed interest in the problem coinciding with rapid advances in experimental technologies, including an approximate exponential increase in the number of neurons that can be recorded in parallel and perturbation techniques such as optogenetics that can be used to calibrate and validate causal hypotheses about functional connectivity. This thesis presents a mathematical examination of synaptic inference from two perspectives: (1) using in vivo data and biophysical models, we ask in what cases the …


Statistical Consulting In Academia: A Review, Ke Xiao 2024 University of Windsor

Statistical Consulting In Academia: A Review, Ke Xiao

Major Papers

This paper reviews the state of statistical consulting in academia by performing a literature review on this topic in chapters 1 and 2. Chapter 1 overviews general aspects of statistical consulting and types of centers that conduct such services in academia. In Chapter 2 we summarise the literature about the common logistics and processes for conducting statistical consulting in academia. In Chapters 3 and 4, we analyze data on statistical consulting centers for the largest 100 universities in the USA. We also review the literature on the future of statistical consulting in academia in the era of big data and …


Formulating An Efficient Statistical Test Using The Goodness Of Fit Approach With Applications To Real-Life Data, S. A. Qaid, S. E. Abo Youssef Prof., Mahmoud Mansour 2024 Department of Mathematics, Faculty of Education, Abyan University, Abyan, Yemen

Formulating An Efficient Statistical Test Using The Goodness Of Fit Approach With Applications To Real-Life Data, S. A. Qaid, S. E. Abo Youssef Prof., Mahmoud Mansour

Basic Science Engineering

Statistical tests are very important for researchers to make decisions. In particular, when the tests are non-parametric, they are of greater importance because they can be applied to a wide range of data sets regardless of knowing the distribution of these data. Researchers are therefore racing to obtain efficient tests for making good decisions based on the results of these tests. In this study, NBU (2)L was used based on the goodness of fit approach to present an efficient statistical test. The efficiency of the proposed test was computed, and the results were compared to those of other tests. Critical …


Machine Learning Approaches For Cyberbullying Detection, Roland Fiagbe 2024 University of Central Florida

Machine Learning Approaches For Cyberbullying Detection, Roland Fiagbe

Data Science and Data Mining

Cyberbullying refers to the act of bullying using electronic means and the internet. In recent years, this act has been identifed to be a major problem among young people and even adults. It can negatively impact one’s emotions and lead to adverse outcomes like depression, anxiety, harassment, and suicide, among others. This has led to the need to employ machine learning techniques to automatically detect cyberbullying and prevent them on various social media platforms. In this study, we want to analyze the combination of some Natural Language Processing (NLP) algorithms (such as Bag-of-Words and TFIDF) with some popular machine learning …


Predicting Superconducting Critical Temperature Using Regression Analysis, Roland Fiagbe 2024 University of Central Florida

Predicting Superconducting Critical Temperature Using Regression Analysis, Roland Fiagbe

Data Science and Data Mining

This project estimates a regression model to predict the superconducting critical temperature based on variables extracted from the superconductor’s chemical formula. The regression model along with the stepwise variable selection gives a reasonable and good predictive model with a lower prediction error (MSE). Variables extracted based on atomic radius, valence, atomic mass and thermal conductivity appeared to have the most contribution to the predictive model.


Multiscale Modelling Of Brain Networks And The Analysis Of Dynamic Processes In Neurodegenerative Disorders, Hina Shaheen 2024 Wilfrid Laurier University

Multiscale Modelling Of Brain Networks And The Analysis Of Dynamic Processes In Neurodegenerative Disorders, Hina Shaheen

Theses and Dissertations (Comprehensive)

The complex nature of the human brain, with its intricate organic structure and multiscale spatio-temporal characteristics ranging from synapses to the entire brain, presents a major obstacle in brain modelling. Capturing this complexity poses a significant challenge for researchers. The complex interplay of coupled multiphysics and biochemical activities within this intricate system shapes the brain's capacity, functioning within a structure-function relationship that necessitates a specific mathematical framework. Advanced mathematical modelling approaches that incorporate the coupling of brain networks and the analysis of dynamic processes are essential for advancing therapeutic strategies aimed at treating neurodegenerative diseases (NDDs), which afflict millions of …


Applications Of Independent And Identically Distributed (Iid) Random Processes In Polarimetry And Climatology, Dan Kestner 2024 Michigan Technological University

Applications Of Independent And Identically Distributed (Iid) Random Processes In Polarimetry And Climatology, Dan Kestner

Dissertations, Master's Theses and Master's Reports

The unifying theme of this thesis is the characterization of “perfect randomness,” i.e., independent and identically distributed (IID) stochastic processes as these are applied in physical science. Two specific and mathematically distinct applications are chosen: (i) Radar and optical polarimetry; (ii) Analysis of time series in meteorology. In (i), IID process of a special kind, namely, with a distribution defined by symmetry, is used to link its multivariate Gaussian density to uniformity on the Poincaré sphere. This “statistical ellipsometry” approach is then used to relate polarimetric mismatches or imbalances to ellipsometric variables and suitably chosen cross-correlation measures. In (ii), recently …


Classification In Supervised Statistical Learning With The New Weighted Newton-Raphson Method, Toma Debnath 2024 Georgia Southern University

Classification In Supervised Statistical Learning With The New Weighted Newton-Raphson Method, Toma Debnath

Electronic Theses and Dissertations

In this thesis, the Weighted Newton-Raphson Method (WNRM), an innovative optimization technique, is introduced in statistical supervised learning for categorization and applied to a diabetes predictive model, to find maximum likelihood estimates. The iterative optimization method solves nonlinear systems of equations with singular Jacobian matrices and is a modification of the ordinary Newton-Raphson algorithm. The quadratic convergence of the WNRM, and high efficiency for optimizing nonlinear likelihood functions, whenever singularity in the Jacobians occur allow for an easy inclusion to classical categorization and generalized linear models such as the Logistic Regression model in supervised learning. The WNRM is thoroughly investigated …


The Distribution Of The Significance Level, Paul O. Monnu 2024 Georgia Southern University

The Distribution Of The Significance Level, Paul O. Monnu

Electronic Theses and Dissertations

Reporting the p-value is customary when conducting a test of hypothesis or significance. The likelihood of getting a fictitious second sample and presuming the null hypothesis is correct is the p-value. The significance level is a statistic that interests us to investigate. Being a statistic, it has a distribution. For the F-test in a one-way ANOVA and the t-tests for population means, we define the significance level, its observed value, and the observed significance level. It is possible to derive the significance level distribution. The t-test and the F-test are not without controversy. Specifically, we demonstrate that as sample size …


An Analysis Of Corporate Social Responsibility And Real Earnings Management, Rachel Brassine 2024 Marshall University

An Analysis Of Corporate Social Responsibility And Real Earnings Management, Rachel Brassine

Theses, Dissertations and Capstones

Real earnings management (REM) is costly in the form of intense loan restrictions, increased interest expense, and public scrutiny. Nevertheless, companies still practice REM. Based on agency and stakeholder theories, this research predicts that as a company’s CSR score increases, REM will decrease, and this association will become more negative when a critical mass of females on the board of directors exists and when a board-level CSR committee is present. This study also predicts that when a company offers an executive incentive plan based on CSR metrics, REM will decrease, and the relationship will become more negative with a critical …


Measuring The Performance Of Sdgs In Provincial Level Using Regional Sustainable Development Index, Nurafiza Thamrin, Ika Yuni Wulansari, Puguh Bodro Irawan 2023 BPS - Statistics Solok Regency

Measuring The Performance Of Sdgs In Provincial Level Using Regional Sustainable Development Index, Nurafiza Thamrin, Ika Yuni Wulansari, Puguh Bodro Irawan

Journal of Environmental Science and Sustainable Development

Measuring the national and sub-national progress in achieving such globally adopted development agendas as Sustainable Development Goals (SDGs) is particularly challenging due to data availability and compatibility of indicators to measure SDGs, especially in Indonesia. This paper attempts to measure the performance of sustainable development at the regional level in Indonesia by newly constructing a multidimensional composite index called the Regional Sustainable Development Index (RSDI). RSDI comprises four dimensions, covering comprehensive economic, social, environmental, and governance indicators. By applying factor analysis, the paper assesses the uncertainty of RSDI and the sensitivity of its composing indicators, then further investigates the relationship …


Reducing Food Scarcity: The Benefits Of Urban Farming, S.A. Claudell, Emilio Mejia 2023 Brigham Young University

Reducing Food Scarcity: The Benefits Of Urban Farming, S.A. Claudell, Emilio Mejia

Journal of Nonprofit Innovation

Urban farming can enhance the lives of communities and help reduce food scarcity. This paper presents a conceptual prototype of an efficient urban farming community that can be scaled for a single apartment building or an entire community across all global geoeconomics regions, including densely populated cities and rural, developing towns and communities. When deployed in coordination with smart crop choices, local farm support, and efficient transportation then the result isn’t just sustainability, but also increasing fresh produce accessibility, optimizing nutritional value, eliminating the use of ‘forever chemicals’, reducing transportation costs, and fostering global environmental benefits.

Imagine Doris, who is …


Digital Commons powered by bepress