Open Access. Powered by Scholars. Published by Universities.®

Statistical Models Commons

Open Access. Powered by Scholars. Published by Universities.®

1,202 Full-Text Articles 1,796 Authors 680,874 Downloads 136 Institutions

All Articles in Statistical Models

Faceted Search

1,202 full-text articles. Page 1 of 45.

Classification Of Breast Cancer Histopathological Images Using Semi-Supervised Gans, Balaji Avvaru, Nibhrat Lohia, Sowmya Mani, Vijayasrikanth kaniti 2022 Southern Methodist University

Classification Of Breast Cancer Histopathological Images Using Semi-Supervised Gans, Balaji Avvaru, Nibhrat Lohia, Sowmya Mani, Vijayasrikanth Kaniti

SMU Data Science Review

Breast cancer is diagnosed more frequently than skin cancer in women in the United States. Most breast cancer cases are diagnosed in women, while children and men are less likely to develop the disease. Various tissues in the breast grow uncontrollably, resulting in breast cancer. Different treatments analyze microscopic histopathology images for diagnosis that help accurately detect cancer cells. Deep learning is one of the evolving techniques to classify images where accuracy depends on the volume and quality of labeled images. This study used various pre-trained models to train the histopathological images and analyze these models to create a new ...


Predicting Insulin Pump Therapy Settings, Riccardo L. Ferraro, David Grijalva, Alex Trahan 2022 Southern Methodist University & Tandem Diabetes Care, Inc

Predicting Insulin Pump Therapy Settings, Riccardo L. Ferraro, David Grijalva, Alex Trahan

SMU Data Science Review

Millions of people live with diabetes worldwide [7]. To mitigate some of the many symptoms associated with diabetes, an estimated 350,000 people in the United States rely on insulin pumps [17]. For many of these people, how effectively their insulin pump performs is the difference between sleeping through the night and a life threatening emergency treatment at a hospital. Three programmed insulin pump therapy settings governing effective insulin pump function are: Basal Rate (BR), Insulin Sensitivity Factor (ISF), and Carbohydrate Ratio (ICR). For many people using insulin pumps, these therapy settings are often not correct, given their physiological needs ...


Application Of Probabilistic Ranking Systems On Women’S Junior Division Beach Volleyball, Cameron Stewart, Michael Mazel, Bivin Sadler 2022 Southern Methodist University

Application Of Probabilistic Ranking Systems On Women’S Junior Division Beach Volleyball, Cameron Stewart, Michael Mazel, Bivin Sadler

SMU Data Science Review

Women’s beach volleyball is one of the fastest growing collegiate sports today. The increase in popularity has come with an increase in valuable scholarship opportunities across the country. With thousands of athletes to sort through, college scouts depend on websites that aggregate tournament results and rank players nationally. This project partnered with the company Volleyball Life, who is the current market leader in the ranking space of junior beach volleyball players. Utilizing the tournament information provided by Volleyball Life, this study explored replacements to the current ranking systems, which are designed to aggregate player points from recent tournament placements ...


Regression-Based Methods For Dynamic Treatment Regimes With Mismeasured Covariates Or Misclassified Response, Dan Liu 2022 The University of Western Ontario

Regression-Based Methods For Dynamic Treatment Regimes With Mismeasured Covariates Or Misclassified Response, Dan Liu

Electronic Thesis and Dissertation Repository

The statistical study of dynamic treatment regimes (DTRs) focuses on estimating sequential treatment decision rules tailored to patient-level information across multiple stages of intervention. Regression-based methods in DTR have been studied in the literature with a critical assumption that all the observed variables are precisely measured. However, this assumption is often violated in many applications. One example is the STAR*D study, in which the patient's depressive score is subject to measurement error. In this thesis, we explore problems in the context of DTR with measurement error or misclassification considered in the observed data.

The first project deals with ...


Between “Breaking” And “Building”: The Bridge Theory Of Research Evaluation, Fang XU, Xiaoxuan LI 2022 Institutes of Science and Development, Chinese Academy of Sciences, Beijing 100190, China School of Public Policy and Management, University of Chinese Academy of Sciences, Beijing 100049, China

Between “Breaking” And “Building”: The Bridge Theory Of Research Evaluation, Fang Xu, Xiaoxuan Li

Bulletin of Chinese Academy of Sciences (Chinese Version)

How to build "new standards" after breaking "Siwei" is a hot and difficult issue in the current reform of research evaluation, which urgently needs good theoretical and methodological support. In this context, this study puts forward the BRIDGE theory of research evaluation of scientific researchers' achievements, which is to integrate the reasonable elements in the quantitative evaluation based on SCI papers into the "new standard" based on peer review, so as to build a bridge between quantitative analysis and qualitative evaluation. The practical application of BRIDGE theory is expressed as "Six Steps", in which the second step "Recode" and the ...


Exploring Human-Caused Fire Occurrence Prediction, Ruyi Jin 2022 Western University

Exploring Human-Caused Fire Occurrence Prediction, Ruyi Jin

Undergraduate Student Research Internships Conference

Wildland Fire Science has become an increasingly hot topic in recent years. The goal of this report is to investigate human-caused wildland fire occurrence prediction. The two main predictors of interest are the mean value of the Fine Fuel Moisture Code (FFMC) and the month when a fire ignites. An Exploratory Data Analysis is presented first, after which we fit models to predict daily fire counts. We first consider Poisson models to fit the count data, but also attempt to fit Negative Binomial models to deal with overdispersion. We compare these models in the following ways: plotting the difference in ...


An Analysis Of Weighted Least Squares Monte Carlo, Xiaotian Zhu 2022 The University of Western Ontario

An Analysis Of Weighted Least Squares Monte Carlo, Xiaotian Zhu

Electronic Thesis and Dissertation Repository

Since Longstaff and Schwartz [2001] brought the amazing Regression-based Monte Carlo (LSMC) method in pricing American options, it has received heated discussion. Based on the research done by Fabozzi et al. [2017] that applies the heteroscedasticity correction method to LSMC, we further extend the study by introducing the methods from Park [1966] and Harvey [1976]. Our work shows that for a single stock American Call option modelled by GBM with two exercise opportunities, WLSMC or IRLSMC provides better estimates in continuation value than LSMC. However, they do not lead to better exercise decisions and hence have little to no effect ...


A Transformer-Based Classification System For Volcanic Seismic Signals, Anthony P. Rinaldi, Cindy Mora Stock, Cristián Bravo Roman, Alexander Hemming 2022 Western University

A Transformer-Based Classification System For Volcanic Seismic Signals, Anthony P. Rinaldi, Cindy Mora Stock, Cristián Bravo Roman, Alexander Hemming

Undergraduate Student Research Internships Conference

Monitoring volcanic events as they occur is a task that, to this day, requires significant human capital. The current process requires geologists to monitor seismographs around the clock, making it extremely labour-intensive and inefficient. The ability to automatically classify volcanic events as they happen in real-time would allow for quicker responses to these events by the surrounding communities. Timely knowledge of the type of event that is occurring can allow these surrounding communities to prepare or evacuate sooner depending on the magnitude of the event. Up until recently, not much research has been conducted regarding the potential for machine learning ...


Bias-Corrected Bagging In Active Learning With An Actuarial Application, Yangxuan Xu 2022 Western University

Bias-Corrected Bagging In Active Learning With An Actuarial Application, Yangxuan Xu

Undergraduate Student Research Internships Conference

The variable annuity (VA) is a modern insurance product that offers certain guaranteed protection and tax-deferred treatment. Because of the inherent complexity of guarantees’ payoff, the closed-form solution of fair market values (FMVs) is often not available. Most insurance companies depend on Monte Carlo (MC) simulation to price the FMVs of these products, which is an extremely computational intensive and time-consuming approach. The metamodeling approach can be used to circumvent the heavy computation.

In the modeling stage, the bagged tree method has proved to outperform other parametric approaches. Also, a bias-corrected (BC) bagging model was tried and showed significant improvement ...


Investigating Distributions Of Epochs In Wildland Fire Lifetimes, Xinlei Wang 2022 Western University

Investigating Distributions Of Epochs In Wildland Fire Lifetimes, Xinlei Wang

Undergraduate Student Research Internships Conference

The objective of my research project is to explore the relationship between variables related to wildland fire and to model distributions of epochs in wildland fire lifetimes. Several distributional families are considered for modeling these epochs, including the exponential distribution, gamma distribution, Weibull distribution and continuous phase-type distribution. I explain each of these distributions in short terms and illustrate how they are fit. Visual results of my exploratory data analysis are illustrated in two parts, data visualization and data modeling, along with my interpretation of each. Since this work is preliminary, I conclude the report with a discussion on what ...


The Q-Analogue Of The Extended Generalized Gamma Distribution, Wenhao Chen 2022 Western University

The Q-Analogue Of The Extended Generalized Gamma Distribution, Wenhao Chen

Undergraduate Student Research Internships Conference

This project introduces a flexible univariate probability model referred to as the q-analogue of the Extended Generalized Gamma (or q-EGG) distribution, which encompasses the majority of the most frequently used continuous distributions, including the gamma, Weibull, logistic, type-1 and type-2 beta, Gaussian, Cauchy, Student-t and F. Closed form representations of its moments and cumulative distribution function are provided. Additionally, computational techniques are proposed for determining estimates of its parameters. Both the method of moments and the maximum likelihood approach are utilized. The effect of each parameter is also graphically illustrated. Certain data sets are modeled with q-EGG distributions; goodness of ...


Investigation Of Key Factors To Earthquake Insurance Take-Up Rates In Quebec And British Columbia Households And Prediction Model Building, Yongcheng Jiang 2022 Western University

Investigation Of Key Factors To Earthquake Insurance Take-Up Rates In Quebec And British Columbia Households And Prediction Model Building, Yongcheng Jiang

Undergraduate Student Research Internships Conference

Maintaining an adequate level of earthquake take-up rate could protect the insurance industry from systemic failure. Past research has shown that British Columbia and Quebec have significant differences in earthquake insurance take-up rate. This report investigates key factors from the structure (default options and various types) of the insurance plan and personal characteristics along with socioeconomic/demographic profiles that affect the demand for earthquake protection in the form of insurance. The report also provides a prediction model for earthquake insurance take-up rate. The results show an importance ranking of key factors of earthquake insurance take up, the most important three ...


Functional Structure Of Excess Return And Volatility, Chenxi Zhao 2022 Western University

Functional Structure Of Excess Return And Volatility, Chenxi Zhao

Undergraduate Student Research Internships Conference

Capturing the relation between excess returns and volatility can help making better decisions in the stock market in terms of portfolio allocation and assets risk management. This paper takes the data of a minute-by-minute series of S&P500 from January 2009 to January 2021 as the research object and explores the best structural representation for the excess return as a function of the volatility, for a well-known index. This is implemented via regression models for volatility and excess returns. The results reveal that there’s a structural break in the relationship between the excess return and volatility based on the ...


Practical T-Test Power Analysis With R, Teck Kiang Tan 2022 National University of Singapore

Practical T-Test Power Analysis With R, Teck Kiang Tan

Practical Assessment, Research, and Evaluation

Power analysis based on the analytical t-test is an important aspect of a research study to determine the sample size required to detect the effect for the comparison of two means. The current paper presents a reader-friendly procedure for carrying out the t-test power analysis using the various R add-on packages. While there is a growing of R users in the academic that uses R as the base for carrying out research, there is a lack of reference that discusses both frequentist and Bayesian approaches and point out their distinct features for t-test power analysis. The practical aspects of the ...


Exploration In Mental Performance For Division 1 Sec College Football Student Athletes, Alex Burgdorf 2022 Nova Southeastern University

Exploration In Mental Performance For Division 1 Sec College Football Student Athletes, Alex Burgdorf

Department of Occupational Therapy Entry-Level Capstone Projects

The stigma surrounding mental health in sports has made intervention difficult. “There is a need for various actors to provide more effective strategies to overcome the stigma that surrounds mental illness, increase mental health literacy in the athlete/coach community, and address athlete-specific barriers to seeking treatment for mental illness” (Castadelli-Maia et.al 2019). The athletes in the football program at the University of Tennessee face more pressure today than ever in history. They have their class schedule, practice and training every day, and meetings with their position coaches. Now, with the introduction of name, image, and likeness (NIL) allowing ...


Dynamic Prediction For Alternating Recurrent Events Using A Semiparametric Joint Frailty Model, Jaehyeon Yun 2022 Southern Methodist University

Dynamic Prediction For Alternating Recurrent Events Using A Semiparametric Joint Frailty Model, Jaehyeon Yun

Statistical Science Theses and Dissertations

Alternating recurrent events data arise commonly in health research; examples include hospital admissions and discharges of diabetes patients; exacerbations and remissions of chronic bronchitis; and quitting and restarting smoking. Recent work has involved formulating and estimating joint models for the recurrent event times considering non-negligible event durations. However, prediction models for transition between recurrent events are lacking. We consider the development and evaluation of methods for predicting future events within these models. Specifically, we propose a tool for dynamically predicting transition between alternating recurrent events in real time. Under a flexible joint frailty model, we derive the predictive probability of ...


Advanced High Dimensional Regression Techniques, Yuan Yang 2022 Clemson University

Advanced High Dimensional Regression Techniques, Yuan Yang

All Dissertations

This dissertation focuses on developing high dimensional regression techniques to analyze large scale data using both Bayesian and frequentist approaches, motivated by data sets from various disciplines, such as public health and genetics. More specifically, Chapters 2 and Chapter 4 take a Bayesian approach to achieve modeling and parameter estimation simultaneously while Chapter 3 takes a frequentist approach. The main aspects of these techniques are that they perform variable selection and parameter estimation simultaneously, while also being easily adaptable to large-scale data. In particular, by embedding a logistic model into traditional spike and slab framework and selecting of proper prior ...


New Developments On The Estimability And The Estimation Of Phase-Type Actuarial Models, Cong Nie 2022 The University of Western Ontario

New Developments On The Estimability And The Estimation Of Phase-Type Actuarial Models, Cong Nie

Electronic Thesis and Dissertation Repository

This thesis studies the estimability and the estimation methods for two models based on Markov processes: the phase-type aging model (PTAM), which models the human aging process, and the discrete multivariate phase-type model (DMPTM), which can be used to model multivariate insurance claim processes.

The principal contributions of this thesis can be categorized into two areas. First, an objective measure of estimability is proposed to quantify estimability in the context of statistical models. Existing methods for assessing estimability require the subjective specification of thresholds, which potentially limits their usefulness. Unlike these methods, the proposed measure of estimability is objective. In ...


The Short-Term Effects Of Fine Airborne Particulate Matter And Climate On Covid-19 Disease Dynamics, El Hussain Shamsa, Kezhong Zhang 2022 Wayne State University

The Short-Term Effects Of Fine Airborne Particulate Matter And Climate On Covid-19 Disease Dynamics, El Hussain Shamsa, Kezhong Zhang

Medical Student Research Symposium

Background: Despite more than 60% of the United States population being fully vaccinated, COVID-19 cases continue to spike in a temporal pattern. These patterns in COVID-19 incidence and mortality may be linked to short-term changes in environmental factors.

Methods: Nationwide, county-wise measurements for COVID-19 cases and deaths, fine-airborne particulate matter (PM2.5), and maximum temperature were obtained from March 20, 2020 to March 20, 2021. Multivariate Linear Regression was used to analyze the association between environmental factors and COVID-19 incidence and mortality rates in each season. Negative Binomial Regression was used to analyze daily fluctuations of COVID-19 cases and ...


Adjusting Community Survey Data Benchmarks For External Factors, Allen Miller, Nicole M. Norelli, Robert Slater, Mingyang N. Yu 2022 Southern Methodist University

Adjusting Community Survey Data Benchmarks For External Factors, Allen Miller, Nicole M. Norelli, Robert Slater, Mingyang N. Yu

SMU Data Science Review

Abstract. Using U.S. resident survey data from the National Community Survey in combination with public data from the U.S. Census and additional sources, a Voting Regressor Model was developed to establish fair benchmark values for city performance. These benchmarks were adjusted for characteristics the city cannot easily influence that contribute to confidence in local government, such as population size, demographics, and income. This adjustment allows for a more meaningful comparison and interpretation of survey results among individual cities. Methods explored for the benchmark adjustment included cluster analysis, anomaly detection, and a variety of regression techniques, including random forest ...


Digital Commons powered by bepress