Open Access. Powered by Scholars. Published by Universities.®
- Institution
-
- COBRA (107)
- Selected Works (41)
- SelectedWorks (23)
- University of Kentucky (17)
- Southern Methodist University (14)
-
- University of Massachusetts Amherst (13)
- Western University (12)
- Virginia Commonwealth University (8)
- Department of Primary Industries and Regional Development, Western Australia (6)
- Florida International University (6)
- Georgia Southern University (6)
- Kennesaw State University (6)
- University of Arkansas, Fayetteville (6)
- Clemson University (5)
- Technological University Dublin (5)
- University of Louisville (5)
- City University of New York (CUNY) (4)
- Michigan Technological University (4)
- University of Central Florida (4)
- University of Nebraska - Lincoln (4)
- University of Nevada, Las Vegas (4)
- James Madison University (3)
- The Texas Medical Center Library (3)
- The University of Akron (3)
- University of New Mexico (3)
- University of Tennessee, Knoxville (3)
- Western Michigan University (3)
- Bucknell University (2)
- California Polytechnic State University, San Luis Obispo (2)
- East Tennessee State University (2)
- Keyword
-
- Statistics (18)
- Bayesian Model Averaging and Semiparametric Regression (12)
- Functional Data Analysis (10)
- Genomics (9)
- Regression (9)
-
- Statistical Models (9)
- Copula Modeling (8)
- Model selection (8)
- Bayesian inference (7)
- Counting process (7)
- Multivariate Models in Marketing (7)
- Prediction (7)
- Statistical Methodology (7)
- Aquaculture (6)
- Causal inference (6)
- Commercial Fishing (6)
- Cross-validation (6)
- Estimating equation (6)
- Fisheries Management (6)
- Forecasting and Time Series (6)
- Simulation (6)
- Statistical Theory and Methods (6)
- Bayesian methods (5)
- Recreational Fishing (5)
- COVID-19 (4)
- Censored data (4)
- Estimation (4)
- Gene expression (4)
- Genetics (4)
- Image Analysis (4)
- Publication Year
- Publication
-
- U.C. Berkeley Division of Biostatistics Working Paper Series (38)
- Harvard University Biostatistics Working Paper Series (27)
- Michael Stanley Smith (22)
- Theses and Dissertations--Statistics (16)
- The University of Michigan Department of Biostatistics Working Paper Series (15)
-
- Jeffrey S. Morris (14)
- Electronic Theses and Dissertations (12)
- Doctoral Dissertations (11)
- Electronic Thesis and Dissertation Repository (10)
- Johns Hopkins University, Dept. of Biostatistics Working Papers (10)
- COBRA Preprint Series (9)
- Theses and Dissertations (9)
- Chongzhi Di (7)
- SMU Data Science Review (7)
- Statistical Science Theses and Dissertations (7)
- UW Biostatistics Working Paper Series (7)
- FIU Electronic Theses and Dissertations (6)
- Fisheries research reports (6)
- All Dissertations (5)
- Graduate Theses and Dissertations (5)
- Data Science and Data Mining (4)
- Dissertations, Master's Theses and Master's Reports (4)
- Masters Theses (4)
- Reactor Campaign (TRP) (4)
- Articles (3)
- Dissertations (3)
- Mark Fiecas (3)
- Philip T. Reiss (3)
- Published and Grey Literature from PhD Candidates (3)
- Williams Honors College, Honors Research Projects (3)
- Publication Type
- File Type
Articles 1 - 30 of 372
Full-Text Articles in Statistical Models
Session 6: The Size-Biased Lognormal Mixture With The Entropy Regularized Algorithm, Tatjana Miljkovic, Taehan Bae
Session 6: The Size-Biased Lognormal Mixture With The Entropy Regularized Algorithm, Tatjana Miljkovic, Taehan Bae
SDSU Data Science Symposium
A size-biased left-truncated Lognormal (SB-ltLN) mixture is proposed as a robust alternative to the Erlang mixture for modeling left-truncated insurance losses with a heavy tail. The weak denseness property of the weighted Lognormal mixture is studied along with the tail behavior. Explicit analytical solutions are derived for moments and Tail Value at Risk based on the proposed model. An extension of the regularized expectation–maximization (REM) algorithm with Shannon's entropy weights (ewREM) is introduced for parameter estimation and variability assessment. The left-truncated internal fraud data set from the Operational Riskdata eXchange is used to illustrate applications of the proposed model. Finally, …
Machine Learning Approaches For Cyberbullying Detection, Roland Fiagbe
Machine Learning Approaches For Cyberbullying Detection, Roland Fiagbe
Data Science and Data Mining
Cyberbullying refers to the act of bullying using electronic means and the internet. In recent years, this act has been identifed to be a major problem among young people and even adults. It can negatively impact one’s emotions and lead to adverse outcomes like depression, anxiety, harassment, and suicide, among others. This has led to the need to employ machine learning techniques to automatically detect cyberbullying and prevent them on various social media platforms. In this study, we want to analyze the combination of some Natural Language Processing (NLP) algorithms (such as Bag-of-Words and TFIDF) with some popular machine learning …
Predicting Superconducting Critical Temperature Using Regression Analysis, Roland Fiagbe
Predicting Superconducting Critical Temperature Using Regression Analysis, Roland Fiagbe
Data Science and Data Mining
This project estimates a regression model to predict the superconducting critical temperature based on variables extracted from the superconductor’s chemical formula. The regression model along with the stepwise variable selection gives a reasonable and good predictive model with a lower prediction error (MSE). Variables extracted based on atomic radius, valence, atomic mass and thermal conductivity appeared to have the most contribution to the predictive model.
Multiscale Modelling Of Brain Networks And The Analysis Of Dynamic Processes In Neurodegenerative Disorders, Hina Shaheen
Multiscale Modelling Of Brain Networks And The Analysis Of Dynamic Processes In Neurodegenerative Disorders, Hina Shaheen
Theses and Dissertations (Comprehensive)
The complex nature of the human brain, with its intricate organic structure and multiscale spatio-temporal characteristics ranging from synapses to the entire brain, presents a major obstacle in brain modelling. Capturing this complexity poses a significant challenge for researchers. The complex interplay of coupled multiphysics and biochemical activities within this intricate system shapes the brain's capacity, functioning within a structure-function relationship that necessitates a specific mathematical framework. Advanced mathematical modelling approaches that incorporate the coupling of brain networks and the analysis of dynamic processes are essential for advancing therapeutic strategies aimed at treating neurodegenerative diseases (NDDs), which afflict millions of …
Reducing Food Scarcity: The Benefits Of Urban Farming, S.A. Claudell, Emilio Mejia
Reducing Food Scarcity: The Benefits Of Urban Farming, S.A. Claudell, Emilio Mejia
Journal of Nonprofit Innovation
Urban farming can enhance the lives of communities and help reduce food scarcity. This paper presents a conceptual prototype of an efficient urban farming community that can be scaled for a single apartment building or an entire community across all global geoeconomics regions, including densely populated cities and rural, developing towns and communities. When deployed in coordination with smart crop choices, local farm support, and efficient transportation then the result isn’t just sustainability, but also increasing fresh produce accessibility, optimizing nutritional value, eliminating the use of ‘forever chemicals’, reducing transportation costs, and fostering global environmental benefits.
Imagine Doris, who is …
Exploration And Statistical Modeling Of Profit, Caleb Gibson
Exploration And Statistical Modeling Of Profit, Caleb Gibson
Undergraduate Honors Theses
For any company involved in sales, maximization of profit is the driving force that guides all decision-making. Many factors can influence how profitable a company can be, including external factors like changes in inflation or consumer demand or internal factors like pricing and product cost. Understanding specific trends in one's own internal data, a company can readily identify problem areas or potential growth opportunities to help increase profitability.
In this discussion, we use an extensive data set to examine how a company might analyze their own data to identify potential changes the company might investigate to drive better performance. Based …
The Private Pilot Check Ride: Applying The Spacing Effect Theory To Predict Time To Proficiency For The Practical Test, Michael Scott Harwin
The Private Pilot Check Ride: Applying The Spacing Effect Theory To Predict Time To Proficiency For The Practical Test, Michael Scott Harwin
Theses and Dissertations
This study examined the relationship between a set of targeted factors and the total flight time students needed to become ready to take the private pilot check ride. The study was grounded in Ebbinghaus’s (1885/1913/2013) forgetting curve theory and spacing effect, and Ausubel’s (1963) theory of meaningful learning. The research factors included (a) training time to proficiency, which represented the number of training days needed to become check-ride ready; (b) flight training program (Part 61 vs. Part 141); (c) organization offering the training program (2- or 4-year college/university vs. FBO); (d) scheduling policy (mandated vs. student-driven); and demographical variables, which …
Bayesian Strategies For Propensity Score Estimation In Causal Inference., Uthpala I. Wanigasekara
Bayesian Strategies For Propensity Score Estimation In Causal Inference., Uthpala I. Wanigasekara
Electronic Theses and Dissertations
Causal inference is a method used in various fields to draw causal conclusions based on data. It involves using assumptions, study designs, and estimation strategies to minimize the impact of confounding variables. Propensity scores are used to estimate outcome effects, through matching methods, stratification, weighting methods, and the Covariate Balancing Propensity Score method. However, they can be sensitive to estimation techniques and can lead to unstable findings. Researchers have proposed integrating weighing with regression adjustment in parametric models to improve causal inference validity. The first project focuses on Bayesian joint and two-stage methods for propensity score analysis. Propensity score modeling …
Bayesian Learning Of Spatiotemporal Source Distribution For Beached Microplastic In The Gulf Of Mexico, David Pojunas
Bayesian Learning Of Spatiotemporal Source Distribution For Beached Microplastic In The Gulf Of Mexico, David Pojunas
Graduate Theses and Dissertations
Over the last several decades, plastic waste has gradually accumulated while slowly degrading in terrestrial and oceanic environments. Recently, there has been an increased effort to identify the possible sources of plastic to understand how they affect vulnerable beaches. This issue is of particular concern in the Gulf of Mexico due to the presence of oil, natural gas, and plastic production. In this thesis, we expand upon existing Bayesian plastic attribution models and develop a rigorous statistical framework to map observed beached microplastics to their sources. Within this framework, we combine Lagrangian backtracking simulations of floating particles using nurdle beaching …
Nonparametric Derivative Estimation Using Penalized Splines: Theory And Application, Bright Antwi Boasiako
Nonparametric Derivative Estimation Using Penalized Splines: Theory And Application, Bright Antwi Boasiako
Doctoral Dissertations
This dissertation is in the field of Nonparametric Derivative Estimation using
Penalized Splines. It is conducted in two parts. In the first part, we study the L2
convergence rates of estimating derivatives of mean regression functions using penalized splines. In 1982, Stone provided the optimal rates of convergence for estimating derivatives of mean regression functions using nonparametric methods. Using these rates, Zhou et. al. in their 2000 paper showed that the MSE of derivative estimators based on regression splines approach zero at the optimal rate of convergence. Also, in 2019, Xiao showed that, under some general conditions, penalized spline estimators …
Parameter Estimation For Normally Distributed Grouped Data And Clustering Single-Cell Rna Sequencing Data Via The Expectation-Maximization Algorithm, Zahra Aghahosseinalishirazi
Parameter Estimation For Normally Distributed Grouped Data And Clustering Single-Cell Rna Sequencing Data Via The Expectation-Maximization Algorithm, Zahra Aghahosseinalishirazi
Electronic Thesis and Dissertation Repository
The Expectation-Maximization (EM) algorithm is an iterative algorithm for finding the maximum likelihood estimates in problems involving missing data or latent variables. The EM algorithm can be applied to problems consisting of evidently incomplete data or missingness situations, such as truncated distributions, censored or grouped observations, and also to problems in which the missingness of the data is not natural or evident, such as mixed-effects models, mixture models, log-linear models, and latent variables. In Chapter 2 of this thesis, we apply the EM algorithm to grouped data, a problem in which incomplete data are evident. Nowadays, data confidentiality is of …
Using Geographic Information To Explore Player-Specific Movement And Its Effects On Play Success In The Nfl, Hayley Horn, Eric Laigaie, Alexander Lopez, Shravan Reddy
Using Geographic Information To Explore Player-Specific Movement And Its Effects On Play Success In The Nfl, Hayley Horn, Eric Laigaie, Alexander Lopez, Shravan Reddy
SMU Data Science Review
American Football is a billion-dollar industry in the United States. The analytical aspect of the sport is an ever-growing domain, with open-source competitions like the NFL Big Data Bowl accelerating this growth. With the amount of player movement during each play, tracking data can prove valuable in many areas of football analytics. While concussion detection, catch recognition, and completion percentage prediction are all existing use cases for this data, player-specific movement attributes, such as speed and agility, may be helpful in predicting play success. This research calculates player-specific speed and agility attributes from tracking data and supplements them with descriptive …
A Framework For Statistical Modeling Of Wind Speed And Wind Direction, Eva Murphy
A Framework For Statistical Modeling Of Wind Speed And Wind Direction, Eva Murphy
All Dissertations
Atmospheric near surface wind speed and wind direction play an important role in many applications, ranging from air quality modeling, building design, wind turbine placement to climate change research. It is therefore crucial to accurately estimate the joint probability distribution of wind speed and direction. This dissertation aims to provide a modeling framework for studying the variation of wind speed and wind direction. To this end, three projects are conducted to address some of the key issues for modeling wind vectors.\\
First, a conditional decomposition approach is developed to model the joint distribution of wind speed and direction. Specifically, the …
Addressing The Impact Of Time-Dependent Social Groupings On Animal Survival And Recapture Rates In Mark-Recapture Studies, Alexandru M. Draghici
Addressing The Impact Of Time-Dependent Social Groupings On Animal Survival And Recapture Rates In Mark-Recapture Studies, Alexandru M. Draghici
Electronic Thesis and Dissertation Repository
Mark-recapture (MR) models typically assume that individuals under study have independent survival and recapture outcomes. One such model of interest is known as the Cormack-Jolly-Seber (CJS) model. In this dissertation, we conduct three major research projects focused on studying the impact of violating the independence assumption in MR models along with presenting extensions which relax the independence assumption. In the first project, we conduct a simulation study to address the impact of failing to account for pair-bonded animals having correlated recapture and survival fates on the CJS model. We examined the impact of correlation on the likelihood ratio test (LRT), …
Analytical Approach For Monitoring The Behavior Of Patients With Pancreatic Adenocarcinoma At Different Stages As A Function Of Time, Aditya Chakaborty Dr, Chris P. Tsokos Dr
Analytical Approach For Monitoring The Behavior Of Patients With Pancreatic Adenocarcinoma At Different Stages As A Function Of Time, Aditya Chakaborty Dr, Chris P. Tsokos Dr
Biology and Medicine Through Mathematics Conference
No abstract provided.
Optimizing Tumor Xenograft Experiments Using Bayesian Linear And Nonlinear Mixed Modelling And Reinforcement Learning, Mary Lena Bleile
Optimizing Tumor Xenograft Experiments Using Bayesian Linear And Nonlinear Mixed Modelling And Reinforcement Learning, Mary Lena Bleile
Statistical Science Theses and Dissertations
Tumor xenograft experiments are a popular tool of cancer biology research. In a typical such experiment, one implants a set of animals with an aliquot of the human tumor of interest, applies various treatments of interest, and observes the subsequent response. Efficient analysis of the data from these experiments is therefore of utmost importance. This dissertation proposes three methods for optimizing cancer treatment and data analysis in the tumor xenograft context. The first of these is applicable to tumor xenograft experiments in general, and the second two seek to optimize the combination of radiotherapy with immunotherapy in the tumor xenograft …
Movie Recommender System Using Matrix Factorization, Roland Fiagbe
Movie Recommender System Using Matrix Factorization, Roland Fiagbe
Data Science and Data Mining
Recommendation systems are a popular and beneficial field that can help people make informed decisions automatically. This technique assists users in selecting relevant information from an overwhelming amount of available data. When it comes to movie recommendations, two common methods are collaborative filtering, which compares similarities between users, and content-based filtering, which takes a user’s specific preferences into account. However, our study focuses on the collaborative filtering approach, specifically matrix factorization. Various similarity metrics are used to identify user similarities for recommendation purposes. Our project aims to predict movie ratings for unwatched movies using the MovieLens rating dataset. We developed …
Employee Attrition: Analyzing Factors Influencing Job Satisfaction Of Ibm Data Scientists, Graham Nash
Employee Attrition: Analyzing Factors Influencing Job Satisfaction Of Ibm Data Scientists, Graham Nash
Symposium of Student Scholars
Employee attrition is a relevant issue that every business employer must consider when gauging the effectiveness of their employees. Whether or not an employee chooses to leave their job can come from a multitude of factors. As a result, employers need to develop methods in which they can measure attrition by calculating the several qualities of their employees. Factors like their age, years with the company, which department they work in, their level of education, their job role, and even their marital status are all considered by employers to assist in predicting employee attrition. This project will be analyzing a …
That’S My Deity: An Examination Of Online Lokean Cultures Through Log-Linear Modeling, Mary Bernstein
That’S My Deity: An Examination Of Online Lokean Cultures Through Log-Linear Modeling, Mary Bernstein
Senior Theses
A rise in online religious communities and the growth of so-called ‘Old World’ religions are reflected in the internet’s subcultures of Neopaganism, a growing religious movement that has been documented in America since the 1960s. The religions under this umbrella movement vary drastically and include belief systems such as Wicca, Druidry, and deity worship. Belief systems under this movement lack the traditional hierarchy found in structured religion and lack a singular sacred text. As such, believers usually find and support one another not through a physical sacred place of meeting, but through an online community that acts as sacred space. …
Biasing Estimator To Mitigate Multicollinearity In Linear Regression Model, Abdulrasheed Bello Badawaire, Issam Dawoud, Adewale Folaranmi Lukman, Victoria Laoye, Arowolo Olatunji
Biasing Estimator To Mitigate Multicollinearity In Linear Regression Model, Abdulrasheed Bello Badawaire, Issam Dawoud, Adewale Folaranmi Lukman, Victoria Laoye, Arowolo Olatunji
Al-Bahir Journal for Engineering and Pure Sciences
A new two-parameter estimator was developed to combat the threat of multicollinearity for the linear regression model. Some necessary and sufficient conditions for the dominance of the proposed estimator over ordinary least squares (OLS) estimator, ridge regression estimator, Liu estimator, KL estimator, and some two-parameter estimators are obtained in the matrix mean square error sense. Theory and simulation results show that, under some conditions, the proposed two-parameter estimator consistently dominates other estimators considered in this study. The real-life application result follows suit.
Finite Mixtures Of Mean-Parameterized Conway-Maxwell-Poisson Models, Dongying Zhan
Finite Mixtures Of Mean-Parameterized Conway-Maxwell-Poisson Models, Dongying Zhan
Theses and Dissertations--Statistics
For modeling count data, the Conway-Maxwell-Poisson (CMP) distribution is a popular generalization of the Poisson distribution due to its ability to characterize data over- or under-dispersion. While the classic parameterization of the CMP has been well-studied, its main drawback is that it is does not directly model the mean of the counts. This is mitigated by using a mean-parameterized version of the CMP distribution. In this work, we are concerned with the setting where count data may be comprised of subpopulations, each possibly having varying degrees of data dispersion. Thus, we propose a finite mixture of mean-parameterized CMP distributions. An …
Classification Of Adult Income Using Decision Tree, Roland Fiagbe
Classification Of Adult Income Using Decision Tree, Roland Fiagbe
Data Science and Data Mining
Decision tree is a commonly used data mining methodology for performing classification tasks. It is a tree-based supervised machine learning algorithm that is used to classify or make predictions in a path of how previous questions are answered. Generally, the decision tree algorithm categorizes data into branch-like segments that develop into a tree that contains a root, nodes, and leaves. This project seeks to explore the decision tree methodology and apply it to the Adult Income dataset from the UCI Machine Learning Repository, to determine whether a person makes over 50K per year and determine the necessary factors that improve …
Bayesian Structural Time Series Methods For Modeling Cattle Body Temperature In Heat-Stressed Animals, Lacey Quandt
Bayesian Structural Time Series Methods For Modeling Cattle Body Temperature In Heat-Stressed Animals, Lacey Quandt
Murray State Theses and Dissertations
Climate change has had devastating effects globally, most commonly talked about during natural disasters and rising temperatures. Notably, the climate concern is turning towards agriculture and livestock. With rising temperatures, the prolonged amount of heat stress put on animals, specifically cattle, is becoming more apparent. Heat stress has been linked to a reduction in cattle growing and fattening, feed intake, productivity, reproduction, and fertility; increased heart rates and respiration; changes in behavior; and mortality in severe cases. There are abatement strategies put in place to lower heat stress in cattle, such as improvements in shading and cooling, nutritional management, and …
Statistical Methods For Gene Selection And Genetic Association Studies, Xuewei Cao
Statistical Methods For Gene Selection And Genetic Association Studies, Xuewei Cao
Dissertations, Master's Theses and Master's Reports
This dissertation includes five Chapters. A brief description of each chapter is organized as follows.
In Chapter One, we propose a signed bipartite genotype and phenotype network (GPN) by linking phenotypes and genotypes based on the statistical associations. It provides a new insight to investigate the genetic architecture among multiple correlated phenotypes and explore where phenotypes might be related at a higher level of cellular and organismal organization. We show that multiple phenotypes association studies by considering the proposed network are improved by incorporating the genetic information into the phenotype clustering.
In Chapter Two, we first illustrate the proposed GPN …
High Dimensional Data Analysis: Variable Screening And Inference, Lei Fang
High Dimensional Data Analysis: Variable Screening And Inference, Lei Fang
Theses and Dissertations--Statistics
This dissertation focuses on the problem of high dimensional data analysis, which arises in many fields including genomics, finance, and social sciences. In such settings, the number of features or variables is much larger than the number of observations, posing significant challenges to traditional statistical methods.
To address these challenges, this dissertation proposes novel methods for variable screening and inference. The first part of the dissertation focuses on variable screening, which aims to identify a subset of important variables that are strongly associated with the response variable. Specifically, we propose a robust nonparametric screening method to effectively select the predictors …
Statistical Methods For Modern Threats, Brandon Lumsden
Statistical Methods For Modern Threats, Brandon Lumsden
All Dissertations
More than ever before, technology is evolving at a rapid pace across the broad spectrum of biological sciences. As data collection becomes more precise, efficient, and standardized, a demand for appropriate statistical modeling grows as well. Throughout this dissertation, we examine a variety of new age data arising from modern technology of the 21st century. We begin by employing a suite of existing statistical techniques to address research questions surrounding three medical conditions presenting in public health sciences. Here we describe the techniques used, including generalized linear models and longitudinal models, and we summarize the significant associations identified between research …
Learning Graphical Models Of Multivariate Functional Data With Applications To Neuroimaging, Jiajing Niu
Learning Graphical Models Of Multivariate Functional Data With Applications To Neuroimaging, Jiajing Niu
All Dissertations
This dissertation investigates the functional graphical models that infer the functional connectivity based on neuroimaging data, which is noisy, high dimensional and has limited samples. The dissertation provides two recipes to infer the functional graphical model: 1) a fully Bayesian framework 2) an end-to-end deep model.
We first propose a fully Bayesian regularization scheme to estimate functional graphical models. We consider a direct Bayesian analog of the functional graphical lasso proposed by Qiao et al. (2019).. We then propose a regularization strategy via the graphical horseshoe. We compare both Bayesian approaches to the frequentist functional graphical lasso, and compare the …
Bayesian Methods For Graphical Models With Neighborhood Selection., Sagnik Bhadury
Bayesian Methods For Graphical Models With Neighborhood Selection., Sagnik Bhadury
Electronic Theses and Dissertations
Graphical models determine associations between variables through the notion of conditional independence. Gaussian graphical models are a widely used class of such models, where the relationships are formalized by non-null entries of the precision matrix. However, in high-dimensional cases, covariance estimates are typically unstable. Moreover, it is natural to expect only a few significant associations to be present in many realistic applications. This necessitates the injection of sparsity techniques into the estimation method. Classical frequentist methods, like GLASSO, use penalization techniques for this purpose. Fully Bayesian methods, on the contrary, are slow because they require iteratively sampling over a quadratic …
Functional Data Analysis Of Covid-19, Nichole L. Fluke
Functional Data Analysis Of Covid-19, Nichole L. Fluke
Mathematics & Statistics ETDs
This thesis deals with Functional Data Analysis (FDA) on COVID data. The Data involves counts for new COVID cases, hospitalized COVID patients, and new COVID deaths. The data used is for all the states and regions in the United States. The data starts in March 1st, 2020 and goes through March 31st, 2021. The FDA smooths the data and looks to see if there are similarities or differences between the states and regions in the data. The data also shows which states and regions stand out from the others and which ones are similar. Also shown …
Regression-Based Methods For Dynamic Treatment Regimes With Mismeasured Covariates Or Misclassified Response, Dan Liu
Electronic Thesis and Dissertation Repository
The statistical study of dynamic treatment regimes (DTRs) focuses on estimating sequential treatment decision rules tailored to patient-level information across multiple stages of intervention. Regression-based methods in DTR have been studied in the literature with a critical assumption that all the observed variables are precisely measured. However, this assumption is often violated in many applications. One example is the STAR*D study, in which the patient's depressive score is subject to measurement error. In this thesis, we explore problems in the context of DTR with measurement error or misclassification considered in the observed data.
The first project deals with covariate measurement …