Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 30 of 33

Full-Text Articles in Physical Sciences and Mathematics

Statistical Approaches For The Early Detection Of Colorectal Cancer Using Longitudinal Biomarkers, Emily Berry May 2024

Statistical Approaches For The Early Detection Of Colorectal Cancer Using Longitudinal Biomarkers, Emily Berry

Statistical Science Theses and Dissertations

Colorectal cancer (CRC) is the third leading cause of cancer-related death in the United States [45]. CRC is believed to advance from adenomatous polyps creating a unique opportunity for both early detection and cancer prevention [4, 23]. Like other diseases, CRC screening reduces mortality by detecting cancer at earlier, more treatable stages; however, it can also reduce incidence through the removal of precancerous lesions [4]. As a result, screening is recommended for average-risk adults ≥ 45 years of age and includes a variety of tests [4, 12]. Despite alternate screening options, colonoscopy capacity is often cited as a barrier to …


Factors Predictive Of The Development Of Surgical Site Infection In Thyroidectomy, A Replication Study Of Myssiorek (2018), Kaitlyn M. Kenig May 2024

Factors Predictive Of The Development Of Surgical Site Infection In Thyroidectomy, A Replication Study Of Myssiorek (2018), Kaitlyn M. Kenig

Capstone Experience

The original study aimed to show that thyroidectomy does not result in surgical site infection (SSI) in most cases, and thus routine prescription of antibiotics is not necessary. The study looked to see what risk factors could predict the incidence of SSI. This would highlight those individuals who were at most risk of developing SSI, and then antibiotics would only be prescribed to these individuals instead of all or most individuals who undergo thyroidectomy.

This study used NSQIP data to look at incidence of SSI and look for risk factors that may be predictive of SSI. Only surgeries that were …


Bayesian Statistical Modeling Of Spatially Resolved Transcriptomics Data, Xi Jiang Oct 2023

Bayesian Statistical Modeling Of Spatially Resolved Transcriptomics Data, Xi Jiang

Statistical Science Theses and Dissertations

Spatially resolved transcriptomics (SRT) quantifies expression levels at different spatial locations, providing a new and powerful tool to investigate novel biological insights. As experimental technologies enhance both in capacity and efficiency, there arises a growing demand for the development of analytical methodologies.

One question in SRT data analysis is to identify genes whose expressions exhibit spatially correlated patterns, called spatially variable (SV) genes. Most current methods to identify SV genes are built upon the geostatistical model with Gaussian process, which could limit the models' ability to identify complex spatial patterns. In order to overcome this challenge and capture more types …


Forecasting Covid-19 With Temporal Hierarchies And Ensemble Methods, Li Shandross Aug 2023

Forecasting Covid-19 With Temporal Hierarchies And Ensemble Methods, Li Shandross

Masters Theses

Infectious disease forecasting efforts underwent rapid growth during the COVID-19 pandemic, providing guidance for pandemic response and about potential future trends. Yet despite their importance, short-term forecasting models often struggled to produce accurate real-time predictions of this complex and rapidly changing system. This gap in accuracy persisted into the pandemic and warrants the exploration and testing of new methods to glean fresh insights.

In this work, we examined the application of the temporal hierarchical forecasting (THieF) methodology to probabilistic forecasts of COVID-19 incident hospital admissions in the United States. THieF is an innovative forecasting technique that aggregates time-series data into …


Prevalence Of Sars-Cov-2 Antibodies In Liberty University Student Population, Emily Bonus Apr 2023

Prevalence Of Sars-Cov-2 Antibodies In Liberty University Student Population, Emily Bonus

Senior Honors Theses

In 2020, the virus SARS-CoV-2 gained attention as it spread around the world. Its antibodies are poorly understood, and little research focuses on those with few COVID-19 complications yet large numbers of close contacts: university students. This longitudinal study recorded SARS-CoV-2 antibody presence in 107 undergraduate Liberty University students twice during early 2021. After extensive data cleaning and the application of various statistical tests and ANOVAs, the data seems to show that in the case of COVID-19 infections, SARS-CoV-2 IgM antibodies are immediately produced, and then IgG antibodies follow later. However, the COVID-19 vaccine causes the production of both IgM …


Regression Modeling Of Complex Survival Data Based On Pseudo-Observations, Rong Rong Dec 2022

Regression Modeling Of Complex Survival Data Based On Pseudo-Observations, Rong Rong

Statistical Science Theses and Dissertations

The restricted mean survival time (RMST) is a clinically meaningful summary measure in studies with survival outcomes. Statistical methods have been developed for regression analysis of RMST to investigate impacts of covariates on RMST, which is a useful alternative to the Cox regression analysis. However, existing methods for regression modeling of RMST are not applicable to left-truncated right-censored data that arise frequently in prevalent cohort studies, for which the sampling bias due to left truncation and informative censoring induced by the prevalent sampling scheme must be properly addressed. Meanwhile, statistical methods have been developed for regression modeling of the cumulative …


Estimation Of Causal Effects In Complex Clustered Data, Joshua R. Nugent Oct 2022

Estimation Of Causal Effects In Complex Clustered Data, Joshua R. Nugent

Doctoral Dissertations

Analysis of clustered data from randomized trials or observational data often poses theoretical and practical statistical challenges, including but not limited to small numbers of independent units, many adjustment variables, continuous exposures, and/or differential clustering across trial arms. Further, commonly-used parametric methods rely on assumptions that may be violated in practice. Motivated by three scientific questions in public health, methods are developed and/or demonstrated for non-parametric estimation of causal effects. In Chapter 1, methods are elaborated for a cluster randomized trial (CRT) with missing individual-level data at baseline and follow-up, a complex sampling strategy, and limited number of clusters. Chapter …


Medical Outcomes, Quality Of Life, And Family Perceptions For Outpatient Vs Inpatient Neutropenia Management After Chemotherapy For Pediatric Acute Myeloid Leukemia, Kelly D Getz, Julia E Szymczak, Yimei Li, Rachel Madding, Yuan-Shung V Huang, Catherine Aftandilian, Staci D Arnold, Kira O Bona, Emi Caywood, Anderson B Collier, M Monica Gramatges, Meret Henry, Craig Lotterman, Kelly Maloney, Amir Mian, Rajen Mody, Elaine Morgan, Elizabeth A Raetz, Jeffrey Rubnitz, Anupam Verma, Naomi Winick, Jennifer J Wilkes, Jennifer C Yu, Brian T Fisher, Richard Aplenc Oct 2021

Medical Outcomes, Quality Of Life, And Family Perceptions For Outpatient Vs Inpatient Neutropenia Management After Chemotherapy For Pediatric Acute Myeloid Leukemia, Kelly D Getz, Julia E Szymczak, Yimei Li, Rachel Madding, Yuan-Shung V Huang, Catherine Aftandilian, Staci D Arnold, Kira O Bona, Emi Caywood, Anderson B Collier, M Monica Gramatges, Meret Henry, Craig Lotterman, Kelly Maloney, Amir Mian, Rajen Mody, Elaine Morgan, Elizabeth A Raetz, Jeffrey Rubnitz, Anupam Verma, Naomi Winick, Jennifer J Wilkes, Jennifer C Yu, Brian T Fisher, Richard Aplenc

Department of Medicine Faculty Papers

Importance: Pediatric acute myeloid leukemia (AML) requires multiple courses of intensive chemotherapy that result in neutropenia, with significant risk for infectious complications. Supportive care guidelines recommend hospitalization until neutrophil recovery. However, there are little data to support inpatient over outpatient management.

Objective: To evaluate outpatient vs inpatient neutropenia management for pediatric AML.

Design, setting, and participants: This cohort study used qualitative and quantitative methods to compare medical outcomes, patient health-related quality of life (HRQOL), and patient and family perceptions between outpatient and inpatient neutropenia management. The study included patients from 17 US pediatric hospitals with frontline chemotherapy start dates ranging …


A Bayesian Hierarchical Mixture Model With Continuous-Time Markov Chains To Capture Bumblebee Foraging Behavior, Max Thrush Hukill Jan 2021

A Bayesian Hierarchical Mixture Model With Continuous-Time Markov Chains To Capture Bumblebee Foraging Behavior, Max Thrush Hukill

Honors Projects

The standard statistical methodology for analyzing complex case-control studies in ethology is often limited by approaches that force researchers to model distinct aspects of biological processes in a piecemeal, disjointed fashion. By developing a hierarchical Bayesian model, this work demonstrates that statistical inference in this context can be done using a single coherent framework. To do this, we construct a continuous-time Markov chain (CTMC) to model bumblebee foraging behavior. To connect the experimental design with the CTMC, we employ a mixture model controlled by a logistic regression on the two-factor design matrix. We then show how to infer these model …


Bayesian Semi-Supervised Keyphrase Extraction And Jackknife Empirical Likelihood For Assessing Heterogeneity In Meta-Analysis, Guanshen Wang Dec 2020

Bayesian Semi-Supervised Keyphrase Extraction And Jackknife Empirical Likelihood For Assessing Heterogeneity In Meta-Analysis, Guanshen Wang

Statistical Science Theses and Dissertations

This dissertation investigates: (1) A Bayesian Semi-supervised Approach to Keyphrase Extraction with Only Positive and Unlabeled Data, (2) Jackknife Empirical Likelihood Confidence Intervals for Assessing Heterogeneity in Meta-analysis of Rare Binary Events.

In the big data era, people are blessed with a huge amount of information. However, the availability of information may also pose great challenges. One big challenge is how to extract useful yet succinct information in an automated fashion. As one of the first few efforts, keyphrase extraction methods summarize an article by identifying a list of keyphrases. Many existing keyphrase extraction methods focus on the unsupervised setting, …


Causal Inference And Prediction On Observational Data With Survival Outcomes, Xiaofei Chen Jul 2020

Causal Inference And Prediction On Observational Data With Survival Outcomes, Xiaofei Chen

Statistical Science Theses and Dissertations

Infants with hypoplastic left heart syndrome require an initial Norwood operation, followed some months later by a stage 2 palliation (S2P). The timing of S2P is critical for the operation’s success and the infant’s survival, but the optimal timing, if one exists, is unknown. We attempt to estimate the optimal timing of S2P by analyzing data from the Single Ventricle Reconstruction Trial (SVRT), which randomized patients between two different types of Norwood procedure. In the SVRT, the timing of the S2P was chosen by the medical team; thus with respect to this exposure, the trial constitutes an observational study, and …


Inference Of Heterogeneity In Meta-Analysis Of Rare Binary Events And Rss-Structured Cluster Randomized Studies, Chiyu Zhang Dec 2019

Inference Of Heterogeneity In Meta-Analysis Of Rare Binary Events And Rss-Structured Cluster Randomized Studies, Chiyu Zhang

Statistical Science Theses and Dissertations

This dissertation contains two topics: (1) A Comparative Study of Statistical Methods for Quantifying and Testing Between-study Heterogeneity in Meta-analysis with Focus on Rare Binary Events; (2) Estimation of Variances in Cluster Randomized Designs Using Ranked Set Sampling.

Meta-analysis, the statistical procedure for combining results from multiple studies, has been widely used in medical research to evaluate intervention efficacy and safety. In many practical situations, the variation of treatment effects among the collected studies, often measured by the heterogeneity parameter, may exist and can greatly affect the inference about effect sizes. Comparative studies have been done for only one or …


Sample Size Calculation Of Clinical Trials With Correlated Outcomes, Dateng Li Aug 2019

Sample Size Calculation Of Clinical Trials With Correlated Outcomes, Dateng Li

Statistical Science Theses and Dissertations

In this thesis, we investigate sample size calculation for three kinds of clinical trials: (1). Randomized controlled trials (RCTs) with longitudinal count outcomes; (2). Cluster randomized trials (CRTs) with count outcomes; (3). CRTs with multiple binary co-primary endpoints.


Factors Associated With Eosinophilic Esophagitis In Nevada, Julia Lorraine Anderson Aug 2019

Factors Associated With Eosinophilic Esophagitis In Nevada, Julia Lorraine Anderson

UNLV Theses, Dissertations, Professional Papers, and Capstones

Eosinophilic esophagitis (EoE) is a rare immune-mediated illness with symptoms that range from difficulty swallowing to food impaction of the esophagus. Most published studies have been documented among patients residing in cool regions with significant annual rainfall. No published studies to our knowledge have been performed examining the healthcare utilization trends of EoE in Nevada. Utilizing two unique databases, the factors associated with EoE healthcare utilization patterns in Nevada were examined. All analyses were performed in R version 3.5.1. This study included a demographic and regional analysis identifying risk factors associated with having an EoE healthcare visit in Nevada. Several …


Methods For Making Policy-Relevant Forecasts Of Infectious Disease Incidence, Stephen A. Lauer Jul 2019

Methods For Making Policy-Relevant Forecasts Of Infectious Disease Incidence, Stephen A. Lauer

Doctoral Dissertations

Infectious diseases place an enormous burden on the people of the developing world and their governments. When, where, and how to allocate resources in order to slow the spread of a virus or deal with the aftermath of an outbreak is often the responsibility of local public health officials. In this thesis, we develop statistical methods for forecasting future incidence of infectious diseases and estimating the effects of interventions designed to reduce future incidence, bearing in mind the needs and concerns of those public health officials. While most infectious disease forecasting models focus on short-term horizons (i.e. weeks or …


Robust And Adaptive Design Approaches For Stepped Wedge Cluster Randomized Trials, Jijia Wang Jan 2019

Robust And Adaptive Design Approaches For Stepped Wedge Cluster Randomized Trials, Jijia Wang

Statistical Science Theses and Dissertations

The stepped wedge (SW) cluster randomized design has been increasingly employed by pragmatic trials in health services research. In this study, based on the GEE approach, I present a closed-form sample size that is applicable to both closed-cohort and cross-sectional SW trials with outcomes from the exponential family. On the other hand, I proposed a Bayesian adaptive design for cross-sectional SW cluster randomized trials. It is more adaptable than traditional designs because it allows early termination of the trial when interim data indicate that the intervention is sufficient efficacious or inefficacious. A decision to terminate or continue the trial will …


Spectral Methods For The Detection And Characterization Of Topologically Associated Domains, Kellen Garrison Cresswell Jan 2019

Spectral Methods For The Detection And Characterization Of Topologically Associated Domains, Kellen Garrison Cresswell

Theses and Dissertations

The three-dimensional (3D) structure of the genome plays a crucial role in gene expression regulation. Chromatin conformation capture technologies (Hi-C) have revealed that the genome is organized in a hierarchy of topologically associated domains (TADs), sub-TADs, and chromatin loops which is relatively stable across cell-lines and even across species. These TADs dynamically reorganize during development of disease, and exhibit cell- and conditionspecific differences. Identifying such hierarchical structures and how they change between conditions is a critical step in understanding genome regulation and disease development. Despite their importance, there are relatively few tools for identification of TADs and even fewer for …


Angiostrongylus Cantonensis: Epidemiologic Review, Location-Specific Habitat Modelling, And Surveillance In Hillsborough County, Florida, U.S.A., Brad Christian Perich Mar 2018

Angiostrongylus Cantonensis: Epidemiologic Review, Location-Specific Habitat Modelling, And Surveillance In Hillsborough County, Florida, U.S.A., Brad Christian Perich

USF Tampa Graduate Theses and Dissertations

Angiostrongylus cantonensis is a parasitic nematode endemic to tropical and subtropical regions and is the leading cause of human eosinophilic meningitis. The parasite is commonly known as rat lungworm because the primary host in its lifecycle is the rat. A clinical overview of rat lungworm infection is presented, followed by a literature review of rat lungworm epidemiology, risk factors, and surveillance projects. Data collected from previous snail surveys in Florida was considered alongside elevation, population per square kilometer, median household income by zip code territory, and normalized difference vegetation index specific to the geographic coordinates from which the snail samples …


Distance-Based Analysis Of Variance For Brain Connectivity, Russell T. Shinohara, Haochang Shou, Marco Carone, Robert Schultz, Birkan Tunc, Drew Parker, Ragini Verma Aug 2016

Distance-Based Analysis Of Variance For Brain Connectivity, Russell T. Shinohara, Haochang Shou, Marco Carone, Robert Schultz, Birkan Tunc, Drew Parker, Ragini Verma

UPenn Biostatistics Working Papers

The field of neuroimaging dedicated to mapping connections in the brain is increasingly being recognized as key for understanding neurodevelopment and pathology. Networks of these connections are quantitatively represented using complex structures including matrices, functions, and graphs, which require specialized statistical techniques for estimation and inference about developmental and disorder-related changes. Unfortunately, classical statistical testing procedures are not well suited to high-dimensional testing problems. In the context of global or regional tests for differences in neuroimaging data, traditional analysis of variance (ANOVA) is not directly applicable without first summarizing the data into univariate or low-dimensional features, a process that may …


A Weighted Gene Co-Expression Network Analysis For Streptococcus Sanguinis Microarray Experiments, Erik C. Dvergsten Jan 2016

A Weighted Gene Co-Expression Network Analysis For Streptococcus Sanguinis Microarray Experiments, Erik C. Dvergsten

Theses and Dissertations

Streptococcus sanguinis is a gram-positive, non-motile bacterium native to human mouths. It is the primary cause of endocarditis and is also responsible for tooth decay. Two-component systems (TCSs) are commonly found in bacteria. In response to environmental signals, TCSs may regulate the expression of virulence factor genes.

Gene co-expression networks are exploratory tools used to analyze system-level gene functionality. A gene co-expression network consists of gene expression profiles represented as nodes and gene connections, which occur if two genes are significantly co-expressed. An adjacency function transforms the similarity matrix containing co-expression similarities into the adjacency matrix containing connection strengths. Gene …


Statistical Handling Of Medical Data - An Ethical Perspective, Ajay Kumar Bansal Dr Dec 2015

Statistical Handling Of Medical Data - An Ethical Perspective, Ajay Kumar Bansal Dr

COBRA Preprint Series

Medical Science is a delicate subject and the clinical data generated from the medical trials must be reliable and of good quality. Not only the quality of generated data is important, but the management is also crucial and is to be handled very carefully. In this paper, the ethical aspect of statistical handling of such data is discussed.

Every profession has some set of norms to follow to achieve its objectives. These norms are called professional ethics which shows the essence of human behaviour. Same way, the field of medical research is expected to follow ethical norms, to obtain reliable …


Developing A Weibull Model Extension To Estimate Cancer Latency Times, Diana L. Nadler Jan 2015

Developing A Weibull Model Extension To Estimate Cancer Latency Times, Diana L. Nadler

Legacy Theses & Dissertations (2009 - 2024)

More than one-third of all Americans will be diagnosed with cancer sometime in their lives. Though their illness may be invisible now, it presents a great, and largely unexamined, opportunity to find and treat their cancers early. Early detection represents one of the most promising approaches to reduce the growing cancer burden by identifying cancer while it is localized and curable, preventing not only mortality, but also reducing morbidity and costs.


Estimating Prevalence From Complex Surveys, Sophie O'Brien Nov 2014

Estimating Prevalence From Complex Surveys, Sophie O'Brien

Masters Theses

Massachusetts passed legislation in the fall of 2012 to allow the construction of three casinos and a slot parlor in the state. The prevalence of problem gambling in the state and in areas where casinos will be constructed is of particular interest. The goal is to evaluate the change in prevalence after construction of the casinos, using a multi-mode address based sample survey. The objective of this thesis is to evaluate and describe ways of using statistical inference to estimates prevalence rates in finite populations. Four methods were considered in an attempt to evaluate the prevalence of problem gambling in …


Normalization Techniques For Statistical Inference From Magnetic Resonance Imaging, Russell T. Shinohara, Elizabeth M. Sweeney, Jeff Goldsmith, Navid Shiee, Farrah J. Mateen, Peter A. Calabresi, Samson Jarso, Dzung L. Pham, Daniel S. Reich, Ciprian M. Crainiceanu Aug 2013

Normalization Techniques For Statistical Inference From Magnetic Resonance Imaging, Russell T. Shinohara, Elizabeth M. Sweeney, Jeff Goldsmith, Navid Shiee, Farrah J. Mateen, Peter A. Calabresi, Samson Jarso, Dzung L. Pham, Daniel S. Reich, Ciprian M. Crainiceanu

UPenn Biostatistics Working Papers

While computed tomography and other imaging techniques are measured in absolute units with physical meaning, magnetic resonance images are expressed in arbitrary units that are difficult to interpret and differ between study visits and subjects. Much work in the image processing literature on intensity normalization has focused on histogram matching and other histogram mapping techniques, with little emphasis on normalizing images to have biologically interpretable units. Furthermore, there are no formalized principles or goals for the crucial comparability of image intensities within and across subjects. To address this, we propose a set of criteria necessary for the normalization of images. …


Detecting And Correcting Batch Effects In High-Throughput Genomic Experiments, Sarah Reese Apr 2013

Detecting And Correcting Batch Effects In High-Throughput Genomic Experiments, Sarah Reese

Theses and Dissertations

Batch effects are due to probe-specific systematic variation between groups of samples (batches) resulting from experimental features that are not of biological interest. Principal components analysis (PCA) is commonly used as a visual tool to determine whether batch effects exist after applying a global normalization method. However, PCA yields linear combinations of the variables that contribute maximum variance and thus will not necessarily detect batch effects if they are not the largest source of variability in the data. We present an extension of principal components analysis to quantify the existence of batch effects, called guided PCA (gPCA). We describe a …


Characterization Of A Weighted Quantile Score Approach For Highly Correlated Data In Risk Analysis Scenarios, Caroline Carrico Mar 2013

Characterization Of A Weighted Quantile Score Approach For Highly Correlated Data In Risk Analysis Scenarios, Caroline Carrico

Theses and Dissertations

In risk evaluation, the effect of mixtures of environmental chemicals on a common adverse outcome is of interest. However, due to the high dimensionality and inherent correlations among chemicals that occur together, the traditional methods (e.g. ordinary or logistic regression) are unsuitable. We extend and characterize a weighted quantile score (WQS) approach to estimating an index for a set of highly correlated components. In the case with environmental chemicals, we use the WQS to identify “bad actors” and estimate body burden. The accuracy of the WQS was evaluated through extensive simulation studies in terms of validity (ability of the WQS …


Is Obesity Socially Contagious?, Ciani Jean Sparks Mar 2013

Is Obesity Socially Contagious?, Ciani Jean Sparks

Statistics

The main objective of this paper is to analyze three different articles that discuss whether obesity could be socially contagious. According to the World Health Organization in 2013, obesity is the fifth leading risk for deaths around the world. This disease has dramatically increased in the last decade, which has led scientists to believe there are other factors contributing to the epidemic besides genetics. The first article I analyzed, written by Nicholas Christakis and James Fowler, provided a logistic regression model to estimate the odds of a person becoming obese. The model included the explanatory variables: age, sex, education, smoking …


Interactions Between Serotypes Of Dengue Highlight Epidemiological Impact Of Cross-Immunity, Nicholas Reich, Sourya Shrestha, Aaron King, Pejman Rohani, Justin Lessler, Siripen Kalayanarooj, In-Kyu Yoon, Robert Gibbons, Donald Burke, Derek Cummings Jan 2013

Interactions Between Serotypes Of Dengue Highlight Epidemiological Impact Of Cross-Immunity, Nicholas Reich, Sourya Shrestha, Aaron King, Pejman Rohani, Justin Lessler, Siripen Kalayanarooj, In-Kyu Yoon, Robert Gibbons, Donald Burke, Derek Cummings

Nicholas G Reich

Dengue, a mosquito-borne virus of humans, infects over 50 million people annually. Infection with any of the four dengue serotypes induces protective immunity to that serotype, but does not confer long-term protection against infection by other serotypes. The immunological interactions between sero- types are of central importance in understanding epidemiological dynamics and anticipating the impact of dengue vaccines. We analysed a 38-year time series with 12 197 serotyped dengue infections from a hospital in Bangkok, Thailand. Using novel mechanistic models to represent different hypothesized immune interactions between serotypes, we found strong evidence that infec- tion with dengue provides substantial short-term …


Models And Software Development For Interval-Censored Data, Chun Pan Jan 2013

Models And Software Development For Interval-Censored Data, Chun Pan

Theses and Dissertations

Interval-censored time-to-event data occur naturally in studies of diseases where the symptoms are not directly observable, and periodic clinical examinations are required for detection. Due to the lack of well-established procedures, interval-censored data have been conventionally treated as right-censored data, however, this introduces bias at the first place. This dissertation focuses on methodological research and software development for interval-censored data. Specifically, it consists of three projects. The first project is to create an R package for regression analysis and survival curve estimation of interval-censored data based on several published papers by our research team. In the second project, a Bayesian …


Advanced Methodology Developments In Mixture Cure Models, Chao Cai Jan 2013

Advanced Methodology Developments In Mixture Cure Models, Chao Cai

Theses and Dissertations

Modern medical treatments have substantially improved cure rates for many chronic diseases and have generated increasing interest in appropriate statistical models to handle survival data with non-negligible cure fractions. The mixture cure models are designed to model such data set, which assume that studied population is a mixture of being cured and uncured. In this dissertation, I will develop two programs named smcure and NPHMC in R. The first program aims to facilitate estimating two popular mixture cure models: the proportional hazards (PH) mixture cure model and accelerated failure time (AFT) mixture cure model. The second program focuses on designing …