Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 30 of 34

Full-Text Articles in Physical Sciences and Mathematics

High-Dimensional Mediation Analysis Of Multi-Omics Data, Sunyi Chi May 2024

High-Dimensional Mediation Analysis Of Multi-Omics Data, Sunyi Chi

Dissertations & Theses (Open Access)

Environmental exposures such as cigarette smoking influence health outcomes through intermediate molecular phenotypes, such as the methylome, transcriptome, and metabolome. Mediation analysis is a useful tool for investigating the role of potentially high-dimensional intermediate phenotypes in the relationship between environmental exposures and health outcomes. Rapid development of high-throughput technologies have made mediation analysis of multi-omics data critical to gain groundbreaking insights into the biological mechanisms underlying the disease etiology. This dissertation aims to develop mediation analysis methods that utilize the enormous amount of multi-omics data in assessing mechanisms of disease etiology. It contains three projects where I propose advanced mediation …


Bayesian Adaptive Clinical Trial Design, Mengyi Lu Dec 2022

Bayesian Adaptive Clinical Trial Design, Mengyi Lu

Dissertations & Theses (Open Access)

The landscape of drug development in oncology has changed from conventional chemotherapies to molecular targeted therapies and immunotherapies, which provide innovative therapeutic modalities for treating cancers. These novel therapeutic agents work through mechanisms that fundamentally differ from standard chemotherapeutic agents, making the conventional trial design paradigm inefficient and dysfunctional. Specifically, the focus of dose-finding trials has shifted from finding the maximum tolerated dose (MTD) to the optimal biological dose (OBD), defined as the dose that optimizes the risk–benefit tradeoff. How to accurately identify the OBD and its dosing schedule is of great importance to maximize efficacy and safety of targeted …


Bayesian Adaptive Designs For Proof-Of-Concept Trials And Platform Trials, Yujie Zhao Aug 2022

Bayesian Adaptive Designs For Proof-Of-Concept Trials And Platform Trials, Yujie Zhao

Dissertations & Theses (Open Access)

With the revolutionary achievement in molecular targeted therapies and cancer immunotherapies, the traditional drug development paradigm in phase II trials becomes increasingly inefficient due to its slow progress, high cost, and high failure rate. Fitting one standard strategy to all different trials also harms its reliability in decision-making because it doesn’t fully use all available resources and information in each trial. It’s crucial to develop novel phase II trial designs to accomplish different objectives for different types of trials. This research mainly focuses on Bayesian adaptive designs for phase II trials. Three types of trials are discussed in which traditional …


Statistical Modeling Of Longitudinal Medical Cost Data, Shikun Wang Jun 2022

Statistical Modeling Of Longitudinal Medical Cost Data, Shikun Wang

Dissertations & Theses (Open Access)

Projecting the future cancer care cost is critical in health economics research and policy making. An indispensable step is to estimate cost trajectories from an incident cohort of cancer patients using longitudinal medical cost data, accounting for terminal events such as death, and right censoring due to loss of follow-up. Since the cost of cancer care and survival are correlated, a scientifically meaningful quantity for inference in this context is the mean cost trajectory conditional on survival. Many standard approaches for longitudinal and survival analysis are not valid for the problem. The research for my Ph.D. dissertation consists of three …


Modeling Of Cns Cancer With A Focus On The Immune Component, Daniel Zamler May 2022

Modeling Of Cns Cancer With A Focus On The Immune Component, Daniel Zamler

Dissertations & Theses (Open Access)

The knowledge surrounding cancers of the central nervous system remains poorly developed, in particular with regard to the immune component. The works contained in this thesis look at craniopharyngioma, glioblastoma, and several forms of brain metastasis. While some attention is given to the tumor cells themselves, as well as the patient setting which these studies model, the immune component of disease progression and treatment plays a strong role in each and is the primary focus of the works contained.

Craniopharyngioma is a relatively rare tumor in adults. Although histologically benign, it can be locally aggressive and may require additional therapeutic …


Identification And Characterization Of De Novo Germline Tp53 Mutation Carriers In Families With Li-Fraumeni Syndrome, Carlos C. Vera Recio Aug 2021

Identification And Characterization Of De Novo Germline Tp53 Mutation Carriers In Families With Li-Fraumeni Syndrome, Carlos C. Vera Recio

Dissertations & Theses (Open Access)

Li-Fraumeni syndrome (LFS) is an inherited cancer syndrome caused by a deleterious mutation in TP53. An estimated 48% of LFS patients present due to a de novo mutation (DNM) in TP53. The knowledge of DNM status, DNM or familial mutation (FM), of an LFS patient requires genetic testing of both parents which is often inaccessible, making de novo LFS patients difficult to study. Famdenovo.TP53 is a Mendelian Risk prediction model used to predict DNM status of TP53 mutation carriers based on the cancer-family history and several input genetic parameters, including disease-gene penetrance. The good predictive performance of Famdenovo.TP53 was demonstrated …


Biases And Blind-Spots In Genome-Wide Crispr-Cas9 Knockout Screens, Merve Dede May 2021

Biases And Blind-Spots In Genome-Wide Crispr-Cas9 Knockout Screens, Merve Dede

Dissertations & Theses (Open Access)

Adaptation of the bacterial CRISPR-Cas9 system to mammalian cells revolutionized the field of functional genomics, enabling genome-scale genetic perturbations to study essential genes, whose loss of function results in a severe fitness defect. There are two types of essential genes in a cell. Core essential genes are absolutely required for growth and proliferation in every cell type. On the other hand, context-dependent essential genes become essential in an environmental or genetic context. The concept of context-dependent gene essentiality is particularly important in cancer, since killing cancer cells selectively without harming surrounding healthy tissue remains a major challenge. The toxicity of …


Mixture Model Approaches To Integrative Analysis Of Multi-Omics Data And Spatially Correlated Genomic Data, Ziqiao Wang May 2021

Mixture Model Approaches To Integrative Analysis Of Multi-Omics Data And Spatially Correlated Genomic Data, Ziqiao Wang

Dissertations & Theses (Open Access)

Integrative genomic data analysis is a powerful tool to study the complex biological processes behind a disease. Statistical methods can model the interrelationships of the involved gene activities through jointly analyzing multiple types of genomic data from different platforms (vertical integration), or improve the power of a study through aggregating the same type of genomic data across studies (horizontal integration). In this dissertation, we propose statistical methods and strategies for integrative multi-omics data in association analysis of disease phenotypes, with an emphasis on cancer applications.

We develop a new strategy based on horizontal integration by leveraging publicly available datasets into …


Statistical Methods For Resolving Intratumor Heterogeneity With Single-Cell Dna Sequencing, Alexander Davis Aug 2020

Statistical Methods For Resolving Intratumor Heterogeneity With Single-Cell Dna Sequencing, Alexander Davis

Dissertations & Theses (Open Access)

Tumor cells have heterogeneous genotypes, which drives progression and treatment resistance. Such genetic intratumor heterogeneity plays a role in the process of clonal evolution that underlies tumor progression and treatment resistance. Single-cell DNA sequencing is a promising experimental method for studying intratumor heterogeneity, but brings unique statistical challenges in interpreting the resulting data. Researchers lack methods to determine whether sufficiently many cells have been sampled from a tumor. In addition, there are no proven computational methods for determining the ploidy of a cell, a necessary step in the determination of copy number. In this work, software for calculating probabilities from …


A Signature Enrichment Design With Bayesian Adaptive Randomization For Cancer Clinical Trials, Fang Xia Dec 2019

A Signature Enrichment Design With Bayesian Adaptive Randomization For Cancer Clinical Trials, Fang Xia

Dissertations & Theses (Open Access)

Clinical trials in the era of precision medicine demand more flexible and efficient trial designs. Adaptive clinical trial designs allow pre-specified modifications of an on-going clinical trial and could shorten the trial duration. We reviewed five common types of adaptive clinical trials based on adaptation methods. In particular, outcome-randomization becomes more popular as it can assign more patients to the promising treatments based on the accumulated trial data. This data-driven allocation allows more patients to benefit from the trial, which is especially important for cancer patients. We compared different Bayesian outcome-adaptive randomization methods and discussed them from both methodological and …


Novel Bayesian Adaptive Clinical Trial Designs In Early Phases, Haitao Pan Aug 2017

Novel Bayesian Adaptive Clinical Trial Designs In Early Phases, Haitao Pan

Dissertations & Theses (Open Access)

Early phase, or phase I and phase II, trials are the first step in testing new medicines that have been developed in the lab. The main goal of phase I clinical trials is to establish the recommended dose of new drugs for phase II trials. For the cytotoxic drugs, the goal is to find maximum tolerated dose (MTD). The guiding principle for dose escalation in phase I trials is to avoid exposing too many patients to subtherapeutic doses while preserving safety and maintaining rapid accrual. Therefore, dose escalation methods, especially Bayesian designs, are recommended to be used in phase I …


A Tail-Based Test For Differential Expression Analysis And Pathway Analysis In Rna-Sequencing Data, Jiong Chen Aug 2017

A Tail-Based Test For Differential Expression Analysis And Pathway Analysis In Rna-Sequencing Data, Jiong Chen

Dissertations & Theses (Open Access)

RNA sequencing data have been abundantly generated in biomedical research for biomarker discovery and pathway analysis. Such data at the exon-level are usually heavily tailed and correlated. Conventional statistical tests based on the mean or median difference for differential expression likely suffer from low power when the between-group difference occurs mostly in the upper or lower tail of the distribution of gene expression. We propose a tail-based test to make comparisons between groups in terms of a specific distribution area rather than a single location. The proposed test, which is derived from quantile regression, adjusts for covariates and accounts for …


Statistical Methods For Two Problems In Cancer Research: Analysis Of Rna-Seq Data From Archival Samples And Characterization Of Onset Of Multiple Primary Cancers, Jialu Li May 2017

Statistical Methods For Two Problems In Cancer Research: Analysis Of Rna-Seq Data From Archival Samples And Characterization Of Onset Of Multiple Primary Cancers, Jialu Li

Dissertations & Theses (Open Access)

My dissertation is focused on quantitative methodology development and application for two important topics in translational and clinical cancer research.

The first topic was motivated by the challenge of applying transcriptome sequencing (RNA-seq) to formalin-fixation and paraffin-embedding (FFPE) tumor samples for reliable diagnostic development. We designed a biospecimen study to directly compare gene expression results from different protocols to prepare libraries for RNA-seq from human breast cancer tissues, with randomization to fresh-frozen (FF) or FFPE conditions. To comprehensively evaluate the FFPE RNA-seq data quality for expression profiling, we developed multiple computational methods for assessment, such as the uniformity and continuity …


Detecting And Evaluating Therapy Induced Changes In Radiomics Features Measured From Non-Small Cell Lung Cancer To Predict Patient Outcomes, Xenia J. Fave May 2017

Detecting And Evaluating Therapy Induced Changes In Radiomics Features Measured From Non-Small Cell Lung Cancer To Predict Patient Outcomes, Xenia J. Fave

Dissertations & Theses (Open Access)

The purpose of this study was to investigate whether radiomics features measured from weekly 4-dimensional computed tomography (4DCT) images of non-small cell lung cancers (NSCLC) change during treatment and if those changes are prognostic for patient outcomes or dependent on treatment modality. Radiomics features are quantitative metrics designed to evaluate tumor heterogeneity from routine medical imaging. Features that are prognostic for patient outcome could be used to monitor tumor response and identify high-risk patients for adaptive treatment. This would be especially valuable for NSCLC due to the high prevalence and mortality of this disease.

A novel process was designed to …


Further Advances For The Sequential Multiple Assignment Randomized Trial (Smart), Tianjiao Dai Feb 2017

Further Advances For The Sequential Multiple Assignment Randomized Trial (Smart), Tianjiao Dai

Dissertations & Theses (Open Access)

ABSTRACT

FURTHER ADVANCES FOR THE SEQUENTIAL MULTIPLE ASSIGNMENT RANDOMIZED TRIAL (SMART)

Tianjiao Dai, M.S.

Advisory Professor: Sanjay Shete, Ph.D.

Sequential multiple assignment randomized trial (SMART) designs have been developed these years for studying adaptive interventions. In my Ph.D. study, I mainly investigate how to further improve SMART designs and optimize the interventions for each individual in the trial. My dissertation has focused on two topics of SMART designs.

1) Developing a novel SMART design that can reduce the cost and side effects associated with the interventions and proposing the corresponding analytic methods. I have developed a time-varying SMART design in …


Utilizing Computed Tomography Image Features To Advance Prediction Of Radiation Pneumonitis, Shane P. Krafft Aug 2016

Utilizing Computed Tomography Image Features To Advance Prediction Of Radiation Pneumonitis, Shane P. Krafft

Dissertations & Theses (Open Access)

Improving outcomes for non-small-cell lung cancer patients treated with radiation therapy (RT) requires optimizing the balance between local tumor control and risk of normal tissue toxicity. In approximately 20% of patients, severe acute symptomatic lung toxicity, termed radiation pneumonitis (RP), still occurs. Identifying the individuals at risk of RP prior to or early during treatment offers tremendous potential to improve RT by providing the physician with information to assist in making clinical decisions that enhance therapy. Our central goal for this work was to demonstrate the potential gain in predictive accuracy of normal tissue complication probability models for RP by …


Integration Of Multi-Platform High-Dimensional Omic Data, Xuebei An May 2016

Integration Of Multi-Platform High-Dimensional Omic Data, Xuebei An

Dissertations & Theses (Open Access)

The development of high-throughput biotechnologies have made data accessible from different platforms, including RNA sequencing, copy number variation, DNA methylation, protein lysate arrays, etc. The high-dimensional omic data derived from different technological platforms have been extensively used to facilitate comprehensive understanding of disease mechanisms and to determine personalized health treatments. Although vital to the progress of clinical research, the high dimensional multi-platform data impose new challenges for data analysis. Numerous studies have been proposed to integrate multi-platform omic data; however, few have efficiently and simultaneously addressed the problems that arise from high dimensionality and complex correlations.

In my dissertation, I …


Germline Mutation Detection In Next Generation Sequencing Data And Tp53 Mutation Carrier Probability Estimation For Li-Fraumeni Syndrome, Gang Peng Aug 2015

Germline Mutation Detection In Next Generation Sequencing Data And Tp53 Mutation Carrier Probability Estimation For Li-Fraumeni Syndrome, Gang Peng

Dissertations & Theses (Open Access)

Next generation sequencing technology has been widely used in genomic analysis, but its application has been compromised by the missing true variants, especially when these variants are rare. We proposed a family-based variant calling method, FamSeq, integrating Mendelian transmission information with de novo mutation and sequencing data to improve the variant calling accuracy. We investigated the factors impacting the improvement of family-based variant calling in simulation data and validated it in real sequencing data. In both simulation and real data, FamSeq works better than the single individual based method.

In FamSeq, we implemented four different methods for the Mendelian genetic …


Computational Modeling Of Rna-Small Molecule And Rna-Protein Interactions, Lu Chen Aug 2015

Computational Modeling Of Rna-Small Molecule And Rna-Protein Interactions, Lu Chen

Dissertations & Theses (Open Access)

The past decade has witnessed an era of RNA biology; despite the considerable discoveries nowadays, challenges still remain when one aims to screen RNA-interacting small molecule or RNA-interacting protein. These challenges imply an immediate need for cost-efficient while predictive computational tools capable of generating insightful hypotheses to discover novel RNA-interacting small molecule or RNA-interacting protein. Thus, we implemented novel computational models in this dissertation to predict RNA-ligand interactions (Chapter 1) and RNA-protein interactions (Chapter 2).

Targeting RNA has not garnered comparable interest as protein, and is restricted by lack of computational tools for structure-based drug design. To test the potential …


Genetics Of Obesity In Starr County, Texas Mexican Americans, Heather M. Highland May 2015

Genetics Of Obesity In Starr County, Texas Mexican Americans, Heather M. Highland

Dissertations & Theses (Open Access)

Currently, over two-thirds of Americans are classified as over-weight or obese. Obesity increases risk for many other diseases including type 2 diabetes, heart disease, stroke, and cancer, making obesity the largest public health problem in America and most other Westernized nations. Hispanics have a higher rate of both obesity and type 2 diabetes, making them a particularly interesting population in which to study obesity. For the last 33 years, the Starr County Health Studies has collected an array of phenotypes and biological samples from residents of Starr County, along Texas-Mexico border. This study includes 825 subjects who were not known …


Genetic Predictors Of Metabolic Side Effects Of Diuretic Therapy, Jorge L. Del Aguila Aug 2014

Genetic Predictors Of Metabolic Side Effects Of Diuretic Therapy, Jorge L. Del Aguila

Dissertations & Theses (Open Access)

Thiazide diuretics are a recommended first-line monotherapy for hypertension (i.e.SBP>140 mmHg or DBP>90 mmHg). Even so, diuretics are associated with adverse metabolic side effects, such as hyperlipidemia, hyperglycemia and hypokalemia which increase the risk of developing type II diabetes. This thesis used three analytical strategies to identify and quantify genetic factors that contribute to the development of adverse metabolic effects due to thiazide diuretic treatment. I performed a genome-wide association study (GWAS) and meta-analysis of the change in fasting plasma glucose and triglycerides in response to HCTZ from two different clinical trials: the Pharmacogenomic Evaluation of Antihypertensive Responses …


The Association Between The Il-1 Pathway, Isaac C. Wun May 2014

The Association Between The Il-1 Pathway, Isaac C. Wun

Dissertations & Theses (Open Access)

Cutaneous malignant melanoma (CMM) is a potentially lethal malignancy that warrants attention and further research, as it is known to that there is an increasing rate of incidence in theUnited States, and it is also known that exposure to UV light is its most crucial risk factor, and family history of melanoma is also an important risk factor. Melanoma is an aggressive and lethal cancer in humans. There are an estimated new 132,000 melanoma cases annually worldwide, and the trend has doubled in the past 20 years. However, attempts to treat melanoma have encountered considerable resistance and remained ineffective. The …


Renal Cryoablation: Investigation Of Periprocedural Visualization Tools And Treatment Response Quantification, Katherine L. Dextraze Aug 2013

Renal Cryoablation: Investigation Of Periprocedural Visualization Tools And Treatment Response Quantification, Katherine L. Dextraze

Dissertations & Theses (Open Access)

Cryoablation for small renal tumors has demonstrated sufficient clinical efficacy over the past decade as a non-surgical nephron-sparing approach for treating renal masses for patients who are not surgical candidates. Minimally invasive percutaneous cryoablations have been performed with image guidance from CT, ultrasound, and MRI. During the MRI-guided cryoablation procedure, the interventional radiologist visually compares the iceball size on monitoring images with respect to the original tumor on separate planning images. The comparisons made during the monitoring step are time consuming, inefficient and sometimes lack the precision needed for decision making, requiring the radiologist to make further changes later in …


Bayesian Statistical Methods In Gene-Environment And Gene-Gene Interaction Studies, Changlu Liu Aug 2013

Bayesian Statistical Methods In Gene-Environment And Gene-Gene Interaction Studies, Changlu Liu

Dissertations & Theses (Open Access)

Complex diseases such as cancer result from multiple genetic changes and environmental exposures. Due to the rapid development of genotyping and sequencing technologies, we are now able to more accurately assess causal effects of many genetic and environmental factors. Genome-wide association studies have been able to localize many causal genetic variants predisposing to certain diseases. However, these studies only explain a small portion of variations in the heritability of diseases. More advanced statistical models are urgently needed to identify and characterize some additional genetic and environmental factors and their interactions, which will enable us to better understand the causes of …


Integrative Biomarker Identification And Classification Using High Throughput Assays, Pan Tong May 2013

Integrative Biomarker Identification And Classification Using High Throughput Assays, Pan Tong

Dissertations & Theses (Open Access)

It is well accepted that tumorigenesis is a multi-step procedure involving aberrant functioning of genes regulating cell proliferation, differentiation, apoptosis, genome stability, angiogenesis and motility. To obtain a full understanding of tumorigenesis, it is necessary to collect information on all aspects of cell activity. Recent advances in high throughput technologies allow biologists to generate massive amounts of data, more than might have been imagined decades ago. These advances have made it possible to launch comprehensive projects such as (TCGA) and (ICGC) which systematically characterize the molecular fingerprints of cancer cells using gene expression, methylation, copy number, microRNA and SNP microarrays …


Development Of Novel Methods To Minimize The Impact Of Sequencing Errors In The Next-Generation Sequencing Data Analysis, Xiaofeng Zheng May 2013

Development Of Novel Methods To Minimize The Impact Of Sequencing Errors In The Next-Generation Sequencing Data Analysis, Xiaofeng Zheng

Dissertations & Theses (Open Access)

Next-generation sequencing (NGS) technology has become a prominent tool in biological and biomedical research. However, NGS data analysis, such as de novo assembly, mapping and variants detection is far from maturity, and the high sequencing error-rate is one of the major problems. .

To minimize the impact of sequencing errors, we developed a highly robust and efficient method, MTM, to correct the errors in NGS reads. We demonstrated the effectiveness of MTM on both single-cell data with highly non-uniform coverage and normal data with uniformly high coverage, reflecting that MTM’s performance does not rely on the coverage of the sequencing …


Bayesian Adaptive Designs For Early Phase Clinical Trials, Chunyan Cai Aug 2012

Bayesian Adaptive Designs For Early Phase Clinical Trials, Chunyan Cai

Dissertations & Theses (Open Access)

My dissertation focuses mainly on Bayesian adaptive designs for phase I and phase II clinical trials. It includes three specific topics: (1) proposing a novel two-dimensional dose-finding algorithm for biological agents, (2) developing Bayesian adaptive screening designs to provide more efficient and ethical clinical trials, and (3) incorporating missing late-onset responses to make an early stopping decision.

Treating patients with novel biological agents is becoming a leading trend in oncology. Unlike cytotoxic agents, for which toxicity and efficacy monotonically increase with dose, biological agents may exhibit non-monotonic patterns in their dose-response relationships. Using a trial with two biological agents as …


The Role Of Cell Sterilization In Population Based Studies Of Radiogenic Second Cancers Following Radiation Therapy, Annelise Giebeler Dec 2011

The Role Of Cell Sterilization In Population Based Studies Of Radiogenic Second Cancers Following Radiation Therapy, Annelise Giebeler

Dissertations & Theses (Open Access)

Advances in radiotherapy have generated increased interest in comparative studies of treatment techniques and their effectiveness. In this respect, pediatric patients are of specific interest because of their sensitivity to radiation induced second cancers. However, due to the rarity of childhood cancers and the long latency of second cancers, large sample sizes are unavailable for the epidemiological study of contemporary radiotherapy treatments. Additionally, when specific treatments are considered, such as proton therapy, sample sizes are further reduced due to the rareness of such treatments. We propose a method to improve statistical power in micro clinical trials. Specifically, we use a …


Development Of A Bayesian Joint Logistic Model To Better Study The Association Between Haplotypes And Disease, Anthony M. D'Amelio Jr Dec 2011

Development Of A Bayesian Joint Logistic Model To Better Study The Association Between Haplotypes And Disease, Anthony M. D'Amelio Jr

Dissertations & Theses (Open Access)

In 2011, there will be an estimated 1,596,670 new cancer cases and 571,950 cancer-related deaths in the US. With the ever-increasing applications of cancer genetics in epidemiology, there is great potential to identify genetic risk factors that would help identify individuals with increased genetic susceptibility to cancer, which could be used to develop interventions or targeted therapies that could hopefully reduce cancer risk and mortality.

In this dissertation, I propose to develop a new statistical method to evaluate the role of haplotypes in cancer susceptibility and development. This model will be flexible enough to handle not only haplotypes of any …


Bayesian Phase I Dose Finding In Cancer Trials, Lin Yang Aug 2011

Bayesian Phase I Dose Finding In Cancer Trials, Lin Yang

Dissertations & Theses (Open Access)

This dissertation explores phase I dose-finding designs in cancer trials from three perspectives: the alternative Bayesian dose-escalation rules, a design based on a time-to-dose-limiting toxicity (DLT) model, and a design based on a discrete-time multi-state (DTMS) model.

We list alternative Bayesian dose-escalation rules and perform a simulation study for the intra-rule and inter-rule comparisons based on two statistical models to identify the most appropriate rule under certain scenarios. We provide evidence that all the Bayesian rules outperform the traditional ``3+3'' design in the allocation of patients and selection of the maximum tolerated dose.

The design based on a time-to-DLT model …