Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 25 of 25

Full-Text Articles in Physical Sciences and Mathematics

High-Dimensional Mediation Analysis Of Multi-Omics Data, Sunyi Chi May 2024

High-Dimensional Mediation Analysis Of Multi-Omics Data, Sunyi Chi

Dissertations & Theses (Open Access)

Environmental exposures such as cigarette smoking influence health outcomes through intermediate molecular phenotypes, such as the methylome, transcriptome, and metabolome. Mediation analysis is a useful tool for investigating the role of potentially high-dimensional intermediate phenotypes in the relationship between environmental exposures and health outcomes. Rapid development of high-throughput technologies have made mediation analysis of multi-omics data critical to gain groundbreaking insights into the biological mechanisms underlying the disease etiology. This dissertation aims to develop mediation analysis methods that utilize the enormous amount of multi-omics data in assessing mechanisms of disease etiology. It contains three projects where I propose advanced mediation …


Bayesian Adaptive Clinical Trial Design, Mengyi Lu Dec 2022

Bayesian Adaptive Clinical Trial Design, Mengyi Lu

Dissertations & Theses (Open Access)

The landscape of drug development in oncology has changed from conventional chemotherapies to molecular targeted therapies and immunotherapies, which provide innovative therapeutic modalities for treating cancers. These novel therapeutic agents work through mechanisms that fundamentally differ from standard chemotherapeutic agents, making the conventional trial design paradigm inefficient and dysfunctional. Specifically, the focus of dose-finding trials has shifted from finding the maximum tolerated dose (MTD) to the optimal biological dose (OBD), defined as the dose that optimizes the risk–benefit tradeoff. How to accurately identify the OBD and its dosing schedule is of great importance to maximize efficacy and safety of targeted …


Bayesian Adaptive Designs For Proof-Of-Concept Trials And Platform Trials, Yujie Zhao Aug 2022

Bayesian Adaptive Designs For Proof-Of-Concept Trials And Platform Trials, Yujie Zhao

Dissertations & Theses (Open Access)

With the revolutionary achievement in molecular targeted therapies and cancer immunotherapies, the traditional drug development paradigm in phase II trials becomes increasingly inefficient due to its slow progress, high cost, and high failure rate. Fitting one standard strategy to all different trials also harms its reliability in decision-making because it doesn’t fully use all available resources and information in each trial. It’s crucial to develop novel phase II trial designs to accomplish different objectives for different types of trials. This research mainly focuses on Bayesian adaptive designs for phase II trials. Three types of trials are discussed in which traditional …


Identification And Characterization Of De Novo Germline Tp53 Mutation Carriers In Families With Li-Fraumeni Syndrome, Carlos C. Vera Recio Aug 2021

Identification And Characterization Of De Novo Germline Tp53 Mutation Carriers In Families With Li-Fraumeni Syndrome, Carlos C. Vera Recio

Dissertations & Theses (Open Access)

Li-Fraumeni syndrome (LFS) is an inherited cancer syndrome caused by a deleterious mutation in TP53. An estimated 48% of LFS patients present due to a de novo mutation (DNM) in TP53. The knowledge of DNM status, DNM or familial mutation (FM), of an LFS patient requires genetic testing of both parents which is often inaccessible, making de novo LFS patients difficult to study. Famdenovo.TP53 is a Mendelian Risk prediction model used to predict DNM status of TP53 mutation carriers based on the cancer-family history and several input genetic parameters, including disease-gene penetrance. The good predictive performance of Famdenovo.TP53 was demonstrated …


Mixture Model Approaches To Integrative Analysis Of Multi-Omics Data And Spatially Correlated Genomic Data, Ziqiao Wang May 2021

Mixture Model Approaches To Integrative Analysis Of Multi-Omics Data And Spatially Correlated Genomic Data, Ziqiao Wang

Dissertations & Theses (Open Access)

Integrative genomic data analysis is a powerful tool to study the complex biological processes behind a disease. Statistical methods can model the interrelationships of the involved gene activities through jointly analyzing multiple types of genomic data from different platforms (vertical integration), or improve the power of a study through aggregating the same type of genomic data across studies (horizontal integration). In this dissertation, we propose statistical methods and strategies for integrative multi-omics data in association analysis of disease phenotypes, with an emphasis on cancer applications.

We develop a new strategy based on horizontal integration by leveraging publicly available datasets into …


A Signature Enrichment Design With Bayesian Adaptive Randomization For Cancer Clinical Trials, Fang Xia Dec 2019

A Signature Enrichment Design With Bayesian Adaptive Randomization For Cancer Clinical Trials, Fang Xia

Dissertations & Theses (Open Access)

Clinical trials in the era of precision medicine demand more flexible and efficient trial designs. Adaptive clinical trial designs allow pre-specified modifications of an on-going clinical trial and could shorten the trial duration. We reviewed five common types of adaptive clinical trials based on adaptation methods. In particular, outcome-randomization becomes more popular as it can assign more patients to the promising treatments based on the accumulated trial data. This data-driven allocation allows more patients to benefit from the trial, which is especially important for cancer patients. We compared different Bayesian outcome-adaptive randomization methods and discussed them from both methodological and …


A Tail-Based Test For Differential Expression Analysis And Pathway Analysis In Rna-Sequencing Data, Jiong Chen Aug 2017

A Tail-Based Test For Differential Expression Analysis And Pathway Analysis In Rna-Sequencing Data, Jiong Chen

Dissertations & Theses (Open Access)

RNA sequencing data have been abundantly generated in biomedical research for biomarker discovery and pathway analysis. Such data at the exon-level are usually heavily tailed and correlated. Conventional statistical tests based on the mean or median difference for differential expression likely suffer from low power when the between-group difference occurs mostly in the upper or lower tail of the distribution of gene expression. We propose a tail-based test to make comparisons between groups in terms of a specific distribution area rather than a single location. The proposed test, which is derived from quantile regression, adjusts for covariates and accounts for …


Novel Bayesian Adaptive Clinical Trial Designs In Early Phases, Haitao Pan Aug 2017

Novel Bayesian Adaptive Clinical Trial Designs In Early Phases, Haitao Pan

Dissertations & Theses (Open Access)

Early phase, or phase I and phase II, trials are the first step in testing new medicines that have been developed in the lab. The main goal of phase I clinical trials is to establish the recommended dose of new drugs for phase II trials. For the cytotoxic drugs, the goal is to find maximum tolerated dose (MTD). The guiding principle for dose escalation in phase I trials is to avoid exposing too many patients to subtherapeutic doses while preserving safety and maintaining rapid accrual. Therefore, dose escalation methods, especially Bayesian designs, are recommended to be used in phase I …


Statistical Methods For Two Problems In Cancer Research: Analysis Of Rna-Seq Data From Archival Samples And Characterization Of Onset Of Multiple Primary Cancers, Jialu Li May 2017

Statistical Methods For Two Problems In Cancer Research: Analysis Of Rna-Seq Data From Archival Samples And Characterization Of Onset Of Multiple Primary Cancers, Jialu Li

Dissertations & Theses (Open Access)

My dissertation is focused on quantitative methodology development and application for two important topics in translational and clinical cancer research.

The first topic was motivated by the challenge of applying transcriptome sequencing (RNA-seq) to formalin-fixation and paraffin-embedding (FFPE) tumor samples for reliable diagnostic development. We designed a biospecimen study to directly compare gene expression results from different protocols to prepare libraries for RNA-seq from human breast cancer tissues, with randomization to fresh-frozen (FF) or FFPE conditions. To comprehensively evaluate the FFPE RNA-seq data quality for expression profiling, we developed multiple computational methods for assessment, such as the uniformity and continuity …


Further Advances For The Sequential Multiple Assignment Randomized Trial (Smart), Tianjiao Dai Feb 2017

Further Advances For The Sequential Multiple Assignment Randomized Trial (Smart), Tianjiao Dai

Dissertations & Theses (Open Access)

ABSTRACT

FURTHER ADVANCES FOR THE SEQUENTIAL MULTIPLE ASSIGNMENT RANDOMIZED TRIAL (SMART)

Tianjiao Dai, M.S.

Advisory Professor: Sanjay Shete, Ph.D.

Sequential multiple assignment randomized trial (SMART) designs have been developed these years for studying adaptive interventions. In my Ph.D. study, I mainly investigate how to further improve SMART designs and optimize the interventions for each individual in the trial. My dissertation has focused on two topics of SMART designs.

1) Developing a novel SMART design that can reduce the cost and side effects associated with the interventions and proposing the corresponding analytic methods. I have developed a time-varying SMART design in …


Integration Of Multi-Platform High-Dimensional Omic Data, Xuebei An May 2016

Integration Of Multi-Platform High-Dimensional Omic Data, Xuebei An

Dissertations & Theses (Open Access)

The development of high-throughput biotechnologies have made data accessible from different platforms, including RNA sequencing, copy number variation, DNA methylation, protein lysate arrays, etc. The high-dimensional omic data derived from different technological platforms have been extensively used to facilitate comprehensive understanding of disease mechanisms and to determine personalized health treatments. Although vital to the progress of clinical research, the high dimensional multi-platform data impose new challenges for data analysis. Numerous studies have been proposed to integrate multi-platform omic data; however, few have efficiently and simultaneously addressed the problems that arise from high dimensionality and complex correlations.

In my dissertation, I …


Germline Mutation Detection In Next Generation Sequencing Data And Tp53 Mutation Carrier Probability Estimation For Li-Fraumeni Syndrome, Gang Peng Aug 2015

Germline Mutation Detection In Next Generation Sequencing Data And Tp53 Mutation Carrier Probability Estimation For Li-Fraumeni Syndrome, Gang Peng

Dissertations & Theses (Open Access)

Next generation sequencing technology has been widely used in genomic analysis, but its application has been compromised by the missing true variants, especially when these variants are rare. We proposed a family-based variant calling method, FamSeq, integrating Mendelian transmission information with de novo mutation and sequencing data to improve the variant calling accuracy. We investigated the factors impacting the improvement of family-based variant calling in simulation data and validated it in real sequencing data. In both simulation and real data, FamSeq works better than the single individual based method.

In FamSeq, we implemented four different methods for the Mendelian genetic …


Genetics Of Obesity In Starr County, Texas Mexican Americans, Heather M. Highland May 2015

Genetics Of Obesity In Starr County, Texas Mexican Americans, Heather M. Highland

Dissertations & Theses (Open Access)

Currently, over two-thirds of Americans are classified as over-weight or obese. Obesity increases risk for many other diseases including type 2 diabetes, heart disease, stroke, and cancer, making obesity the largest public health problem in America and most other Westernized nations. Hispanics have a higher rate of both obesity and type 2 diabetes, making them a particularly interesting population in which to study obesity. For the last 33 years, the Starr County Health Studies has collected an array of phenotypes and biological samples from residents of Starr County, along Texas-Mexico border. This study includes 825 subjects who were not known …


Genetic Predictors Of Metabolic Side Effects Of Diuretic Therapy, Jorge L. Del Aguila Aug 2014

Genetic Predictors Of Metabolic Side Effects Of Diuretic Therapy, Jorge L. Del Aguila

Dissertations & Theses (Open Access)

Thiazide diuretics are a recommended first-line monotherapy for hypertension (i.e.SBP>140 mmHg or DBP>90 mmHg). Even so, diuretics are associated with adverse metabolic side effects, such as hyperlipidemia, hyperglycemia and hypokalemia which increase the risk of developing type II diabetes. This thesis used three analytical strategies to identify and quantify genetic factors that contribute to the development of adverse metabolic effects due to thiazide diuretic treatment. I performed a genome-wide association study (GWAS) and meta-analysis of the change in fasting plasma glucose and triglycerides in response to HCTZ from two different clinical trials: the Pharmacogenomic Evaluation of Antihypertensive Responses …


The Association Between The Il-1 Pathway, Isaac C. Wun May 2014

The Association Between The Il-1 Pathway, Isaac C. Wun

Dissertations & Theses (Open Access)

Cutaneous malignant melanoma (CMM) is a potentially lethal malignancy that warrants attention and further research, as it is known to that there is an increasing rate of incidence in theUnited States, and it is also known that exposure to UV light is its most crucial risk factor, and family history of melanoma is also an important risk factor. Melanoma is an aggressive and lethal cancer in humans. There are an estimated new 132,000 melanoma cases annually worldwide, and the trend has doubled in the past 20 years. However, attempts to treat melanoma have encountered considerable resistance and remained ineffective. The …


Bayesian Statistical Methods In Gene-Environment And Gene-Gene Interaction Studies, Changlu Liu Aug 2013

Bayesian Statistical Methods In Gene-Environment And Gene-Gene Interaction Studies, Changlu Liu

Dissertations & Theses (Open Access)

Complex diseases such as cancer result from multiple genetic changes and environmental exposures. Due to the rapid development of genotyping and sequencing technologies, we are now able to more accurately assess causal effects of many genetic and environmental factors. Genome-wide association studies have been able to localize many causal genetic variants predisposing to certain diseases. However, these studies only explain a small portion of variations in the heritability of diseases. More advanced statistical models are urgently needed to identify and characterize some additional genetic and environmental factors and their interactions, which will enable us to better understand the causes of …


Integrative Biomarker Identification And Classification Using High Throughput Assays, Pan Tong May 2013

Integrative Biomarker Identification And Classification Using High Throughput Assays, Pan Tong

Dissertations & Theses (Open Access)

It is well accepted that tumorigenesis is a multi-step procedure involving aberrant functioning of genes regulating cell proliferation, differentiation, apoptosis, genome stability, angiogenesis and motility. To obtain a full understanding of tumorigenesis, it is necessary to collect information on all aspects of cell activity. Recent advances in high throughput technologies allow biologists to generate massive amounts of data, more than might have been imagined decades ago. These advances have made it possible to launch comprehensive projects such as (TCGA) and (ICGC) which systematically characterize the molecular fingerprints of cancer cells using gene expression, methylation, copy number, microRNA and SNP microarrays …


Development Of Novel Methods To Minimize The Impact Of Sequencing Errors In The Next-Generation Sequencing Data Analysis, Xiaofeng Zheng May 2013

Development Of Novel Methods To Minimize The Impact Of Sequencing Errors In The Next-Generation Sequencing Data Analysis, Xiaofeng Zheng

Dissertations & Theses (Open Access)

Next-generation sequencing (NGS) technology has become a prominent tool in biological and biomedical research. However, NGS data analysis, such as de novo assembly, mapping and variants detection is far from maturity, and the high sequencing error-rate is one of the major problems. .

To minimize the impact of sequencing errors, we developed a highly robust and efficient method, MTM, to correct the errors in NGS reads. We demonstrated the effectiveness of MTM on both single-cell data with highly non-uniform coverage and normal data with uniformly high coverage, reflecting that MTM’s performance does not rely on the coverage of the sequencing …


Bayesian Adaptive Designs For Early Phase Clinical Trials, Chunyan Cai Aug 2012

Bayesian Adaptive Designs For Early Phase Clinical Trials, Chunyan Cai

Dissertations & Theses (Open Access)

My dissertation focuses mainly on Bayesian adaptive designs for phase I and phase II clinical trials. It includes three specific topics: (1) proposing a novel two-dimensional dose-finding algorithm for biological agents, (2) developing Bayesian adaptive screening designs to provide more efficient and ethical clinical trials, and (3) incorporating missing late-onset responses to make an early stopping decision.

Treating patients with novel biological agents is becoming a leading trend in oncology. Unlike cytotoxic agents, for which toxicity and efficacy monotonically increase with dose, biological agents may exhibit non-monotonic patterns in their dose-response relationships. Using a trial with two biological agents as …


Development Of A Bayesian Joint Logistic Model To Better Study The Association Between Haplotypes And Disease, Anthony M. D'Amelio Jr Dec 2011

Development Of A Bayesian Joint Logistic Model To Better Study The Association Between Haplotypes And Disease, Anthony M. D'Amelio Jr

Dissertations & Theses (Open Access)

In 2011, there will be an estimated 1,596,670 new cancer cases and 571,950 cancer-related deaths in the US. With the ever-increasing applications of cancer genetics in epidemiology, there is great potential to identify genetic risk factors that would help identify individuals with increased genetic susceptibility to cancer, which could be used to develop interventions or targeted therapies that could hopefully reduce cancer risk and mortality.

In this dissertation, I propose to develop a new statistical method to evaluate the role of haplotypes in cancer susceptibility and development. This model will be flexible enough to handle not only haplotypes of any …


Bayesian Phase I Dose Finding In Cancer Trials, Lin Yang Aug 2011

Bayesian Phase I Dose Finding In Cancer Trials, Lin Yang

Dissertations & Theses (Open Access)

This dissertation explores phase I dose-finding designs in cancer trials from three perspectives: the alternative Bayesian dose-escalation rules, a design based on a time-to-dose-limiting toxicity (DLT) model, and a design based on a discrete-time multi-state (DTMS) model.

We list alternative Bayesian dose-escalation rules and perform a simulation study for the intra-rule and inter-rule comparisons based on two statistical models to identify the most appropriate rule under certain scenarios. We provide evidence that all the Bayesian rules outperform the traditional ``3+3'' design in the allocation of patients and selection of the maximum tolerated dose.

The design based on a time-to-DLT model …


Gene By Bmi Interactions Influencing C-Reactive Protein Levels In European-Americans, Sarah Tudor Aug 2011

Gene By Bmi Interactions Influencing C-Reactive Protein Levels In European-Americans, Sarah Tudor

Dissertations & Theses (Open Access)

C-Reactive Protein (CRP) is a biomarker indicating tissue damage, inflammation, and infection. High-sensitivity CRP (hsCRP) is an emerging biomarker often used to estimate an individual’s risk for future coronary heart disease (CHD). hsCRP levels falling below 1.00 mg/l indicate a low risk for developing CHD, levels ranging between 1.00 mg/l and 3.00 mg/l indicate an elevated risk, and levels exceeding 3.00 mg/l indicate high risk. Multiple Genome-Wide Association Studies (GWAS) have identified a number of genetic polymorphisms which influence CRP levels. SNPs implicated in such studies have been found in or near genes of interest including: CRP, APOE, APOC, IL-6, …


Prediction Of Dna Methylation Based On Genomic Architecture And Applications Of Positional Weight Matrices, Juan Gallegos Aug 2011

Prediction Of Dna Methylation Based On Genomic Architecture And Applications Of Positional Weight Matrices, Juan Gallegos

Dissertations & Theses (Open Access)

Gene silencing due to epigenetic mechanisms shows evidence of significant contributions to cancer development. We hypothesis that the genetic architecture based on retrotransposon elements surrounding the transcription start site, plays an important role in the suppression and promotion of DNA methylation. In our investigation we found a high rate of SINE and LINEs retrotransposon elements near the transcription start site of unmethylated genes when compared to methylated genes. The presence of these elements were positively associated with promoter methylation, contrary to logical expectations, due to the malicious effects of retrotransposon elements which insert themselves randomly into the genome causing possible …


A Bayesian Approach To Dose-Response Assessment And Drug-Drug Interaction Analysis: Application To In Vitro Studies, Violeta G. Hennessey Aug 2010

A Bayesian Approach To Dose-Response Assessment And Drug-Drug Interaction Analysis: Application To In Vitro Studies, Violeta G. Hennessey

Dissertations & Theses (Open Access)

The considerable search for synergistic agents in cancer research is motivated by the therapeutic benefits achieved by combining anti-cancer agents. Synergistic agents make it possible to reduce dosage while maintaining or enhancing a desired effect. Other favorable outcomes of synergistic agents include reduction in toxicity and minimizing or delaying drug resistance. Dose-response assessment and drug-drug interaction analysis play an important part in the drug discovery process, however analysis are often poorly done. This dissertation is an effort to notably improve dose-response assessment and drug-drug interaction analysis.

The most commonly used method in published analysis is the Median-Effect Principle/Combination Index method …


Survival Prediction For Brain Tumor Patients Using Gene Expression Data, Vinicius Bonato May 2010

Survival Prediction For Brain Tumor Patients Using Gene Expression Data, Vinicius Bonato

Dissertations & Theses (Open Access)

Brain tumor is one of the most aggressive types of cancer in humans, with an estimated median survival time of 12 months and only 4% of the patients surviving more than 5 years after disease diagnosis. Until recently, brain tumor prognosis has been based only on clinical information such as tumor grade and patient age, but there are reports indicating that molecular profiling of gliomas can reveal subgroups of patients with distinct survival rates. We hypothesize that coupling molecular profiling of brain tumors with clinical information might improve predictions of patient survival time and, consequently, better guide future treatment decisions. …