Open Access. Powered by Scholars. Published by Universities.®

Biostatistics Commons

Open Access. Powered by Scholars. Published by Universities.®

2020

Discipline
Institution
Keyword
Publication
Publication Type

Articles 31 - 60 of 128

Full-Text Articles in Biostatistics

Compressed Dna Representation For Efficient Amr Classification, John Partee, Robert Hazell, Anjli Solsi, John Santerre Aug 2020

Compressed Dna Representation For Efficient Amr Classification, John Partee, Robert Hazell, Anjli Solsi, John Santerre

SMU Data Science Review

In this paper, we explore a representation methodology for the compression of DNA isolates. Using lossless string compression via tokenization of frequently repeated segments of DNA, we reduce the length of the isolates to be counted as k-mers for classification. With this new representation, we apply a previously established feature sampling method to dramatically reduce the feature space. In understanding the genetic diversity, we also look at conserving biological function across these spaces. Using a random forest model we were able to predict the resistance or susceptibility of bacteria with 85-90\% accuracy, with a 30-50\% reduction in overall isolate length, …


A Differential Geometry-Based Machine Learning Algorithm For The Brain Age Problem, Justin Asher, Khoa Tan Dang, Maxwell Masters Aug 2020

A Differential Geometry-Based Machine Learning Algorithm For The Brain Age Problem, Justin Asher, Khoa Tan Dang, Maxwell Masters

The Journal of Purdue Undergraduate Research

No abstract provided.


Robust Inference On Effects Attributable To Mediators: A Controlled-Direct-Effect-Based Approach For Causal Effect Decomposition With Multiple Mediators, An-Shun Tai, Yi-Juan Du, Sheng-Hsuan Lin Aug 2020

Robust Inference On Effects Attributable To Mediators: A Controlled-Direct-Effect-Based Approach For Causal Effect Decomposition With Multiple Mediators, An-Shun Tai, Yi-Juan Du, Sheng-Hsuan Lin

Harvard University Biostatistics Working Paper Series

Effect decomposition is a critical technique for mechanism investigation in settings with multiple causally ordered mediators. Causal mediation analysis is a standard method for effect decomposition, but the assumptions required for the identification process are extremely strong. By extending the framework of controlled direct effects, this study proposes the effect attributable to mediators (EAM) as a novel measure for effect decomposition. For policy making, EAM represents how much an effect can be eliminated by setting mediators to certain values. From the perspective of mechanism investigation, EAM contains information about how much a particular mediator or set of mediators is involved …


Interplay Of Trna-Derived Fragments And T Cell Activation In Breast Cancer Patient Survival, Nayang Shan, Ningshan Li, Qile Dai, Lin Hou, Xiting Yan, Amei Amei, Lingeng Lu, Zuoheng Wang Aug 2020

Interplay Of Trna-Derived Fragments And T Cell Activation In Breast Cancer Patient Survival, Nayang Shan, Ningshan Li, Qile Dai, Lin Hou, Xiting Yan, Amei Amei, Lingeng Lu, Zuoheng Wang

Mathematical Sciences Faculty Research

Effector CD8+ T cell activation and its cytotoxic function are positively correlated with improved survival in breast cancer. tRNA-derived fragments (tRFs) have recently been found to be involved in gene regulation in cancer progression. However, it is unclear how interactions between expression of tRFs and T cell activation affect breast cancer patient survival. We used Kaplan–Meier survival and multivariate Cox regression models to evaluate the effect of interactions between expression of tRFs and T cell activation on survival in 1081 breast cancer patients. Spearman correlation analysis and weighted gene co-expression network analysis were conducted to identify genes and pathways that …


The Influence Of Environmental Variables On The Height Growth Of Loblolly Pine (Pinus Taeda) In The Western Gulf, Osakpamwan Edo-Iyasere Aug 2020

The Influence Of Environmental Variables On The Height Growth Of Loblolly Pine (Pinus Taeda) In The Western Gulf, Osakpamwan Edo-Iyasere

Electronic Theses and Dissertations

Understanding the effects of environmental factors on stand growth is important in optimizing forest management plans. This study investigated the effects of soil and climate factors on the height growth (site index) of loblolly pine (Pinus Taeda L.) using data collected from permanent plots established in intensively-managed plantations across East Texas and Western Louisiana. The Chapman-Richards model was selected as the base model to describe the height-age relationships and important soil and climate variables were incorporated into the models as model parameter coefficient adjustors. Our results showed that the most important factors for predicting site index were nitrogen …


Classification-Based Method For Estimating Dynamic Treatment Regimes, Junwei Shen Aug 2020

Classification-Based Method For Estimating Dynamic Treatment Regimes, Junwei Shen

Electronic Thesis and Dissertation Repository

Dynamic treatment regimes are sequential decision rules dictating how to individualize treatments to patients based on evolving treatments and covariate history. In this thesis, we investigate two methods of estimating dynamic treatment regimes. The first method extends outcome weighted learning from two-treatments to multi-treatments and allows for negative treatment outcome. We show that under two different sets of assumptions, the Fisher consistency can be maintained. The second method estimates treatment rules by a neural classification tree. A weighted squared loss function is defined to approximate the indicator function to maintain the smoothness. A method of tree reconstruction and pruning is …


Risk Of New Bloodstream Infections And Mortality Among People Who Inject Drugs With Infective Endocarditis., Charlie Tan, Esfandiar Shojaei, Joshua C. Wiener, Meera Shah, Sharon Koivu, Michael Silverman Aug 2020

Risk Of New Bloodstream Infections And Mortality Among People Who Inject Drugs With Infective Endocarditis., Charlie Tan, Esfandiar Shojaei, Joshua C. Wiener, Meera Shah, Sharon Koivu, Michael Silverman

Epidemiology and Biostatistics Publications

IMPORTANCE: People who inject drugs (PWID) who are being treated for infective endocarditis remain at risk of new bloodstream infections (BSIs) due to ongoing intravenous drug use (IVDU).

OBJECTIVES: To characterize new BSIs in PWID receiving treatment for infective endocarditis, to determine the clinical factors associated with their development, and to determine whether new BSIs and treatment setting are associated with mortality.

DESIGN, SETTING, AND PARTICIPANTS: This retrospective cohort study was performed at 3 tertiary care hospitals in London, Ontario, Canada, from April 1, 2007, to March 31, 2018. Participants included a consecutive sample of all PWID 18 years or …


Machine-Learning-Based Prediction Of Sepsis Events From Vertical Clinical Trial Data: A Naïve Approach, Tyler Michael Gaddis Aug 2020

Machine-Learning-Based Prediction Of Sepsis Events From Vertical Clinical Trial Data: A Naïve Approach, Tyler Michael Gaddis

Theses and Dissertations

Sepsis is a potentially life-threatening condition characterized by a dysregulated, disproportionate immune response to infection by which the afflicted body attacks its own tissues, sometimes to the point of organ failure, and in the worst cases, death. According to the Centers for Disease Control and Prevention (CDC) Sepsis is reported to kill upwards of 270,000 Americans annually, though this figure may be greater given certain ambiguities in the current accepted diagnostic framework of the disease.

This study attempted to first establish an understanding of past definitions of sepsis, and to then recommend use of machine learning as integral in an …


A Novel Correction For The Adjusted Box-Pierce Test — New Risk Factors For Emergency Department Return Visits Within 72 Hours For Children With Respiratory Conditions — General Pediatric Model For Understanding And Predicting Prolonged Length Of Stay, Sidy Danioko Aug 2020

A Novel Correction For The Adjusted Box-Pierce Test — New Risk Factors For Emergency Department Return Visits Within 72 Hours For Children With Respiratory Conditions — General Pediatric Model For Understanding And Predicting Prolonged Length Of Stay, Sidy Danioko

Computational and Data Sciences (PhD) Dissertations

This thesis represents the results of three research projects that underline the breadth and depth of my interests.

Firstly, I devoted some efforts to the well-known Box-Pierce goodness-of-fit tests for time series models which has been an important research topic over the last few decades. All previously proposed tests are focused on changes of the test statistics. Instead, I adopted a different approach that takes the best performing test and modifying the rejection region. Thus, I developed a semiparametric correction of the Adjusted Box-Pierce test that attains the best I error rates for all sample sizes and lags and outperforms …


Activation Of Trpa1 Nociceptor Promotes Systemic Adult Mammalian Skin Regeneration, Jenny J. Wei, Hali S. Kim, Casey A. Spencer, Donna Brennan-Crispi, Ying Zheng, Nicolette M. Johnson, Misha Rosenbach, Christopher Miller, Denis H. Y. Leung, George Cotsarelis, Thomas H. Leung Aug 2020

Activation Of Trpa1 Nociceptor Promotes Systemic Adult Mammalian Skin Regeneration, Jenny J. Wei, Hali S. Kim, Casey A. Spencer, Donna Brennan-Crispi, Ying Zheng, Nicolette M. Johnson, Misha Rosenbach, Christopher Miller, Denis H. Y. Leung, George Cotsarelis, Thomas H. Leung

Research Collection School Of Economics

Adult mammalian wounds, with rare exception, heal with fibrotic scars that severely disrupt tissue architecture and function. Regenerative medicine seeks methods to avoid scar formation and restore the original tissue structures. We show in three adult mouse models that pharmacologic activation of the nociceptor TRPA1 on cutaneous sensory neurons reduces scar formation and can also promote tissue regeneration. Local activation of TRPA1 induces tissue regeneration on distant untreated areas of injury, demonstrating a systemic effect. Activated TRPA1 stimulates local production of interleukin-23 (IL-23) by dermal dendritic cells, leading to activation of circulating dermal IL-17–producing γδ T cells. Genetic ablation of …


Severe Acute Respiratory Syndrome Coronavirus 2 Transmission Potential, Iran, 2020, Kamalich Muniz-Rodriguez, Isaac Fung, Shayesteh R. Ferdosi, Sylvia Ofori, Yiseul Lee, Amna Tariq, Gerardo Chowell Aug 2020

Severe Acute Respiratory Syndrome Coronavirus 2 Transmission Potential, Iran, 2020, Kamalich Muniz-Rodriguez, Isaac Fung, Shayesteh R. Ferdosi, Sylvia Ofori, Yiseul Lee, Amna Tariq, Gerardo Chowell

Department of Biostatistics, Epidemiology, and Environmental Health Sciences Faculty Publications

To determine the transmission potential of severe acute respiratory syndrome coronavirus 2 in Iran in 2020, we estimated the reproduction number as 4.4 (95% CI 3.9–4.9) by using a generalized growth model and 3.5 (95% CI 1.3–8.1) by using epidemic doubling time. The reproduction number decreased to 1.55 after social distancing interventions were implemented.


Marginal Methods And Software For Clustered Data With Cluster- And Group-Size Informativeness., Mary Elizabeth Gregg Aug 2020

Marginal Methods And Software For Clustered Data With Cluster- And Group-Size Informativeness., Mary Elizabeth Gregg

Electronic Theses and Dissertations

Clustered data result when observations have some natural organizational association. In such data, cluster size is defined as the number of observations belonging to a cluster. A phenomenon termed informative cluster size (ICS) occurs when observation outcomes vary in a systematic way related to the cluster size. An additional form of informativeness, termed informative within-cluster group size (IWCGS), arises when the distribution of group-defining categorical covariates within clusters similarly carries information related to outcomes. Standard methods for the marginal analysis of clustered data can produce biased estimates and inference when data have informativeness. A reweighting methodology has been developed that …


Linear Methods For Regression With Small Sample Sizes Relative To The Number Of Variables., Rajesh Sikder Aug 2020

Linear Methods For Regression With Small Sample Sizes Relative To The Number Of Variables., Rajesh Sikder

Electronic Theses and Dissertations

In data sets where there are a small number of observations but a large number of variables observed for each observation, ordinary least squares estimation cannot be used for regression models. There are many alternative including stepwise regression, penalized methods such as ridge regression and the LASSO, and methods based on derived inputs such as principal components regression and partial least squares regression. In this thesis, these five methods are described. K-fold cross validation is also discussed as a way for determining regularization parameters for each method. The performance of these methods in estimation and prediction is also examined through …


The Impact Of Improved Access To After-Hours Primary Care On Emergency Department And Primary Care Utilization: A Systematic Review., Michael Hong, Amardeep Thind, Gregory S Zaric, Sisira Sarma Aug 2020

The Impact Of Improved Access To After-Hours Primary Care On Emergency Department And Primary Care Utilization: A Systematic Review., Michael Hong, Amardeep Thind, Gregory S Zaric, Sisira Sarma

Epidemiology and Biostatistics Publications

Access to after-hours primary care is problematic in many developed countries, leading patients to instead visit the emergency department for non-urgent conditions. However, emergency department utilization for conditions treatable in primary care settings may contribute to emergency department overcrowding and increased health system costs. This systematic review examines the impact of various initiatives by developed countries to improve access to after-hours primary care on emergency department and primary care utilization. We performed a systematic review on the impact of improved access to after-hours primary and searched CINAHL, EMBASE, MEDLINE, and Scopus. We identified 20 studies that examined the impact of …


Sars-Cov-2 Viral And Serological Testing When College Campuses Reopen: Some Practical Considerations, Isaac Chun-Hai Fung, Chi-Ngai Cheung, Andreas Handel Jul 2020

Sars-Cov-2 Viral And Serological Testing When College Campuses Reopen: Some Practical Considerations, Isaac Chun-Hai Fung, Chi-Ngai Cheung, Andreas Handel

Department of Biostatistics, Epidemiology, and Environmental Health Sciences Faculty Publications

The coronavirus disease 2019 (COVID-19) pandemic prompted universities across the United States to close campuses in Spring 2020. Universities are deliberating whether, when, and how they should resume in-person instruction in Fall 2020. In this essay, we discuss some practical considerations for the use of 2 potentially useful control strategies based on testing: (1) severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) reverse transcriptase-polymerase chain reaction (RT-PCR) testing followed by case-patient isolation and quarantine of close contacts, and (2) serological testing followed by an “immune shield” approach, that is, low social distancing requirements for seropositive persons. The isolation of case-patients and …


Interacting Effects Of Climate And Biotic Factors On Mesocarnivore Distribution And Snowshoe Hare Demography Along The Boreal-Temperate Ecotone, Alexej P. Siren Jul 2020

Interacting Effects Of Climate And Biotic Factors On Mesocarnivore Distribution And Snowshoe Hare Demography Along The Boreal-Temperate Ecotone, Alexej P. Siren

Doctoral Dissertations

The motivation of my dissertation research was to understand the influence of climate and biotic factors on range limits with a focus on winter-adapted species, including the Canada lynx (Lynx canadensis), American marten (Martes americana), and snowshoe hare (Lepus americanus). I investigated range dynamics along the boreal-temperate ecotone of the northeastern US. Through an integrative literature review, I developed a theoretical framework building from existing thinking on range limits and ecological theory. I used this theory for my second chapter to evaluate direct and indirect causes of carnivore range limits in the northeastern US, …


Maximum Likelihood Estimation Of Species Trees And Anomaly Zone Detection Using Ranked Gene Trees, Anastasiia Kim Jul 2020

Maximum Likelihood Estimation Of Species Trees And Anomaly Zone Detection Using Ranked Gene Trees, Anastasiia Kim

Mathematics & Statistics ETDs

A phylogenetic tree represents the evolutionary relationships among a set of organisms. Gene trees can be used to reconstruct phylogenetic trees. The methods in this dissertation focus on the gene tree topologies with emphasis on ranked gene tree topologies. A ranked tree depicts the order in which nodes appear in the tree together with topological relationships among gene lineages. One challenge that arises during phylogenetic inference is the existence of the anomaly zones, the regions of branch-length space in the species tree that can produce gene trees that have topologies differing from the species tree topology but are more probable …


Epidemiology Of Cancers In Men Who Have Sex With Men (Msm): A Protocol For Umbrella Review Of Systematic Reviews, Manoj Kumar Honaryar, Yelena Tarasenko, Maribel Almonte, Vitaly Smelov Jul 2020

Epidemiology Of Cancers In Men Who Have Sex With Men (Msm): A Protocol For Umbrella Review Of Systematic Reviews, Manoj Kumar Honaryar, Yelena Tarasenko, Maribel Almonte, Vitaly Smelov

Department of Biostatistics, Epidemiology, and Environmental Health Sciences Faculty Publications

While earlier studies on men having sex with men (MSM) tended to examine infection-related cancers, an increasing number of studies have been focusing on effects of sexual orientation on other cancers and social and cultural causes for cancer disparities. As a type of tertiary research, this umbrella review (UR) aims to synthesize findings from existing review studies on the effects of sexual orientation on cancer. Relevant peer-reviewed systematic reviews (SRs) will be identified without date or language restrictions using MEDLINE, Cochrane Database of Systematic Reviews, and the International Prospective Register for Systematic Reviews, among others. The research team members will …


A Study Of The Efficacy Of Machine Learning For Diagnosing Obstructive Coronary Artery Disease In Non-Diabetic Patients, Demond Larae Handley Jul 2020

A Study Of The Efficacy Of Machine Learning For Diagnosing Obstructive Coronary Artery Disease In Non-Diabetic Patients, Demond Larae Handley

Theses and Dissertations

According to the Centers for Disease Control and Prevention, about 18.2 million adults age 20 and older have Coronary Artery Disease in the United States. Early diagnosis is therefore of crucial importance to help prevent debilitating consequences, and principally death for many patients. In this study we use data containing gene expression values from peripheral blood samples in 198 non-diabetic patients, with the goal of developing an age and sex gene expression model for diagnosis of Coronary Artery Disease. We employ machine learning methods to obtain a classification based on genetic information, age and sex. Our implementation uses feed forward …


Improving The Quality And Design Of Retrospective Clinical Outcome Studies That Utilize Electronic Health Records, Oliwier Dziadkowiec, Jeffery Durbin, Vignesh Jayaraman Muralidharan, Megan Novak, Brendon Cornett Jul 2020

Improving The Quality And Design Of Retrospective Clinical Outcome Studies That Utilize Electronic Health Records, Oliwier Dziadkowiec, Jeffery Durbin, Vignesh Jayaraman Muralidharan, Megan Novak, Brendon Cornett

HCA Healthcare Journal of Medicine

Electronic health records (EHRs) are an excellent source for secondary data analysis. Studies based on EHR-derived data, if designed properly, can answer previously unanswerable clinical research questions. In this paper we will highlight the benefits of large retrospective studies from secondary sources such as EHRs, examine retrospective cohort and case-control study design challenges, as well as methodological and statistical adjustment that can be made to overcome some of the inherent design limitations, in order to increase the generalizability, validity and reliability of the results obtained from these studies.


Causal Inference And Prediction On Observational Data With Survival Outcomes, Xiaofei Chen Jul 2020

Causal Inference And Prediction On Observational Data With Survival Outcomes, Xiaofei Chen

Statistical Science Theses and Dissertations

Infants with hypoplastic left heart syndrome require an initial Norwood operation, followed some months later by a stage 2 palliation (S2P). The timing of S2P is critical for the operation’s success and the infant’s survival, but the optimal timing, if one exists, is unknown. We attempt to estimate the optimal timing of S2P by analyzing data from the Single Ventricle Reconstruction Trial (SVRT), which randomized patients between two different types of Norwood procedure. In the SVRT, the timing of the S2P was chosen by the medical team; thus with respect to this exposure, the trial constitutes an observational study, and …


The Practical Advantages And Disadvantages Of Laplace Regression As An Alternative To Cox Proportional Hazards Model: A Comparison Via Simulation, Sydney Smith Jul 2020

The Practical Advantages And Disadvantages Of Laplace Regression As An Alternative To Cox Proportional Hazards Model: A Comparison Via Simulation, Sydney Smith

Theses and Dissertations

The Cox proportional hazards model is the most common regression technique for survival analysis. However, the proportional hazards assumption restricts it’s use to a limited group of multiplicative models. Laplace regression is a flexible quantile regression technique for censored observations that is appropriate in a wider variety of applications as compared to the Cox proportional hazards model. Instead of estimating a hazard ratio, Laplace regression which is free from a proportionality assumption, can be used to estimate many adjusted percentiles of survival time allowing for a more complete description of the association of interest. This paper compares the performance of …


Bayesian Zero-Inflated Model For Ordinal Data, Huizhong Yang Jul 2020

Bayesian Zero-Inflated Model For Ordinal Data, Huizhong Yang

Theses and Dissertations

Datasets with a relatively large number of zeros is commonly seen in medical applications. Although models like Zero-inflated Poisson (ZIP) model are proposed for counts data, there is still some issues with ordinal data which have excess zeros. In this paper, we developed a Bayesian approach to accommodate the excess zero in ordinal data. Intellectual disability (ID), also known as mental retardation (MR), is a disability characterized by below-average intelligence or mental ability and a lack of the learning necessary skills for daily life. A person with intellectual disability has intellectual functioning and adaptive behaviors limitations. Intellectual disability is a …


Network-Based Statistical Analysis Of Functional Magnetic Resonance Imaging Data From Aphasia Patients, Xingpei Zhao Jul 2020

Network-Based Statistical Analysis Of Functional Magnetic Resonance Imaging Data From Aphasia Patients, Xingpei Zhao

Theses and Dissertations

Functional magnetic resonance imaging (fMRI) is a neuroimaging technique that provides insight into brain function and activity. Network models of fMRI signals can reveal functional connectivity related to certain brain disorders, such as post-stroke aphasia. This thesis aims to identify the functional connections that distinguish anomic and Broca’s aphasia by comparing the resting-state fMRI from the patients with these two types of aphasia. The network-based statistic (NBS) approach is used to detect such connections. After the analytic pipeline is applied to the fMRI data, the NBS approach identifies a distinct subnetwork between the two types of aphasia, which involves the …


Evaluating The Importation Of Yellow Fever Cases Into China In 2016 And Strategies Used To Prevent And Control The Spread Of The Disease, Chao Li, Dan Li, Shirley Joann Smart, Lei Zhou, Peng Yang, Jianming Ou, Yi He, Ruiqi Ren, Tao Ma, Nijuan Xiang, Haitian Sui, Yali Wang, Jian Zhao, Chaonan Wang, Yeping Wang, Daxin Ni, Isaac Chun-Hai Fung, Dexin Li, Yangmu Huang, Qun Li Jun 2020

Evaluating The Importation Of Yellow Fever Cases Into China In 2016 And Strategies Used To Prevent And Control The Spread Of The Disease, Chao Li, Dan Li, Shirley Joann Smart, Lei Zhou, Peng Yang, Jianming Ou, Yi He, Ruiqi Ren, Tao Ma, Nijuan Xiang, Haitian Sui, Yali Wang, Jian Zhao, Chaonan Wang, Yeping Wang, Daxin Ni, Isaac Chun-Hai Fung, Dexin Li, Yangmu Huang, Qun Li

Department of Biostatistics, Epidemiology, and Environmental Health Sciences Faculty Publications

During the yellow fever epidemic in Angola in 2016, cases of yellow fever were reported in China for the first time. The 11 cases, all Chinese nationals returning from Angola, were identified in March and April 2016, one to two weeks after the peak of the Angolan epidemic. One patient died; the other 10 cases recovered after treatment. This paper reviews the epidemiological characteristics of the 11 yellow fever cases imported into China. It examines case detection and disease control and surveillance, and presents recommendations for further action to prevent additional importation of yellow fever into China.


Spermine Synthase And Myc Cooperate To Maintain Colorectal Cancer Cell Survival By Repressing Bim Expression, Yubin Guo, Qing Ye, Pan Deng, Yanan Cao, Daheng He, Zhaohe Zhou, Chi Wang, Yekaterina Y. Zaytseva, Charles E. Schwartz, Eun Young Lee, B. Mark Evers, Andrew J. Morris, Side Liu, Qing-Bai She Jun 2020

Spermine Synthase And Myc Cooperate To Maintain Colorectal Cancer Cell Survival By Repressing Bim Expression, Yubin Guo, Qing Ye, Pan Deng, Yanan Cao, Daheng He, Zhaohe Zhou, Chi Wang, Yekaterina Y. Zaytseva, Charles E. Schwartz, Eun Young Lee, B. Mark Evers, Andrew J. Morris, Side Liu, Qing-Bai She

Markey Cancer Center Faculty Publications

Dysregulation of polyamine metabolism has been linked to the development of colorectal cancer (CRC), but the underlying mechanism is incompletely characterized. Here, we report that spermine synthase (SMS), a polyamine biosynthetic enzyme, is overexpressed in CRC. Targeted disruption of SMS in CRC cells results in spermidine accumulation, which inhibits FOXO3a acetylation and allows subsequent translocation to the nucleus to transcriptionally induce expression of the proapoptotic protein Bim. However, this induction is blunted by MYC-driven expression of miR-19a and miR-19b that repress Bim production. Pharmacological or genetic inhibition of MYC activity in SMS-depleted CRC cells dramatically induces Bim expression and apoptosis …


On Variable Selections In High-Dimensional Incomplete Data, Tao Sun Jun 2020

On Variable Selections In High-Dimensional Incomplete Data, Tao Sun

Major Papers

Modern Statistics has entered the era of Big Data, wherein data sets are too large, high-dimensional, incomplete and complex for most classical statistical methods. This analysis of Big data firstly focuses on missing data. We compare different multiple imputation methods. Combining the characteristics of medical high-throughput experiments, we compared multivariate imputation by chained equations (MICE), missing forest (missForest), as well as self-training selection (STS) methods. A phenotypic data set of common lung disease was assessed. Moreover, in terms of improving the interpretability and predictability of the model, variable selection plays a pivotal role in the following analysis. Taking the Lasso-Poisson …


Excess Mortality From Covid-19: A Commentary On The Italian Experience, Paolo Pasquariello, Saverio Stranges Jun 2020

Excess Mortality From Covid-19: A Commentary On The Italian Experience, Paolo Pasquariello, Saverio Stranges

Epidemiology and Biostatistics Publications

No abstract provided.


A Web-Based, Positive Emotion Skills Intervention For Enhancing Posttreatment Psychological Well-Being In Young Adult Cancer Survivors (Empower): Protocol For A Single-Arm Feasibility Trial, John M. Salsman, Laurie E. Mclouth, Michael Cohn, Janet A. Tooze, Mia Sorkin, Judith T. Moskowitz May 2020

A Web-Based, Positive Emotion Skills Intervention For Enhancing Posttreatment Psychological Well-Being In Young Adult Cancer Survivors (Empower): Protocol For A Single-Arm Feasibility Trial, John M. Salsman, Laurie E. Mclouth, Michael Cohn, Janet A. Tooze, Mia Sorkin, Judith T. Moskowitz

Behavioral Science Faculty Publications

BACKGROUND: Adolescent and young adult cancer survivors (AYAs) experience clinically significant distress and have limited access to supportive care services. Interventions to enhance psychological well-being have improved positive affect and reduced depression in clinical and healthy populations but have not been routinely tested in AYAs.

OBJECTIVE: The aim of this protocol is to (1) test the feasibility and acceptability of a Web-based positive emotion skills intervention for posttreatment AYAs called Enhancing Management of Psychological Outcomes With Emotion Regulation (EMPOWER) and (2) examine proof of concept for reducing psychological distress and enhancing psychological well-being.

METHODS: The intervention development and testing are …


Integrated Multiple Mediation Analysis: A Robustness–Specificity Trade-Off In Causal Structure, An-Shun Tai, Sheng-Hsuan Lin May 2020

Integrated Multiple Mediation Analysis: A Robustness–Specificity Trade-Off In Causal Structure, An-Shun Tai, Sheng-Hsuan Lin

Harvard University Biostatistics Working Paper Series

Recent methodological developments in causal mediation analysis have addressed several issues regarding multiple mediators. However, these developed methods differ in their definitions of causal parameters, assumptions for identification, and interpretations of causal effects, making it unclear which method ought to be selected when investigating a given causal effect. Thus, in this study, we construct an integrated framework, which unifies all existing methodologies, as a standard for mediation analysis with multiple mediators. To clarify the relationship between existing methods, we propose four strategies for effect decomposition: two-way, partially forward, partially backward, and complete decompositions. This study reveals how the direct and …