Open Access. Powered by Scholars. Published by Universities.®

Bioinformatics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 16 of 16

Full-Text Articles in Bioinformatics

Statistical Modeling Of Extracellular Vesicle Cargo To Predict Clinical Trial Outcomes For Hypoplastic Left Heart Syndrome, Jessica R Hoffman, Hyun-Ji Park, Sruti Bheri, Manu O Platt, Joshua M Hare, Sunjay Kaushal, Judith L Bettencourt, Dejian Lai, Timothy C Slesnick, William T Mahle, Michael E Davis Oct 2023

Statistical Modeling Of Extracellular Vesicle Cargo To Predict Clinical Trial Outcomes For Hypoplastic Left Heart Syndrome, Jessica R Hoffman, Hyun-Ji Park, Sruti Bheri, Manu O Platt, Joshua M Hare, Sunjay Kaushal, Judith L Bettencourt, Dejian Lai, Timothy C Slesnick, William T Mahle, Michael E Davis

Journal Articles

Cardiac-derived c-kit+ progenitor cells (CPCs) are under investigation in the CHILD phase I clinical trial (NCT03406884) for the treatment of hypoplastic left heart syndrome (HLHS). The therapeutic efficacy of CPCs can be attributed to the release of extracellular vesicles (EVs). To understand sources of cell therapy variability we took a machine learning approach: combining bulk CPC-derived EV (CPC-EV) RNA sequencing and cardiac-relevant


Measuring And Controlling Medical Record Abstraction (Mra) Error Rates In An Observational Study., Maryam Y Garza, Tremaine Williams, Sahiti Myneni, Susan H Fenton, Songthip Ounpraseuth, Zhuopei Hu, Jeannette Lee, Jessica Snowden, Meredith N Zozus, Anita C Walden, Alan E Simon, Barbara Mcclaskey, Sarah G Sanders, Sandra S Beauman, Sara R Ford, Lacy Malloch, Amy Wilson, Lori A Devlin, Leslie W Young Aug 2022

Measuring And Controlling Medical Record Abstraction (Mra) Error Rates In An Observational Study., Maryam Y Garza, Tremaine Williams, Sahiti Myneni, Susan H Fenton, Songthip Ounpraseuth, Zhuopei Hu, Jeannette Lee, Jessica Snowden, Meredith N Zozus, Anita C Walden, Alan E Simon, Barbara Mcclaskey, Sarah G Sanders, Sandra S Beauman, Sara R Ford, Lacy Malloch, Amy Wilson, Lori A Devlin, Leslie W Young

Journal Articles

BACKGROUND: Studies have shown that data collection by medical record abstraction (MRA) is a significant source of error in clinical research studies relying on secondary use data. Yet, the quality of data collected using MRA is seldom assessed. We employed a novel, theory-based framework for data quality assurance and quality control of MRA. The objective of this work is to determine the potential impact of formalized MRA training and continuous quality control (QC) processes on data quality over time.

METHODS: We conducted a retrospective analysis of QC data collected during a cross-sectional medical record review of mother-infant dyads with Neonatal …


Comprehensive Characterization Of Covid-19 Patients With Repeatedly Positive Sars-Cov-2 Tests Using A Large U.S. Electronic Health Record Database., Xiao Dong, Yujia Zhou, Xiao-Ou Shu, Elmer V Bernstam, Rebecca Stern, David M Aronoff, Hua Xu, Loren Lipworth Sep 2021

Comprehensive Characterization Of Covid-19 Patients With Repeatedly Positive Sars-Cov-2 Tests Using A Large U.S. Electronic Health Record Database., Xiao Dong, Yujia Zhou, Xiao-Ou Shu, Elmer V Bernstam, Rebecca Stern, David M Aronoff, Hua Xu, Loren Lipworth

Journal Articles

In the absence of genome sequencing, two positive molecular tests for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) separated by negative tests, prolonged time, and symptom resolution remain the best surrogate measure of possible reinfection. Using a large electronic health record database, we characterized clinical and testing data for 23 patients with repeatedly positive SARS-CoV-2 PCR test results ≥60 days apart, separated by ≥2 consecutive negative test results. The prevalence of chronic medical conditions, symptoms, and severe outcomes related to coronavirus disease 19 (COVID-19) illness were ascertained. The median age of patients was 64.5 years, 40% were Black, and 39% …


Digital Technology Needs In Maternal Mental Health: A Qualitative Inquiry., Alexandra Zingg, Laura Carter, Deevakar Rogith, Amy Franklin, Sudhakar Selvaraj, Jerrie Refuerzo, Sahiti Myneni May 2021

Digital Technology Needs In Maternal Mental Health: A Qualitative Inquiry., Alexandra Zingg, Laura Carter, Deevakar Rogith, Amy Franklin, Sudhakar Selvaraj, Jerrie Refuerzo, Sahiti Myneni

Journal Articles

Digital technologies offer many opportunities to improve mental healthcare management for women seeking pre- and-postnatal care. They provide a discrete, practical medium that is well-suited for the sensitive nature of mental health. Women who are more prone to experiencing peripartum depression (PPD), such as those of low-socioeconomic background or in high-risk pregnancies, can benefit the most from such technologies. However, current digital interventions directed towards this population provide suboptimal support, and their responsiveness to end user needs is quite limited. Our objective is to understand the digital terrain of information needs for low-socioeconomic status women with high-risk pregnancies, specifically within …


Med-Bert: Pretrained Contextualized Embeddings On Large-Scale Structured Electronic Health Records For Disease Prediction., Laila Rasmy, Yang Xiang, Ziqian Xie, Cui Tao, Degui Zhi May 2021

Med-Bert: Pretrained Contextualized Embeddings On Large-Scale Structured Electronic Health Records For Disease Prediction., Laila Rasmy, Yang Xiang, Ziqian Xie, Cui Tao, Degui Zhi

Journal Articles

Deep learning (DL)-based predictive models from electronic health records (EHRs) deliver impressive performance in many clinical tasks. Large training cohorts, however, are often required by these models to achieve high accuracy, hindering the adoption of DL-based models in scenarios with limited training data. Recently, bidirectional encoder representations from transformers (BERT) and related models have achieved tremendous successes in the natural language processing domain. The pretraining of BERT on a very large training corpus generates contextualized embeddings that can boost the performance of models trained on smaller datasets. Inspired by BERT, we propose Med-BERT, which adapts the BERT framework originally developed …


Generalized And Transferable Patient Language Representation For Phenotyping With Limited Data., Yuqi Si, Elmer V Bernstam, Kirk Roberts Apr 2021

Generalized And Transferable Patient Language Representation For Phenotyping With Limited Data., Yuqi Si, Elmer V Bernstam, Kirk Roberts

Journal Articles

The paradigm of representation learning through transfer learning has the potential to greatly enhance clinical natural language processing. In this work, we propose a multi-task pre-training and fine-tuning approach for learning generalized and transferable patient representations from medical language. The model is first pre-trained with different but related high-prevalence phenotypes and further fine-tuned on downstream target tasks. Our main contribution focuses on the impact this technique can have on low-prevalence phenotypes, a challenging task due to the dearth of data. We validate the representation from pre-training, and fine-tune the multi-task pre-trained models on low-prevalence phenotypes including 38 circulatory diseases, 23 …


Exchanges In A Virtual Environment For Diabetes Self-Management Education And Support: Social Network Analysis., Carlos A Pérez-Aldana, Allison A Lewinski, Constance M Johnson, Allison A Vorderstrasse, Sahiti Myneni Jan 2021

Exchanges In A Virtual Environment For Diabetes Self-Management Education And Support: Social Network Analysis., Carlos A Pérez-Aldana, Allison A Lewinski, Constance M Johnson, Allison A Vorderstrasse, Sahiti Myneni

Journal Articles

BACKGROUND: Diabetes remains a major health problem in the United States, affecting an estimated 10.5% of the population. Diabetes self-management interventions improve diabetes knowledge, self-management behaviors, and clinical outcomes. Widespread internet connectivity facilitates the use of eHealth interventions, which positively impacts knowledge, social support, and clinical and behavioral outcomes. In particular, diabetes interventions based on virtual environments have the potential to improve diabetes self-efficacy and support, while being highly feasible and usable. However, little is known about the patterns of social interactions and support taking place within type 2 diabetes-specific virtual communities.

OBJECTIVE: The objective of this study was to …


Representation Of Ehr Data For Predictive Modeling: A Comparison Between Umls And Other Terminologies., Laila Rasmy, Firat Tiryaki, Yujia Zhou, Yang Xiang, Cui Tao, Hua Xu, Degui Zhi Oct 2020

Representation Of Ehr Data For Predictive Modeling: A Comparison Between Umls And Other Terminologies., Laila Rasmy, Firat Tiryaki, Yujia Zhou, Yang Xiang, Cui Tao, Hua Xu, Degui Zhi

Journal Articles

OBJECTIVE: Predictive disease modeling using electronic health record data is a growing field. Although clinical data in their raw form can be used directly for predictive modeling, it is a common practice to map data to standard terminologies to facilitate data aggregation and reuse. There is, however, a lack of systematic investigation of how different representations could affect the performance of predictive models, especially in the context of machine learning and deep learning.

MATERIALS AND METHODS: We projected the input diagnoses data in the Cerner HealthFacts database to Unified Medical Language System (UMLS) and 5 other terminologies, including CCS, CCSR, …


Understanding Spatial Language In Radiology: Representation Framework, Annotation, And Spatial Relation Extraction From Chest X-Ray Reports Using Deep Learning., Surabhi Datta, Yuqi Si, Laritza Rodriguez, Sonya E Shooshan, Dina Demner-Fushman, Kirk Roberts Aug 2020

Understanding Spatial Language In Radiology: Representation Framework, Annotation, And Spatial Relation Extraction From Chest X-Ray Reports Using Deep Learning., Surabhi Datta, Yuqi Si, Laritza Rodriguez, Sonya E Shooshan, Dina Demner-Fushman, Kirk Roberts

Journal Articles

Radiology reports contain a radiologist's interpretations of images, and these images frequently describe spatial relations. Important radiographic findings are mostly described in reference to an anatomical location through spatial prepositions. Such spatial relationships are also linked to various differential diagnoses and often described through uncertainty phrases. Structured representation of this clinically significant spatial information has the potential to be used in a variety of downstream clinical informatics applications. Our focus is to extract these spatial representations from the reports. For this, we first define a representation framework based on the Spatial Role Labeling (SpRL) scheme, which we refer to as …


Covid-19 Testnorm: A Tool To Normalize Covid-19 Testing Names To Loinc Codes., Xiao Dong, Jianfu Li, Ekin Soysal, Jiang Bian, Scott L Duvall, Elizabeth Hanchrow, Hongfang Liu, Kristine E Lynch, Michael Matheny, Karthik Natarajan, Lucila Ohno-Machado, Serguei Pakhomov, Ruth Madeleine Reeves, Amy M Sitapati, Swapna Abhyankar, Theresa Cullen, Jami Deckard, Xiaoqian Jiang, Robert Murphy, Hua Xu Jul 2020

Covid-19 Testnorm: A Tool To Normalize Covid-19 Testing Names To Loinc Codes., Xiao Dong, Jianfu Li, Ekin Soysal, Jiang Bian, Scott L Duvall, Elizabeth Hanchrow, Hongfang Liu, Kristine E Lynch, Michael Matheny, Karthik Natarajan, Lucila Ohno-Machado, Serguei Pakhomov, Ruth Madeleine Reeves, Amy M Sitapati, Swapna Abhyankar, Theresa Cullen, Jami Deckard, Xiaoqian Jiang, Robert Murphy, Hua Xu

Journal Articles

Large observational data networks that leverage routine clinical practice data in electronic health records (EHRs) are critical resources for research on coronavirus disease 2019 (COVID-19). Data normalization is a key challenge for the secondary use of EHRs for COVID-19 research across institutions. In this study, we addressed the challenge of automating the normalization of COVID-19 diagnostic tests, which are critical data elements, but for which controlled terminology terms were published after clinical implementation. We developed a simple but effective rule-based tool called COVID-19 TestNorm to automatically normalize local COVID-19 testing names to standard LOINC (Logical Observation Identifiers Names and Codes) …


Deep Learning In Clinical Natural Language Processing: A Methodical Review., Stephen Wu, Kirk Roberts, Surabhi Datta, Jingcheng Du, Zongcheng Ji, Yuqi Si, Sarvesh Soni, Qiong Wang, Qiang Wei, Yang Xiang, Bo Zhao, Hua Xu Mar 2020

Deep Learning In Clinical Natural Language Processing: A Methodical Review., Stephen Wu, Kirk Roberts, Surabhi Datta, Jingcheng Du, Zongcheng Ji, Yuqi Si, Sarvesh Soni, Qiong Wang, Qiang Wei, Yang Xiang, Bo Zhao, Hua Xu

Journal Articles

OBJECTIVE: This article methodically reviews the literature on deep learning (DL) for natural language processing (NLP) in the clinical domain, providing quantitative analysis to answer 3 research questions concerning methods, scope, and context of current research.

MATERIALS AND METHODS: We searched MEDLINE, EMBASE, Scopus, the Association for Computing Machinery Digital Library, and the Association for Computational Linguistics Anthology for articles using DL-based approaches to NLP problems in electronic health records. After screening 1,737 articles, we collected data on 25 variables across 212 papers.

RESULTS: DL in clinical NLP publications more than doubled each year, through 2018. Recurrent neural networks (60.8%) …


Digilego For Peripartum Depression: A Novel Patient-Facing Digital Health Instantiation, J Rodin, C Timko, S Harris Jan 2020

Digilego For Peripartum Depression: A Novel Patient-Facing Digital Health Instantiation, J Rodin, C Timko, S Harris

Journal Articles

Digital health technologies offer unique opportunities to improve health outcomes for mental health conditions such as peripartum depression (PPD), a disorder that affects approximately 10-15% of women in the U.S. every year. In this paper, we present the adaption of a digital technology development framework, Digilego, in the context of PPD. Methods include mapping of the Behavior Intervention Technology (BIT) model and the Patient Engagement Framework (PEF) to translate patient needs captured through focus groups. This informs formative development and implementation of digital health features for optimal patient engagement in PPD screening and management. Results show an array ofPPD-specific Digilego …


Enhancing Clinical Concept Extraction With Contextual Embeddings., Yuqi Si, Jingqi Wang, Hua Xu, Kirk Roberts Nov 2019

Enhancing Clinical Concept Extraction With Contextual Embeddings., Yuqi Si, Jingqi Wang, Hua Xu, Kirk Roberts

Journal Articles

OBJECTIVE: Neural network-based representations ("embeddings") have dramatically advanced natural language processing (NLP) tasks, including clinical NLP tasks such as concept extraction. Recently, however, more advanced embedding methods and representations (eg, ELMo, BERT) have further pushed the state of the art in NLP, yet there are no common best practices for how to integrate these representations into clinical tasks. The purpose of this study, then, is to explore the space of possible options in utilizing these new models for clinical concept extraction, including comparing these to traditional word embedding methods (word2vec, GloVe, fastText).

MATERIALS AND METHODS: Both off-the-shelf, open-domain embeddings and …


Deep Patient Representation Of Clinical Notes Via Multi-Task Learning For Mortality Prediction., Yuqi Si, Kirk Roberts Jan 2019

Deep Patient Representation Of Clinical Notes Via Multi-Task Learning For Mortality Prediction., Yuqi Si, Kirk Roberts

Journal Articles

We propose a deep learning-based multi-task learning (MTL) architecture focusing on patient mortality predictions from clinical notes. The MTL framework enables the model to learn a patient representation that generalizes to a variety of clinical prediction tasks. Moreover, we demonstrate how MTL enables small but consistent gains on a single classification task (e.g., in-hospital mortality prediction) simply by incorporating related tasks (e.g., 30-day and 1-year mortality prediction) into the MTL framework. To accomplish this, we utilize a multi-level Convolutional Neural Network (CNN) associated with a MTL loss component. The model is evaluated with 3, 5, and 20 tasks and is …


Effects Of A Community Population Health Initiative On Blood Pressure Control In Latinos., James R Langabeer, Timothy D Henry, Carlos Perez Aldana, Larissa Deluna, Nora Silva, Tiffany Champagne-Langabeer Nov 2018

Effects Of A Community Population Health Initiative On Blood Pressure Control In Latinos., James R Langabeer, Timothy D Henry, Carlos Perez Aldana, Larissa Deluna, Nora Silva, Tiffany Champagne-Langabeer

Journal Articles

Background Hypertension remains one of the most important, modifiable cardiovascular risk factors. Yet, the largest minority ethnic group (Hispanics/Latinos) often have different health outcomes and behavior, making hypertension management more difficult. We explored the effects of an American Heart Association-sponsored population health intervention aimed at modifying behavior of Latinos living in Texas. Methods and Results We enrolled 8071 patients, and 5714 (65.7%) completed the 90-day program (58.5 years ±11.7; 59% female) from July 2016 to June 2018. Navigators identified patients with risk factors; initial and final blood pressure ( BP ) readings were performed in the physician's office; and interim …


A Frame-Based Nlp System For Cancer-Related Information Extraction., Yuqi Si, Kirk Roberts Jan 2018

A Frame-Based Nlp System For Cancer-Related Information Extraction., Yuqi Si, Kirk Roberts

Journal Articles

We propose a frame-based natural language processing (NLP) method that extracts cancer-related information from clinical narratives. We focus on three frames: cancer diagnosis, cancer therapeutic procedure, and tumor description. We utilize a deep learning-based approach, bidirectional Long Short-term Memory (LSTM) Conditional Random Field (CRF), which uses both character and word embeddings. The system consists of two constituent sequence classifiers: a frame identification (lexical unit) classifier and a frame element classifier. The classifier achieves an F