Open Access. Powered by Scholars. Published by Universities.®

Bioinformatics Commons

Open Access. Powered by Scholars. Published by Universities.®

Bioinformatics

Discipline
Institution
Publication Year
Publication
Publication Type
File Type

Articles 1 - 30 of 326

Full-Text Articles in Bioinformatics

Mmappr2: An Improved Bioinformatics Approach To Find Novel Genes, Aiden Cardall, Jonathon T. Hill, Kyle Johnsen, Connor Ward, Maliha Tasnim, Jared Taylor Mar 2024

Mmappr2: An Improved Bioinformatics Approach To Find Novel Genes, Aiden Cardall, Jonathon T. Hill, Kyle Johnsen, Connor Ward, Maliha Tasnim, Jared Taylor

Library/Life Sciences Undergraduate Poster Competition 2024

Introduction

• New genes are commonly found by randomly inducing mutations in model organisms.

• Mapping the mutations to the genome to find novel genes is difficult, time-consuming, and expensive.

• We created a bioinformatics program, MMAPPR, to automate this process.

• Here, we introduce a new algorithm, MMAPPR2, which requires little to no bioinformatics knowledge to use.

• MMAPPR2 makes several improvements that allow it to identify genes more rapidly and precisely.

• MMAPPR2 will aid the rapid identification of genes in a wide range of species and developmental systems.


Determining The Role Of Noncoding Insertion And Deletion Mutations In Lung Cancer, Zachary Everton, Matthew H. Bailey Mar 2024

Determining The Role Of Noncoding Insertion And Deletion Mutations In Lung Cancer, Zachary Everton, Matthew H. Bailey

Library/Life Sciences Undergraduate Poster Competition 2024

Background

● Cancer is a disease in which cells grow and divide at an uncontrolled rate and cause damage to surrounding tissue and is caused by mutations in the cells’ DNA.

● Though some cancer-causing mutations are inherited from parents, most cancer-causing mutations emerge over the course of a person’s life and are localized to the tumor. These localized mutations are also known as somatic mutations.

● The human genome is over 6.27 billion base pairs long and cannot be read from end to end; instead it is read in small pieces that are aligned to best-matching sequences in the …


Protein-Protein Interactions In Cell Cycle Proteins: An In Silico Investigation Of Two Important Players, Andriele Eichner Feb 2024

Protein-Protein Interactions In Cell Cycle Proteins: An In Silico Investigation Of Two Important Players, Andriele Eichner

Dissertations, Theses, and Capstone Projects

The examination of the cell cycle carries significant implications for the biology, health, and overall existence of all living things. These implications span from the development and growth of these organisms to the aging process and cancer, as well as the potential of stem cell therapies to repair diseases and injuries. Numerous proteins of the cell cycle are essential for cellular division and proliferation and are widely conserved over the course of evolution. In this work, we aimed to investigate the molecular processes of protein-protein interactions in cell cycle proteins, centering on two key players: Cdc6 in budding yeast and …


Using Single Cell Genomics To Explore The Impact Of Marine Viruses On Microbial Respiration., Paxton Tomko Jan 2024

Using Single Cell Genomics To Explore The Impact Of Marine Viruses On Microbial Respiration., Paxton Tomko

MCB Articles

Viral metabolic reprograming of marine prokaryotes, through the use of virally encoded auxiliary metabolic genes (AMGs), plays a critical role in marine ecosystem function by influencing biochemical cycles and genetic diversity in these environments. Despite the fundamental role viruses play in global environmental ecosystems, they remain an understudied aspect of microbial ecology and evolution, in part due to the methods available for studying virus host interactions in natural systems. Thus far, metagenomic analyses have been used to study the interactions of virus host pairs, but these types of analyses have their limitations in accurately linking viruses to hosts, or culture-based …


Genomic Characterization Of Adolescent And Young Adult Cancers: Investigation Of Ewing Sarcoma Susceptibility And Chornobyl Thyroid Tumors, Olivia Lee Dec 2023

Genomic Characterization Of Adolescent And Young Adult Cancers: Investigation Of Ewing Sarcoma Susceptibility And Chornobyl Thyroid Tumors, Olivia Lee

Dissertations & Theses (Open Access)

Adolescent and young adult (AYA) cancers, diagnosed between the ages of 15 and 39, can exhibit distinctive genetic and molecular characteristics. Reported epidemiologic findings and treatment outcomes based on pediatric and adult cancer studies are often not suitable for application to the AYA population, underscoring the need for more thorough genomic research. Advances in sequencing technologies have enabled comprehensive analyses of complex genomic characteristics of AYA cancers, crucial for understanding the underlying biology of these malignancies. Here, I have utilized advanced sequencing techniques and integrated analytic approaches to describe important genomic features in two different AYA cancer types: Ewing Sarcoma …


Convolutional Neural Network-Based Gene Prediction Using Buffalograss As A Model System, Michael Morikone Nov 2023

Convolutional Neural Network-Based Gene Prediction Using Buffalograss As A Model System, Michael Morikone

Complex Biosystems PhD Program: Dissertations

The task of gene prediction has been largely stagnant in algorithmic improvements compared to when algorithms were first developed for predicting genes thirty years ago. Rather than iteratively improving the underlying algorithms in gene prediction tools by utilizing better performing models, most current approaches update existing tools through incorporating increasing amounts of extrinsic data to improve gene prediction performance. The traditional method of predicting genes is done using Hidden Markov Models (HMMs). These HMMs are constrained by having strict assumptions made about the independence of genes that do not always hold true. To address this, a Convolutional Neural Network (CNN) …


Generative Ai-Assisted Pathway Analysis And Interpretation Of Rna-Seq Experiment Data, Junguk Hur Aug 2023

Generative Ai-Assisted Pathway Analysis And Interpretation Of Rna-Seq Experiment Data, Junguk Hur

AI Assignment Library

No abstract provided.


Predicting Marine Teleost Responses To Ocean Warming And Pollution, Akila Harishchandra Aug 2023

Predicting Marine Teleost Responses To Ocean Warming And Pollution, Akila Harishchandra

Electronic Theses and Dissertations

Ocean warming and pollution are two detrimental anthropogenic factors causing rapid marine ecosystem degradation recorded in the past decades. These factors alter the marine environment intolerable for many marine species, forcing them to either adapt or shift their contemporary habitat ranges to reduce the extinction risk embedded with environmental degradation. Estimating marine species’ habitat range shifts, and their potential for developing adaptive mechanisms are critical for ecosystem conservation and management, human health risk assessment, and climate change vulnerability assessments. Given that, for the first chapter of this thesis, we focused on developing a species distribution model (SDM) integrating marine species …


Evolution Of Overlapping Reading Frames In Virus Genomes, Laura Muñoz Baena Aug 2023

Evolution Of Overlapping Reading Frames In Virus Genomes, Laura Muñoz Baena

Electronic Thesis and Dissertation Repository

Viruses are formidable pathogens that represent the majority of biological entities in our planet, and their genomes are a source of interesting enigmas. One feature in which virus genomes are usually rich, is the presence of overlapping reading frames (OvRFs) — portions of the genome where the same nucleotide sequence encodes more than one protein. OvRFs are hypothesized to be used by viruses to encode proteins more compactly and to regulate transcription. In addition, OvRFs might be a source of gene novelty, facilitating the creation of new open reading frames (ORF) within the transcriptional context of existing ones.

To characterize …


Exploration Of The Immune Landscape Of Ebv-Associated Gastric Cancers, Mikhail Salnikov Jun 2023

Exploration Of The Immune Landscape Of Ebv-Associated Gastric Cancers, Mikhail Salnikov

Electronic Thesis and Dissertation Repository

Epstein–Barr virus (EBV) is a gammaherpesvirus associated with 9% of all gastric cancers (GCs). EBV-associated GCs (EBVaGCs) are pathologically and clinically distinct entities from EBV-negative GCs (EBVnGCs), with EBVaGCs exhibiting differential molecular pathology and patient prognosis. The purpose of this thesis is to investigate the tumor microenvironment (TME) of EBVaGCs, which has not been explored in-depth. We hypothesize that EBVaGCs and EBVnGCs are also distinct in terms of the molecular immune landscape. We employed over 400 stomach adenocarcinoma (STAD) samples from The Cancer Genome Atlas (TCGA), as well as a single cell dataset, for the construction of a web suite …


A Review Of How Bioinformatics And Genome Sequencing Are Affecting Precision Medicine, Taylor S. Hickey May 2023

A Review Of How Bioinformatics And Genome Sequencing Are Affecting Precision Medicine, Taylor S. Hickey

Honors Theses

Advancement in genomic sequencing and bioinformatics methods have been affecting biomedical research through precision medicine, especially in the area of cancer. Vaccine therapies can be developed using neoantigens that target specific mutations in tumors. The goals of this research are to identify mutations that lead to cancer and then define subpopulations in which patients can easily be identified. The future goal is to have targeted vaccines that are specific to each subpopulation ready to be used in treatment of their cancer. Limitations to reaching these goals have been due to tumor heterogeneity, cancer location, and difficulty in creating neoantigens for …


Understanding Host-Microbe Interactions In Maize Kernel And Sweetpotato Leaf Metagenomic Profiles., Alison K. Adams May 2023

Understanding Host-Microbe Interactions In Maize Kernel And Sweetpotato Leaf Metagenomic Profiles., Alison K. Adams

Doctoral Dissertations

Functional and quantitative metagenomic profiling remains challenging and limits our understanding of host-microbe interactions. This body of work aims to mediate these challenges by using a novel quantitative reduced representation sequencing strategy (OmeSeq-qRRS), development of a fully automated software for quantitative metagenomic/microbiome profiling (Qmatey: quantitative metagenomic alignment and taxonomic identification using exact-matching) and implementing these tools for understanding plant-microbe-pathogen interactions in maize and sweetpotato. The next generation sequencing-based OmeSeq-qRRS leverages the strengths of shotgun whole genome sequencing and costs lower that the more affordable amplicon sequencing method. The novel FASTQ data compression/indexing and enhanced-multithreading of the MegaBLAST in Qmatey allows …


Deephtlv: A Deep Learning Framework For Detecting Human T-Lymphotrophic Virus 1 Integration Sites, Johnathan Jia, Johnathan Jia May 2023

Deephtlv: A Deep Learning Framework For Detecting Human T-Lymphotrophic Virus 1 Integration Sites, Johnathan Jia, Johnathan Jia

Dissertations & Theses (Open Access)

In the 1980s, researchers found the first human oncogenic retrovirus called human T-lymphotrophic virus type 1 (HTLV-1). Since then, HTLV-1 has been identified as the causative agent behind several diseases such as adult T-cell leukemia/lymphoma (ATL) and a HTLV-1 associated myelopathy or tropical spastic paraparesis (HAM/TSP). As part of its normal replication cycle, the genome is converted into DNA and integrated into the genome. With several hundreds to thousands of unique viral integration sites (VISs) distributed with indeterminate preference throughout the genome, detection of HTLV-1 VISs is a challenging task. Experimental studies typically use molecular biology …


Computational Analysis Of Microbial Sequence Data Using Statistics And Machine Learning, Zhixiu Lu May 2023

Computational Analysis Of Microbial Sequence Data Using Statistics And Machine Learning, Zhixiu Lu

Doctoral Dissertations

Since the discovery of the double helix of DNA in 1953, modern molecular biology has opened the door to a better understanding of how genes control chemical processes within cells, including protein synthesis. Although we are still far from claiming a complete understanding, recent advances in sequencing technologies, increased computational capacity, and more sophisticated computational methods have allowed the development of various new applications that provide further insight into DNA sequence data and how the information they encode impacts living organisms and their environment. Sequencing data can now be used to start identifying the relationships between microorganisms, where they live, …


Investigating The Role Of Spatial Compartmentalization And Genomic Translocations In Metastatic Cancer: A Multi-Omic Analysis, Joshua Harris Garretson May 2023

Investigating The Role Of Spatial Compartmentalization And Genomic Translocations In Metastatic Cancer: A Multi-Omic Analysis, Joshua Harris Garretson

Chancellor’s Honors Program Projects

No abstract provided.


The Genomics Of Autism-Related Genes Il1rapl1 And Il1rapl2: Insights Into Their Cortical Distribution, Cell-Type Specificity, And Developmental Trajectories, Jacob Weaver Apr 2023

The Genomics Of Autism-Related Genes Il1rapl1 And Il1rapl2: Insights Into Their Cortical Distribution, Cell-Type Specificity, And Developmental Trajectories, Jacob Weaver

MUSC Theses and Dissertations

Neuropsychiatric disorders have a significant impact on modern society. These disorders affect a large percentage of the population: schizophrenia has a world-wide prevalence of 1% and autism spectrum disorders (ASD) affects 1 in 59 school-aged children in the US. There is substantial evidence that most neuropsychiatric disorders have a genetic component. Thus, with the advent of high throughput sequencing much effort has gone into identifying genetic variants associated with these disorders. The emerging picture from these studies is a complex one where hundreds of genes with small effects interact with a varied landscape of common variants to result in disease. …


Cancer/Testis Gene Expression Changes In Metastatic Cancer, Clara M. Mosentine Jan 2023

Cancer/Testis Gene Expression Changes In Metastatic Cancer, Clara M. Mosentine

Dissertations, Master's Theses and Master's Reports

Metastasis is the movement of cancerous cells to new parts of the body, often through the blood or lymph systems. Metastasis is classified as stage IV cancer, a prognosis that is significantly more difficult to effectively treat compared to earlier cancer stages. We are interested in assessing whether expression of Cancer/testis (CT) genes, a class of genes that are predominantly expressed in germ cells while also being abnormally expressed in a large percentage of cancers, is associated with cancer metastasis. Germ cells make up an organism’s reproductive system, such as the testis and ovaries, and exhibit cellular immortality and, in …


Taxonomic Classification Of Viral And Bacterial Dna Following 2021 Avian Mass Mortality Event, Tessa Baillargeon Jan 2023

Taxonomic Classification Of Viral And Bacterial Dna Following 2021 Avian Mass Mortality Event, Tessa Baillargeon

Honors Theses and Capstones

From May through July 2021, an unusual mortality event occurred along the eastern coast and Midwest of the United States. Thousands of birds, mostly from the order Passeriformes, were part of the die-off including blue jays (Cyanocitta cristata), common grackles (Quiscalus quiscula), European starlings (Sturnus vulgaris), American robins (Turdus migratorius). Clinical signs included crusted eyes, swollen conjunctiva, otitis, seizures, and ataxia.

The New Hampshire Veterinary Diagnostic Laboratory (NHVDL) received over 100 affected birds from various collaborators throughout the United States including Washington DC, NJ, CT, MD, and OH. Given the timing and geologic …


Respire: A Technological Tool To Navigate Mechanical Ventilation In Patient Care And Educational Settings, Swara Chokshi Jan 2023

Respire: A Technological Tool To Navigate Mechanical Ventilation In Patient Care And Educational Settings, Swara Chokshi

Undergraduate Research Posters

Around the world, more than 20 million patients rely on mechanical ventilators annually; however, not enough individuals understand how to operate ventilators, posing a risk to the health of many. Moreover, it is increasingly difficult to determine optimal mechanical ventilator settings in a timely fashion, especially in low-resource countries and critical care areas. Respire is a mobile application that bridges this gap in a twofold manner: it is designed to assist healthcare workers around the world navigate and use mechanical ventilators effectively as well as educate the general public about mechanical ventilation. Respire offers a user-friendly yet educational interface that …


Identification Of Novel Biosynthetic Gene Clusters Encoding For Polyketide/Nrps-Producing Chemotherapeutic Compounds From Marine-Derived Streptomyces Hygroscopicus From A Marine Sanctuary, Hannah Ruth Flaherty Jan 2023

Identification Of Novel Biosynthetic Gene Clusters Encoding For Polyketide/Nrps-Producing Chemotherapeutic Compounds From Marine-Derived Streptomyces Hygroscopicus From A Marine Sanctuary, Hannah Ruth Flaherty

Honors Theses and Capstones

Nearly one out of six deaths in 2020, around ten million people, were caused by cancer, making it a leading cause of death worldwide (WHO, 2022). This major public health issue, in addition to the rise of multidrug-resistant (MDR) pathogens, provides a high demand for the discovery of new pharmaceutical drugs to be used clinically to treat these conditions. The Streptomyces genus accounts to produce 39% of all microbial metabolites currently approved for human health, indicating its potential as an important species to study for antimicrobial and anticancer agents. The long linear genome of Streptomyces contains specialized sequences known as …


Structural Analysis Of Predicted Proteins Using Alphafold, Brydon P. Wall Jan 2023

Structural Analysis Of Predicted Proteins Using Alphafold, Brydon P. Wall

Undergraduate Research Posters

The function of around 67% of predicted proteins from genes in Mycobacteriophage CheetoDust can not be confidently predicted using traditional techniques and can only be functionally labeled “hypothetical proteins”. However, a new approach using AlphaFold, an artificial intelligence tool to generate a structural prediction from a sequence, can take advantage of structurally conserved regions that were previously obfuscated to gain new insights and visualize data in new ways.

Since amino acid sequences are more conserved than its corresponding DNA sequence, amino acid sequences are used when predicting the function of the corresponding translated protein. Until recently, predicting structure from an …


Vibes: A Workflow For Annotating And Visualizing Viral Sequences Integrated Into Bacterial Genomes, Conner J. Copeland Jan 2023

Vibes: A Workflow For Annotating And Visualizing Viral Sequences Integrated Into Bacterial Genomes, Conner J. Copeland

Graduate Student Theses, Dissertations, & Professional Papers

Bacteriophages are viruses that infect bacteria. Many bacteriophages integrate their genomes into the bacterial chromosome and become prophages. Prophages may substantially burden or benefit host bacteria fitness, acting in some cases as parasites and in others as mutualists, and have been demonstrated to increase host virulence. The increasing ease of bacterial genome se- quencing provides an opportunity to deeply explore prophage prevalence and insertion sites. Here we present VIBES, a workflow intended to automate prophage annotation in complete bacterial genome sequences. VIBES provides additional context to prophage annotations by annotating bac- terial genes and viral proteins in user-provided bacterial and …


Bioinformatic Analysis Of Proteomic And Genomic Data From Nsclc Tumors On Prognostic And Predictive Factors Of Immunotherapy Treatment, Mark Wuenschel Jan 2023

Bioinformatic Analysis Of Proteomic And Genomic Data From Nsclc Tumors On Prognostic And Predictive Factors Of Immunotherapy Treatment, Mark Wuenschel

Theses and Dissertations--Pharmacy

Recent lung cancer research has led to advancements in molecular immunology, resulting in development of small molecule inhibitors, or immune checkpoint inhibitors, that propagate an anti-tumor T cell response. Despite increased overall and progression-free survival with reduced adverse effects compared to traditional chemotherapy, treating advanced stage lung adenocarcinoma patients remains non-curative, and evidence of non-responders or tumor recurrence to immune checkpoint inhibitor therapy is growing. Also, compared to traditional chemotherapy, there is a lower percentage of patients who respond to small molecule inhibitors. In this analysis of proteomic and genomic data from The Cancer Proteome Atlas and Global Data Commons …


Gene Regulatory Context Of Honey Bee Worker Sterility, Rahul Choorakkat Unnikrishnan Dec 2022

Gene Regulatory Context Of Honey Bee Worker Sterility, Rahul Choorakkat Unnikrishnan

Electronic Thesis and Dissertation Repository

Honey bee workers deactivate their ovaries and are functionally sterile when a queen is present in the colony. I adopt a bioinformatics approach to up-date a model transcriptional regulatory network (TRN) to study gene-regulatory processes that regulate fecundity in workers. On splitting the network, I obtained nine clusters and each cluster conformed to properties associated with real-world networks. Two of the nine clusters are enriched for 'sterility genes' and contained single well-connected hub genes (GB44769, ftz-f1). The genes in the two clusters were functionally enriched for nucleic acid binding (GO:0003676) and nucleotide binding (GO:0000166). I identified homologous genes for …


Accurate Simulation Of Reads And Improved Strategies For Abundance Estimation Supporting Reduced Representation Sequencing For Metagenomics, Ryan Kuster Dec 2022

Accurate Simulation Of Reads And Improved Strategies For Abundance Estimation Supporting Reduced Representation Sequencing For Metagenomics, Ryan Kuster

Doctoral Dissertations

Next generation sequencing has impacted all areas of biology by providing affordable investigations into some of the most complex processes underpinning life. With its ubiquitous application, there is still benefit in considering the nuances of the technology and its downstream analysis. Sequencing libraries produced by fragmenting DNA with restriction enzyme digests limit the scope of sequencing to a reduced set of genomic loci, allowing for deeper sequencing of those regions at a reduced cost per sample. These sequencing libraries have been used to determine genetic markers within populations of closely related individuals due to their sensitivity and preservation within populations. …


Ngly1 Deficiency Affects Glycosaminoglycan Biosynthesis And Wnt Signaling Pathway In Mice, Amy Batten Oct 2022

Ngly1 Deficiency Affects Glycosaminoglycan Biosynthesis And Wnt Signaling Pathway In Mice, Amy Batten

PANDION: The Osprey Journal of Research and Ideas

Individuals affected by NGLY1 Deficiency cannot properly deglycosylate and recycle certain proteins. Even though less than 100 people worldwide have been diagnosed with this rare autosomal recessive condition, thousands are affected by similar glycosylation disorders. Common phenotypic manifestations of NGLY1 Deficiency include severe neural and intellectual delay, impaired muscle and liver function, and seizures that may become intractable. Very little is currently known about the various mechanisms through which NGLY1 deficiency affects the body and this has led to a lack of viable treatment options for those afflicted. This experiment uses a loss-of-function (LOF) mouse model of NGLY1 Deficiency homologous …


Towards More Complete Metagenomic Analyses Through Circularized Genomes And Conjugative Elements, Benjamin R. Joris Aug 2022

Towards More Complete Metagenomic Analyses Through Circularized Genomes And Conjugative Elements, Benjamin R. Joris

Electronic Thesis and Dissertation Repository

Advancements in sequencing technologies have revolutionized biological sciences and led to the emergence of a number of fields of research. One such field of research is metagenomics, which is the study of the genomic content of complex communities of bacteria. The goal of this thesis was to contribute computational methodology that can maximize the data generated in these studies and to apply these protocols human and environmental metagenomic samples.

Standard metagenomic analyses include a step for binning of assembled contigs, which has previously been shown to exclude mobile genetic elements, and I demonstrated that this phenomenon extends to all conjugative …


Methods And Tools To Improve Performance Of Plant Genome Analysis, Drew Ferrell Aug 2022

Methods And Tools To Improve Performance Of Plant Genome Analysis, Drew Ferrell

Theses and Dissertations

Multi -omics data analysis and integration facilitates hypothesis building toward an understanding of genes and pathway responses driven by environments. Methods designed to estimate and analyze gene expression, with regard to treatments or conditions, can be leveraged to understand gene-level responses in the cell. However, genes often interact and signal within larger structures such as pathways and networks. Complex studies guided toward describing dynamic genetic pathways and networks require algorithms or methods designed for inference based on gene interactions and related topologies. Classes of algorithms and methods may be integrated into generalized workflows for comparative genomics studies, as multi -omics …


Population Genetics Of Populus Trichocarpa For Targeted Breeding As A Biofuel Crop, Cai John Aug 2022

Population Genetics Of Populus Trichocarpa For Targeted Breeding As A Biofuel Crop, Cai John

Doctoral Dissertations

Populus trichocarpa (poplar) is a woody species native to the western U.S. and Canada. As a fast-growing crop, it has been under investigation by the Department of Energy as a resource for liquid biofuel production. Having recently expanded the collection of poplar whole-genome sequences so that it spans the entire natural species range, we have the novel opportunity to study adaptive responses across this range. This work starts with an initial proof of concept study in a well-studied portion of the species range that has complete whole-genome sequences and RNA expression. The completeness of these data allow robust validation of …


Modeling Electrostatics In Molecular Biology And Its Relevance With Molecular Mechanisms Of Diseases, Mahesh Koirala Aug 2022

Modeling Electrostatics In Molecular Biology And Its Relevance With Molecular Mechanisms Of Diseases, Mahesh Koirala

All Dissertations

Electrostatics plays an essential role in molecular biology. Modeling electrostatics in molecular biology is complicated due to the water phase, mobile ions, and irregularly shaped inhomogeneous biological macromolecules. This dissertation presents the popular DelPhi package that solves PBE and delivers the electrostatic potential distribution of biomolecules. We used the newly developed DelPhiForce steered Molecular Dynamics (DFMD) approach to model the binding of barstar to barnase and demonstrated that the first-principles method could also model the binding. This dissertation also reflects the use of existing computational approaches to model the effects of Single Amino Acid Variations (SAVs) to reveal molecular mechanisms …