Open Access. Powered by Scholars. Published by Universities.®

Life Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 30 of 87

Full-Text Articles in Life Sciences

State-Of-The-Art Approaches For Sequencing, Assembling And Annotating Naphthenic Acid Degrading Bacterial Metagenomes, Henry H. Say Aug 2023

State-Of-The-Art Approaches For Sequencing, Assembling And Annotating Naphthenic Acid Degrading Bacterial Metagenomes, Henry H. Say

Electronic Thesis and Dissertation Repository

Naphthenic acids (NAs) are the main toxic component of oil refinery wastewater and require special processes to be removed. Harnessing bacterial biodegradation for NA removal has the potential to be effective, yet NA-degrading bacteria and pathways are poorly understood and uncharacterized. To improve our understanding of NA degradation, I characterize the metagenomes of novel NA-degrading bacterial communities seeded in NA-enriched granulated activated carbon (GAC) filters. I demonstrate methods that maximize the throughput of extraction, sequencing, and annotation of novel metagenomes - producing 72 MAGs and other 5432 circular contigs - 226 of which were putative phages. I also include state-of-the-art …


Evolution Of Overlapping Reading Frames In Virus Genomes, Laura Muñoz Baena Aug 2023

Evolution Of Overlapping Reading Frames In Virus Genomes, Laura Muñoz Baena

Electronic Thesis and Dissertation Repository

Viruses are formidable pathogens that represent the majority of biological entities in our planet, and their genomes are a source of interesting enigmas. One feature in which virus genomes are usually rich, is the presence of overlapping reading frames (OvRFs) — portions of the genome where the same nucleotide sequence encodes more than one protein. OvRFs are hypothesized to be used by viruses to encode proteins more compactly and to regulate transcription. In addition, OvRFs might be a source of gene novelty, facilitating the creation of new open reading frames (ORF) within the transcriptional context of existing ones.

To characterize …


Decoy-Target Database Strategy And False Discovery Rate Analysis For Glycan Identification, Xiaoou Li Jul 2023

Decoy-Target Database Strategy And False Discovery Rate Analysis For Glycan Identification, Xiaoou Li

Electronic Thesis and Dissertation Repository

In recent years, the technology of glycopeptide sequencing through MS/MS mass spectrometry data has achieved remarkable progress. Various software tools have been developed and widely used for protein identification. Estimation of false discovery rate (FDR) has become an essential method for evaluating the performance of glycopeptide scoring algorithms. The target-decoy strategy, which involves constructing decoy databases, is currently the most popular utilized method for FDR calculation. In this study, we applied various decoy construction algorithms to generate decoy glycan databases and proposed a novel approach to calculate the FDR by using the EM algorithm and mixture model.


Exploration Of The Immune Landscape Of Ebv-Associated Gastric Cancers, Mikhail Salnikov Jun 2023

Exploration Of The Immune Landscape Of Ebv-Associated Gastric Cancers, Mikhail Salnikov

Electronic Thesis and Dissertation Repository

Epstein–Barr virus (EBV) is a gammaherpesvirus associated with 9% of all gastric cancers (GCs). EBV-associated GCs (EBVaGCs) are pathologically and clinically distinct entities from EBV-negative GCs (EBVnGCs), with EBVaGCs exhibiting differential molecular pathology and patient prognosis. The purpose of this thesis is to investigate the tumor microenvironment (TME) of EBVaGCs, which has not been explored in-depth. We hypothesize that EBVaGCs and EBVnGCs are also distinct in terms of the molecular immune landscape. We employed over 400 stomach adenocarcinoma (STAD) samples from The Cancer Genome Atlas (TCGA), as well as a single cell dataset, for the construction of a web suite …


De Novo Sequencing Of Multiple Tandem Mass Spectra Of Peptide Containing Silac Labeling, Fang Han Mar 2023

De Novo Sequencing Of Multiple Tandem Mass Spectra Of Peptide Containing Silac Labeling, Fang Han

Electronic Thesis and Dissertation Repository

The systematic studies of proteins has gradually become fundamental in the research related to molecular biology. Shotgun proteomics use bottom-up proteomics techniques in identifying proteins contained in complex mixtures using a combination of high performance liquid chromatography coupled with mass spectrometry technology. Current mass spectrometers equipped with high sensitivity and accuracy can produce thousands of tandem mass spectrometry (MS/MS) spectra in a single run. The large amount of data collected in a single LC-MS/MS run requires effective computational approaches to automate the process of spectra interpretation. De novo peptide sequencing from tandem mass spectrometry (MS/MS) has emerged as an important …


Characterizing The Function Of B Cells That Accumulate In The Inflamed Central Nervous System In Anti-Myelin Autoimmunity, Lika Chowdhury Dec 2022

Characterizing The Function Of B Cells That Accumulate In The Inflamed Central Nervous System In Anti-Myelin Autoimmunity, Lika Chowdhury

Electronic Thesis and Dissertation Repository

While the role of autoimmune T cells has been extensively studied in anti-myelin

autoimmunity, little is known about the function of B cells in multiple sclerosis (MS), a chronic inflammatory disease of the central nervous system (CNS). B cells form clusters with T cells in the meninges directly adjacent to demyelinating lesions. Previous studies have shown that disease progression is dependent on the depletion of specific populations of B cells, but it is not clear which contributes to pathology or how. The purpose of this thesis is to characterize the population of meningeal B cells to determine how they differ …


Selection Pressure On Surface Exposed Virus Proteins, Sareh Bagherichimeh Dec 2022

Selection Pressure On Surface Exposed Virus Proteins, Sareh Bagherichimeh

Electronic Thesis and Dissertation Repository

Viral infection requires the interaction between virus surface-exposed (SE) proteins and host cell receptors. This can result in an “arms race” that is assumed to drive accelerated rates of evolution, and some well known examples of diversifying selection involve surface pro- teins (HIV-1 env, influenza hemagglutinin). We conducted a systematic analysis to determine whether this is truly a distinctive feature of SE virus proteins, in comparison to non-SE proteins encoded by the same genomes.

We obtained reference and all neighbour genomes of 52 human viruses from the NCBI Viral Genomes database. The coding sequences (CDS) of each genome extracted by …


Gene Regulatory Context Of Honey Bee Worker Sterility, Rahul Choorakkat Unnikrishnan Dec 2022

Gene Regulatory Context Of Honey Bee Worker Sterility, Rahul Choorakkat Unnikrishnan

Electronic Thesis and Dissertation Repository

Honey bee workers deactivate their ovaries and are functionally sterile when a queen is present in the colony. I adopt a bioinformatics approach to up-date a model transcriptional regulatory network (TRN) to study gene-regulatory processes that regulate fecundity in workers. On splitting the network, I obtained nine clusters and each cluster conformed to properties associated with real-world networks. Two of the nine clusters are enriched for 'sterility genes' and contained single well-connected hub genes (GB44769, ftz-f1). The genes in the two clusters were functionally enriched for nucleic acid binding (GO:0003676) and nucleotide binding (GO:0000166). I identified homologous genes for …


Radiation Exposure Determination In A Secure, Cloud-Based Online Environment, Ben C. Shirley, Eliseos J. Mucaki, Peter Rogan Oct 2022

Radiation Exposure Determination In A Secure, Cloud-Based Online Environment, Ben C. Shirley, Eliseos J. Mucaki, Peter Rogan

Biochemistry Publications

Rapid sample processing and interpretation of estimated exposures will be critical for triaging exposed individuals after a major radiation incident. The dicentric chromosome (DC) assay assesses absorbed radiation using metaphase cells from blood. The Automated Dicentric Chromosome Identifier and Dose Estimator System (ADCI) identifies DCs and determines radiation doses. This study aimed to broaden accessibility and speed of this system, while protecting data and software integrity. ADCI Online is a secure web-streaming platform accessible worldwide from local servers. Cloud-based systems containing data and software are separated until they are linked for radiation exposure estimation. Dose estimates are identical to ADCI …


Capturing Within Host Hiv-1 Evolution Dynamics Using Simulation Methods, Emmanuel Wong Aug 2022

Capturing Within Host Hiv-1 Evolution Dynamics Using Simulation Methods, Emmanuel Wong

Electronic Thesis and Dissertation Repository

The persistent latent reservoir of long-lived cells carrying integrated HIV DNA is the source of reinfection upon treatment interruption, and a primary focus for cure research. The reservoir is difficult to study because these cells are relatively rare or located in tissues that are difficult to sample. Sequencing proviral DNA in the latent reservoir is an important source of information about reservoir establishment and persistence, especially from the presence of identical (clonal) sequences. I evaluated the relationship between select measures of these clonal sequences and drivers of reservoir persistence, e.g., clonal expansion, by implementing a simulation model of within-host HIV …


Towards More Complete Metagenomic Analyses Through Circularized Genomes And Conjugative Elements, Benjamin R. Joris Aug 2022

Towards More Complete Metagenomic Analyses Through Circularized Genomes And Conjugative Elements, Benjamin R. Joris

Electronic Thesis and Dissertation Repository

Advancements in sequencing technologies have revolutionized biological sciences and led to the emergence of a number of fields of research. One such field of research is metagenomics, which is the study of the genomic content of complex communities of bacteria. The goal of this thesis was to contribute computational methodology that can maximize the data generated in these studies and to apply these protocols human and environmental metagenomic samples.

Standard metagenomic analyses include a step for binning of assembled contigs, which has previously been shown to exclude mobile genetic elements, and I demonstrated that this phenomenon extends to all conjugative …


A Bacterial Microbiome Analysis Of Solarized Ginseng Garden Soils, Anka Colo Aug 2022

A Bacterial Microbiome Analysis Of Solarized Ginseng Garden Soils, Anka Colo

Undergraduate Student Research Internships Conference

American ginseng (Panax quinquefolius) is a highly valued perennial crop grown for its roots during a four-year cultivation cycle. American ginseng is subject to ginseng replant disease (GRD) in which severe root rot develops in newly planted ginseng grown in a former ginseng garden. A common strategy to mitigating GRD is not available and techniques such as fumigation, fungicides, and biocontrol are ineffective, banned, or are slowly being phased out. Alternatively, soil solarization is a pre-plant technique used to treat soil to reduce disease inoculum and alter soil microbiomes. In summer 2019, a six-week soil solarization experiment was …


Multigene Phylogeny Of Mushroom Genus Hohenbuehelia (Fungi = Pleurotaceae), Beau Claude Daigneault Aug 2022

Multigene Phylogeny Of Mushroom Genus Hohenbuehelia (Fungi = Pleurotaceae), Beau Claude Daigneault

Undergraduate Student Research Internships Conference

A multigene phylogenetic study of mushroom genus Hohenbuehelia. Four gene loci (ITS, LSU, Tef1, RPB2) were examined, sequence data collected and available Genbank data was concatenated into a supermatrix alignment, with a RAxML phylogenetic tree as output.


Manipulating The Root Mycobiome To Improve Plant Performance And Reduce Pathogen Pressure In Corn (Zea Mays), Noor F. Saeed Cheema Jun 2022

Manipulating The Root Mycobiome To Improve Plant Performance And Reduce Pathogen Pressure In Corn (Zea Mays), Noor F. Saeed Cheema

Electronic Thesis and Dissertation Repository

Crop yield often varies within a field of a single genetically uniform crop plant, with the causes presumed to be a mix of both biotic and abiotic factors. Manipulating crop root mycobiomes could potentially increase yield by reducing pathogen impacts and improving access to soil water and nutrients. This study aimed to identify different fungal inoculation treatments that could increase the growth of corn seedlings sown in low productivity soils to that in high productivity soils and shift the root mycobiome composition. Fungal inoculation treatments did not have significantly different root mycobiome composition than seedlings grown in low yield control …


Identification Of Dna Methylation Episignatures For Classification And Phenotype/Genotype Correlation In Mendelian Neurodevelopmental Disorders, John Reilly Apr 2022

Identification Of Dna Methylation Episignatures For Classification And Phenotype/Genotype Correlation In Mendelian Neurodevelopmental Disorders, John Reilly

Electronic Thesis and Dissertation Repository

ABSTRACT: Diagnosis for neurodevelopmental disorders poses numerous challenges, related to the lack of specific findings and limited understanding of clinical impact of the majority of genetic variation. Epigenomics mechanisms involve chemical modifications in DNA that involve a range of cellular mechanisms. DNA methylation is an epigenetic mechanism involving addition and removal of methyl groups to cytosine residues. These methylation signals form episignatures; patterns of methylation that can be used as biomarkers capable of differentiating neurodevelopmental disorders. EpiSigns have enabled molecular diagnosis of a number of genetic conditions, classification of variants of unknown significance, and provided insights into the pathophysiology of …


Radiation Exposure Determination In A Secure, Cloudbased Online Environment, Ben C. Shirley, Eliseos J. Mucaki, Joan H.M. Knoll, Peter Rogan Jan 2022

Radiation Exposure Determination In A Secure, Cloudbased Online Environment, Ben C. Shirley, Eliseos J. Mucaki, Joan H.M. Knoll, Peter Rogan

Biochemistry Publications

Rapid sample processing and interpretation of estimated exposures will be critical for triaging exposed individuals after a major radiation incident. The dicentric chromosome (DC) assay assesses absorbed radiation using metaphase cells from blood. The Automated Dicentric Chromosome Identifier and Dose Estimator System (ADCI) identifies DCs and determines radiation doses. This study aimed to broaden accessibility and speed of this system, while protecting data and software integrity. ADCI Online is a secure web-streaming platform accessible worldwide from local servers. Cloud-based systems containing data and software are separated until they are linked for radiation exposure estimation. Dose estimates are identical to ADCI …


Improved Radiation Expression Profiling In Blood By Sequential Application Of Sensitive And Specific Gene Signatures, Eliseos J. Mucaki, Ben C. Shirley, Peter K. Rogan Oct 2021

Improved Radiation Expression Profiling In Blood By Sequential Application Of Sensitive And Specific Gene Signatures, Eliseos J. Mucaki, Ben C. Shirley, Peter K. Rogan

Biochemistry Publications

Purpose. Combinations of expressed genes can discriminate radiation-exposed from normal control blood samples by machine learning based signatures (with 8 to 20% misclassification rates). These signatures can quantify therapeutically-relevant as well as accidental radiation exposures. The prodromal symptoms of Acute Radiation Syndrome (ARS) overlap those present in Influenza and Dengue Fever infections. Surprisingly, these human radiation signatures misclassified gene expression profiles of virally infected samples as false positive exposures. The present study investigates these and other confounders, and then mitigates their impact on signature accuracy.

Methods. This study investigated recall by previous and novel radiation signatures independently derived …


Visualization And Interpretation Of Protein Interactions, Dipanjan Chatterjee Apr 2021

Visualization And Interpretation Of Protein Interactions, Dipanjan Chatterjee

Electronic Thesis and Dissertation Repository

Visualization and interpretation of deep learning models' prediction is a very important area of research in machine learning nowadays. Researchers are not only focused on generating a model with good performance, but also they want to trust the model. Our aim in this thesis is to adapt existing interpretation methods to a protein-protein binding site prediction problem to visualize and understand the model's prediction and learning pattern.

We present three deep learning-based interpretation methods: sensitivity analysis, saliency map and integrated gradients to analyze the amino acid residues which create positive and negative relevance to the deep learning models' prediction. As …


Sequencing And Assembling The Nuclear Genome Of The Antarctic Psychrophilic Green Alga Chlamydomonas Sp. Uwo241: Unravelling The Evolution Of Cold Adaptation, Xi Zhang Jan 2021

Sequencing And Assembling The Nuclear Genome Of The Antarctic Psychrophilic Green Alga Chlamydomonas Sp. Uwo241: Unravelling The Evolution Of Cold Adaptation, Xi Zhang

Electronic Thesis and Dissertation Repository

DNA sequencing technologies have undergone tremendous advancements in recent years, but assembling, annotating, and analyzing a nuclear genome is still a huge undertaking, especially for small laboratory groups, partly because many eukaryotic genomes are repeat-rich and contain thousands of genes and introns. The Antarctic harbors a variety of algae that can withstand extreme cold but do not grow at warmer temperatures (psychrophiles), including the unicellular green alga Chlamydomonas sp. UWO241 (a.k.a. UWO241). Little is known, however, about how psychrophilic algae evolved from their respective mesophilic ancestors by adapting to particular cold environments. To present insights into this issue,I critically determined …


Pathway‐Extended Gene Expression Signatures Integrate Novel Biomarkers That Improve Predictions Of Patient Responses To Kinase Inhibitors, Ashis Bagchee‐Clark, Eliseos J. Mucaki, Tyson Whitehead, Peter Rogan Dec 2020

Pathway‐Extended Gene Expression Signatures Integrate Novel Biomarkers That Improve Predictions Of Patient Responses To Kinase Inhibitors, Ashis Bagchee‐Clark, Eliseos J. Mucaki, Tyson Whitehead, Peter Rogan

Biochemistry Publications

Cancer chemotherapy responses have been related to multiple pharmacogenetic biomarkers, often for the same drug. This study utilizes machine learning to derive multi‐gene expression signatures that predict individual patient responses to specific tyrosine kinase inhibitors, including erlotinib, gefitinib, sorafenib, sunitinib, lapatinib and imatinib. Support vector machine (SVM) learning was used to train mathematical models that distinguished sensitivity from resistance to these drugs using a novel systems biology‐based approach. This began with expression of genes previously implicated in specific drug responses, then expanded to evaluate genes whose products were related through biochemical pathways and interactions. Optimal pathway‐extended SVMs predicted responses in …


Pathway-Extended Gene Expression Signatures Integrate Novel Biomarkers That Improve Predictions Of Patient Responses To Kinase Inhibitors, Ashis Jem Bagchee-Clark, Eliseos J. Mucaki, Tyson Whitehead, Peter Rogan Nov 2020

Pathway-Extended Gene Expression Signatures Integrate Novel Biomarkers That Improve Predictions Of Patient Responses To Kinase Inhibitors, Ashis Jem Bagchee-Clark, Eliseos J. Mucaki, Tyson Whitehead, Peter Rogan

Biochemistry Publications

No abstract provided.


Deciphering The Ck2-Dependent Phosphoproteome And Its Integration With Regulatory Ptm Networks, Teresa Nunez De Villavicencio Diaz Nov 2020

Deciphering The Ck2-Dependent Phosphoproteome And Its Integration With Regulatory Ptm Networks, Teresa Nunez De Villavicencio Diaz

Electronic Thesis and Dissertation Repository

Protein functions are regulated by the post-translational addition of covalent modifications on certain amino acids. Depending on their distance within the 3-dimensional structure, addition/removal of individual post translational modifications (PTMs) can be impacted by others. This PTM interplay constitutes an essential regulatory mechanism that interconnects the molecular networks in the cell. Protein CK2, a clinically relevant acidophilic Ser/Thr kinase, may be responsible for 10-20% of the human phosphoproteome. Such estimates agree with the number of known substrates, which continues to expand. Furthermore, the demonstration that CK2 participates in hierarchical phosphorylation and has similar sequence determinants to caspases suggest extensive PTM …


Multiple Roles Of Nup1 In Arabidopsis Growth And Development, Raj K. Thapa Nov 2020

Multiple Roles Of Nup1 In Arabidopsis Growth And Development, Raj K. Thapa

Electronic Thesis and Dissertation Repository

The nuclear pore complex (NPC) is the gateway between the nucleus and cytoplasm, which provides the passage for transport of RNA, protein, and other molecules into and out of the nucleus. NPC is conserved across all eukaryotes and plays a vital role in various cellular processes. However, compared to other organisms, the study of NPC in plants is limited. Although more than 30 different types of nucleoporin proteins in the model plant Arabidopsis thaliana have been identified, none of those proteins has been studied in detail. In this thesis, I focused on one such protein named NUCLEOPORIN1 (NUP1) and investigated …


Pan-Cancer Analysis Of Telomerase Reverse Transcriptase (Tert) Isoforms, Mathushan Subasri Oct 2020

Pan-Cancer Analysis Of Telomerase Reverse Transcriptase (Tert) Isoforms, Mathushan Subasri

Electronic Thesis and Dissertation Repository

Reactivation of the multi-subunit ribonucleoprotein telomerase is the primary telomere maintenance mechanism in cancer, but it is rate-limited by the enzymatic component, telomerase reverse transcriptase (TERT). While regulatory in nature, TERT alternative splice variant/isoform regulation and functions are not fully elucidated and are further complicated by their highly diverse expression. In this thesis, I characterized TERT expression across normal and neoplastic tissues using TCGA and GTEx RNA-sequencing data. In doing so, I demonstrated the global overexpression and splicing shift towards full-length TERT in neoplastic tissue. Furthermore, my studies identified tumour subtype expression differences possibly regulated by subtype-specific characteristics, detailed heterogeneity …


Estimating Partial Body Ionizing Radiation Exposure By Automated Cytogenetic Biodosimetry, Ben Shirley, Peter Rogan Oct 2020

Estimating Partial Body Ionizing Radiation Exposure By Automated Cytogenetic Biodosimetry, Ben Shirley, Peter Rogan

Biochemistry Publications

Purpose: Inhomogeneous exposures to ionizing radiation can be detected and quantified with the dicentric chromosome assay (DCA) of metaphase cells. Complete automation of interpretation of the DCA for whole-body irradiation has significantly improved throughput without compromising accuracy, however, low levels of residual false positive dicentric chromosomes (DCs) have confounded its application for partial-body exposure determination.

Materials and methods: We describe a method of estimating and correcting for false positive DCs in digitally processed images of metaphase cells. Nearly all DCs detected in unirradiated calibration samples are introduced by digital image processing. DC frequencies of irradiated calibration samples and those exposed …


Computational Methods For Predicting Protein-Protein Interactions And Binding Sites, Yiwei Li Aug 2020

Computational Methods For Predicting Protein-Protein Interactions And Binding Sites, Yiwei Li

Electronic Thesis and Dissertation Repository

Proteins are essential to organisms and participate in virtually every process within cells. Quite often, they keep the cells functioning by interacting with other proteins. This process is called protein-protein interaction (PPI). The bonding amino acid residues during the process of protein-protein interactions are called PPI binding sites. Identifying PPIs and PPI binding sites are fundamental problems in system biology.

Experimental methods for solving these two problems are slow and expensive. Therefore, great efforts are being made towards increasing the performance of computational methods.

We present DELPHI, a deep learning based program for PPI site prediction and SPRINT, an algorithmic …


Regulators Of Ectopic Calcification In A Mouse Model Of Dish: A Multi-Omics Perspective, Matthew A. Veras Jun 2020

Regulators Of Ectopic Calcification In A Mouse Model Of Dish: A Multi-Omics Perspective, Matthew A. Veras

Electronic Thesis and Dissertation Repository

Diffuse idiopathic skeletal hyperostosis (DISH) is a non-inflammatory spondyloarthropathy and the second most common form of arthritis characterized by formation of ectopic mineral along the spine. Pathological findings in DISH include regional calcification of the anterior longitudinal ligament, paraspinal connective tissues, and annulus fibrosus (AF) of the intervertebral disc (IVD). Clinical symptoms of DISH include increased spine stiffness, decreased spinal range of motion, and in severe cases dysphagia and spinal cord/nerve root compression. The molecular pathways responsible for DISH have not been delineated and as such, there are no disease-modifying treatments. Clinical treatment for DISH is limited to surgical resection …


B Cell Acute Lymphoblastic Leukemia Is Driven By Activating Janus Kinase Mutations Cooperating With Spi1 And Spib Deletions In A Murine Model, Michelle Lim Jun 2020

B Cell Acute Lymphoblastic Leukemia Is Driven By Activating Janus Kinase Mutations Cooperating With Spi1 And Spib Deletions In A Murine Model, Michelle Lim

Electronic Thesis and Dissertation Repository

B cell acute lymphoblastic leukemia (B-ALL) is caused by genetic lesions in developing B cells that function as drivers for accumulation of additional mutations in an evolutionary selection process. We investigated secondary drivers of leukemogenesis and their mechanism(s) of arising in a mouse model of B-ALL driven by PU.1/Spi-B deletion (Mb1-CreDPB). Whole exome sequencing revealed recurrent mutations in Jak3 (encoding Janus Kinase 3) and Jak1. Mutations with high variant allele frequency (VAF) were dominated by C->T transition mutations that were compatible with AID, whereas the majority of mutations, with low VAF, were dominated by C->A transversions associated with …


Machine Learning With Digital Signal Processing For Rapid And Accurate Alignment-Free Genome Analysis: From Methodological Design To A Covid-19 Case Study, Gurjit Singh Randhawa Jun 2020

Machine Learning With Digital Signal Processing For Rapid And Accurate Alignment-Free Genome Analysis: From Methodological Design To A Covid-19 Case Study, Gurjit Singh Randhawa

Electronic Thesis and Dissertation Repository

In the field of bioinformatics, taxonomic classification is the scientific practice of identifying, naming, and grouping of organisms based on their similarities and differences. The problem of taxonomic classification is of immense importance considering that nearly 86% of existing species on Earth and 91% of marine species remain unclassified. Due to the magnitude of the datasets, the need exists for an approach and software tool that is scalable enough to handle large datasets and can be used for rapid sequence comparison and analysis. We propose ML-DSP, a stand-alone alignment-free software tool that uses Machine Learning and Digital Signal Processing to …


Designing A Novel Hiv-1 Candidate Vaccine, Rahul Pawa Apr 2020

Designing A Novel Hiv-1 Candidate Vaccine, Rahul Pawa

Electronic Thesis and Dissertation Repository

Currently no vaccine has been developed that can prevent the spread of HIV-1. During sexual transmission, a single viral variant called the Transmitted/Founder (T/F) purportedly with unique physical properties, establishes infection in 70-80% of individuals. Unlike previous studies that have tried to identify T/F viruses based on their structure glycan composition and amino acid sequence, we have analyzed the RNA sequences of HIV-1 to help identify T/F variants. Using a combination of both in silico data analysis and in vitro assays, we have identified that T/F viruses have higher numbers of immunostimulatory motifs than HIV virions that fail to infect. …