Open Access. Powered by Scholars. Published by Universities.®
- Institution
- Keyword
-
- Computer Science (3)
- Bioinformatics (2)
- AdaBoost (1)
- Agent-based modeling (1)
- Algorithm (1)
-
- Amino acids (1)
- Applied Sciences (1)
- Assembly finishing (1)
- Assembly quality (1)
- Authentication (1)
- BLOSUM (1)
- Behavioral economics (1)
- Best management practices (1)
- Bet-hedging (1)
- Biology (1)
- Biometrics (1)
- Cancer risk prediction (1)
- Cancerous (1)
- Clinicla Trail (1)
- Co-assembly (1)
- Colored De Bruijn Graph (1)
- Computational Proteomics (1)
- Computational biology (1)
- Cryo-EM (1)
- Data mining (1)
- Database Search (1)
- Diffusion of innovation (1)
- Disease stratification (1)
- Energy function (1)
- Engagement (1)
- Publication
-
- Theses (5)
- Computer Science Theses & Dissertations (3)
- Theses and Dissertations (3)
- CCE Theses and Dissertations (2)
- Graduate College Dissertations and Theses (2)
-
- Wayne State University Dissertations (2)
- Electronic Thesis and Dissertation Repository (1)
- Master's Theses (1)
- Open Access Theses (1)
- Senior Independent Study Theses (1)
- Theses and Dissertations--Computer Science (1)
- USF Tampa Graduate Theses and Dissertations (1)
- Williams Honors College, Honors Research Projects (1)
Articles 1 - 24 of 24
Full-Text Articles in Life Sciences
Computational Methods For Biomarker Identification In Complex Disease, Amin Ahmadi Adl
Computational Methods For Biomarker Identification In Complex Disease, Amin Ahmadi Adl
USF Tampa Graduate Theses and Dissertations
In a modern systematic view of biology, cell functions arise from the interaction between molecular components. One of the challenging problems in systems biology with high-throughput measurements is discovering the important components involved in the development and progression of complex diseases, which may serve as biomarkers for accurate predictive modeling and as targets for therapeutic purposes. Due to the non-linearity and heterogeneity of these complex diseases, traditional biomarker identification approaches have had limited success at finding clinically useful biomarkers. In this dissertation we propose novel methods for biomarker identification that explicitly take into account the non-linearity and heterogeneity of complex …
Efficient Algorithms For Prokaryotic Whole Genome Assembly And Finishing, Abhishek Biswas
Efficient Algorithms For Prokaryotic Whole Genome Assembly And Finishing, Abhishek Biswas
Computer Science Theses & Dissertations
De-novo genome assembly from DNA fragments is primarily based on sequence overlap information. In addition, mate-pair reads or paired-end reads provide linking information for joining gaps and bridging repeat regions. Genome assemblers in general assemble long contiguous sequences (contigs) using both overlapping reads and linked reads until the assembly runs into an ambiguous repeat region. These contigs are further bridged into scaffolds using linked read information. However, errors can be made in both phases of assembly due to high error threshold of overlap acceptance and linking based on too few mate reads. Identical as well as similar repeat regions can …
Algorithms For Peptide Identification From Mixture Tandem Mass Spectra, Yi Liu
Algorithms For Peptide Identification From Mixture Tandem Mass Spectra, Yi Liu
Electronic Thesis and Dissertation Repository
The large amount of data collected in an mass spectrometry experiment requires effective computational approaches for the automated analysis of those data. Though extensive research has been conducted for such purpose by the proteomics community, there are still remaining challenges, among which, one particular challenge is that the identification rate of the MS/MS spectra collected is rather low. One significant reason that contributes to this situation is the frequently observed mixture spectra, which result from the concurrent fragmentation of multiple precursors in a single MS/MS spectrum. However, nearly all the mainstream computational methods still take the assumption that the acquired …
Three Essays On Enhancing Clinical Trial Subject Recruitment Using Natural Language Processing And Text Mining, Euisung Jung
Three Essays On Enhancing Clinical Trial Subject Recruitment Using Natural Language Processing And Text Mining, Euisung Jung
Theses and Dissertations
Patient recruitment and enrollment are critical factors for a successful clinical trial; however, recruitment tends to be the most common problem in most clinical trials. The success of a clinical trial depends on efficiently recruiting suitable patients to conduct the trial. Every clinical trial research has a protocol, which describes what will be done in the study and how it will be conducted. Also, the protocol ensures the safety of the trial subjects and the integrity of the data collected. The eligibility criteria section of clinical trial protocols is important because it specifies the necessary conditions that participants have to …
Naturalists’ Perspectives On The Use Of Mobile Technology During A Nature Hike, Aubin Marishka Radzewicz St. Clair
Naturalists’ Perspectives On The Use Of Mobile Technology During A Nature Hike, Aubin Marishka Radzewicz St. Clair
Master's Theses
Naturalists act as our link between scientific knowledge and the public’s understanding of natural history and conservation efforts. In order for them to succeed, they need access to reference materials as well as up-to-date information (Mankin, Warner, & Anderson, 1999). Incorporating mobile technology (i.e. tablets) into naturalists’ endeavors in natural history and environmental education can be used as supportive and educational tools. My project investigated how newly trained naturalists used tablet technology while leading groups of children on nature hikes. I investigated naturalists’ views on the use of mobile technology as a tool during the hikes. My research was guided …
Unsupervised Gene Regulatory Network Inference On Microarray Data, Nidhi Radia
Unsupervised Gene Regulatory Network Inference On Microarray Data, Nidhi Radia
Theses
Obtaining gene regulatory networks (GRNs) from expression data is a challenging and crucial task. Many computational methods and algorithms have been developed to infer gene networks for gene expression data, which are usually obtained from microarray experiments. A gene network is a method to depict the relation among clusters of genes. To infer gene networks, the unsupervised method is used in this study. The two types of data used are time-series data and steady-state data. The data is analyzed using various tools containing different algorithms and concepts. GRNs from time-series data tools are obtained using the Time-delayed Algorithm for the …
Exact Genome Alignment, Nandini Ghosh
Exact Genome Alignment, Nandini Ghosh
Theses
The increase in the volume of genomic data due to the decrease in the cost of whole genome sequencing techniques has opened up new avenues of research in the field of Bioinformatics, like comparative genomics and evolutionary dynamics. The fundamental task in these studies is to align the genome sequences accurately. Sequence alignment helps to identify regions of similarity between the sequences to establish their functional, evolutionary and structural relationship. The thesis investigates the performance of two sequence alignment programs LASTZ, a hash table based faster method and SSEARCH, a slower but more rigorous Smith-Waterman based approach, on whole genome …
Identifying Modifier Genes In Sma Model Mice, Weiting Xu
Identifying Modifier Genes In Sma Model Mice, Weiting Xu
Theses
Spinal Muscular Atrophy (SMA) involves the loss of nerve cells called motor neurons in the spinal cord and is classified as a motor neuron disease, it affects 1 in 5000-10000 newborns, one of the leading genetic causes of infant death in USA. Mutations in the SMN1, UBA1, DYNC1H1 and VAPB genes cause spinal muscular atrophy. Extra copies of the SMN2 gene modify the severity of spinal muscular atrophy. Mutations in SMN1 (Motor Neuron 1) mainly causes SMA (Autosomal recessive inheritance). SMN1 gene mutations lead to a shortage of the SMN protein and SMN protein forms SMN complex …
A Dynamic Behavioral Biometric Approach To Authenticate Users Employing Their Fingers To Interact With Touchscreen Devices, Arturo Ponce
A Dynamic Behavioral Biometric Approach To Authenticate Users Employing Their Fingers To Interact With Touchscreen Devices, Arturo Ponce
CCE Theses and Dissertations
The use of mobile devices has extended to all areas of human life and has changed the way people work and socialize. Mobile devices are susceptible to getting lost, stolen, or compromised. Several approaches have been adopted to protect the information stored on these devices. One of these approaches is user authentication. The two most popular methods of user authentication are knowledge based and token based methods but they present different kinds of problems.
Biometric authentication methods have emerged in recent years as a way to deal with these problems. They use an individual’s unique characteristics for identification and have …
Scattering Correction Methods Of Infrared Spectra Using Graphics Processing Units, Asher Imtiaz
Scattering Correction Methods Of Infrared Spectra Using Graphics Processing Units, Asher Imtiaz
Theses and Dissertations
Fourier transform infrared (FTIR) microspectroscopy has been used for many years as a technique that provides distinctive structure-specific infrared spectra for a wide range of materials (e.g., biological (tissues, cells, bacteria, viruses), polymers, energy related, composites, minerals). The mid-infrared radiation can strongly scatter from distinct particles, with diameters ranging between 2-20 micrometer. Transmission measurements of samples (approximately 100 micrometers x 100 micrometers x 10 micrometers) with distinct particles. will be dominated by this scattering (Mie scattering). The scattering distorts the measured spectra, and the absorption spectra appear different from pure absorbance spectra. This thesis presents development and implementation of two …
Image Enhancement Of Cancerous Tissue In Mammography Images, Richard Thomas Richardson
Image Enhancement Of Cancerous Tissue In Mammography Images, Richard Thomas Richardson
CCE Theses and Dissertations
This research presents a framework for enhancing and analyzing time-sequenced mammographic images for detection of cancerous tissue, specifically designed to assist radiologists and physicians with the detection of breast cancer. By using computer aided diagnosis (CAD) systems as a tool to help in the detection of breast cancer in computed tomography (CT) mammography images, previous CT mammography images will enhance the interpretation of the next series of images. The first stage of this dissertation applies image subtraction to images from the same patient over time. Image types are defined as temporal subtraction, dual-energy subtraction, and Digital Database for Screening Mammography …
Framework For Functional Tree Simulation Applied To 'Golden Delicious' Apple Trees, Marek Fiser
Framework For Functional Tree Simulation Applied To 'Golden Delicious' Apple Trees, Marek Fiser
Open Access Theses
This research aims to develop a framework to model realistic and functional trees. Our modular framework is easily controllable and extensible by the user. The framework provides powerful features such as per-leaf light simulation and a source-sink based resources transport model. Our novel 3D reconstruction algorithm generates realistic tree models with a good 3D polygon topology. ^ The second part of this research is a functional model of a Golden Delicious apple tree. We use our simulation framework and data collected in Purdue Meigs farm over the year 2014 to create a realistic data-driven model. The model reacts on temperature …
Computational Development For Secondary Structure Detection From Three-Dimensional Images Of Cryo-Electron Microscopy, Dong Si
Computer Science Theses & Dissertations
Electron cryo-microscopy (cryo-EM) as a cutting edge technology has carved a niche for itself in the study of large-scale protein complex. Although the protein backbone of complexes cannot be derived directly from the medium resolution (5-10 Å) of amino acids from three-dimensional (3D) density images, secondary structure elements (SSEs) such as alpha-helices and beta-sheets can still be detected. The accuracy of SSE detection from the volumetric protein density images is critical for ab initio backbone structure derivation in cryo-EM. So far it is challenging to detect the SSEs automatically and accurately from the density images at these resolutions. This dissertation …
Rice And Mouse Quantitative Phenotype Prediction In Genome-Wide Association Studies With Support Vector Regression, Abdulrhman Fahad M. Aljouie
Rice And Mouse Quantitative Phenotype Prediction In Genome-Wide Association Studies With Support Vector Regression, Abdulrhman Fahad M. Aljouie
Theses
Quantitative phenotypes prediction from genotype data is significant for pathogenesis, crop yields, and immunity tests. The scientific community conducted many studies to find unobserved quantitative phenotype high predictive ability models. Early genome-wide association studies (GWAS) focused on genetic variants that are associated with disease or phenotype, however, these variants manly covers small portion of the whole genetic variance, and therefore, the effectiveness of predictions obtained using this information may possibly be circumscribed [ 1 ].
Instead, this study shows prediction ability from whole genome single nucleotide polymorphisms (SNPs) data of 1940 genotyped stoke mouse with - 12k SNPs, and 413 …
Cancer Risk Prediction With Next Generation Sequencing Data Using Machine Learning, Nihir Patel
Cancer Risk Prediction With Next Generation Sequencing Data Using Machine Learning, Nihir Patel
Theses
The use of computational biology for next generation sequencing (NGS) analysis is rapidly increasing in genomics research. However, the effectiveness of NGS data to predict disease abundance is yet unclear. This research investigates the problem in the whole exome NGS data of the chronic lymphocytic leukemia (CLL) available at dbGaP. Initially, raw reads from samples are aligned to the human reference genome using burrows wheeler aligner. From the samples, structural variants, namely, Single Nucleotide Polymorphism (SNP) and Insertion Deletion (INDEL) are identified and are filtered using SAMtools as well as with Genome Analyzer Tool Kit (GATK). Subsequently, the variants are …
Efficient Synergistic De Novo Co-Assembly Of Bacterial Genomes From Single Cells Using Colored De Bruijn Graph, Narjes Sadat Movahedi Tabrizi
Efficient Synergistic De Novo Co-Assembly Of Bacterial Genomes From Single Cells Using Colored De Bruijn Graph, Narjes Sadat Movahedi Tabrizi
Wayne State University Dissertations
Recent progress in DNA amplification techniques, particularly multiple displacement
amplification (MDA), has made it possible to sequence and assemble bacterial
genomes from a single cell. However, the quality of single cell genome assembly has
not yet reached the quality of normal multi-cell genome assembly due to the coverage
bias (including uneven depth of coverage and region blackout) and errors caused by
MDA. Computational methods try to mitigates the amplification bias. In this document
we introduce a de novo co-assembly method using colored de Bruijn graph,
which can overcome the problem of blackout regions due to amplification bias. The
algorithm is …
Novel Computational Methods For Transcript Reconstruction And Quantification Using Rna-Seq Data, Yan Huang
Novel Computational Methods For Transcript Reconstruction And Quantification Using Rna-Seq Data, Yan Huang
Theses and Dissertations--Computer Science
The advent of RNA-seq technologies provides an unprecedented opportunity to precisely profile the mRNA transcriptome of a specific cell population. It helps reveal the characteristics of the cell under the particular condition such as a disease. It is now possible to discover mRNA transcripts not cataloged in existing database, in addition to assessing the identities and quantities of the known transcripts in a given sample or cell. However, the sequence reads obtained from an RNA-seq experiment is only a short fragment of the original transcript. How to recapitulate the mRNA transcriptome from short RNA-seq reads remains a challenging problem. We …
Applications Of Machine Learning In Biology And Medicine, Saied Haidarian Shahri
Applications Of Machine Learning In Biology And Medicine, Saied Haidarian Shahri
Wayne State University Dissertations
Machine learning as a field is defined to be the set of computational algorithms that improve their performance by assimilating data.
As such, the field as a whole has found applications in many diverse disciplines from robotics and communication in engineering to economics and finance, and also biology and medicine.
It should not come as a surprise that many popular methods in use today have completely different origins.
Despite this heterogeneity, different methods can be divided into standard tasks, such as supervised, unsupervised, semi-supervised and reinforcement learning.
Although machine learning as a field can be formalized as methods trying to …
Learning Emotions: A Software Engine For Simulating Realistic Emotion In Artificial Agents, Douglas Code
Learning Emotions: A Software Engine For Simulating Realistic Emotion In Artificial Agents, Douglas Code
Senior Independent Study Theses
This paper outlines a software framework for the simulation of dynamic emotions in simulated agents. This framework acts as a domain-independent, black-box solution for giving actors in games or simulations realistic emotional reactions to events. The emotion management engine provided by the framework uses a modified Fuzzy Logic Adaptive Model of Emotions (FLAME) model, which lets it manage both appraisal of events in relation to an individual’s emotional state, and learning mechanisms through which an individual’s emotional responses to a particular event or object can change over time. In addition to the FLAME model, the engine draws on the design …
Graph-Based Regularization In Machine Learning: Discovering Driver Modules In Biological Networks, Xi Gao
Graph-Based Regularization In Machine Learning: Discovering Driver Modules In Biological Networks, Xi Gao
Theses and Dissertations
Curiosity of human nature drives us to explore the origins of what makes each of us different. From ancient legends and mythology, Mendel's law, Punnett square to modern genetic research, we carry on this old but eternal question. Thanks to technological revolution, today's scientists try to answer this question using easily measurable gene expression and other profiling data. However, the exploration can easily get lost in the data of growing volume, dimension, noise and complexity. This dissertation is aimed at developing new machine learning methods that take data from different classes as input, augment them with knowledge of feature relationships, …
De Novo Protein Structure Modeling And Energy Function Design, Lin Chen
De Novo Protein Structure Modeling And Energy Function Design, Lin Chen
Computer Science Theses & Dissertations
The two major challenges in protein structure prediction problems are (1) the lack of an accurate energy function and (2) the lack of an efficient search algorithm. A protein energy function accurately describing the interaction between residues is able to supervise the optimization of a protein conformation, as well as select native or native-like structures from numerous possible conformations. An efficient search algorithm must be able to reduce a conformational space to a reasonable size without missing the native conformation. My PhD research studies focused on these two directions.
A protein energy function—the distance and orientation dependent energy function of …
Evaluation Of The Signature Molecular Descriptor With Blosum62 And An All-Atom Description For Use In Sequence Alignment Of Proteins, Lindsay M. Aichinger
Evaluation Of The Signature Molecular Descriptor With Blosum62 And An All-Atom Description For Use In Sequence Alignment Of Proteins, Lindsay M. Aichinger
Williams Honors College, Honors Research Projects
This Honors Project focused on a few aspects of this topic. The second is comparing the molecular signature kernels to three of the BLOSUM matrices (30, 62, and 90) to test the accuracy of the mathematical model. The kernel matrix was manipulated in order to improve the relationship by focusing on side groups and also by changing how the structure was represented in the matrix by increasing the initial height distance from the central atom (Height 1 and Height 2 included).
There were multiple design constraints for this project. The first was the comparison with the BLOSUM matrices (30, 62, …
Gene Expression Noise In Stress Response As A Survival Strategy In Fluctuating Environments, Javier Garcia-Bernardo
Gene Expression Noise In Stress Response As A Survival Strategy In Fluctuating Environments, Javier Garcia-Bernardo
Graduate College Dissertations and Theses
Populations of cells live in uncertain environments, where they encounter large variations in nutrients, oxygen and toxic compounds. In the fluctuating environment, cells can sense their surroundings and express proteins to protect themselves against harmful substances. However, if the stressor appears infrequently or abruptly, sensing can be too costly or too slow, and cells cannot rely solely on it. To hedge against the sudden appearance of a stressor, cell populations can also rely on phenotypic diversification through bet-hedging. In bet-hedging, cells exploit noise in gene expression or use multistable genetic networks to produce an heterogeneous distribution of resistance-conferring protein levels. …
Grass-Based Dairy In Vermont: Benefits, Barriers, And Effective Public Policies, Serge William Wiltshire
Grass-Based Dairy In Vermont: Benefits, Barriers, And Effective Public Policies, Serge William Wiltshire
Graduate College Dissertations and Theses
A comprehensive literature review was undertaken in order to define and assess the sustainability and resiliency characteristics associated with grass-based and confinement dairy farming. Primarily as a result of reduced input costs, grass-based dairy farming often enhances profitability over confinement systems, especially on small farms. Further, conversion of tilled soil to permanent pasture has been shown to significantly reduce harmful sediment and nutrient transport into waterways. Perennial forage also acts as a carbon sink, curtailing or even negating a grass-based farm's carbon footprint. Finally, social benefits derived from enhanced nutrition and higher quality of life are also associated with grass-based …