Open Access. Powered by Scholars. Published by Universities.®

Life Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences

Theses/Dissertations

2015

Institution
Keyword
Publication

Articles 1 - 24 of 24

Full-Text Articles in Life Sciences

Computational Methods For Biomarker Identification In Complex Disease, Amin Ahmadi Adl Nov 2015

Computational Methods For Biomarker Identification In Complex Disease, Amin Ahmadi Adl

USF Tampa Graduate Theses and Dissertations

In a modern systematic view of biology, cell functions arise from the interaction between molecular components. One of the challenging problems in systems biology with high-throughput measurements is discovering the important components involved in the development and progression of complex diseases, which may serve as biomarkers for accurate predictive modeling and as targets for therapeutic purposes. Due to the non-linearity and heterogeneity of these complex diseases, traditional biomarker identification approaches have had limited success at finding clinically useful biomarkers. In this dissertation we propose novel methods for biomarker identification that explicitly take into account the non-linearity and heterogeneity of complex …


Efficient Algorithms For Prokaryotic Whole Genome Assembly And Finishing, Abhishek Biswas Oct 2015

Efficient Algorithms For Prokaryotic Whole Genome Assembly And Finishing, Abhishek Biswas

Computer Science Theses & Dissertations

De-novo genome assembly from DNA fragments is primarily based on sequence overlap information. In addition, mate-pair reads or paired-end reads provide linking information for joining gaps and bridging repeat regions. Genome assemblers in general assemble long contiguous sequences (contigs) using both overlapping reads and linked reads until the assembly runs into an ambiguous repeat region. These contigs are further bridged into scaffolds using linked read information. However, errors can be made in both phases of assembly due to high error threshold of overlap acceptance and linking based on too few mate reads. Identical as well as similar repeat regions can …


Algorithms For Peptide Identification From Mixture Tandem Mass Spectra, Yi Liu Aug 2015

Algorithms For Peptide Identification From Mixture Tandem Mass Spectra, Yi Liu

Electronic Thesis and Dissertation Repository

The large amount of data collected in an mass spectrometry experiment requires effective computational approaches for the automated analysis of those data. Though extensive research has been conducted for such purpose by the proteomics community, there are still remaining challenges, among which, one particular challenge is that the identification rate of the MS/MS spectra collected is rather low. One significant reason that contributes to this situation is the frequently observed mixture spectra, which result from the concurrent fragmentation of multiple precursors in a single MS/MS spectrum. However, nearly all the mainstream computational methods still take the assumption that the acquired …


Three Essays On Enhancing Clinical Trial Subject Recruitment Using Natural Language Processing And Text Mining, Euisung Jung Aug 2015

Three Essays On Enhancing Clinical Trial Subject Recruitment Using Natural Language Processing And Text Mining, Euisung Jung

Theses and Dissertations

Patient recruitment and enrollment are critical factors for a successful clinical trial; however, recruitment tends to be the most common problem in most clinical trials. The success of a clinical trial depends on efficiently recruiting suitable patients to conduct the trial. Every clinical trial research has a protocol, which describes what will be done in the study and how it will be conducted. Also, the protocol ensures the safety of the trial subjects and the integrity of the data collected. The eligibility criteria section of clinical trial protocols is important because it specifies the necessary conditions that participants have to …


Naturalists’ Perspectives On The Use Of Mobile Technology During A Nature Hike, Aubin Marishka Radzewicz St. Clair Aug 2015

Naturalists’ Perspectives On The Use Of Mobile Technology During A Nature Hike, Aubin Marishka Radzewicz St. Clair

Master's Theses

Naturalists act as our link between scientific knowledge and the public’s understanding of natural history and conservation efforts. In order for them to succeed, they need access to reference materials as well as up-to-date information (Mankin, Warner, & Anderson, 1999). Incorporating mobile technology (i.e. tablets) into naturalists’ endeavors in natural history and environmental education can be used as supportive and educational tools. My project investigated how newly trained naturalists used tablet technology while leading groups of children on nature hikes. I investigated naturalists’ views on the use of mobile technology as a tool during the hikes. My research was guided …


Unsupervised Gene Regulatory Network Inference On Microarray Data, Nidhi Radia May 2015

Unsupervised Gene Regulatory Network Inference On Microarray Data, Nidhi Radia

Theses

Obtaining gene regulatory networks (GRNs) from expression data is a challenging and crucial task. Many computational methods and algorithms have been developed to infer gene networks for gene expression data, which are usually obtained from microarray experiments. A gene network is a method to depict the relation among clusters of genes. To infer gene networks, the unsupervised method is used in this study. The two types of data used are time-series data and steady-state data. The data is analyzed using various tools containing different algorithms and concepts. GRNs from time-series data tools are obtained using the Time-delayed Algorithm for the …


Exact Genome Alignment, Nandini Ghosh May 2015

Exact Genome Alignment, Nandini Ghosh

Theses

The increase in the volume of genomic data due to the decrease in the cost of whole genome sequencing techniques has opened up new avenues of research in the field of Bioinformatics, like comparative genomics and evolutionary dynamics. The fundamental task in these studies is to align the genome sequences accurately. Sequence alignment helps to identify regions of similarity between the sequences to establish their functional, evolutionary and structural relationship. The thesis investigates the performance of two sequence alignment programs LASTZ, a hash table based faster method and SSEARCH, a slower but more rigorous Smith-Waterman based approach, on whole genome …


Identifying Modifier Genes In Sma Model Mice, Weiting Xu May 2015

Identifying Modifier Genes In Sma Model Mice, Weiting Xu

Theses

Spinal Muscular Atrophy (SMA) involves the loss of nerve cells called motor neurons in the spinal cord and is classified as a motor neuron disease, it affects 1 in 5000-10000 newborns, one of the leading genetic causes of infant death in USA. Mutations in the SMN1, UBA1, DYNC1H1 and VAPB genes cause spinal muscular atrophy. Extra copies of the SMN2 gene modify the severity of spinal muscular atrophy. Mutations in SMN1 (Motor Neuron 1) mainly causes SMA (Autosomal recessive inheritance). SMN1 gene mutations lead to a shortage of the SMN protein and SMN protein forms SMN complex …


A Dynamic Behavioral Biometric Approach To Authenticate Users Employing Their Fingers To Interact With Touchscreen Devices, Arturo Ponce May 2015

A Dynamic Behavioral Biometric Approach To Authenticate Users Employing Their Fingers To Interact With Touchscreen Devices, Arturo Ponce

CCE Theses and Dissertations

The use of mobile devices has extended to all areas of human life and has changed the way people work and socialize. Mobile devices are susceptible to getting lost, stolen, or compromised. Several approaches have been adopted to protect the information stored on these devices. One of these approaches is user authentication. The two most popular methods of user authentication are knowledge based and token based methods but they present different kinds of problems.

Biometric authentication methods have emerged in recent years as a way to deal with these problems. They use an individual’s unique characteristics for identification and have …


Scattering Correction Methods Of Infrared Spectra Using Graphics Processing Units, Asher Imtiaz May 2015

Scattering Correction Methods Of Infrared Spectra Using Graphics Processing Units, Asher Imtiaz

Theses and Dissertations

Fourier transform infrared (FTIR) microspectroscopy has been used for many years as a technique that provides distinctive structure-specific infrared spectra for a wide range of materials (e.g., biological (tissues, cells, bacteria, viruses), polymers, energy related, composites, minerals). The mid-infrared radiation can strongly scatter from distinct particles, with diameters ranging between 2-20 micrometer. Transmission measurements of samples (approximately 100 micrometers x 100 micrometers x 10 micrometers) with distinct particles. will be dominated by this scattering (Mie scattering). The scattering distorts the measured spectra, and the absorption spectra appear different from pure absorbance spectra. This thesis presents development and implementation of two …


Image Enhancement Of Cancerous Tissue In Mammography Images, Richard Thomas Richardson Apr 2015

Image Enhancement Of Cancerous Tissue In Mammography Images, Richard Thomas Richardson

CCE Theses and Dissertations

This research presents a framework for enhancing and analyzing time-sequenced mammographic images for detection of cancerous tissue, specifically designed to assist radiologists and physicians with the detection of breast cancer. By using computer aided diagnosis (CAD) systems as a tool to help in the detection of breast cancer in computed tomography (CT) mammography images, previous CT mammography images will enhance the interpretation of the next series of images. The first stage of this dissertation applies image subtraction to images from the same patient over time. Image types are defined as temporal subtraction, dual-energy subtraction, and Digital Database for Screening Mammography …


Framework For Functional Tree Simulation Applied To 'Golden Delicious' Apple Trees, Marek Fiser Apr 2015

Framework For Functional Tree Simulation Applied To 'Golden Delicious' Apple Trees, Marek Fiser

Open Access Theses

This research aims to develop a framework to model realistic and functional trees. Our modular framework is easily controllable and extensible by the user. The framework provides powerful features such as per-leaf light simulation and a source-sink based resources transport model. Our novel 3D reconstruction algorithm generates realistic tree models with a good 3D polygon topology. ^ The second part of this research is a functional model of a Golden Delicious apple tree. We use our simulation framework and data collected in Purdue Meigs farm over the year 2014 to create a realistic data-driven model. The model reacts on temperature …


Computational Development For Secondary Structure Detection From Three-Dimensional Images Of Cryo-Electron Microscopy, Dong Si Apr 2015

Computational Development For Secondary Structure Detection From Three-Dimensional Images Of Cryo-Electron Microscopy, Dong Si

Computer Science Theses & Dissertations

Electron cryo-microscopy (cryo-EM) as a cutting edge technology has carved a niche for itself in the study of large-scale protein complex. Although the protein backbone of complexes cannot be derived directly from the medium resolution (5-10 Å) of amino acids from three-dimensional (3D) density images, secondary structure elements (SSEs) such as alpha-helices and beta-sheets can still be detected. The accuracy of SSE detection from the volumetric protein density images is critical for ab initio backbone structure derivation in cryo-EM. So far it is challenging to detect the SSEs automatically and accurately from the density images at these resolutions. This dissertation …


Rice And Mouse Quantitative Phenotype Prediction In Genome-Wide Association Studies With Support Vector Regression, Abdulrhman Fahad M. Aljouie Jan 2015

Rice And Mouse Quantitative Phenotype Prediction In Genome-Wide Association Studies With Support Vector Regression, Abdulrhman Fahad M. Aljouie

Theses

Quantitative phenotypes prediction from genotype data is significant for pathogenesis, crop yields, and immunity tests. The scientific community conducted many studies to find unobserved quantitative phenotype high predictive ability models. Early genome-wide association studies (GWAS) focused on genetic variants that are associated with disease or phenotype, however, these variants manly covers small portion of the whole genetic variance, and therefore, the effectiveness of predictions obtained using this information may possibly be circumscribed [ 1 ].

Instead, this study shows prediction ability from whole genome single nucleotide polymorphisms (SNPs) data of 1940 genotyped stoke mouse with - 12k SNPs, and 413 …


Cancer Risk Prediction With Next Generation Sequencing Data Using Machine Learning, Nihir Patel Jan 2015

Cancer Risk Prediction With Next Generation Sequencing Data Using Machine Learning, Nihir Patel

Theses

The use of computational biology for next generation sequencing (NGS) analysis is rapidly increasing in genomics research. However, the effectiveness of NGS data to predict disease abundance is yet unclear. This research investigates the problem in the whole exome NGS data of the chronic lymphocytic leukemia (CLL) available at dbGaP. Initially, raw reads from samples are aligned to the human reference genome using burrows wheeler aligner. From the samples, structural variants, namely, Single Nucleotide Polymorphism (SNP) and Insertion Deletion (INDEL) are identified and are filtered using SAMtools as well as with Genome Analyzer Tool Kit (GATK). Subsequently, the variants are …


Efficient Synergistic De Novo Co-Assembly Of Bacterial Genomes From Single Cells Using Colored De Bruijn Graph, Narjes Sadat Movahedi Tabrizi Jan 2015

Efficient Synergistic De Novo Co-Assembly Of Bacterial Genomes From Single Cells Using Colored De Bruijn Graph, Narjes Sadat Movahedi Tabrizi

Wayne State University Dissertations

Recent progress in DNA amplification techniques, particularly multiple displacement

amplification (MDA), has made it possible to sequence and assemble bacterial

genomes from a single cell. However, the quality of single cell genome assembly has

not yet reached the quality of normal multi-cell genome assembly due to the coverage

bias (including uneven depth of coverage and region blackout) and errors caused by

MDA. Computational methods try to mitigates the amplification bias. In this document

we introduce a de novo co-assembly method using colored de Bruijn graph,

which can overcome the problem of blackout regions due to amplification bias. The

algorithm is …


Novel Computational Methods For Transcript Reconstruction And Quantification Using Rna-Seq Data, Yan Huang Jan 2015

Novel Computational Methods For Transcript Reconstruction And Quantification Using Rna-Seq Data, Yan Huang

Theses and Dissertations--Computer Science

The advent of RNA-seq technologies provides an unprecedented opportunity to precisely profile the mRNA transcriptome of a specific cell population. It helps reveal the characteristics of the cell under the particular condition such as a disease. It is now possible to discover mRNA transcripts not cataloged in existing database, in addition to assessing the identities and quantities of the known transcripts in a given sample or cell. However, the sequence reads obtained from an RNA-seq experiment is only a short fragment of the original transcript. How to recapitulate the mRNA transcriptome from short RNA-seq reads remains a challenging problem. We …


Applications Of Machine Learning In Biology And Medicine, Saied Haidarian Shahri Jan 2015

Applications Of Machine Learning In Biology And Medicine, Saied Haidarian Shahri

Wayne State University Dissertations

Machine learning as a field is defined to be the set of computational algorithms that improve their performance by assimilating data.

As such, the field as a whole has found applications in many diverse disciplines from robotics and communication in engineering to economics and finance, and also biology and medicine.

It should not come as a surprise that many popular methods in use today have completely different origins.

Despite this heterogeneity, different methods can be divided into standard tasks, such as supervised, unsupervised, semi-supervised and reinforcement learning.

Although machine learning as a field can be formalized as methods trying to …


Learning Emotions: A Software Engine For Simulating Realistic Emotion In Artificial Agents, Douglas Code Jan 2015

Learning Emotions: A Software Engine For Simulating Realistic Emotion In Artificial Agents, Douglas Code

Senior Independent Study Theses

This paper outlines a software framework for the simulation of dynamic emotions in simulated agents. This framework acts as a domain-independent, black-box solution for giving actors in games or simulations realistic emotional reactions to events. The emotion management engine provided by the framework uses a modified Fuzzy Logic Adaptive Model of Emotions (FLAME) model, which lets it manage both appraisal of events in relation to an individual’s emotional state, and learning mechanisms through which an individual’s emotional responses to a particular event or object can change over time. In addition to the FLAME model, the engine draws on the design …


Graph-Based Regularization In Machine Learning: Discovering Driver Modules In Biological Networks, Xi Gao Jan 2015

Graph-Based Regularization In Machine Learning: Discovering Driver Modules In Biological Networks, Xi Gao

Theses and Dissertations

Curiosity of human nature drives us to explore the origins of what makes each of us different. From ancient legends and mythology, Mendel's law, Punnett square to modern genetic research, we carry on this old but eternal question. Thanks to technological revolution, today's scientists try to answer this question using easily measurable gene expression and other profiling data. However, the exploration can easily get lost in the data of growing volume, dimension, noise and complexity. This dissertation is aimed at developing new machine learning methods that take data from different classes as input, augment them with knowledge of feature relationships, …


De Novo Protein Structure Modeling And Energy Function Design, Lin Chen Jan 2015

De Novo Protein Structure Modeling And Energy Function Design, Lin Chen

Computer Science Theses & Dissertations

The two major challenges in protein structure prediction problems are (1) the lack of an accurate energy function and (2) the lack of an efficient search algorithm. A protein energy function accurately describing the interaction between residues is able to supervise the optimization of a protein conformation, as well as select native or native-like structures from numerous possible conformations. An efficient search algorithm must be able to reduce a conformational space to a reasonable size without missing the native conformation. My PhD research studies focused on these two directions.

A protein energy function—the distance and orientation dependent energy function of …


Evaluation Of The Signature Molecular Descriptor With Blosum62 And An All-Atom Description For Use In Sequence Alignment Of Proteins, Lindsay M. Aichinger Jan 2015

Evaluation Of The Signature Molecular Descriptor With Blosum62 And An All-Atom Description For Use In Sequence Alignment Of Proteins, Lindsay M. Aichinger

Williams Honors College, Honors Research Projects

This Honors Project focused on a few aspects of this topic. The second is comparing the molecular signature kernels to three of the BLOSUM matrices (30, 62, and 90) to test the accuracy of the mathematical model. The kernel matrix was manipulated in order to improve the relationship by focusing on side groups and also by changing how the structure was represented in the matrix by increasing the initial height distance from the central atom (Height 1 and Height 2 included).

There were multiple design constraints for this project. The first was the comparison with the BLOSUM matrices (30, 62, …


Gene Expression Noise In Stress Response As A Survival Strategy In Fluctuating Environments, Javier Garcia-Bernardo Jan 2015

Gene Expression Noise In Stress Response As A Survival Strategy In Fluctuating Environments, Javier Garcia-Bernardo

Graduate College Dissertations and Theses

Populations of cells live in uncertain environments, where they encounter large variations in nutrients, oxygen and toxic compounds. In the fluctuating environment, cells can sense their surroundings and express proteins to protect themselves against harmful substances. However, if the stressor appears infrequently or abruptly, sensing can be too costly or too slow, and cells cannot rely solely on it. To hedge against the sudden appearance of a stressor, cell populations can also rely on phenotypic diversification through bet-hedging. In bet-hedging, cells exploit noise in gene expression or use multistable genetic networks to produce an heterogeneous distribution of resistance-conferring protein levels. …


Grass-Based Dairy In Vermont: Benefits, Barriers, And Effective Public Policies, Serge William Wiltshire Jan 2015

Grass-Based Dairy In Vermont: Benefits, Barriers, And Effective Public Policies, Serge William Wiltshire

Graduate College Dissertations and Theses

A comprehensive literature review was undertaken in order to define and assess the sustainability and resiliency characteristics associated with grass-based and confinement dairy farming. Primarily as a result of reduced input costs, grass-based dairy farming often enhances profitability over confinement systems, especially on small farms. Further, conversion of tilled soil to permanent pasture has been shown to significantly reduce harmful sediment and nutrient transport into waterways. Perennial forage also acts as a carbon sink, curtailing or even negating a grass-based farm's carbon footprint. Finally, social benefits derived from enhanced nutrition and higher quality of life are also associated with grass-based …