Open Access. Powered by Scholars. Published by Universities.®
- Discipline
-
- Biostatistics (4)
- Statistical Models (4)
- Medicine and Health Sciences (3)
- Multivariate Analysis (3)
- Bioinformatics (2)
-
- Biometry (2)
- Categorical Data Analysis (2)
- Computational Biology (2)
- Genetics and Genomics (2)
- Life Sciences (2)
- Statistical Methodology (2)
- Applied Mathematics (1)
- Applied Statistics (1)
- Artificial Intelligence and Robotics (1)
- Astrophysics and Astronomy (1)
- Biochemistry, Biophysics, and Structural Biology (1)
- Biomechanics (1)
- Clinical Epidemiology (1)
- Clinical Trials (1)
- Computational Linguistics (1)
- Computational Neuroscience (1)
- Computer Sciences (1)
- Design of Experiments and Sample Surveys (1)
- Discourse and Text Linguistics (1)
- Disease Modeling (1)
- Diseases (1)
- Environmental Sciences (1)
- Institution
- Keyword
-
- Bioinformatics (1)
- Biomedical signal processing (1)
- Biostatistics (1)
- Crossover (1)
- Finite Mixture Models (1)
-
- HSV (1)
- Hierarchical Mixture Model (1)
- High dimensional data (1)
- High-dimensional data (1)
- High-performance computing (1)
- High-throughput genomics (1)
- Homogeneity Test (1)
- In- formation Criterion (1)
- Inference on regularized coefficient estimates (1)
- Integration (1)
- Large-scale biological data analysis (1)
- Message-passing interface (1)
- Micro-array Analysis (1)
- Microarray (1)
- Mixed (1)
- Multi-platform data (1)
- Non-negative matrix factorization (1)
- Pathway information incorporation (1)
- Poisson (1)
- Shedding (1)
- Streptococcus sanguinis (1)
- Text mining (1)
- WGCNA (1)
- Publication
- Publication Type
Articles 1 - 5 of 5
Full-Text Articles in Microarrays
Integration Of Multi-Platform High-Dimensional Omic Data, Xuebei An
Integration Of Multi-Platform High-Dimensional Omic Data, Xuebei An
Dissertations & Theses (Open Access)
The development of high-throughput biotechnologies have made data accessible from different platforms, including RNA sequencing, copy number variation, DNA methylation, protein lysate arrays, etc. The high-dimensional omic data derived from different technological platforms have been extensively used to facilitate comprehensive understanding of disease mechanisms and to determine personalized health treatments. Although vital to the progress of clinical research, the high dimensional multi-platform data impose new challenges for data analysis. Numerous studies have been proposed to integrate multi-platform omic data; however, few have efficiently and simultaneously addressed the problems that arise from high dimensionality and complex correlations.
In my dissertation, I …
Hpcnmf: A High-Performance Toolbox For Non-Negative Matrix Factorization, Karthik Devarajan, Guoli Wang
Hpcnmf: A High-Performance Toolbox For Non-Negative Matrix Factorization, Karthik Devarajan, Guoli Wang
COBRA Preprint Series
Non-negative matrix factorization (NMF) is a widely used machine learning algorithm for dimension reduction of large-scale data. It has found successful applications in a variety of fields such as computational biology, neuroscience, natural language processing, information retrieval, image processing and speech recognition. In bioinformatics, for example, it has been used to extract patterns and profiles from genomic and text-mining data as well as in protein sequence and structure analysis. While the scientific performance of NMF is very promising in dealing with high dimensional data sets and complex data structures, its computational cost is high and sometimes could be critical for …
Models For Hsv Shedding Must Account For Two Levels Of Overdispersion, Amalia Magaret
Models For Hsv Shedding Must Account For Two Levels Of Overdispersion, Amalia Magaret
UW Biostatistics Working Paper Series
We have frequently implemented crossover studies to evaluate new therapeutic interventions for genital herpes simplex virus infection. The outcome measured to assess the efficacy of interventions on herpes disease severity is the viral shedding rate, defined as the frequency of detection of HSV on the genital skin and mucosa. We performed a simulation study to ascertain whether our standard model, which we have used previously, was appropriately considering all the necessary features of the shedding data to provide correct inference. We simulated shedding data under our standard, validated assumptions and assessed the ability of 5 different models to reproduce the …
A Weighted Gene Co-Expression Network Analysis For Streptococcus Sanguinis Microarray Experiments, Erik C. Dvergsten
A Weighted Gene Co-Expression Network Analysis For Streptococcus Sanguinis Microarray Experiments, Erik C. Dvergsten
Theses and Dissertations
Streptococcus sanguinis is a gram-positive, non-motile bacterium native to human mouths. It is the primary cause of endocarditis and is also responsible for tooth decay. Two-component systems (TCSs) are commonly found in bacteria. In response to environmental signals, TCSs may regulate the expression of virulence factor genes.
Gene co-expression networks are exploratory tools used to analyze system-level gene functionality. A gene co-expression network consists of gene expression profiles represented as nodes and gene connections, which occur if two genes are significantly co-expressed. An adjacency function transforms the similarity matrix containing co-expression similarities into the adjacency matrix containing connection strengths. Gene …
Development In Normal Mixture And Mixture Of Experts Modeling, Meng Qi
Development In Normal Mixture And Mixture Of Experts Modeling, Meng Qi
Theses and Dissertations--Statistics
In this dissertation, first we consider the problem of testing homogeneity and order in a contaminated normal model, when the data is correlated under some known covariance structure. To address this problem, we developed a moment based homogeneity and order test, and design weights for test statistics to increase power for homogeneity test. We applied our test to microarray about Down’s syndrome. This dissertation also studies a singular Bayesian information criterion (sBIC) for a bivariate hierarchical mixture model with varying weights, and develops a new data dependent information criterion (sFLIC).We apply our model and criteria to birth- weight and gestational …