Open Access. Powered by Scholars. Published by Universities.®

Statistical Models Commons

Open Access. Powered by Scholars. Published by Universities.®

1,359 Full-Text Articles 2,009 Authors 853,222 Downloads 156 Institutions

All Articles in Statistical Models

Faceted Search

1,359 full-text articles. Page 19 of 53.

Statistical And Machine Learning Methods Evaluated For Incorporating Soil And Weather Into Corn Nitrogen Recommendations, Curtis J. Ransom, Newell R. Kitchen, James J. Camberato, Paul R. Carter, Richard B. Ferguson, Fabián G. Fernández, David W. Franzen, Carrie A. M. Laboski, D. Brenton Myers, Emerson D. Nafziger, John E. Sawyer, John F. Shanahan 2019 University of Missouri

Statistical And Machine Learning Methods Evaluated For Incorporating Soil And Weather Into Corn Nitrogen Recommendations, Curtis J. Ransom, Newell R. Kitchen, James J. Camberato, Paul R. Carter, Richard B. Ferguson, Fabián G. Fernández, David W. Franzen, Carrie A. M. Laboski, D. Brenton Myers, Emerson D. Nafziger, John E. Sawyer, John F. Shanahan

John E. Sawyer

Nitrogen (N) fertilizer recommendation tools could be improved for estimating corn (Zea mays L.) N needs by incorporating site-specific soil and weather information. However, an evaluation of analytical methods is needed to determine the success of incorporating this information. The objectives of this research were to evaluate statistical and machine learning (ML) algorithms for utilizing soil and weather information for improving corn N recommendation tools. Eight algorithms [stepwise, ridge regression, least absolute shrinkage and selection operator (Lasso), elastic net regression, principal component regression (PCR), partial least squares regression (PLSR), decision tree, and random forest] were evaluated using a dataset …


Effective Statistical Energy Function Based Protein Un/Structure Prediction, Avdesh Mishra 2019 University of New Orleans

Effective Statistical Energy Function Based Protein Un/Structure Prediction, Avdesh Mishra

University of New Orleans Theses and Dissertations

Proteins are an important component of living organisms, composed of one or more polypeptide chains, each containing hundreds or even thousands of amino acids of 20 standard types. The structure of a protein from the sequence determines crucial functions of proteins such as initiating metabolic reactions, DNA replication, cell signaling, and transporting molecules. In the past, proteins were considered to always have a well-defined stable shape (structured proteins), however, it has recently been shown that there exist intrinsically disordered proteins (IDPs), which lack a fixed or ordered 3D structure, have dynamic characteristics and therefore, exist in multiple states. Based on …


Exploring The Estimability Of Mark-Recapture Models With Individual, Time-Varying Covariates Using The Scaled Logit Link Function, Jiaqi Mu 2019 The University of Western Ontario

Exploring The Estimability Of Mark-Recapture Models With Individual, Time-Varying Covariates Using The Scaled Logit Link Function, Jiaqi Mu

Electronic Thesis and Dissertation Repository

Mark-recapture studies are often used to estimate the survival of individuals in a population and identify factors that affect survival in order to understand how the population might be affected by changing conditions. Factors that vary between individuals and over time, like body mass, present a challenge because they can only be observed when an individual is captured. Several models have been proposed to deal with the missing-covariate problem and commonly impose a logit link function which implies that the survival probability varies between 0 and 1. In this thesis I explore the estimability of four possible models when survival …


Split Credibility: A Two-Dimensional Semi-Linear Credibility Model, Jingbing Qiu 2019 The University of Western Ontario

Split Credibility: A Two-Dimensional Semi-Linear Credibility Model, Jingbing Qiu

Electronic Thesis and Dissertation Repository

In the thesis, we introduce a two-dimensional semi-linear credibility model, which is an extension of the classical credibility or split credibility models used by practicing actuaries. Our model predicts the future expected losses of a policyholder by considering its historical primary and excess losses. The optimal split point is derived based on the mean squared error criterion. We show when and why splitting a policyholder’s historical losses into primary and excess parts work analytically. In addition, we derived formulas for estimating our model parameters nonparametrically. Finally, we show the application of our model through three examples.


Successful Shot Locations And Shot Types Used In Ncaa Men’S Division I Basketball, Olivia D. Perrin 2019 Northern Michigan University

Successful Shot Locations And Shot Types Used In Ncaa Men’S Division I Basketball, Olivia D. Perrin

All NMU Master's Theses

The primary purpose of the current study was to investigate the effect of court location (distance and angle from basket) and shot types used on shot success in NCAA Men’s DI basketball during the 2017-18 season. A secondary purpose was to further expand the analysis based on two additional factors: player position (guard, forward, or center) and team ranking. All statistical analyses were completed in RStudio and three binomial logistic regression analyses were performed to evaluate factors that influence shot success; one for all two and three point shot attempts, one for only two point attempts, and one for only …


Spatio-Temporal Prediction Of Arkansas Gubernatorial Election, Michael Harris 2019 University of Arkansas, Fayetteville

Spatio-Temporal Prediction Of Arkansas Gubernatorial Election, Michael Harris

Graduate Theses and Dissertations

Our goal is to create spatio-temporal models for predicting future gubernatorial elections. For a concrete example of how well our models work we use past data to predict the 2018 Arkansas gubernatorial election and use the existing 2018 election data to check our models predictive accuracy. Gubernatorial election data was collected from the Arkansas Secretary of State website while related covariate data was collected from the website for the Federal Reserve Bank of St. Louis. The data we collect is on the county level. For predictive purposes we fit multiple models to the data using Markov chain Monte Carlo and …


Development Of A Statistical Shape-Function Model Of The Implanted Knee For Real-Time Prediction Of Joint Mechanics, Kalin Gibbons 2019 Boise State University

Development Of A Statistical Shape-Function Model Of The Implanted Knee For Real-Time Prediction Of Joint Mechanics, Kalin Gibbons

Boise State University Theses and Dissertations

Outcomes of total knee arthroplasty (TKA) are dependent on surgical technique, patient variability, and implant design. Non-optimal design or alignment choices may result in undesirable contact mechanics and joint kinematics, including poor joint alignment, instability, and reduced range of motion. Implant design and surgical alignment are modifiable factors with potential to improve patient outcomes, and there is a need for robust implant designs that can accommodate patient variability. Our objective was to develop a statistical shape-function model (SFM) of a posterior stabilized implant knee to instantaneously predict output mechanics in an efficient manner. Finite element methods were combined with Latin …


Robustness Of Semi-Parametric Survival Model: Simulation Studies And Application To Clinical Data, Isaac Nwi-Mozu 2019 East Tennessee State University

Robustness Of Semi-Parametric Survival Model: Simulation Studies And Application To Clinical Data, Isaac Nwi-Mozu

Electronic Theses and Dissertations

An efficient way of analyzing survival clinical data such as cancer data is a great concern to health experts. In this study, we investigate and propose an efficient way of handling survival clinical data. Simulation studies were conducted to compare performances of various forms of survival model techniques using an R package ``survsim". Models performance was conducted with varying sample sizes as small ($n5000$). For small and mild samples, the performance of the semi-parametric outperform or approximate the performance of the parametric model. However, for large samples, the parametric model outperforms the semi-parametric model. We compared the effectiveness and reliability …


Hierarchical Modeling And Differential Expression Analysis For Rna-Seq Experiments With Inbred And Hybrid Genotypes, Andrew Lithio, Dan Nettleton 2019 Iowa State University

Hierarchical Modeling And Differential Expression Analysis For Rna-Seq Experiments With Inbred And Hybrid Genotypes, Andrew Lithio, Dan Nettleton

Dan Nettleton

The performance of inbred and hybrid genotypes is of interest in plant breeding and genetics. High-throughput sequencing of RNA (RNA-seq) has proven to be a useful tool in the study of the molecular genetic responses of inbreds and hybrids to environmental stresses. Commonly used experimental designs and sequencing methods lead to complex data structures that require careful attention in data analysis. We demonstrate an analysis of RNA-seq data from a split-plot design involving drought stress applied to two inbred genotypes and two hybrids formed by crosses between the inbreds. Our generalized linear modeling strategy incorporates random effects for whole-plot experimental …


Nested Hierarchical Functional Data Modeling And Inference For The Analysis Of Functional Plant Phenotypes, Yuhang Xu, Yehua Li, Dan Nettleton 2019 University of Nebraska - Lincoln

Nested Hierarchical Functional Data Modeling And Inference For The Analysis Of Functional Plant Phenotypes, Yuhang Xu, Yehua Li, Dan Nettleton

Dan Nettleton

In a plant science Root Image Study, the process of seedling roots bending in response to gravity is recorded using digital cameras, and the bending rates are modeled as functional plant phenotype data. The functional phenotypes are collected from seeds representing a large variety of genotypes and have a three-level nested hierarchical structure, with seeds nested in groups nested in genotypes. The seeds are imaged on different days of the lunar cycle, and an important scientific question is whether there are lunar effects on root bending. We allow the mean function of the bending rate to depend on the lunar …


Root Type-Specific Reprogramming Of Maize Pericycle Transcriptomes By Local High Nitrate Results In Disparate Lateral Root Branching Patterns, Peng Yu, Jutta A. Baldauf, Andrew Lithio, Caroline Marcon, Dan Nettleton, Chunjian Li, Frank Hochholdinger 2019 China Agricultural University

Root Type-Specific Reprogramming Of Maize Pericycle Transcriptomes By Local High Nitrate Results In Disparate Lateral Root Branching Patterns, Peng Yu, Jutta A. Baldauf, Andrew Lithio, Caroline Marcon, Dan Nettleton, Chunjian Li, Frank Hochholdinger

Dan Nettleton

The adaptability of root system architecture to unevenly distributed mineral nutrients in soil is a key determinant of plant performance. The molecular mechanisms underlying nitrate dependent plasticity of lateral root branching across the different root types of maize are only poorly understood. In this study, detailed morphological and anatomical analyses together with cell type-specific transcriptome profiling experiments combining laser capture microdissection with RNA-seq were performed to unravel the molecular signatures of lateral root formation in primary, seminal, crown, and brace roots of maize (Zea mays) upon local high nitrate stimulation. The four maize root types displayed divergent branching …


Using Random Forests To Estimate Win Probability Before Each Play Of An Nfl Game, Dennis Lock, Dan Nettleton 2019 Iowa State University

Using Random Forests To Estimate Win Probability Before Each Play Of An Nfl Game, Dennis Lock, Dan Nettleton

Dan Nettleton

Before any play of a National Football League (NFL) game, the probability that a given team will win depends on many situational variables (such as time remaining, yards to go for a first down, field position and current score) as well as the relative quality of the two teams as quantified by the Las Vegas point spread. We use a random forest method to combine pre-play variables to estimate Win Probability (WP) before any play of an NFL game. When a subset of NFL play-by-play data for the 12 seasons from 2001 to 2012 is used as a training dataset, …


Stability Of Single-Parent Gene Expression Complementation In Maize Hybrids Upon Water Deficit Stress, Caroline Marcon, Anja Paschold, Waqas Ahmed Malik, Andrew Lithio, Jutta A. Baldauf, Lena Altrogge, Nina Opitz, Christa Lanz, Heiko Schoof, Dan Nettleton, Hans-Peter Piepho, Frank Hochholdinger 2019 University of Bonn

Stability Of Single-Parent Gene Expression Complementation In Maize Hybrids Upon Water Deficit Stress, Caroline Marcon, Anja Paschold, Waqas Ahmed Malik, Andrew Lithio, Jutta A. Baldauf, Lena Altrogge, Nina Opitz, Christa Lanz, Heiko Schoof, Dan Nettleton, Hans-Peter Piepho, Frank Hochholdinger

Dan Nettleton

Heterosis is the superior performance of F1 hybrids compared with their homozygous, genetically distinct parents. In this study, we monitored the transcriptomic divergence of the maize (Zea mays) inbred lines B73 and Mo17 and their reciprocal F1 hybrid progeny in primary roots under control and water deficit conditions simulated by polyethylene glycol treatment. Single-parent expression (SPE) of genes is an extreme instance of gene expression complementation, in which genes are active in only one of two parents but are expressed in both reciprocal hybrids. In this study, 1,997 genes only expressed in B73 and 2,024 genes …


Genomic Neighborhoods For Arabidopsisretrotransposons: A Role For Targeted Integration In The Distribution Of The Metaviridae, Brooke D. Peterson-Burch, Dan Nettleton, Daniel F. Voytas 2019 U.S. Department of Agriculture

Genomic Neighborhoods For Arabidopsisretrotransposons: A Role For Targeted Integration In The Distribution Of The Metaviridae, Brooke D. Peterson-Burch, Dan Nettleton, Daniel F. Voytas

Dan Nettleton

Background: Retrotransposons are an abundant component of eukaryotic genomes. The high quality of the Arabidopsis thaliana genome sequence makes it possible to comprehensively characterize retroelement populations and explore factors that contribute to their genomic distribution.

Results: We identified the full complement of A. thaliana long terminal repeat (LTR) retroelements using RetroMap, a software tool that iteratively searches genome sequences for reverse transcriptases and then defines retroelement insertions. Relative ages of full-length elements were estimated by assessing sequence divergence between LTRs: the Pseudoviridae were significantly younger than the Metaviridae. All retroelement insertions were mapped onto the genome sequence and their distribution …


Allocative Poisson Factorization For Computational Social Science, Aaron Schein 2019 University of Massachusetts Amherst

Allocative Poisson Factorization For Computational Social Science, Aaron Schein

Doctoral Dissertations

Social science data often comes in the form of high-dimensional discrete data such as categorical survey responses, social interaction records, or text. These data sets exhibit high degrees of sparsity, missingness, overdispersion, and burstiness, all of which present challenges to traditional statistical modeling techniques. The framework of Poisson factorization (PF) has emerged in recent years as a natural way to model high-dimensional discrete data sets. This framework assumes that each observed count in a data set is a Poisson random variable $y ~ Pois(\mu)$ whose rate parameter $\mu$ is a function of shared model parameters. This thesis examines a specific …


Complementation Contributes To Transcriptome Complexity In Maize (Zea Mays L.) Hybrids Relative To Their Inbred Parents, Anja Paschold, Yi Jia, Caroline Marcon, Steve Lund, Nick B. Larson, Cheng-Ting Yeh, Stephan Ossowski, Christa Lanz, Dan Nettleton, Patrick S. Schnable, Frank Hochholdinger 2019 University of Bonn

Complementation Contributes To Transcriptome Complexity In Maize (Zea Mays L.) Hybrids Relative To Their Inbred Parents, Anja Paschold, Yi Jia, Caroline Marcon, Steve Lund, Nick B. Larson, Cheng-Ting Yeh, Stephan Ossowski, Christa Lanz, Dan Nettleton, Patrick S. Schnable, Frank Hochholdinger

Dan Nettleton

Typically, F1-hybrids are more vigorous than their homozygous, genetically distinct parents, a phenomenon known as heterosis. In the present study, the transcriptomes of the reciprocal maize (Zea mays L.) hybrids B73×Mo17 and Mo17×B73 and their parental inbred lines B73 and Mo17 were surveyed in primary roots, early in the developmental manifestation of heterotic root traits. The application of statistical methods and a suitable experimental design established that 34,233 (i.e., 86%) of all high-confidence maize genes were expressed in at least one genotype. Nearly 70% of all expressed genes were differentially expressed between the two parents and 42%–55% …


Estimation And Testing Of Gene Expression Heterosis, Tieming Ji, Peng Liu, Dan Nettleton 2019 University of Missouri

Estimation And Testing Of Gene Expression Heterosis, Tieming Ji, Peng Liu, Dan Nettleton

Dan Nettleton

Heterosis, also known as the hybrid vigor, occurs when the mean phenotype of hybrid offspring is superior to that of its two inbred parents. The heterosis phenomenon is extensively utilized in agriculture though the molecular basis is still unknown. In an effort to understand phenotypic heterosis at the molecular level, researchers have begun to compare expression levels of thousands of genes between parental inbred lines and their hybrid offspring to search for evidence of gene expression heterosis. Standard statistical approaches for separately analyzing expression data for each gene can produce biased and highly variable estimates and unreliable tests of heterosis. …


Non-Syntenic Genes Drive Rtcs-Dependent Regulation Of The Embryo Transcriptome During Formation Of Seminal Root Primordia In Maize (Zea Mays L.), Huanhuan Tai, Nina Opitz, Andrew Lithio, Xin Lu, Dan Nettleton, Frank Hochholdinger 2019 University of Bonn

Non-Syntenic Genes Drive Rtcs-Dependent Regulation Of The Embryo Transcriptome During Formation Of Seminal Root Primordia In Maize (Zea Mays L.), Huanhuan Tai, Nina Opitz, Andrew Lithio, Xin Lu, Dan Nettleton, Frank Hochholdinger

Dan Nettleton

Seminal roots of maize are pivotal for early seedling establishment. The maize mutant rootless concerning crown and seminal roots (rtcs) is defective in seminal root initiation during embryogenesis. In this study, the transcriptomes of wild-type and rtcs embryos were analyzed by RNA-Seq based on histological results at three stages of seminal root primordia formation. Hierarchical clustering highlighted that samples of each genotype grouped together along development. Determination of their gene activity status revealed hundreds of genes specifically transcribed in wild-type or rtcs embryos, while K-mean clustering revealed changes in gene expression dynamics between wild-type and rtcs during embryo …


Post-Weaning Blood Transcriptomic Differences Between Yorkshire Pigs Divergently Selected For Residual Feed Intake, Haibo Liu, Yet T. Nguyen, Dan Nettleton, Jack C. M. Dekkers, Christopher K. Tuggle 2019 Iowa State University

Post-Weaning Blood Transcriptomic Differences Between Yorkshire Pigs Divergently Selected For Residual Feed Intake, Haibo Liu, Yet T. Nguyen, Dan Nettleton, Jack C. M. Dekkers, Christopher K. Tuggle

Dan Nettleton

Background: Improving feed efficiency (FE) of pigs by genetic selection is of economic and environmental significance. An increasingly accepted measure of feed efficiency is residual feed intake (RFI). Currently, the molecular mechanisms underlying RFI are largely unknown. Additionally, to incorporate RFI into animal breeding programs, feed intake must be recorded on individual pigs, which is costly and time-consuming. Thus, convenient and predictive biomarkers for RFI that can be measured at an early age are greatly desired. In this study, we aimed to explore whether differences exist in the global gene expression profiles of peripheral blood of 35 to 42 day-old …


Substantial Contribution Of Genetic Variation In The Expression Of Transcription Factors To Phenotypic Variation Revealed By Erd-Gwas, Hung-ying Lin, Qiang Liu, Xiao Li, Jinliang Yang, Sanzhen Liu, Yinlian Huang, Michael J. Scanlon, Dan Nettleton, Patrick S. Schnable 2019 Iowa State University

Substantial Contribution Of Genetic Variation In The Expression Of Transcription Factors To Phenotypic Variation Revealed By Erd-Gwas, Hung-Ying Lin, Qiang Liu, Xiao Li, Jinliang Yang, Sanzhen Liu, Yinlian Huang, Michael J. Scanlon, Dan Nettleton, Patrick S. Schnable

Dan Nettleton

Background: There are significant limitations in existing methods for the genome-wide identification of genes whose expression patterns affect traits.

Results: The transcriptomes of five tissues from 27 genetically diverse maize inbred lines were deeply sequenced to identify genes exhibiting high and low levels of expression variation across tissues or genotypes. Transcription factors are enriched among genes with the most variation in expression across tissues, as well as among genes with higher-than-median levels of variation in expression across genotypes. In contrast, transcription factors are depleted among genes whose expression is either highly stable or highly variable across genotypes. We developed a …


Digital Commons powered by bepress