Open Access. Powered by Scholars. Published by Universities.®

Applied Statistics Commons

Open Access. Powered by Scholars. Published by Universities.®

3,511 Full-Text Articles 4,891 Authors 2,591,145 Downloads 167 Institutions

All Articles in Applied Statistics

Faceted Search

3,511 full-text articles. Page 96 of 107.

An Analysis Of Boosted Regression Trees To Predict The Strength Properties Of Wood Composites, Dillon Matthew Carty 2011 University of Tennessee, Knoxville

An Analysis Of Boosted Regression Trees To Predict The Strength Properties Of Wood Composites, Dillon Matthew Carty

Masters Theses

The forest products industry is a significant contributor to the U.S. economy contributing six percent of the total U.S. manufacturing gross domestic product (GDP), placing it on par with the U.S. automotive and plastics industries. Sustaining business competitiveness by reducing costs and maintaining product quality will be essential in the long term for this industry. Improved production efficiency and business competitiveness is the primary rationale for this work. A challenge facing this industry is to develop better knowledge of the complex nature of process variables and their relationship with final product quality attributes. Quantifying better the relationships between process variables …


Exploring The Effectiveness Of Environmentally Sustainable Practices In Municipal Government: A Case Study Of The City Of Knoxville’S Department Of Parks And Recreation, Anthony Michael Brown 2011 University of Tennessee - Knoxville

Exploring The Effectiveness Of Environmentally Sustainable Practices In Municipal Government: A Case Study Of The City Of Knoxville’S Department Of Parks And Recreation, Anthony Michael Brown

Masters Theses

Sustainability practices produce programs and services that meet current needs while preserving the environment and natural resources for the future. City parks and recreation departments are facing budget shortfalls and increasing expectations from customers. Governments are now embracing sustainability practices to create financial savings while also fostering relations with customers.

The purpose of this single case study was twofold: (1) to examine the effectiveness of one city department’s strategies in outsourcing its environmental sustainability program through a performance contract with Ameresco; and (2) to examine the perceptions of key department employees about the effectiveness of the sustainability initiative. A …


Statistical Analysis Of Fatalities Due To Vehicle Accidents In Las Vegas, Nv, Annabelle Marie Mathis 2011 University of Nevada, Las Vegas

Statistical Analysis Of Fatalities Due To Vehicle Accidents In Las Vegas, Nv, Annabelle Marie Mathis

UNLV Theses, Dissertations, Professional Papers, and Capstones

The goal of this thesis is to investigate factors that affect the odds of having a fatality in a vehicle collision. We will be looking at characteristics of the driver that caused the accident (age, gender, behavior, actions, influences, and seat belt worn), the characteristics of the vehicle the driver drove (type of vehicle, and air bag deployment), the characteristics of the environment in which the accident occurred (weather, road condition, lighting, time of day, the day of the week, and month of the year), the characteristics of the crash (direction of accident and how many vehicles were involved), and …


Identifying Unique Neighborhood Characteristics To Guide Health Planning For Stroke And Heart Attack: Fuzzy Cluster And Discriminant Analyses Approaches, Ashley Pedigo, William Seaver, Agricola Odoi 2011 University of Tennessee, Knoxville

Identifying Unique Neighborhood Characteristics To Guide Health Planning For Stroke And Heart Attack: Fuzzy Cluster And Discriminant Analyses Approaches, Ashley Pedigo, William Seaver, Agricola Odoi

Agricola Odoi

Background: Socioeconomic, demographic, and geographic factors are known determinants of stroke and myocardial infarction (MI) risk. Clustering of these factors in neighborhoods needs to be taken into consideration during planning, prioritization and implementation of health programs intended to reduce disparities. Given the complex and multidimensional nature of these factors, multivariate methods are needed to identify neighborhood clusters of these determinants so as to better understand the unique neighborhood profiles. This information is critical for evidence-based health planning and service provision. Therefore, this study used a robust multivariate approach to classify neighborhoods and identify their socio-demographic characteristics so as to provide …


Diagnostic Checking, Time Series And Regression, Esam Mahdi 2011 The University of Western Ontario

Diagnostic Checking, Time Series And Regression, Esam Mahdi

Electronic Thesis and Dissertation Repository

In this thesis, a new univariate-multivariate portmanteau test is derived. The proposed test statistic can be used for diagnostic checking ARMA, VAR, FGN, GARCH, and TAR time series models as well as for checking randomness of series and goodness-of- fit VAR models with stable Paretian errors. The asymptotic distribution of the test statistic is derived as well as a chi-square approximation. However, the Monte-Carlo test is recommended unless the series is very long. Extensive simulation experiments demonstrate the usefulness of this test and its improved power performance compared to widely used previous multivariate portmanteau diagnostic check. The contributed R package …


Asymptotic Theory For Cross-Validated Targeted Maximum Likelihood Estimation, Wenjing Zheng, Mark J. van der Laan 2011 University of California, Berkeley, Division of Biostatistics

Asymptotic Theory For Cross-Validated Targeted Maximum Likelihood Estimation, Wenjing Zheng, Mark J. Van Der Laan

Wenjing Zheng

We consider a targeted maximum likelihood estimator of a path-wise differentiable parameter of the data generating distribution in a semi-parametric model based on observing n independent and identically distributed observations. The targeted maximum likelihood estimator (TMLE) uses V-fold sample splitting for the initial estimator in order to make the TMLE maximally robust in its bias reduction step. We prove a general theorem that states asymptotic efficiency (and thereby regularity) of the targeted maximum likelihood estimator when the initial estimator is consistent and a second order term converges to zero in probability at a rate faster than the square root of …


Using R To Create Synthetic Discrete Response Regression Models, Joseph Hilbe 2011 Arizona State University

Using R To Create Synthetic Discrete Response Regression Models, Joseph Hilbe

Joseph M Hilbe

The creation of synthetic models allows a researcher to better understand models as well as the bias that can occur when the assumptions upon which a model is based is violated. This article provides R code that can be used or amended to create a variety of discrete response regression models.


Helin Institutions' Collection Statistics From Fy 10 To Fy 11, Martha Rice Sanders 2011 HELIN Consortium

Helin Institutions' Collection Statistics From Fy 10 To Fy 11, Martha Rice Sanders

HELIN Collection Statistics

Statistical information about the total number of item and holdings (serials) records held by each HELIN member institution as of June 30, 2010, and June 30, 2011. Gives the percentage of growth for each institution. Additionally, a chart and statistics for the number of item records held by each HELIN member institution as of June 30, 2011. A Chart of e-book collection totals and the libraries to which they belong. Finally, a chart of serials holdings for both paper (plus microform, etc.) and electronic journals, including the CRIARL libraries.


Comparing Hall Of Fame Baseball Players Using Most Valuable Player Ranks, Paul Kvam 2011 University of Richmond

Comparing Hall Of Fame Baseball Players Using Most Valuable Player Ranks, Paul Kvam

Department of Math & Statistics Faculty Publications

We propose a rank-based statistical procedure for comparing performances of top major league baseball players who performed in different eras. The model is based on using the player ranks from voting results for the most valuable player awards in the American and National Leagues. The current voting procedure has remained the same since 1932, so the analysis regards only data for players whose career blossomed after that time. Because the analysis is based on quantiles, its basis is nonparametric and relies on a simple link function. Results are stratified by fielding position, and we compare 73 Hall of Fame players …


Solving The Differential Equation For The Probit Function Using A Variant Of The Carleman Embedding Technique., Kelechukwu Iroajanma Alu 2011 East Tennessee State University

Solving The Differential Equation For The Probit Function Using A Variant Of The Carleman Embedding Technique., Kelechukwu Iroajanma Alu

Electronic Theses and Dissertations

The probit function is the inverse of the cumulative distribution function associated with the standard normal distribution. It is of great utility in statistical modelling. The Carleman embedding technique has been shown to be effective in solving first order and, less efficiently, second order nonlinear differential equations. In this thesis, we show that solutions to the second order nonlinear differential equation for the probit function can be approximated efficiently using a variant of the Carleman embedding technique.


Estimating Area And Lag Associated With Thermal Hysteresis In Cattle, F. Yang, A. M. Parkhurst 2011 Kansas State University Libraries

Estimating Area And Lag Associated With Thermal Hysteresis In Cattle, F. Yang, A. M. Parkhurst

Conference on Applied Statistics in Agriculture

Thermal hysteresis in cattle becomes visible when the phase diagram of body temperature (Tb) vs ambient temperature (Ta) exhibits a loop. The hysteresis loop shows a rotated elliptical pattern which depends on the lag between Tb and Ta. The area of the loop can be used to quantify the amount of heat stress during thermal challenge. Three methods to estimate the area and lag of the elliptical hysteresis loop are: linear least squares method, ellipse-specific nonlinear least squares method, and Lapshin’s analytical method. Linear least squares method uses residual least squares to estimate the coefficients of the ellipse for which …


Comparison Of Linear Mixed Models For Multiple Environment Plant Breeding Trials, Carl A. Walker, Fabiano Pita, Kimberly Garland Campbell 2011 Kansas State University Libraries

Comparison Of Linear Mixed Models For Multiple Environment Plant Breeding Trials, Carl A. Walker, Fabiano Pita, Kimberly Garland Campbell

Conference on Applied Statistics in Agriculture

Evaluations of multiple environment trials (MET) often reveal substantial genotype by environment interactions, and the effects of genotypes within environments are often estimated using cell means, i.e. the simple mean of the observations of each genotype in each environment. However, these estimates are inaccurate, especially for unreplicated or partially replicated trials, so alternative methods of analysis are necessary. One possible approach utilizes information, often from pedigree data, about relationships among the tested genotypes through the use of a genetic relationship matrix (GRM). Predictive accuracy may also be improved by the use of factor analytic (FA) structures for environmental covariances. In …


A Hierarchical Bayesian Approach For Detecting Differential Gene Expression In Unreplicated Rna-Sequencing Data, Sanvesh Srivastava, R. W. Doerge 2011 Kansas State University Libraries

A Hierarchical Bayesian Approach For Detecting Differential Gene Expression In Unreplicated Rna-Sequencing Data, Sanvesh Srivastava, R. W. Doerge

Conference on Applied Statistics in Agriculture

Next-generation sequencing technologies have emerged as a promising technology in a variety of fields, including genomics, epigenomics, and transcriptomics. These technologies play an important role in understanding cell organization and functionality. Unlike data from earlier technologies (e.g., microarrays), data from next-generation sequencing technologies are highly replicable with little technical variation. One application of next-generation sequencing technologies is RNA-Sequencing (RNA-Seq). It is used for detecting differential gene expression between different biological conditions. While statistical methods for detecting differential expression in RNA-Seq data exist, one serious limitation to these methods is the absence of biological replication. At present, the high cost of …


Bootstrap Estimation And Comparison Of An Index Of Phylogenetic Correlation, William J. Price, Bahman Shafii, Carole B. Rapo, Sanford D. Eigenbrode, John Gaskin 2011 Kansas State University Libraries

Bootstrap Estimation And Comparison Of An Index Of Phylogenetic Correlation, William J. Price, Bahman Shafii, Carole B. Rapo, Sanford D. Eigenbrode, John Gaskin

Conference on Applied Statistics in Agriculture

A common objective of bioinformatic analyses is to assess the similarity of species, given a biological trait or characteristic. Phylogenetic correlation is one means to achieve this objective. Such measures provide a means to evaluate evolutionary models and history as well as having potential application to ecological relationships including host preference selection. Typically, these measurements are based on the deviation of an observed phylogeny from a Brownian evolutionary model. Statistical inference for this difference is assessed through likelihood ratio tests. These tests, in turn, rely on the assumption of a Normal likelihood within the phylogenetic trait. In addition, statistical comparison …


Modeling The Root-Knot Nematode/Nutsedge Pest Complex: Perspectives From Weed Science, Nematology And Statistics, Leigh Murray, Stephen H. Thomas, Jill Schroeder, Scott Kreider, Zhining Ou, J. M. Trojan, C. Fiore 2011 Kansas State University Libraries

Modeling The Root-Knot Nematode/Nutsedge Pest Complex: Perspectives From Weed Science, Nematology And Statistics, Leigh Murray, Stephen H. Thomas, Jill Schroeder, Scott Kreider, Zhining Ou, J. M. Trojan, C. Fiore

Conference on Applied Statistics in Agriculture

Previous research by the authors has established that southern root-knot nematode (SRKN, Meloidogyne incognita (Kofoid & White) Chitwood) and yellow and purple nutsedge (YNS, Cyperus esculentus L. and PNS, C. rotundus L.) form a pest-complex that adversely affects a wide variety of crops in the southern and western U.S. These pests appear to have co-evolved a mutually-beneficial relationship that promotes the survival of both nematodes and weeds to the detriment of crops. Traditional management has usually targeted one pest at a time, but managing this pest complex requires that all members of the complex be managed simultaneously. A series of …


Multi-Parental Mating Design Analysis: Model Evaluation And Application In Spring Wheat, M. Kadariya, K. D. Glover, J. Wu, J. L. Gonzalez 2011 Kansas State University Libraries

Multi-Parental Mating Design Analysis: Model Evaluation And Application In Spring Wheat, M. Kadariya, K. D. Glover, J. Wu, J. L. Gonzalez

Conference on Applied Statistics in Agriculture

Conventional quantitative genetics studies have mainly focused on bi-parental mating systems. However, genetic potential of selected individuals within a segregating population may be limited due to only two parents being used for each cross. Multiple-parental mating systems have been proposed that involve three or four diverse parents. This provides a higher potential of combining desirable genes. Due to complexity of the data structure of multi-parental mating systems, analysis of variance (ANOVA) methods are not applicable in analysis. The objective of this study is to validate and apply a mixed linear model approach, minimum norm quadratic unbiased estimation (MINQUE), to analyze …


Estimating The Subject By Treatment Interaction In Non-Replicated Crossover Diet Studies, Matthew Kramer, Shirley C. Chen, Sarah K. Gebauer, David J. Baer 2011 Kansas State University Libraries

Estimating The Subject By Treatment Interaction In Non-Replicated Crossover Diet Studies, Matthew Kramer, Shirley C. Chen, Sarah K. Gebauer, David J. Baer

Conference on Applied Statistics in Agriculture

Researchers in human nutrition commonly refer to the ‘consistent’ diet effect (i.e. the main effect of diet) and an ‘inconsistent’ diet effect (i.e. a subject by diet interaction). However, due to the non-replicated designs of most studies, one can only estimate the first part using ANOVA; the latter (interaction) is confounded with the residual noise. In many diet studies, it appears that subjects do respond differently to the same diet, so the subject by diet interaction may be large. In a search of over 40,000 published human nutrition studies, most using a crossover design, we found that in none was …


Probability Models To Study The Spatial Pattern, Abundance And Diversity Of Tree Species, D. M. Gowda 2011 Kansas State University Libraries

Probability Models To Study The Spatial Pattern, Abundance And Diversity Of Tree Species, D. M. Gowda

Conference on Applied Statistics in Agriculture

Ecological communities are composed of complex vegetation that differs from community to community and also within the community. The variability of tree species in the community in relation to their environments can be studied by using different statistical tools. The present study was conducted to describe and also to quantify the spatial pattern, abundance and diversity of tree species in the Western Ghats of Karnataka. The spatial pattern of tree species was studied by using Poisson and Negative binomial distributions. Results indicate that most of the selected tree species followed Negative binomial distribution having clumped pattern. The Species abundance distribution …


Spatio-Temporal Covariance Modeling With Some Arma Temporal Margins, Samuel Seth Demel, Juan Du 2011 Kansas State University Libraries

Spatio-Temporal Covariance Modeling With Some Arma Temporal Margins, Samuel Seth Demel, Juan Du

Conference on Applied Statistics in Agriculture

A valid covariance structure is needed to model spatio-temporal data in various disciplines, such as environmental science, climatology and agriculture. In this work we propose a collection of spatio-temporal functions whose discrete temporal margins are some autoregressive and moving average (ARMA) models, obtain a necessary and sufficient condition for them to be covariance functions. An asymmetric version of this model is also provided to account for space-time irreversibility property in practice. Finally, a spatio-temporal model with AR(2) discrete margin is fitted to wind data from Ireland for estimation and prediction, which are compared with some general existing parametric models in …


Logistic Regression Analysis To Determine Factors Contributing To Summer Feedlot Deaths, J. Clausen, A. M. Parkhurst, T. L. Mader 2011 Kansas State University Libraries

Logistic Regression Analysis To Determine Factors Contributing To Summer Feedlot Deaths, J. Clausen, A. M. Parkhurst, T. L. Mader

Conference on Applied Statistics in Agriculture

Summer heat has already been identified as a major factor for cattle deaths in the feedlot. This study attempts to assess what other factors contribute to and/or influence cattle deaths. Identifying multiple factors that contribute to summer feedlot deaths could aid feedlot managers in implementation of mitigation strategies and minimize the loss of nearly finished cattle. Daily pen, cattle, and nutritional characteristics were recorded and included in this generalized linear mixed model analysis. Cattle data were obtained from cattle pens at a single location from July 1, 2010 to July 31, 2010. Hourly weather data were acquired from this feed …


Digital Commons powered by bepress