Open Access. Powered by Scholars. Published by Universities.®

Applied Statistics Commons

Open Access. Powered by Scholars. Published by Universities.®

Selected Works

Discipline
Institution
Keyword
Publication Year
Publication
File Type

Articles 1 - 30 of 337

Full-Text Articles in Applied Statistics

Stability Of Single-Parent Gene Expression Complementation In Maize Hybrids Upon Water Deficit Stress, Caroline Marcon, Anja Paschold, Waqas Ahmed Malik, Andrew Lithio, Jutta A. Baldauf, Lena Altrogge, Nina Opitz, Christa Lanz, Heiko Schoof, Dan Nettleton, Hans-Peter Piepho, Frank Hochholdinger Jul 2019

Stability Of Single-Parent Gene Expression Complementation In Maize Hybrids Upon Water Deficit Stress, Caroline Marcon, Anja Paschold, Waqas Ahmed Malik, Andrew Lithio, Jutta A. Baldauf, Lena Altrogge, Nina Opitz, Christa Lanz, Heiko Schoof, Dan Nettleton, Hans-Peter Piepho, Frank Hochholdinger

Dan Nettleton

Heterosis is the superior performance of F1 hybrids compared with their homozygous, genetically distinct parents. In this study, we monitored the transcriptomic divergence of the maize (Zea mays) inbred lines B73 and Mo17 and their reciprocal F1 hybrid progeny in primary roots under control and water deficit conditions simulated by polyethylene glycol treatment. Single-parent expression (SPE) of genes is an extreme instance of gene expression complementation, in which genes are active in only one of two parents but are expressed in both reciprocal hybrids. In this study, 1,997 genes only expressed in B73 and 2,024 genes …


Genomic Neighborhoods For Arabidopsisretrotransposons: A Role For Targeted Integration In The Distribution Of The Metaviridae, Brooke D. Peterson-Burch, Dan Nettleton, Daniel F. Voytas Jul 2019

Genomic Neighborhoods For Arabidopsisretrotransposons: A Role For Targeted Integration In The Distribution Of The Metaviridae, Brooke D. Peterson-Burch, Dan Nettleton, Daniel F. Voytas

Dan Nettleton

Background: Retrotransposons are an abundant component of eukaryotic genomes. The high quality of the Arabidopsis thaliana genome sequence makes it possible to comprehensively characterize retroelement populations and explore factors that contribute to their genomic distribution.

Results: We identified the full complement of A. thaliana long terminal repeat (LTR) retroelements using RetroMap, a software tool that iteratively searches genome sequences for reverse transcriptases and then defines retroelement insertions. Relative ages of full-length elements were estimated by assessing sequence divergence between LTRs: the Pseudoviridae were significantly younger than the Metaviridae. All retroelement insertions were mapped onto the genome sequence and their distribution …


Empirical Bayes Analysis Of Rna-Seq Data For Detection Of Gene Expression Heterosis, Jarad Niemi, Eric Mittman, Will Landau, Dan Nettleton Jun 2019

Empirical Bayes Analysis Of Rna-Seq Data For Detection Of Gene Expression Heterosis, Jarad Niemi, Eric Mittman, Will Landau, Dan Nettleton

Dan Nettleton

An important type of heterosis, known as hybrid vigor, refers to the enhancements in the phenotype of hybrid progeny relative to their inbred parents. Although hybrid vigor is extensively utilized in agriculture, its molecular basis is still largely unknown. In an effort to understand phenotypic heterosis at the molecular level, researchers are measuring transcript abundance levels of thousands of genes in parental inbred lines and their hybrid offspring using RNA sequencing (RNA-seq) technology. The resulting data allow researchers to search for evidence of gene expression heterosis as one potential molecular mechanism underlying heterosis of agriculturally important traits. The null hypotheses …


The Validity Of Online Patient Ratings Of Physicians, Jennifer L. Priestley, Yiyun Zhou, Robert Mcgrath Mar 2019

The Validity Of Online Patient Ratings Of Physicians, Jennifer L. Priestley, Yiyun Zhou, Robert Mcgrath

Jennifer L. Priestley

Background: Information from ratings sites are increasingly informing patient decisions related to health care and the selection of physicians.

Objective: The current study sought to determine the validity of online patient ratings of physicians through comparison with physician peer review.

Methods: We extracted 223,715 reviews of 41,104 physicians from 10 of the largest cities in the United States, including 1142 physicians listed as “America’s Top Doctors” through physician peer review. Differences in mean online patient ratings were tested for physicians who were listed and those who were not.

Results: Overall, no differences were found between the online patient ratings based …


Multivariate Spectral Analysis Of Crism Data To Characterize The Composition Of Mawrth Vallis, Melissa Luna Mar 2018

Multivariate Spectral Analysis Of Crism Data To Characterize The Composition Of Mawrth Vallis, Melissa Luna

Melissa Luna

No abstract provided.


Econometric Modeling Of Regional Electricity Spot Prices In The Australian Market, Michael S. Smith, Thomas S. Shively Dec 2017

Econometric Modeling Of Regional Electricity Spot Prices In The Australian Market, Michael S. Smith, Thomas S. Shively

Michael Stanley Smith

Wholesale electricity markets are increasingly integrated via high voltage interconnectors, and inter-regional
trade in electricity is growing. To model this, we consider a spatial equilibrium model of price formation, where constraints on inter-regional flows result in three distinct equilibria in prices. We use this to motivate an econometric model for the distribution of observed electricity spot prices that captures many of their unique empirical characteristics. The econometric model features supply and inter-regional trade cost functions, which are estimated using Bayesian monotonic regression smoothing methodology. A copula multivariate time series model is employed to capture additional dependence --- both cross-sectional and serial --- in …


Methods For Scalar-On-Function Regression, Philip T. Reiss, Jeff Goldsmith, Han Lin Shang, R. Todd Ogden Jul 2017

Methods For Scalar-On-Function Regression, Philip T. Reiss, Jeff Goldsmith, Han Lin Shang, R. Todd Ogden

Philip T. Reiss

Recent years have seen an explosion of activity in the field of functional data analysis (FDA), in which curves, spectra, images, etc. are considered as basic functional data units. A central problem in FDA is how to fit regression models with scalar responses and functional data points as predictors. We review some of the main approaches to this problem, categorizing the basic model types as linear, nonlinear and nonparametric. We discuss publicly available software packages, and illustrate some of the procedures by application to a functional magnetic resonance imaging dataset.


Estimating Pay Gaps For Workers With Disabilities: Implications From Broadening Definitions And Data Sets, Kevin F. Hallock, Xin Jin, Linda Barrington Jun 2017

Estimating Pay Gaps For Workers With Disabilities: Implications From Broadening Definitions And Data Sets, Kevin F. Hallock, Xin Jin, Linda Barrington

Kevin F Hallock

Purpose: To compare pay gap estimates across 3 different national survey data sets for people with disabilities relative to those without disabilities when pay is measured as wage and salary alone versus a (total compensation) definition that includes an estimate of the value of benefits.

Method: Estimates of the cost to the employers of employee benefits at the occupational level from an employer survey data set are matched to individual-level data in each of the 3 data sets. Multiple regression techniques are applied to estimate wage and salary and total compensation gaps between full-time men with and without …


Discrimination By Gender And Disability Status: Do Worker Perceptions Match Statistical Measures?, Kevin F. Hallock, Wallace Hendricks, Emer Broadbent Jun 2017

Discrimination By Gender And Disability Status: Do Worker Perceptions Match Statistical Measures?, Kevin F. Hallock, Wallace Hendricks, Emer Broadbent

Kevin F Hallock

We explore whether perceptions of discrimination are related to ordinary statistical measures. The majority of disabled respondents report feeling some discrimination due to their disability, the majority of women feel some discrimination because of their gender, and a surprising number of men also report some discrimination. We do not find a strong link between perceptions of discrimination and measured discrimination perhaps because those who perceive discrimination feel that it occurs along other dimensions than pay. However, we do find a connection between whether a person feels his or her income is inadequate and measured discrimination for all groups studied.


Prediction Of Remaining Life Of Power Transformers Based On Left Truncated And Right Censored Lifetime Data, Yili Hong, William Q. Meeker, James D. Mccalley Jun 2017

Prediction Of Remaining Life Of Power Transformers Based On Left Truncated And Right Censored Lifetime Data, Yili Hong, William Q. Meeker, James D. Mccalley

James McCalley

Prediction of the remaining life of high-voltage power transformers is an important issue for energy companies because of the need for planning maintenance and capital expenditures. Lifetime data for such transformers are complicated because transformer lifetimes can extend over many decades and transformer designs and manufacturing practices have evolved. We were asked to develop statistically-based predictions for the lifetimes of an energy company’s fleet of high-voltage transmission and distribution transformers. The company’s data records begin in 1980, providing information on installation and failure dates of transformers. Although the dataset contains many units that were installed before 1980, there is no …


The Engineering Admissions Partnership Program: A Navigation Strategy For Community College Students Seeking A Pathway Into Engineering, Marcia R. Laugerman, Mack C. Shelley, Steven K. Mickelson, Diane T. Rover Jun 2017

The Engineering Admissions Partnership Program: A Navigation Strategy For Community College Students Seeking A Pathway Into Engineering, Marcia R. Laugerman, Mack C. Shelley, Steven K. Mickelson, Diane T. Rover

Diane Rover

This paper presents the evaluation of a program designed to improve transfer outcomes for community college students pursuing an engineering degree. The program, the Engineering Admissions Partnership Program (E-APP), was designed to improve the navigational success of community college transfer students through connections to the university. These connections include coordinated academic advising, peer-mentoring, campus visits, and online social and professional networks. The objective of the study is to determine the efficacy of the E-APP and its interventions, which will be measured by increased participation rates and increased university retention rates for E-APP participants. Outcome data for the students are analyzed …


Random Regression Models Based On The Elliptically Contoured Distribution Assumptions With Applications To Longitudinal Data, Alfred A. Bartolucci, Shimin Zheng, Sejong Bae, Karan P. Singh May 2017

Random Regression Models Based On The Elliptically Contoured Distribution Assumptions With Applications To Longitudinal Data, Alfred A. Bartolucci, Shimin Zheng, Sejong Bae, Karan P. Singh

Shimin Zheng

We generalize Lyles et al.’s (2000) random regression models for longitudinal data, accounting for both undetectable values and informative drop-outs in the distribution assumptions. Our models are constructed on the generalized multivariate theory which is based on the Elliptically Contoured Distribution (ECD). The estimation of the fixed parameters in the random regression models are invariant under the normal or the ECD assumptions. For the Human Immunodeficiency Virus Epidemiology Research Study data, ECD models fit the data better than classical normal models according to the Akaike (1974) Information Criterion. We also note that both univariate distributions of the random intercept and …


A Realistic Meteorological Assessment Of Perennial Biofuel Crop Deployment: A Southern Great Plains Perspective, Melissa Wagner, Meng Wang, Gonzalo Miguez-Macho, Jesse Miller, Andy Vanloocke, Justin E. Bagley, Carl J. Bernacchi, Matei Georgescu Jan 2017

A Realistic Meteorological Assessment Of Perennial Biofuel Crop Deployment: A Southern Great Plains Perspective, Melissa Wagner, Meng Wang, Gonzalo Miguez-Macho, Jesse Miller, Andy Vanloocke, Justin E. Bagley, Carl J. Bernacchi, Matei Georgescu

Andy VanLoocke

Utility of perennial bioenergy crops (e.g., switchgrass and miscanthus) offers unique opportunities to transition toward a more sustainable energy pathway due to their reduced carbon footprint, averted competition with food crops, and ability to grow on abandoned and degraded farmlands. Studies that have examined biogeophysical impacts of these crops noted a positive feedback between near-surface cooling and enhanced evapotranspiration (ET), but also potential unintended consequences of soil moisture and groundwater depletion. To better understand hydrometeorological effects of perennial bioenergy crop expansion, this study conducted high-resolution (2-km grid spacing) simulations with a state-of-the-art atmospheric model (Weather Research and Forecasting system) dynamically …


Penalized Nonparametric Scalar-On-Function Regression Via Principal Coordinates, Philip T. Reiss, David L. Miller, Pei-Shien Wu, Wen-Yu Hua Dec 2016

Penalized Nonparametric Scalar-On-Function Regression Via Principal Coordinates, Philip T. Reiss, David L. Miller, Pei-Shien Wu, Wen-Yu Hua

Philip T. Reiss

A number of classical approaches to nonparametric regression have recently been extended to the case of functional predictors. This paper introduces a new method of this type, which extends intermediate-rank penalized smoothing to scalar-on-function regression. The core idea is to regress the response on leading principal coordinates defined by a relevant distance among the functional predictors, while applying a ridge penalty. Our publicly available implementation, based on generalized additive modeling software, allows for fast optimal tuning parameter selection and for extensions to multiple functional predictors, exponential family-valued responses, and mixed-effects models. In an application to signature verification data, the proposed …


Hilbe-Pglr-Errata-And-Comments, Joseph M. Hilbe Mar 2016

Hilbe-Pglr-Errata-And-Comments, Joseph M. Hilbe

Joseph M Hilbe

Errata and Comments for Practical Guide to Logistic Regression


The Fraud Detection Triangle: A New Framework For Selecting Variables In Fraud Detection Research, Adrian Gepp, Kuldeep Kumar, Sukanto Bhattacharya Feb 2016

The Fraud Detection Triangle: A New Framework For Selecting Variables In Fraud Detection Research, Adrian Gepp, Kuldeep Kumar, Sukanto Bhattacharya

Kuldeep Kumar

The selection of explanatory (independent) variables is crucial to developing a fraud detection model. However, the selection process in prior financial statement fraud detection studies is not standardized. Furthermore, the categories of variables differ between studies. Consequently, the new Fraud Detection Triangle framework is proposed as an overall theory to assist in guiding the selection of variables for future fraud detection research. This new framework adapts and extends Cressey’s (1953) well-known and widely-used fraud triangle to make it more suited for use in fraud detection research. While the new framework was developed for financial statement fraud detection, it is more …


Functional Car Models For Spatially Correlated Functional Datasets, Lin Zhang, Veerabhadran Baladandayuthapani, Hongxiao Zhu, Keith A. Baggerly, Tadeusz Majewski, Bogdan Czerniak, Jeffrey S. Morris Jan 2016

Functional Car Models For Spatially Correlated Functional Datasets, Lin Zhang, Veerabhadran Baladandayuthapani, Hongxiao Zhu, Keith A. Baggerly, Tadeusz Majewski, Bogdan Czerniak, Jeffrey S. Morris

Jeffrey S. Morris

We develop a functional conditional autoregressive (CAR) model for spatially correlated data for which functions are collected on areal units of a lattice. Our model performs functional response regression while accounting for spatial correlations with potentially nonseparable and nonstationary covariance structure, in both the space and functional domains. We show theoretically that our construction leads to a CAR model at each functional location, with spatial covariance parameters varying and borrowing strength across the functional domain. Using basis transformation strategies, the nonseparable spatial-functional model is computationally scalable to enormous functional datasets, generalizable to different basis functions, and can be used on …


Modeling The Evolution Of Dynamic Brain Processes During An Associative Learning Experiment, Mark Fiecas, Hernando Ombao Dec 2015

Modeling The Evolution Of Dynamic Brain Processes During An Associative Learning Experiment, Mark Fiecas, Hernando Ombao

Mark Fiecas

Our goal is to use local field potentials (LFPs) to rigorously study changes in neuronal activity in the hippocampus and the nucleus accumbens over the course of an associative learning experiment. We show that the spectral properties of the LFPs changed during the experiment. While many statistical models take into account nonstationarity within a single trial of the experiment, the evolution of brain dynamics across trials is often ignored. In this paper, we developed a novel time series model that captures both sources of nonstationarity. Under the proposed model we rigorously define the spectral density matrix so that it evolves …


Characterizing The Statistical Distribution Of Organic Carbon And Extractable Phosphorus At A Regional Scale, John J. Brejda, David W. Meek, Douglas L. Karlen Dec 2015

Characterizing The Statistical Distribution Of Organic Carbon And Extractable Phosphorus At A Regional Scale, John J. Brejda, David W. Meek, Douglas L. Karlen

Douglas L Karlen

Greater awareness of potential environmental problems has created the need to monitor total organic carbon (TOC) and extractable phosphorus (P) concentrations at a regional scale. The probability distribution of these soil properties can have a significant effect on the power of statistical tests and the quality of inferences applied to these properties. The objectives of this study were to: (1) evaluate the probability distribution of TOC and extractable P at the regional scale in three Major Land Resource Areas (MLRA), and (2) identify appropriate transformations that will result in a normal distribution. Both TOC and extractable P were non-normally distributed …


Estimation Of Reliability In Multicomponent Stress-Strength Based On Generalized Rayleigh Distribution, Gadde Srinivasa Rao Nov 2015

Estimation Of Reliability In Multicomponent Stress-Strength Based On Generalized Rayleigh Distribution, Gadde Srinivasa Rao

Srinivasa Rao Gadde Dr.

A multicomponent system of k components having strengths following k- independently and identically distributed random variables x1, x2, ..., xk and each component experiencing a random stress Y is considered. The system is regarded as alive only if at least s out of k (s < k) strengths exceed the stress. The reliability of such a system is obtained when strength and stress variates are given by a generalized Rayleigh distribution with different shape parameters. Reliability is estimated using the maximum likelihood (ML) method of estimation in samples drawn from strength and stress distributions; the reliability estimators are compared asymptotically. Monte-Carlo …


The Fraud Detection Triangle: A New Framework For Selecting Variables In Fraud Detection Research, Adrian Gepp, Kuldeep Kumar, Sukanto Bhattacharya Oct 2015

The Fraud Detection Triangle: A New Framework For Selecting Variables In Fraud Detection Research, Adrian Gepp, Kuldeep Kumar, Sukanto Bhattacharya

Adrian Gepp

The selection of explanatory (independent) variables is crucial to developing a fraud detection model. However, the selection process in prior financial statement fraud detection studies is not standardized. Furthermore, the categories of variables differ between studies. Consequently, the new Fraud Detection Triangle framework is proposed as an overall theory to assist in guiding the selection of variables for future fraud detection research. This new framework adapts and extends Cressey’s (1953) well-known and widely-used fraud triangle to make it more suited for use in fraud detection research. While the new framework was developed for financial statement fraud detection, it is more …


Embアルゴリズムの新たな応用による多重比率補定(高橋将宜), Masayoshi Takahashi Sep 2015

Embアルゴリズムの新たな応用による多重比率補定(高橋将宜), Masayoshi Takahashi

Masayoshi Takahashi

No abstract provided.


Pglr-Sas Data, Joseph M. Hilbe Jul 2015

Pglr-Sas Data, Joseph M. Hilbe

Joseph M Hilbe

SAS data files for Practical Guide to Logistic Regression


R Code For Practical Guide To Logistic Regression, Joseph M. Hilbe Jul 2015

R Code For Practical Guide To Logistic Regression, Joseph M. Hilbe

Joseph M Hilbe

R code for Practical Guide to Logistic Regression


Pglr-Stata Data Files, Joseph M. Hilbe Jul 2015

Pglr-Stata Data Files, Joseph M. Hilbe

Joseph M Hilbe

Stata data files for Practical Guide to Logistic Regression


Sas Code Only For Practical Guide To Logistic Regression, Joseph M. Hilbe Jul 2015

Sas Code Only For Practical Guide To Logistic Regression, Joseph M. Hilbe

Joseph M Hilbe

SAS code-only for Practical Guide to Logistic Regression


Sas Code & Output For Practical Guide To Logistic Regression, Joseph M. Hilbe Jul 2015

Sas Code & Output For Practical Guide To Logistic Regression, Joseph M. Hilbe

Joseph M Hilbe

SAS code for Practical Guide to Logistic Regression


公的統計における欠測値補定の研究:多重代入法と単一代入法(高橋将宜), Masayoshi Takahashi Jun 2015

公的統計における欠測値補定の研究:多重代入法と単一代入法(高橋将宜), Masayoshi Takahashi

Masayoshi Takahashi

No abstract provided.


Responses To Corporate Versus Individual Wrongdoing, Valerie P. Hans, M. David Ermann Jun 2015

Responses To Corporate Versus Individual Wrongdoing, Valerie P. Hans, M. David Ermann

Valerie P. Hans

For many years, researchers assumed that the public was indifferent to corporate wrongdoing, but recent surveys have discovered evidence to the contrary. Taking insights from these data a step further, this study employed an experimental design to examine whether people responded differently to corporate versus individual wrongdoers. We varied the identity of the central actor in a scenario involving harm to workers. Half the respondents were informed that a corporation caused the harm; the remainder were told that an individual did so. Respondents applied a higher standard of responsibility to the corporate actor. For identical actions, the corporation was judged …


Judge-Jury Agreement In Criminal Cases: A Partial Replication Of Kalven And Zeisel's The American Jury, Theodore Eisenberg, Paula L. Hannaford-Agor, Valerie P. Hans, Nicole L. Waters, G. Thomas Munsterman, Stewart J. Schwab, Martin T. Wells Jun 2015

Judge-Jury Agreement In Criminal Cases: A Partial Replication Of Kalven And Zeisel's The American Jury, Theodore Eisenberg, Paula L. Hannaford-Agor, Valerie P. Hans, Nicole L. Waters, G. Thomas Munsterman, Stewart J. Schwab, Martin T. Wells

Valerie P. Hans

This study uses a new criminal case data set to partially replicate Kalven and Zeisel's classic study of judge-jury agreement. The data show essentially the same rate of judge-jury agreement as did Kalven and Zeisel for cases tried almost 50 years ago. This study also explores judge-jury agreement as a function of evidentiary strength (as reported by both judges and juries), evidentiary complexity (as reported by both judges and juries), legal complexity (as reported by judges), and locale. Regardless of which adjudicator's view of evidentiary strength is used, judges tend to convict more than juries in cases of "middle" evidentiary …