Open Access. Powered by Scholars. Published by Universities.®

Applied Statistics Commons

Open Access. Powered by Scholars. Published by Universities.®

2012

Discipline
Institution
Keyword
Publication
Publication Type
File Type

Articles 31 - 60 of 141

Full-Text Articles in Applied Statistics

Bayesian Estimation Of Erlang Distribution Under Different Generalized Truncated Distributions As Priors, Adil H. Khan, T.R. Jan Nov 2012

Bayesian Estimation Of Erlang Distribution Under Different Generalized Truncated Distributions As Priors, Adil H. Khan, T.R. Jan

Journal of Modern Applied Statistical Methods

Various generalized truncated distributions are considered as independent informative priors for estimating shape and scale parameters of the Erlang distribution. In addition, various special cases are also discussed.


A Proposed Ridge Parameter To Improve The Least Square Estimator, Ghadban Khalaf Nov 2012

A Proposed Ridge Parameter To Improve The Least Square Estimator, Ghadban Khalaf

Journal of Modern Applied Statistical Methods

Ridge regression, a form of biased linear estimation, is a more appropriate technique than ordinary least squares (OLS) estimation in the case of highly intercorrelated explanatory variables in the linear regression model Y = β + u. Two proposed ridge regression parameters from the mean square error (MSE) perspective are evaluated. A simulation study was conducted to demonstrate the performance of the proposed estimators compared to the OLS, HK and HKB estimators. Results show that the suggested estimators outperform the OLS and the other estimators regarding the ridge parameters in all situations examined.


Comparing Two Independent Groups Via A Quantile Generalization Of The Wilcoxon-Mann-Whitney Test, Rand R. Wilcox Nov 2012

Comparing Two Independent Groups Via A Quantile Generalization Of The Wilcoxon-Mann-Whitney Test, Rand R. Wilcox

Journal of Modern Applied Statistical Methods

The Wilcoxon-Mann-Whitney test, as well as modern improvements, are based in part on an estimate of p = P(D < 0), where D = X−Y and X and Y are independent random variables; a common goal is to test H0: p = 0.5. This corresponds to testing H0: ξ0.5, where ξ0.5 is the 0.5 quantile of the distribution of D. If the distributions associated with X and Y do not differ, then D has a symmetric distribution about zero. In particular, ξq + ξ1-q = 0 for any q ≤ 0.5, where ξq is the qth quantile. Methods aimed at testing H0: p = 0.5 are generalized by …


An Extension Of Cochran-Orcutt Procedure For Generalized Linear Regression Models With Periodically Correlated Errors, Abdullah A. Smadi, Nour H. Abu-Afouna Nov 2012

An Extension Of Cochran-Orcutt Procedure For Generalized Linear Regression Models With Periodically Correlated Errors, Abdullah A. Smadi, Nour H. Abu-Afouna

Journal of Modern Applied Statistical Methods

An important assumption of ordinary regression models is independence among errors. This research considers the case of periodically correlated errors following the periodic AR model of order 1 (PAR(1)). The remedial measure for correlated errors in regression known as the Cochran-Orcutt procedure is generalized to the case of periodically correlated errors. The motivation for making such generalizations is that the response data may inhibit some seasonality, which may not be captured by the traditional AR(1) autoregressive model. The proposed procedure is described and the bias and MSE of the resulting intercept and slope parameter estimates of the simple LR model …


Obtaining Critical Values For Test Of Markov Regime Switching, Douglas G. Steigerwald, Valerie Bostwick Oct 2012

Obtaining Critical Values For Test Of Markov Regime Switching, Douglas G. Steigerwald, Valerie Bostwick

Douglas G. Steigerwald

For Markov regime-switching models, testing for the possible presence of more than one regime requires the use of a non-standard test statistic. Carter and Steigerwald (forthcoming, Journal of Econometric Methods) derive in detail the analytic steps needed to implement the test ofMarkov regime-switching proposed by Cho and White (2007, Econometrica). We summarize the implementation steps and address the computational issues that arise. A new command to compute regime-switching critical values, rscv, is introduced and presented in the context of empirical research.


A Doubling Technique For The Power Method Transformations, Mohan D. Pant, Todd C. Headrick Oct 2012

A Doubling Technique For The Power Method Transformations, Mohan D. Pant, Todd C. Headrick

Mohan Dev Pant

Power method polynomials are used for simulating non-normal distributions with specified product moments or L-moments. The power method is capable of producing distributions with extreme values of skew (L-skew) and kurtosis (L-kurtosis). However, these distributions can be extremely peaked and thus not representative of real-world data. To obviate this problem, two families of distributions are introduced based on a doubling technique with symmetric standard normal and logistic power method distributions. The primary focus of the methodology is in the context of L-moment theory. As such, L-moment based systems of equations are derived for simulating univariate and multivariate non-normal distributions with …


Modeling Martian Planetary Entry Descent And Landing Using Monte Carlo Driven Response Surface Methodology, Narcrisha S. Norman Oct 2012

Modeling Martian Planetary Entry Descent And Landing Using Monte Carlo Driven Response Surface Methodology, Narcrisha S. Norman

Mechanical & Aerospace Engineering Theses & Dissertations

Response surface methodology (RSM) is a statistical method that explores the relationships between several descriptive variables and one or more response variables. For over sixty years, among other areas, it has been utilized in quality engineering, process engineering, aircraft engineering, economics, chemical engineering, automotive engineering and design/technique optimization. In this dissertation, RSM is utilized to produce regression models that represent the planetary entry, descent and landing (EDL) process. A complete understanding of EDL process is an essential component of any planetary exploration. Research in this area is ongoing and confidence in the ability to explore known celestial bodies is growing. …


Regional Specialization: Measurement & Application, Zheng Lu Sep 2012

Regional Specialization: Measurement & Application, Zheng Lu

Zheng Lu (Chinese: 路征)

Various measure methods for regional specialization and evolution of China's regional specialization are introduced in this presentation.


International Astrostatistics Association, Joseph Hilbe Sep 2012

International Astrostatistics Association, Joseph Hilbe

Joseph M Hilbe

Overview of the history, purpose, Council and officers of the International Astrostatistics Association (IAA)


Comparative Analysis Of Dispersion Parameter Estimates In Loglinear Modeling: Applied To E-Commerce Sales And Customer Data, Scott Davis Sep 2012

Comparative Analysis Of Dispersion Parameter Estimates In Loglinear Modeling: Applied To E-Commerce Sales And Customer Data, Scott Davis

Statistics

When loglinear models are applied to count data the issue of over-dispersion often arises. Moment and maximum likelihood estimation methods in accounting for over-dispersion are widely used because they allow for model checking tools such as Chi-square, F, and likelihood ratio tests. Here is a comparison between R functions that each uses one method; glm.nb uses MLE, and glm.poisson.disp uses MME. The Index of Dissimilarity and visual model selection (ECDF plots) are also incorporated. These are applied to sales data using product and customer information compiled over the last five years that was generously provided by an e-commerce company.


Bailey/Howe Reference Analytics: What Two Years Of Data Tell Us, Elizabeth Berman Aug 2012

Bailey/Howe Reference Analytics: What Two Years Of Data Tell Us, Elizabeth Berman

UVM Libraries Conference Day

Analyzing the last two academic years (2010-2011 and 2011-2012) of reference-desk statistics, this presentation will highlight trends at the Bailey/Howe Reference Desk, and offer scenarios for the future of reference services.


An L-Moment-Based Analog For The Schmeiser-Deutsch Class Of Distributions, Todd C. Headrick, Mohan D. Pant Aug 2012

An L-Moment-Based Analog For The Schmeiser-Deutsch Class Of Distributions, Todd C. Headrick, Mohan D. Pant

Mohan Dev Pant

This paper characterizes the conventional moment-based Schmeiser-Deutsch (S-D) class of distributions through the method of L-moments. The system can be used in a variety of settings such as simulation or modeling various processes. A procedure is also described for simulating S-D distributions with specified L-moments and L-correlations. The Monte Carlo results presented in this study indicate that the estimates of L-skew, L-kurtosis, and L-correlation associated with the S-D class of distributions are substantially superior to their corresponding conventional product-moment estimators in terms of relative bias—most notably when sample sizes are small.


The Implementation Of The Shear Correlation Function And The Matter Power Spectrum In R, Allison A. Scheppelmann, Deborah J. Bard Aug 2012

The Implementation Of The Shear Correlation Function And The Matter Power Spectrum In R, Allison A. Scheppelmann, Deborah J. Bard

STAR Program Research Presentations

Weak gravitational lensing is an important tool in understanding the large-scale structure of the universe. One component in understanding the effect of weak gravitational lensing is the shear correlation function and matter power spectrum. The calculation of these values is often complicated and time consuming. In order to decrease the cost of these calculations they were implemented in R using parallelization. This resulted in the calculations completing faster and the process to be easily changed in order to fit the need of each researcher using the algorithms created in R.


諸外国のデータエディティング及び混淆正規分布モデルによる多変量外れ値検出法についての研究(高橋将宜、選択的エディティング、セレクティブエディティング), Masayoshi Takahashi Aug 2012

諸外国のデータエディティング及び混淆正規分布モデルによる多変量外れ値検出法についての研究(高橋将宜、選択的エディティング、セレクティブエディティング), Masayoshi Takahashi

Masayoshi Takahashi

No abstract provided.


Analysis Of Bank Failure And Size Of Assets, Guancun Zhong Aug 2012

Analysis Of Bank Failure And Size Of Assets, Guancun Zhong

UNLV Theses, Dissertations, Professional Papers, and Capstones

The financial health of the banking industry is an important prerequisite for economic stability and growth. Bank failures in the United States have run in cycles largely associated with the collapse of economic bubbles. The number of bank failures has increased dramatically over the last thirty years (Halling and Hayden, 2007). In this thesis, we try to address the following two questions: 1) What is the relationship, if any, between a bank's asset size and its likelihood of failures? 2) How can we use statistical tools to predict the numbers of bank failures in the future? Various modeling techniques are …


Significant Themes In 19th-Century Literature, Matthew L. Jockers, David Mimno Aug 2012

Significant Themes In 19th-Century Literature, Matthew L. Jockers, David Mimno

Department of English: Faculty Publications

External factors such as author gender, author nationality, and date of publication affect both the choice of literary themes in novels and the expression of those themes, but the extent of this association is difficult to quantify. In this work, we apply statistical methods to identify and extract hundreds of "topics" from a corpus of 3,346 works of 19th-century British, Irish, and American fiction. We use these topics as a measurable, data-driven proxy for literary themes. External factors may predict fluctuations in the use of themes and the individual word choices within themes. We use topics to measure the evidence …


From Unbiased Numerical Estimates To Unbiased Interval Estimates, Baokun Li, Gang Xiang, Vladik Kreinovich, Panagios Moscopoulos Aug 2012

From Unbiased Numerical Estimates To Unbiased Interval Estimates, Baokun Li, Gang Xiang, Vladik Kreinovich, Panagios Moscopoulos

Departmental Technical Reports (CS)

One of the main objectives of statistics is to estimate the parameters of a probability distribution based on a sample taken from this distribution. Of course, since the sample is finite, the estimate X is, in general, different from the actual value x of the corresponding parameter. What we can require is that the corresponding estimate is unbiased, i.e., that the mean value of the difference X - x is equal to 0: E[X] = x. In some problems, unbiased estimates are not possible. We show that in some such problems, it is possible to have interval unbiased estimates, i.e., …


Big Data And The Future, Sherri Rose Jul 2012

Big Data And The Future, Sherri Rose

Sherri Rose

No abstract provided.


Combined Eeg And Eye Tracking In Sports Skills Training And Performance Analysis, Keith Barfoot, Matthew Casey, Andrew J. Callaway Jul 2012

Combined Eeg And Eye Tracking In Sports Skills Training And Performance Analysis, Keith Barfoot, Matthew Casey, Andrew J. Callaway

Andrew J Callaway

No abstract provided.


Technical Factors Utilised By Elite Archers: Towards Setting An Agenda For Archery, Andrew J. Callaway, Shelley A. Broomfield Jul 2012

Technical Factors Utilised By Elite Archers: Towards Setting An Agenda For Archery, Andrew J. Callaway, Shelley A. Broomfield

Andrew J Callaway

Archery, in one form or another, has been around for thousands of years yet research into what makes an archer 'good' is still in its infancy. There are several variations over bow type and different competitions which can be competed, previous works have focused on Recurve (Olympic) bow types whilst Compound have generally been ignored. Research in the area has tended to focus on muscle activation patterns using Electromyography (EMG) and aiming based studies, where generally scores are used as a factor to correlate to.

AIM: The aim of this research is to offer a development from the use of …


Data Mining Of Portable Eeg Brain Wave Signals For Sports Performance Analysis: An Archery Case Study, Matthew Casey, Alan Yau, Andrew J. Callaway, Keith Barfoot Jul 2012

Data Mining Of Portable Eeg Brain Wave Signals For Sports Performance Analysis: An Archery Case Study, Matthew Casey, Alan Yau, Andrew J. Callaway, Keith Barfoot

Andrew J Callaway

No abstract provided.


Meta-Heuristics Analysis For Technologically Complex Programs: Understanding The Impact Of Total Constraints For Schedule, Quality And Cost, Henry Darrel Webb Jul 2012

Meta-Heuristics Analysis For Technologically Complex Programs: Understanding The Impact Of Total Constraints For Schedule, Quality And Cost, Henry Darrel Webb

Engineering Management & Systems Engineering Projects for D. Eng. Degree

Program management data associated with a technically complex radio frequency electronics base communication system has been collected and analyzed to identify heuristics which may be utilized in addition to existing processes and procedures to provide indicators that a program is trending to failure. Analysis of the collected data includes detailed schedule analysis, detailed earned value management analysis and defect analysis within the framework of a Firm Fixed Price (FFP) incentive fee contract.

This project develops heuristics and provides recommendations for analysis of complex project management efforts such as those discussed herein. The analysis of the effects of the constraints on …


Reliability Models For Hpc Applications And A Cloud Economic Model, Thanadech Thanakornworakij Jul 2012

Reliability Models For Hpc Applications And A Cloud Economic Model, Thanadech Thanakornworakij

Doctoral Dissertations

With the enormous number of computing resources in HPC and Cloud systems, failures become a major concern. Therefore, failure behaviors such as reliability, failure rate, and mean time to failure need to be understood to manage such a large system efficiently.

This dissertation makes three major contributions in HPC and Cloud studies. First, a reliability model with correlated failures in a k-node system for HPC applications is studied. This model is extended to improve accuracy by accounting for failure correlation. Marshall-Olkin Multivariate Weibull distribution is improved by excess life, conditional Weibull, to better estimate system reliability. Also, the univariate …


Response Surface Optimization Of Electron Beam Freeform Fabrication Depositions Using Design Of Experiments, Patricia A. Quigley Jul 2012

Response Surface Optimization Of Electron Beam Freeform Fabrication Depositions Using Design Of Experiments, Patricia A. Quigley

Engineering Management & Systems Engineering Theses & Dissertations

The Electron Beam Freeform Fabrication (EBF3 ) System is a material depositing, layer additive technique that produces three dimensional (3D) parts out of a wide range of metals in high vacuum, using an electron beam and wire feedstock. Screening deposition trials on a titanium alloy, Ti-6Al-4V, at the National Aeronautics Space Administration (NASA) revealed selective vaporization of the aluminum content of linear prototypes when subjected to chemical analysis. In this study, the aluminum content, bead height and bead width output responses were analyzed from a systematic study of the effects that the interactions of the EBF3 processing parameters …


A Statistical Model To Determine Multiple Binding Sites Of A Transcription Factor On Dna Using Chip-Seq Data, Rasika Jayatillake Jul 2012

A Statistical Model To Determine Multiple Binding Sites Of A Transcription Factor On Dna Using Chip-Seq Data, Rasika Jayatillake

Mathematics & Statistics Theses & Dissertations

Protein-DNA interaction is vital to many biological processes in cells such as cell division, embryo development and regulating gene expression. Chromatin Immunoprecipitation followed by massively parallel sequencing (ChIP-seq) is a new technology that can reveal protein binding sites in genome with superior accuracy. Although many methods have been proposed to find binding sites for ChIP-seq data, they can find only one binding site within a short region of the genome. In this study we introduce a statistical model to identify multiple binding sites of a transcription factor within a short region of the genome using the ChIP-seq data. Mapped sequence …


Targeted Maximum Likelihood Estimation For Dynamic Treatment Regimes In Sequential Randomized Controlled Trials, Paul Chaffee, Mark J. Van Der Laan Jun 2012

Targeted Maximum Likelihood Estimation For Dynamic Treatment Regimes In Sequential Randomized Controlled Trials, Paul Chaffee, Mark J. Van Der Laan

Paul H. Chaffee

Sequential Randomized Controlled Trials (SRCTs) are rapidly becoming essential tools in the search for optimized treatment regimes in ongoing treatment settings. Analyzing data for multiple time-point treatments with a view toward optimal treatment regimes is of interest in many types of afflictions: HIV infection, Attention Deficit Hyperactivity Disorder in children, leukemia, prostate cancer, renal failure, and many others. Methods for analyzing data from SRCTs exist but they are either inefficient or suffer from the drawbacks of estimating equation methodology. We describe an estimation procedure, targeted maximum likelihood estimation (TMLE), which has been fully developed and implemented in point treatment settings, …


Investigation Of Trends And Predictive Effectiveness Of Crash Severity Models, James E. Mooradian Jun 2012

Investigation Of Trends And Predictive Effectiveness Of Crash Severity Models, James E. Mooradian

Master's Theses

This thesis describes analysis using ordinal logistic regression to uncover temporal patterns in the severity level (fatal, serious injury, minor injury, slight injury or no injury) for persons involved in highway crashes in Connecticut, focusing on the demographic split between senior travelers (65 years and over) and non-senior travelers. Existing state sources provide data describing the time and weather conditions for each crash and the vehicles and persons involved over the time period from 1995 to 2009 as well as the traffic volumes and the characteristics of the roads on which these crashes occurred. Findings indicate an overall increase in …


A Logistic L-Moment-Based Analog For The Tukey G-H, G, H, And H-H System Of Distributions, Todd C. Headrick, Mohan D. Pant Jun 2012

A Logistic L-Moment-Based Analog For The Tukey G-H, G, H, And H-H System Of Distributions, Todd C. Headrick, Mohan D. Pant

Mohan Dev Pant

This paper introduces a standard logistic L-moment-based system of distributions. The proposed system is an analog to the standard normal conventional moment-based Tukey g-h, g, h, and h-h system of distributions. The system also consists of four classes of distributions and is referred to as (i) asymmetric γ-κ, (ii) log-logistic γ, (iii) symmetric κ, and (iv) asymmetric κL-κR. The system can be used in a variety of settings such as simulation or modeling events—most notably when heavy-tailed distributions are of interest. A procedure is also described for simulating γ-κ, γ, κ, and κL-κR distributions with specified L-moments and L-correlations. The …


Analysis Of Dietary Patterns Over Freshman Year Of College, Chelsea Lofland Jun 2012

Analysis Of Dietary Patterns Over Freshman Year Of College, Chelsea Lofland

Statistics

This analysis is an investigation of changes in Cal Poly students’ eating habits over freshman year. The motivation behind this was an interest in college students’ lifestyles; college is the first time most students live on their own and it can be an important maturation period. College is stressful, exciting, liberating, and terrifying all at the same time. This distinctive life experience, along with my desire to handle big and messy data, led me to this research question.

The response variable analyzed was food consumption and the explanatory variables were: sex, race, quarter, food group, stress, exercise, BMI, sleep quality …


Analysing Domestic Electricity Smart Metering Data Using Self Organising Maps, Fintan Mcloughlin, Aidan Duffy, Michael Conlon Jun 2012

Analysing Domestic Electricity Smart Metering Data Using Self Organising Maps, Fintan Mcloughlin, Aidan Duffy, Michael Conlon

Conference Papers

This paper investigates a method of classifying domestic electricity load profiles through Self Organising Maps (SOMs). Approximately four thousand customers are divided into groups based on their electricity demand patterns. Dwelling and occupant characteristics are then investigated for each group. The results show that SOMs are an effective way of classifying customers into groups in terms of their electrical load profile and that certain dwelling and occupant characteristics are significant factors in determining which group they end up in.