Open Access. Powered by Scholars. Published by Universities.®

Applied Statistics Commons

Open Access. Powered by Scholars. Published by Universities.®

2012

Discipline
Institution
Keyword
Publication
Publication Type
File Type

Articles 1 - 30 of 141

Full-Text Articles in Applied Statistics

A Comparative Analysis Of Decision Trees Vis-À-Vis Other Computational Data Mining Techniques In Automotive Insurance Fraud Detection, Adrian Gepp, Kuldeep Kumar, J Holton Wilson, Sukanto Bhattacharya Jul 2014

A Comparative Analysis Of Decision Trees Vis-À-Vis Other Computational Data Mining Techniques In Automotive Insurance Fraud Detection, Adrian Gepp, Kuldeep Kumar, J Holton Wilson, Sukanto Bhattacharya

Kuldeep Kumar

No abstract provided.


Nbr2 Errata And Comments, Joseph Hilbe Dec 2012

Nbr2 Errata And Comments, Joseph Hilbe

Joseph M Hilbe

Errata and Comments for Negative Binomial Regression, 2nd edition


Multiple Subject Barycentric Discriminant Analysis (Musubada): How To Assign Scans To Categories Without Using Spatial Normalization, Hervé Abdi, Lynne J. Williams, Andrew C. Connolly, M. Ida Gobbini Dec 2012

Multiple Subject Barycentric Discriminant Analysis (Musubada): How To Assign Scans To Categories Without Using Spatial Normalization, Hervé Abdi, Lynne J. Williams, Andrew C. Connolly, M. Ida Gobbini

Dartmouth Scholarship

We present a new discriminant analysis (DA) method called Multiple Subject Barycentric Discriminant Analysis (MUSUBADA) suited for analyzing fMRI data because it handles datasets with multiple participants that each provides different number of variables (i.e., voxels) that are themselves grouped into regions of interest (ROIs). Like DA, MUSUBADA (1) assigns observations to predefined categories, (2) gives factorial maps displaying observations and categories, and (3) optimally assigns observations to categories. MUSUBADA handles cases with more variables than observations and can project portions of the data table (e.g., subtables, which can represent participants or ROIs) on the factorial maps. Therefore MUSUBADA can …


Time Series, Unit Roots, And Cointegration: An Introduction, Lonnie K. Stevans Dec 2012

Time Series, Unit Roots, And Cointegration: An Introduction, Lonnie K. Stevans

Lonnie K. Stevans

The econometric literature on unit roots took off after the publication of the paper by Nelson and Plosser (1982) that argued that most macroeconomic series have unit roots and that this is important for the analysis of macroeconomic policy. Yule (1926) suggested that regressions based on trending time series data can be spurious. This problem of spurious correlation was further pursued by Granger and Newbold (1974) and this also led to the development of the concept of cointegration (lack of cointegration implies spurious regression). The pathbreaking paper by Granger (1981), first presented at a conference at the University of Florida …


Generalized Estimating Equations, Second Edition.Pdf, James W. Hardin, Joseph M.. Hilbe Dec 2012

Generalized Estimating Equations, Second Edition.Pdf, James W. Hardin, Joseph M.. Hilbe

Joseph M Hilbe

Generalized Estimating Equations, Second edition, updates the best-selling previous edition, which has been the standard text on the subject since it was published a decade ago. Combining theory and application, the text provides readers with a comprehensive discussion of GEE and related models. Numerous examples are employed throughout the text, along with the software code used to create, run, and evaluate the models being examined. Stata is used as the primary software for running and displaying modeling output; associated R code is also given to allow R users to replicate Stata examples. Specific examples of SAS usage are provided in …


Analysis Of Alcohol Use Among Pregnant Women In San Luis Obispo County, Samantha Law Dec 2012

Analysis Of Alcohol Use Among Pregnant Women In San Luis Obispo County, Samantha Law

Statistics

Drinking alcohol during pregnancy is harmful to the fetus, and can lead to serious alcohol related developmental birth defects. Utilizing prenatal screening, such as the 4P’s Plus© screening tool, during a woman’s first prenatal doctors visit can help educate women and reduce continued alcohol use during pregnancy. Currently the CDC reports that 1 in 13 women in the US drink alcohol while pregnant compared to local reports that 1 in 3 women in San Luis Obispo County continue to drink alcohol during pregnancy. A primary concern for many local county health care experts and organizations is to raise awareness that …


An Analysis Of Risk Reduction Choices In Dcis Breast Cancer Patients, Lauren Soltesz Dec 2012

An Analysis Of Risk Reduction Choices In Dcis Breast Cancer Patients, Lauren Soltesz

Statistics

The main focus of this paper was to evaluate possible demographic and clinical characteristics associated with a woman’s choice of breast conserving surgery (BCS), unilateral mastectomy (ULM), or bilateral risk reduction mastectomy (BRRM). The cohort consisted of patients presenting to the City of Hope National Medical Center with ductal carcinoma in situ breast cancer who elected to have cancer directed surgery (N=305). Analyses to examine associations of patient characteristics with type of surgery were conducted using a multinomial logistic regression. Results showed that older women were more likely to choose breast conserving surgery over bilateral risk reduction mastectomy than younger …


Stress-Lifetime Joint Distribution Model For Performance Degradation Failure, Quan Sun, Yanzhen Tang, Jing Feng, Paul Kvam Dec 2012

Stress-Lifetime Joint Distribution Model For Performance Degradation Failure, Quan Sun, Yanzhen Tang, Jing Feng, Paul Kvam

Department of Math & Statistics Faculty Publications

The high energy density self-healing metallized film pulse capacitor has been applied to all kinds of laser facilities for their power conditioning systems under several stress levels, such as 23kV, 30kV and 35kV, whose reliability performance and maintenance costs are affected by the reliability of capacitors. Due to the costs and time restriction, how to assess the reliability of highly reliable capacitors under a certain stress level as soon as possible becomes a challenge. Accelerated degradation test provides a way to predict its lifetime and reliability effectively. A model called stress-lifetime joint distribution model and an analysis method based on …


An Economic Alternative To The C Chart, Ryan William Black Dec 2012

An Economic Alternative To The C Chart, Ryan William Black

Graduate Theses and Dissertations

Because the probability of Type I error is not evenly distributed beyond upper and lower three-sigma limits the c chart is theoretically inappropriate for a monitor of Poisson distributed phenomena. Furthermore, the normal approximation to the Poisson is of little use when c is small. These practical and theoretical concerns should motivate the computation of true error rates associated with individuals control assuming the Poisson distribution. An economic alternative to the c chart is described as a statistical model of upward shift from c0 to c1 and the two charts are compared in theory. For a range of c chart …


Investigating The Sensitivity Of Goodness-Of-Fit Indices To Detect Measurement Invariance In The Bifactor Model, Jam Khojasteh Dec 2012

Investigating The Sensitivity Of Goodness-Of-Fit Indices To Detect Measurement Invariance In The Bifactor Model, Jam Khojasteh

Graduate Theses and Dissertations

A Monte Carlo simulation study was conducted to evaluate the sensitivities of five commonly used goodness-of-fit indices to detect metric invariance properties of the bifactor model. The fit indices that performed the best in terms of power were Gamma and Mc. In addition, Gamma, Mc, CFI, and RMSEA all held Type I error to a minimum. However, only Gamma and CFI are recommended to use in the bifactor model because the other GOF indices have cutoff values that are too large. For Gamma and CFI values of -.026 to -.045 and -.004 to -.009, respectively indicate a lack of metric …


On The Distribution Of Quadratic Expressions In Various Types Of Random Vectors, Ali Akbar Mohsenipour Nov 2012

On The Distribution Of Quadratic Expressions In Various Types Of Random Vectors, Ali Akbar Mohsenipour

Electronic Thesis and Dissertation Repository

Several approximations to the distribution of indefinite quadratic expressions in possibly singular Gaussian random vectors and ratios thereof are obtained in this dissertation. It is established that such quadratic expressions can be represented in their most general form as the difference of two positive definite quadratic forms plus a linear combination of Gaussian random variables. New advances on the distribution of quadratic expressions in elliptically contoured vectors, which are expressed as scalar mixtures of Gaussian vectors, are proposed as well. Certain distributional aspects of Hermitian quadratic expressions in complex Gaussian vectors are also investigated. Additionally, approximations to the distributions of …


Selection Of Mixed Sampling Plan With Qss-1(N; CN, CT) Plan As Attribute Plan Indexed Through Mapd And Lql, R. Sampath Kumar, M. Indra, R. Radhakrishnan Nov 2012

Selection Of Mixed Sampling Plan With Qss-1(N; CN, CT) Plan As Attribute Plan Indexed Through Mapd And Lql, R. Sampath Kumar, M. Indra, R. Radhakrishnan

Journal of Modern Applied Statistical Methods

A procedure for the construction and selection of the mixed sampling plan using MAPD as a quality standard with the QSS-1 (n; cN, cT) plan as an attribute plan is presented. The plans indexed through MAPD and LQL are constructed and compared for efficiency. Tables are provided for selection of an appropriate sampling plan.


Regression Split By Levels Of The Dependent Variable, Stan Lipovetsky Nov 2012

Regression Split By Levels Of The Dependent Variable, Stan Lipovetsky

Journal of Modern Applied Statistical Methods

Multiple regression coefficients split by the levels of the dependent variable are examined. The decomposition of the coefficients can be defined by points on the ordinal scale or by levels in the numerical response using the Gifi system of binary variables. This approach permits consideration of specific values of the coefficients at each layer of the response variable. Numerical results illustrate how to identify levels of interpretable regression coefficients.


Class(Es) Of Factor-Type Estimator(S) In Presence Of Measurement Error, Diwakar Shukla, Sharad Pathak, Narendra Singh Thakur Nov 2012

Class(Es) Of Factor-Type Estimator(S) In Presence Of Measurement Error, Diwakar Shukla, Sharad Pathak, Narendra Singh Thakur

Journal of Modern Applied Statistical Methods

When data is collected via sample survey it is assumed whatever is reported by a respondent is correct. However, given the issues of prestige bias, personal respect and honor, respondents’ self-reported data often produces over- or under- estimated values as opposed to true values regarding the variables under question. This causes measurement error to be present in sample values. This article considers the factortype estimator as an estimation tool and examines its performance under a measurement error model. Expressions of optimization are derived and theoretical results are supported by numerical examples.


A Graphical Examination Of Variable Deletion Within The Mewma Statistic, Jay R. Schaffer, Shawn Vandenhul Nov 2012

A Graphical Examination Of Variable Deletion Within The Mewma Statistic, Jay R. Schaffer, Shawn Vandenhul

Journal of Modern Applied Statistical Methods

A general procedure for identifying the variable(s) that contribute(s) to the signal of the multivariate extension of the exponentially weighted moving average (MEWMA) chart is presented. The procedure systematically removes one or two variables from the MEWMA statistic calculations. Percentages are calculated for correctly identifying various shifts.


Examining Multiple Comparison Procedures According To Error Rate, Power Type And False Discovery Rate, Guven Ozkaya, Ilker Ercan Nov 2012

Examining Multiple Comparison Procedures According To Error Rate, Power Type And False Discovery Rate, Guven Ozkaya, Ilker Ercan

Journal of Modern Applied Statistical Methods

Examining pairwise differences between means is a common practice of applied researchers, and the selection of an appropriate multiple comparison procedure (MCP) is important for analyzing pairwise comparisons. This study examines the performance of MCPs under the assumption of homogeneity of variances for various numbers of groups with equal and unequal sample sizes via a simulation study. MCPs are compared according to type I error rate, power type and false discovery rate (FDR). Results show that the LSD and Duncan procedures have high error rates and Scheffe’s procedure has low power; no remarkable differences between the other procedures considered were …


Modified Edf Goodness Of Fit Tests For Logistic Distribution Under Srs And Rss, S. A. Al-Subh, M. T. Alodat, Kamaruzaman Ibrahim, Abdul Aziz Jemain Nov 2012

Modified Edf Goodness Of Fit Tests For Logistic Distribution Under Srs And Rss, S. A. Al-Subh, M. T. Alodat, Kamaruzaman Ibrahim, Abdul Aziz Jemain

Journal of Modern Applied Statistical Methods

Modified forms of goodness of fit tests are presented for the logistic distribution using statistics based on the empirical distribution function (EDF). A method to improve the power of the modified EDF goodness of fit tests is introduced based on Ranked Set sampling (RSS). Data are collected via the Ranked Set Sampling (RSS) technique (McIntyre, 1952). Critical values for the logistic distribution with unknown parameters are provided and the powers of the tests are given for a number of alternative distributions. A simulation study is presented to illustrate the power of the new method.


On Some Negative Integer Moments Of Quasi-Negative-Binomial Distribution, Anwar Hassan, Sheikh Bilal Nov 2012

On Some Negative Integer Moments Of Quasi-Negative-Binomial Distribution, Anwar Hassan, Sheikh Bilal

Journal of Modern Applied Statistical Methods

Negative integer moments of the quasi-negative-binomial distribution (QNBD) are investigated. This distribution includes recurrence relations which are helpful in the solution of many applied statistical problems, particularly in life testing and survey sampling, where ratio estimators are useful. Results study show the negative-binomial distribution when the parameter θ2 is zero and also indicate the mean of the QNBD model when its parameters are changed.


Single Sampling Plans For Variables Indexed By Aql And Aoql With Measurement Error, R. Sankle, J.R. Singh Nov 2012

Single Sampling Plans For Variables Indexed By Aql And Aoql With Measurement Error, R. Sankle, J.R. Singh

Journal of Modern Applied Statistical Methods

Single sampling plans are investigated for variables indexed by acceptable quality level (AQL) and average outgoing quality limit (AOQL) under measurement error. Procedures and tables are provided for selection of single sampling plans for variables for given AQL and AOQL when rejected lots are 100% inspected for replacement of a nonconforming unit. For a particular sampling plan in operation for an observed measurement, a method for determining true operating characteristic (OC) functions and average outgoing quality (AOQ) is described for various error sizes.


Small-To-Medium Enterprises And Economic Growth: A Comparative Study Of Clustering Techniques, Karim K. Mardaneh Nov 2012

Small-To-Medium Enterprises And Economic Growth: A Comparative Study Of Clustering Techniques, Karim K. Mardaneh

Journal of Modern Applied Statistical Methods

Small-to-medium enterprises (SMEs) in regional (non-metropolitan) areas are considered when economic planning may require large data sets and sophisticated clustering techniques. The economic growth of regional areas was investigated using four clustering algorithms. Empirical analysis demonstrated that the modified global k-means algorithm outperformed other algorithms.


Exact Logistic Regression For A Matched Pairs Case-Control Design With Polytomous Exposure Variables, Shyam S. Ganguly Nov 2012

Exact Logistic Regression For A Matched Pairs Case-Control Design With Polytomous Exposure Variables, Shyam S. Ganguly

Journal of Modern Applied Statistical Methods

Logistic regression methods are useful in estimating odds ratios under matched pairs case-control designs when the exposure variable of interest is binary or polytomous in nature. Analysis is typically performed using large sample approximation techniques. When conducting the analysis with polytomous exposure variable, situations where the numbers of discordant pairs in the resulting cells are small or the data structure is sparse can be encountered. In such situations, the asymptotic method of analysis is questionable, thus an exact method of analysis may be more suitable. A method is presented that performs exact inference in the case of pair-wise matched case-control …


Graphical Modeling For High Dimensional Data, Munni Begum, Jay Bagga, C. Ann Blakey Nov 2012

Graphical Modeling For High Dimensional Data, Munni Begum, Jay Bagga, C. Ann Blakey

Journal of Modern Applied Statistical Methods

With advances in science and information technologies, many scientific fields are able to meet the challenges of managing and analyzing high-dimensional data. A so-called large p small n problem arises when the number of experimental units, n, is equal to or smaller than the number of features, p. A methodology based on probability and graph theory, termed graphical models, is applied to study the structure and inference of such high-dimensional data.


Extreme Value Charts And Analysis Of Means (Anom) Based On The Log Logistic Distribution, B. Srinivasa Rao, J. Pratapa Reddy, G. Sarath Babu Nov 2012

Extreme Value Charts And Analysis Of Means (Anom) Based On The Log Logistic Distribution, B. Srinivasa Rao, J. Pratapa Reddy, G. Sarath Babu

Journal of Modern Applied Statistical Methods

A probability model of a quality characteristic is assumed to follow a log logistic distribution. This article proposes variable control charts, termed extreme value charts, based on the extreme values of each subgroup. The control chart constants depend on the probability model of the extreme order statistics and the size of each subgroup. The analysis of means (ANOM) technique for a skewed population is applied with respect to log logistic distribution. Results are illustrated using examples based on real data.


Weighted Cook-Johnson Copula And Their Characterizations: Application To Probably Modeling Of The Hot Spring Eruptions, Hakim Bekrizadeh, Gholam Ali Parham, Mohamd Reza Zadkarmi Nov 2012

Weighted Cook-Johnson Copula And Their Characterizations: Application To Probably Modeling Of The Hot Spring Eruptions, Hakim Bekrizadeh, Gholam Ali Parham, Mohamd Reza Zadkarmi

Journal of Modern Applied Statistical Methods

Copulas have emerged as a practical method for multivariate modeling. A limited amount of work has been conducted regarding the application of copula-based modeling in context analysis. This study generalizes the Cook-Johnson copula under the appropriate weighted function and provides examples and the properties of the generalized Cook-Johnson copula. Results show that the generalized Cook-Johnson copula is suitable for probable modeling of hot spring eruption.


Examining Growth With Statistical Shape Analysis And Comparison Of Growth Models, Deniz Sigirli, Ilker Ercan Nov 2012

Examining Growth With Statistical Shape Analysis And Comparison Of Growth Models, Deniz Sigirli, Ilker Ercan

Journal of Modern Applied Statistical Methods

Growth curves have been widely used in the fields of biology, zoology and medicine for assessing some measurable trait of an organism, such as height, weight, area or volume. In statistical shape analysis, a size measure is obtained using the geometrical information of an object as opposed to linear measurements. The performances of commonly used non-linear growth curves are compared by using centroid size as a size measure in a simulation study. An example is provided on the relationship between centroid size of the cerebellum and disease duration in multiple sclerosis patients.


Posterior Estimates Of Poisson Distribution Using R Software, Raja Sultan, S.P. Ahmad Nov 2012

Posterior Estimates Of Poisson Distribution Using R Software, Raja Sultan, S.P. Ahmad

Journal of Modern Applied Statistical Methods

The Bayesian estimation of unknown parameter of the Poisson distribution is examined under different priors. The posterior distributions for the unknown parameter of the Poisson distribution are derived using the following priors: uniform, Jeffrey’s, Gamma distribution, Gamma-Chi-square distribution, Gammaexponential distribution and Chi-square-exponential distribution. Numerical and graphical illustrations of the posterior densities of the parameters of interest were conducted using R Software.


Ferrieri's Index Of Openness Applied To Remittances To Developing Countries, Gaetano Ferrieri Nov 2012

Ferrieri's Index Of Openness Applied To Remittances To Developing Countries, Gaetano Ferrieri

Journal of Modern Applied Statistical Methods

A new methodology to measure international openness and globalization is described. This allows capacity to be effectively combined with size in a number of socio-economic areas, such as trade, migration and foreign investment. The method is applied to remittances to developing countries.


Multivariate Generalized Poisson Distribution For Interference On Selected Non-Communicable Diseases In Lagos State, Nigeria, Adewara Johnson Ademola, Mbata Ugochuckwu Ahamefula Nov 2012

Multivariate Generalized Poisson Distribution For Interference On Selected Non-Communicable Diseases In Lagos State, Nigeria, Adewara Johnson Ademola, Mbata Ugochuckwu Ahamefula

Journal of Modern Applied Statistical Methods

Multivariate Generalized Poisson Distribution (MGPD) models are applied to make inferences regarding non-communicable diseases, diabetes, hypertension, stroke and ulcer in Lagos State, Nigeria. The generalized Poisson distribution is employed due to its usefulness in modeling count data in the presence of either over- or under- dispersion. Results show that the correlation between ulcer and stroke is not significant. Other pairwise comparisons of diseases are significant, thus implying that a patient who suffers from diabetes or stroke has a high propensity to also be hypertensive.


Comparing Two Independent Groups Via A Quantile Generalization Of The Wilcoxon-Mann-Whitney Test, Rand R. Wilcox Nov 2012

Comparing Two Independent Groups Via A Quantile Generalization Of The Wilcoxon-Mann-Whitney Test, Rand R. Wilcox

Journal of Modern Applied Statistical Methods

The Wilcoxon-Mann-Whitney test, as well as modern improvements, are based in part on an estimate of p = P(D < 0), where D = X−Y and X and Y are independent random variables; a common goal is to test H0: p = 0.5. This corresponds to testing H0: ξ0.5, where ξ0.5 is the 0.5 quantile of the distribution of D. If the distributions associated with X and Y do not differ, then D has a symmetric distribution about zero. In particular, ξq + ξ1-q = 0 for any q ≤ 0.5, where ξq is the qth quantile. Methods aimed at testing H0: p = 0.5 are generalized by …


Testing The Population Coefficient Of Variation, Shipra Banik, B. M. Golam Kibria, Dinesh Sharma Nov 2012

Testing The Population Coefficient Of Variation, Shipra Banik, B. M. Golam Kibria, Dinesh Sharma

Journal of Modern Applied Statistical Methods

The coefficient of variation (CV), which is used in many scientific areas, measures the variability of a population relative to its mean and standard deviation. Several methods exist for testing the population CV. This article compares a proposed bootstrap method to existing methods. A simulation study was conducted under both symmetric and skewed distributions to compare the performance of test statistics with respect to empirical size and power. Results indicate that some of the proposed methods are useful and can be recommended to practitioners.