Open Access. Powered by Scholars. Published by Universities.®

Social and Behavioral Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 30 of 55

Full-Text Articles in Social and Behavioral Sciences

Application Of Dynamic Poisson Models To Japanese Cancer Mortality Data, Shuichi Midorikawa, Etsuo Miyaoka, Bruce Smith Nov 2008

Application Of Dynamic Poisson Models To Japanese Cancer Mortality Data, Shuichi Midorikawa, Etsuo Miyaoka, Bruce Smith

Journal of Modern Applied Statistical Methods

A dynamic Poisson model is used with a Bayesian approach to modeling to predict cancer mortality. The complexity of the posterior distribution prohibits direct evaluation of the posterior, and so parameters are estimated by using a Markov Chain Monte Carlo method. The model is applied to analyze lung and stomach cancer data which have been collected in Japan.


Data Mining Ceo Compensation, Susan M. Adams, Atul Gupta, Dominique M. Haughton, John D. Leeth Nov 2008

Data Mining Ceo Compensation, Susan M. Adams, Atul Gupta, Dominique M. Haughton, John D. Leeth

Journal of Modern Applied Statistical Methods

The need to pre-specify expected interactions between variables is an issue in multiple regression. Theoretical and practical considerations make it impossible to pre-specify all possible interactions. The functional form of the dependent variable on the predictors is unknown in many cases. Two ways are described in which the data mining technique Multivariate Adaptive Regression Splines (MARS) can be utilized: first, to obtain possible improvements in model specification, and second, to test for the robustness of findings from a regression analysis. An empirical illustration is provided to show how MARS can be used for both purposes.


Estimating Explanatory Power In A Simple Regression Model Via Smoothers, Rand R. Wilcox Nov 2008

Estimating Explanatory Power In A Simple Regression Model Via Smoothers, Rand R. Wilcox

Journal of Modern Applied Statistical Methods

Consider the regression model Y = γ(X) + ε , where γ(X) is some conditional measure of location associated with Y , given X. Let Υ̂ be some estimate of Y, given X, and let τ2 (Y) be some measure of variation. Explanatory power is η2 = τ2 (Υ̂) /τ2(Y) . When γ(X) = β0 + β1X and τ2(Y) is the variance of Y , η2 = ρ2 , …


Type I Error Rates Of The Kenward-Roger F-Test For A Split-Plot Design With Missing Values And Non-Normal Data, Miguel A. Padilla, Youngkyoung Min, Guili Zhang Nov 2008

Type I Error Rates Of The Kenward-Roger F-Test For A Split-Plot Design With Missing Values And Non-Normal Data, Miguel A. Padilla, Youngkyoung Min, Guili Zhang

Journal of Modern Applied Statistical Methods

The Type I error of the Kenward-Roger (KR) F-test was assessed through a simulation study for a between- by within-subjects split-plot design with non-normal ignorable missing data. The KR-test for the between- and within-subjects main effect was robust under all simulation variables investigated and when the data were missing completely at random (MCAR). This continued to hold for the between-subjects main effect when data were missing at random (MAR). For the interaction, the KR F-test performed fairly well at controlling Type I under MCAR and the simulation variables investigated. However, under MAR, the KR F-test for the …


Comparing Factor Loadings In Exploratory Factor Analysis: A New Randomization Test, W. Holmes Finch, Brian F. French Nov 2008

Comparing Factor Loadings In Exploratory Factor Analysis: A New Randomization Test, W. Holmes Finch, Brian F. French

Journal of Modern Applied Statistical Methods

Factorial invariance testing requires a referent loading to be constrained equal across groups. This study introduces a randomization test for comparing group exploratory factor analysis loadings so as to identify an invariant referent. Results show that it maintains the Type I error rate while providing adequate power under most conditions.


A Randomization Method To Control The Type I Error Rates In Best Subset Regression, Yasser A. Shehata, Paul White Nov 2008

A Randomization Method To Control The Type I Error Rates In Best Subset Regression, Yasser A. Shehata, Paul White

Journal of Modern Applied Statistical Methods

A randomization method for the assessment of statistical significance for best subsets regression is given. The procedure takes into account the number of potential predictors and the inter-dependence between predictors. The approach corrects a non-trivial problem with Type I errors and can be used to assess individual variable significance.


Correlation Between The Sample Mean And Sample Variance, Ramalingam Shanmugam Nov 2008

Correlation Between The Sample Mean And Sample Variance, Ramalingam Shanmugam

Journal of Modern Applied Statistical Methods

This article obtains a general formula to find the correlation coefficient between the sample mean and variance. Several particular results for major non-normal distributions are extracted to help students in classroom, clients during statistical consulting service.


Constructing Confidence Intervals For Spearman’S Rank Correlation With Ordinal Data: A Simulation Study Comparing Analytic And Bootstrap Methods, John Ruscio Nov 2008

Constructing Confidence Intervals For Spearman’S Rank Correlation With Ordinal Data: A Simulation Study Comparing Analytic And Bootstrap Methods, John Ruscio

Journal of Modern Applied Statistical Methods

Research shows good probability coverage using analytic confidence intervals (CIs) for Spearman’s rho with continuous data, but poorer coverage with ordinal data. A simulation study examining the latter case replicated prior results and revealed that coverage of bootstrap CIs was usually as good or better than coverage of analytic CIs.


Two Dimension Marginal Distributions Of Crossing Time And Renewal Numbers Related To Two-Stage Erlang Processes, Mir Ghulam Hyder Talpur, Iffat Zamir, M. Masoom Ali Nov 2008

Two Dimension Marginal Distributions Of Crossing Time And Renewal Numbers Related To Two-Stage Erlang Processes, Mir Ghulam Hyder Talpur, Iffat Zamir, M. Masoom Ali

Journal of Modern Applied Statistical Methods

The two dimensional marginal transform, probability density and cumulative probability distribution functions for the random variables TξN (time taken by servers during vacations), ξN (number of vacations taken by servers) and Nη (number of customers or units arriving in the system) are derived by taking combinations of these random variables. One random variable is controlled at one time to determine the effect of the other two random variables simultaneously.


Analyzing Incomplete Categorical Data: Revisiting Maximum Likelihood Estimation (Mle) Procedure, Hoo Ling Ping, M. Ataharul Islam Nov 2008

Analyzing Incomplete Categorical Data: Revisiting Maximum Likelihood Estimation (Mle) Procedure, Hoo Ling Ping, M. Ataharul Islam

Journal of Modern Applied Statistical Methods

Incomplete data poses formidable difficulties in the application of statistical techniques and requires special procedures to handle. The most common ways to solve this problem are by ignoring, truncating, censoring or collapsing those data, but these may lead to inappropriate conclusions because those data might contain important information. Most of the research for estimating cell probabilities involving incomplete categorical data is based on the EM algorithm. A likelihood approach is employed for estimating cell probabilities for missing values and makes comparisons between maximum likelihood estimation (MLE) and the EM algorithm. The MLE can provide almost the same estimates as that …


Size-Biased Generalized Negative Binomial Distribution, Khurshid Ahmad Mir Nov 2008

Size-Biased Generalized Negative Binomial Distribution, Khurshid Ahmad Mir

Journal of Modern Applied Statistical Methods

A size biased generalized negative binomial distribution (SBGNBD) is defined and a recurrence relationship for the moments of SBGNBD is established. The Bayes’ estimator for a parametric function of one parameter when two other parameters of a known size-biased generalized negative binomial distribution is derived. Prior information on one parameter is given by a beta distribution and the parameters in the prior distribution are assigned by computer using Monte Carlo and R-software.


Variance Estimation In Time Series Regression Models, Samir Safi Nov 2008

Variance Estimation In Time Series Regression Models, Samir Safi

Journal of Modern Applied Statistical Methods

The effect of variance estimation of regression coefficients when disturbances are serially correlated in time series regression models is studied. Variance estimation enters into confidence interval estimation, hypotheses testing, spectrum estimation, and expressions for the estimated standard error of prediction. Using computer simulations, the robustness of various estimators, including Estimated Generalized Least Squares (EGLS) was considered. The estimates of variance of the coefficient estimators produced by computer packages were considered. Models were generated with a second order auto-correlated error structure, considering the robustness of estimators based upon misspecified order. Ordinary Least Squares (OLS) (order zero) estimates outperformed first order EGLS. …


Bootstrap Confidence Intervals And Coverage Probabilities Of Regression Parameter Estimates Using Trimmed Elemental Estimation, Matthew Hall, Matthew S. Mayo Nov 2008

Bootstrap Confidence Intervals And Coverage Probabilities Of Regression Parameter Estimates Using Trimmed Elemental Estimation, Matthew Hall, Matthew S. Mayo

Journal of Modern Applied Statistical Methods

Mayo and Gray introduced the leverage residual-weighted elemental (LRWE) classification of regression estimators and a new method of estimation called trimmed elemental estimation (TEE), showing the efficiency and robustness of TEE point estimates. Using bootstrap methods, properties of various trimmed elemental estimator interval estimates to allow for inference are examined and estimates with ordinary least squares (OLS) and least sum of absolute values (LAV) are compared. Confidence intervals and coverage probabilities for the estimators using a variety of error distributions, sample sizes, and number of parameters are examined. To reduce computational intensity, randomly selecting elemental subsets to calculate the parameter …


Robust Predictive Inference For Multivariate Linear Models With Elliptically Contoured Distribution Using Bayesian, Classical And Structural Approaches, B. M. Golam Kibria Nov 2008

Robust Predictive Inference For Multivariate Linear Models With Elliptically Contoured Distribution Using Bayesian, Classical And Structural Approaches, B. M. Golam Kibria

Journal of Modern Applied Statistical Methods

Predictive distributions of future response and future regression matrices under multivariate elliptically contoured distributions are discussed. Under the elliptically contoured response assumptions, these are identical to those obtained under matric normal or matric-t errors using structural, Bayesian with improper prior, or classical approaches. This gives inference robustness with respect to departure from the reference case of independent sampling from the matric normal or matric t to multivariate elliptically contoured distributions. The importance of the predictive distribution for skewed elliptical models is indicated; the elliptically contoured distribution, as well as matric t distribution, have significant applications in statistical practices.


Delete And Revise Procedures For Two-Stage Short-Run Control Charts, Matthew E. Elam Nov 2008

Delete And Revise Procedures For Two-Stage Short-Run Control Charts, Matthew E. Elam

Journal of Modern Applied Statistical Methods

This article investigates the effect different delete and revise procedures have on the performance of twostage short-run control charting methodology in the second stage of its two stage procedure. Five variables control chart combinations, six delete and revise procedures, and various out-of-control situations in both stages are considered.


A Methodology To Improve Pci Use In Industry, Milind A. Phadnis, Matthew E. Elam Nov 2008

A Methodology To Improve Pci Use In Industry, Milind A. Phadnis, Matthew E. Elam

Journal of Modern Applied Statistical Methods

This article presents the development of a methodology using decision trees to resolve issues in industry with using process capability indices (PCIs). The methodology forms the structure of a prototype decision support system (PDSS) for PCI selection, calculation, and interpretation. Download instructions for the PDSS are available at http://program.20m.com.


Construction Of Insurance Scoring System Using Regression Models, Noriszura Ismail, Abdul Aziz Jemain Nov 2008

Construction Of Insurance Scoring System Using Regression Models, Noriszura Ismail, Abdul Aziz Jemain

Journal of Modern Applied Statistical Methods

This study suggests the regression models of Lognormal, Normal and Gamma for constructing insurance scoring system. The main advantage of a scoring system is that it can be used by insurers to differentiate between high and low risks insureds, thus allowing the profitability of insureds to be predicted.


The Multinomial Regression Modeling Of The Cause-Of-Death Mortality Of The Oldest Old In The U.S., Dudley L. Poston Jr., Hosik Min Nov 2008

The Multinomial Regression Modeling Of The Cause-Of-Death Mortality Of The Oldest Old In The U.S., Dudley L. Poston Jr., Hosik Min

Journal of Modern Applied Statistical Methods

The statistical modeling of the causes of death of the oldest old (persons aged 80 and over) in the U.S. in 2001 was conducted in this article. Data were analyzed using a multinomial logistic regression model (MNLM) because multiple causes of death are coded on death certificates and the codes are nominal. The percentage distribution of the 10 major causes of death among the oldest old was first examined; we next estimated a multinomial logistic regression equation to predict the likelihood of elders dying of one of the causes of death compared to dying of an “other cause.” The independent …


Frequency Domain Modeling With Piecewise Constant Spectra, Erhard Reschenhofer Nov 2008

Frequency Domain Modeling With Piecewise Constant Spectra, Erhard Reschenhofer

Journal of Modern Applied Statistical Methods

Using piecewise constant functions as models for the spectral density of the differenced log real U.S. GDP it was found that these models have the capacity to compete with the spectral densities implied by ARMA models. According to AIC and BIC the piecewise constant spectral densities are superior to ARMA.


Non-Parametric Quantile Selection For Extreme Distributions, Wan Zawiah Wan Zin, Abdul Aziz Jemain Nov 2008

Non-Parametric Quantile Selection For Extreme Distributions, Wan Zawiah Wan Zin, Abdul Aziz Jemain

Journal of Modern Applied Statistical Methods

The objective is to select the best non-parametric quantile estimation method for extreme distributions. This serves as a starting point for further research in quantile application such as in parameter estimation using LQ-moments method. Thirteen methods of non-parametric quantile estimation were applied on six types of extreme distributions and their efficiencies compared. Monte Carlo methods were used to generate the results, which showed that the method of Weighted Kernel estimator of Type 1 was more efficient than the other methods in many cases.


Multi-Group Confirmatory Factor Analysis For Testing Measurement Invariance In Mixed Item Format Data, Kim H. Koh, Bruno D. Zumbo Nov 2008

Multi-Group Confirmatory Factor Analysis For Testing Measurement Invariance In Mixed Item Format Data, Kim H. Koh, Bruno D. Zumbo

Journal of Modern Applied Statistical Methods

This simulation study investigated the empirical Type I error rates of using the maximum likelihood estimation method and Pearson covariance matrix for multi-group confirmatory factor analysis (MGCFA) of full and strong measurement invariance hypotheses with mixed item format data that are ordinal in nature. The results indicate that mixed item formats and sample size combinations do not result in inflated empirical Type I error rates for rejecting the true measurement invariance hypotheses. Therefore, although the common methods are in a sense sub-optimal, they don’t lead to researchers claiming that measures are functioning differently across groups – i.e., a lack of …


An Optimum Allocation With A Family Of Estimators Using Auxiliary Information In Sample Survey, Gajendra K. Vishwakarma, Housila P. Singh Nov 2008

An Optimum Allocation With A Family Of Estimators Using Auxiliary Information In Sample Survey, Gajendra K. Vishwakarma, Housila P. Singh

Journal of Modern Applied Statistical Methods

The problem of obtaining optimum allocation using auxiliary information in stratified random sampling. An optimum allocation with a family of estimators is obtained and its efficiency is compared with that of Neyman allocation based on Srivastava (1971) class of estimators and the optimum allocation suggested by Zaidi et al., (1989). It is shown that the proposed allocation is better in the sense having smaller variance compared to other optimum allocation.


Adaptive Estimation Of Heteroscedastic Linear Regression Model Using Probability Weighted Moments, Faqir Muhammad, Muhammad Aslam, G.R. Pasha Nov 2008

Adaptive Estimation Of Heteroscedastic Linear Regression Model Using Probability Weighted Moments, Faqir Muhammad, Muhammad Aslam, G.R. Pasha

Journal of Modern Applied Statistical Methods

An adaptive estimator is presented by using probability weighted moments as weights rather than conventional estimates of variances for unknown heteroscedastic errors while estimating a heteroscedastic linear regression model. Empirical studies of the data generated by simulations for normal, uniform, and logistically distributed error terms support our proposed estimator to be quite efficient, especially for small samples.


Least Squares Percentage Regression, Chris Tofallis Nov 2008

Least Squares Percentage Regression, Chris Tofallis

Journal of Modern Applied Statistical Methods

In prediction, the percentage error is often felt to be more meaningful than the absolute error. We therefore extend the method of least squares to deal with percentage errors, for both simple and multiple regression. Exact expressions are derived for the coefficients, and we show how such models can be estimated using standard software. When the relative error is normally distributed, least squares percentage regression is shown to provide maximum likelihood estimates. The multiplicative error model is linked to least squares percentage regression in the same way that the standard additive error model is linked to ordinary least squares regression.


On Some Properties Of Quasi-Negative-Binomial Distribution And Its Applications, Anwar Hassan, Sheikh Bilal Nov 2008

On Some Properties Of Quasi-Negative-Binomial Distribution And Its Applications, Anwar Hassan, Sheikh Bilal

Journal of Modern Applied Statistical Methods

The quasi-negative-binomial distribution was applied to queuing theory for determining the distribution of total number of customers served before the queue vanishes under certain assumptions. Some structural properties (probability generating function, convolution, mode and recurrence relation) for the moments of quasi-negative-binomial distribution are discussed. The distribution’s characterization and its relation with other distributions were investigated. A computer program was developed using R to obtain ML estimates and the distribution was fitted to some observed sets of data to test its goodness of fit.


On Measuring The Relative Importance Of Explanatory Variables In A Logistic Regression , D. Roland Thomas, Pengcheng Zhu, Bruno D. Zumbo, Shantanu Dutta May 2008

On Measuring The Relative Importance Of Explanatory Variables In A Logistic Regression , D. Roland Thomas, Pengcheng Zhu, Bruno D. Zumbo, Shantanu Dutta

Journal of Modern Applied Statistical Methods

A search is described for valid methods of assessing the importance of explanatory variables in logistic regression, motivated by earlier work on the relationship between corporate governance variables and the issuance of restricted voting shares (RSF). The methods explored are adaptations of Pratt’s (1987) approach for measuring variable importance in simple linear regression, which is based on a special partition of R2. Pseudo-R2 measures for logistic regression are briefly reviewed, and two measures are selected which can be partitioned in a manner analogous to that used by Pratt. One of these is ultimately selected for the variable …


Using Connectionist Models To Evaluate Examinees’ Response Patterns To Achievement Tests, Mark J. Gierl, Ying Cui, Steve Hunka May 2008

Using Connectionist Models To Evaluate Examinees’ Response Patterns To Achievement Tests, Mark J. Gierl, Ying Cui, Steve Hunka

Journal of Modern Applied Statistical Methods

The attribute hierarchy method (AHM) applied to assessment engineering is described. It is a psychometric method for classifying examinees’ test item responses into a set of attribute mastery patterns associated with different components in a cognitive model of task performance. Attribute probabilities, computed using a neural network, can be estimated for each examinee thereby providing specific information about the examinee’s attribute-mastery level. The pattern recognition approach described in this study relies on an explicit cognitive model to produce the expected response patterns. The expected response patterns serve as the input to the neural network. The model also yields the cognitive …


Log-Linear Model To Assess Socioeconomic And Environmental Factors With Childhood Diarrhea Using Hospital Based Surveillance, Krishnan Rajendran, Thandavarayan Ramamurthy, Sujit Kumar Bhattacharya May 2008

Log-Linear Model To Assess Socioeconomic And Environmental Factors With Childhood Diarrhea Using Hospital Based Surveillance, Krishnan Rajendran, Thandavarayan Ramamurthy, Sujit Kumar Bhattacharya

Journal of Modern Applied Statistical Methods

Categorical outcomes with environment factors analyzed by log linear model are frequent in the environmental epidemiological literature. Epidemiological and socio-economical factors were obtained on 1,119 children below the age of 5 from Infectious Diseases Hospital (IDH) at the Kolkata, India. Significant associations of diarrhea were observed in the rural areas with family income, father’s occupation as a daily labor, literacy of parents, non-cemented floor and wall constructed of mud, and type of storage (wide mouthed earthen pot). The results of the study with specific Log linear model confirm environmental factors were important implications for childhood diarrhea in the rural community. …


On A Test Of Independence Via Quantiles That Is Sensitive To Curvature, Rand R. Wilcox May 2008

On A Test Of Independence Via Quantiles That Is Sensitive To Curvature, Rand R. Wilcox

Journal of Modern Applied Statistical Methods

Let (Yi ,Xi ) , i =1,..., n , be a random sample from some p+1 variate distribution where Xi is a vector having length p. Many methods for testing the hypothesis that Y is independent of X are relatively insensitive to a broad class of departures from independence. Power improvements focus on the median of Y or some other quantile and test the hypothesis that the regression surface is a horizontal plane versus some unknown form. A wild bootstrap method (Stute et al. 1998) can be used based on quantiles, but with small or moderate sample …


Selection Of Non-Regular Fractional Factorial Designs When Some Two-Factor Interactions Are Important, Weiming Ke, Rui Yao May 2008

Selection Of Non-Regular Fractional Factorial Designs When Some Two-Factor Interactions Are Important, Weiming Ke, Rui Yao

Journal of Modern Applied Statistical Methods

A new method is proposed for selecting the optimal non-regular fractional factorial designs in the situation when some two-factor interactions are potentially important. Searching for the best designs according to this method is discussed and some results for the Plackett-Burman design of 12 runs are presented.