Open Access. Powered by Scholars. Published by Universities.®
- Institution
- Keyword
-
- Monte Carlo simulation (4)
- Power (4)
- Reliability (3)
- Simulation (3)
- Statistical Methodology (3)
-
- Bootstrap (2)
- Confidence interval (2)
- Effect size (2)
- Efficiency (2)
- Exact test (2)
- Fortran (2)
- Gibbs sampling (2)
- Least absolute deviations (2)
- Likelihood ratio test (2)
- MCMC (2)
- Monte Carlo (2)
- Order statistics (2)
- Ordinary least squares (2)
- Repeated measures (2)
- SPSS (2)
- Statistical analysis (2)
- measurement error. (1)
- Annual percent change (APC) (1)
- AIC (1)
- AUC; Cox's proportional hazards model; Framingham risk score; ROC (1)
- Accelerated life test (1)
- Acheivement (1)
- Acheivement gap (1)
- Age-adjusted cancer rates (1)
- Aggregate variables (1)
- Publication
-
- Journal of Modern Applied Statistical Methods (64)
- Harvard University Biostatistics Working Paper Series (13)
- U.C. Berkeley Division of Biostatistics Working Paper Series (6)
- COBRA Preprint Series (3)
- Chongzhi Di (3)
-
- Johns Hopkins University, Dept. of Biostatistics Working Papers (2)
- UW Biostatistics Working Paper Series (2)
- Chris J. Lloyd (1)
- Department of Statistics: Dissertations, Theses, and Student Work (1)
- FIU Electronic Theses and Dissertations (1)
- Sunduz Keles (1)
- The University of Michigan Department of Biostatistics Working Paper Series (1)
- UPenn Biostatistics Working Papers (1)
- Publication Type
Articles 1 - 30 of 99
Full-Text Articles in Statistics and Probability
Pragmatic Estimation Of A Spatio-Temporal Air Quality Model With Irregular Monitoring Data, Paul D. Sampson, Adam A. Szpiro, Lianne Sheppard, Johan Lindström, Joel D. Kaufman
Pragmatic Estimation Of A Spatio-Temporal Air Quality Model With Irregular Monitoring Data, Paul D. Sampson, Adam A. Szpiro, Lianne Sheppard, Johan Lindström, Joel D. Kaufman
UW Biostatistics Working Paper Series
Statistical analyses of the health effects of air pollution have increasingly used GIS-based covariates for prediction of ambient air quality in “land-use” regression models. More recently these regression models have accounted for spatial correlation structure in combining monitoring data with land-use covariates. The current paper builds on these concepts to address spatio-temporal prediction of ambient concentrations of particulate matter with aerodynamic diameter less than 2.5 μm (PM2.5) on the basis of a model representing spatially varying seasonal trends and spatial correlation structures. Our hierarchical methodology provides a pragmatic approach that fully exploits regulatory and other supplemental monitoring data which jointly …
On The Behaviour Of Marginal And Conditional Akaike Information Criteria In Linear Mixed Models, Sonja Greven, Thomas Kneib
On The Behaviour Of Marginal And Conditional Akaike Information Criteria In Linear Mixed Models, Sonja Greven, Thomas Kneib
Johns Hopkins University, Dept. of Biostatistics Working Papers
In linear mixed models, model selection frequently includes the selection of random effects. Two versions of the Akaike information criterion (AIC) have been used, based either on the marginal or on the conditional distribution. We show that the marginal AIC is no longer an asymptotically unbiased estimator of the Akaike information, and in fact favours smaller models without random effects. For the conditional AIC, we show that ignoring estimation uncertainty in the random effects covariance matrix, as is common practice, induces a bias that leads to the selection of any random effect not predicted to be exactly zero. We derive …
Survival Analysis With Error-Prone Time-Varying Covariates: A Risk Set Calibration Approach, Xiaomei Liao, David M. Zucker, Yi Li, Donna Spiegelman
Survival Analysis With Error-Prone Time-Varying Covariates: A Risk Set Calibration Approach, Xiaomei Liao, David M. Zucker, Yi Li, Donna Spiegelman
Harvard University Biostatistics Working Paper Series
No abstract provided.
A Statistical Framework For The Analysis Of Chip-Seq Data, Pei Fen Kuan, Dongjun Chung, Guangjin Pan, James A. Thomson, Ron Stewart, Sunduz Keles
A Statistical Framework For The Analysis Of Chip-Seq Data, Pei Fen Kuan, Dongjun Chung, Guangjin Pan, James A. Thomson, Ron Stewart, Sunduz Keles
Sunduz Keles
Chromatin immunoprecipitation followed by sequencing (ChIP-Seq) has revolutionalized experiments for genome-wide profiling of DNA-binding proteins, histone modifications, and nucleosome occupancy. As the cost of sequencing is decreasing, many researchers are switching from microarray-based technologies (ChIP-chip) to ChIP-Seq for genome-wide study of transcriptional regulation. Despite its increasing and well-deserved popularity, there is little work that investigates and accounts for sources of biases in the ChIP-Seq technology. These biases typically arise from both the standard pre-processing protocol and the underlying DNA sequence of the generated data.
We study data from a naked DNA sequencing experiment, which sequences non-cross-linked DNA after deproteinizing and …
A New Class Of Minimum Power Divergence Estimators With Applications To Cancer Surveillance, Nirian Martin, Yi Li
A New Class Of Minimum Power Divergence Estimators With Applications To Cancer Surveillance, Nirian Martin, Yi Li
Harvard University Biostatistics Working Paper Series
No abstract provided.
Jmasm29: Dominance Analysis Of Independent Data (Fortran), Du Feng, Normal Cliff
Jmasm29: Dominance Analysis Of Independent Data (Fortran), Du Feng, Normal Cliff
Journal of Modern Applied Statistical Methods
A Fortran 77 program is provided for an ordinal dominance analysis of independent two-group comparisons. The program calculates the ordinal statistic, d, and statistical inferences about δ. The source codes and an executable file are available at http://www.depts.ttu.edu/hdfs/feng.php.
Markov Modeling Of Breast Cancer, Chunling Cong, Chris P. Tsokos
Markov Modeling Of Breast Cancer, Chunling Cong, Chris P. Tsokos
Journal of Modern Applied Statistical Methods
Previous work with respect to the treatments and relapse time for breast cancer patients is extended by applying a Markov chain to model three different types of breast cancer patients: alive without ever having relapse, alive with relapse, and deceased. It is shown that combined treatment of tamoxifen and radiation is more effective than single treatment of tamoxifen in preventing the recurrence of breast cancer. However, if the patient has already relapsed from breast cancer, single treatment of tamoxifen would be more appropriate with respect to survival time after relapse. Transition probabilities between three stages during different time periods, 2-year, …
Examples Of Computing Power For Zero-Inflated And Overdispersed Count Data, Suzanne R. Doyle
Examples Of Computing Power For Zero-Inflated And Overdispersed Count Data, Suzanne R. Doyle
Journal of Modern Applied Statistical Methods
Examples of zero-inflated Poisson and negative binomial regression models were used to demonstrate conditional power estimation, utilizing the method of an expanded data set derived from probability weights based on assumed regression parameter values. SAS code is provided to calculate power for models with a binary or continuous covariate associated with zero-inflation.
Assessing Trends: Monte Carlo Trials With Four Different Regression Methods, Daniel R. Thompson
Assessing Trends: Monte Carlo Trials With Four Different Regression Methods, Daniel R. Thompson
Journal of Modern Applied Statistical Methods
Ordinary Least Squares (OLS), Poisson, Negative Binomial, and Quasi-Poisson Regression methods were assessed for testing the statistical significance of a trend by performing 10,000 simulations. The Poisson method should be used when data follow a Poisson distribution. The other methods should be used when data follow a normal distribution.
Level Robust Methods Based On The Least Squares Regression Estimator, Marie Ng, Rand R. Wilcox
Level Robust Methods Based On The Least Squares Regression Estimator, Marie Ng, Rand R. Wilcox
Journal of Modern Applied Statistical Methods
Heteroscedastic consistent covariance matrix (HCCM) estimators provide ways for testing hypotheses about regression coefficients under heteroscedasticity. Recent studies have found that methods combining the HCCM-based test statistic with the wild bootstrap consistently perform better than non-bootstrap HCCM-based methods (Davidson & Flachaire, 2008; Flachaire, 2005; Godfrey, 2006). This finding is more closely examined by considering a broader range of situations which were not included in any of the previous studies. In addition, the latest version of HCCM, HC5 (Cribari-Neto, et al., 2007), is evaluated.
Application Of The Truncated Skew Laplace Probability Distribution In Maintenance System, Gokarna R. Aryal, Chris P. Tsokos
Application Of The Truncated Skew Laplace Probability Distribution In Maintenance System, Gokarna R. Aryal, Chris P. Tsokos
Journal of Modern Applied Statistical Methods
A random variable X is said to have the skew-Laplace probability distribution if its pdf is given by f(x) = 2g(x)G(λx), where g (.) and G (.), respectively, denote the pdf and the cdf of the Laplace distribution. When the skew Laplace distribution is truncated on the left at 0 it is called it the truncated skew Laplace (TSL) distribution. This article provides a comparison of TSL distribution with twoparameter gamma model and the hypoexponential model, and an application of the subject model in maintenance system is studied.
Least Error Sample Distribution Function, Vassili F. Pastushenko
Least Error Sample Distribution Function, Vassili F. Pastushenko
Journal of Modern Applied Statistical Methods
Email: The empirical distribution function (ecdf) is unbiased in the usual sense, but shows certain order bias. Pyke suggested discrete ecdf using expectations of order statistics. Piecewise constant optimal ecdf saves 200%/N of sample size N. Results are compared with linear interpolation for U(0, 1), which require up to sixfold shorter samples at the same accuracy.
Impact Of Rank-Based Normalizing Transformations On The Accuracy Of Test Scores, Shira R. Soloman, Shlomo S. Sawilowsky
Impact Of Rank-Based Normalizing Transformations On The Accuracy Of Test Scores, Shira R. Soloman, Shlomo S. Sawilowsky
Journal of Modern Applied Statistical Methods
The purpose of this article is to provide an empirical comparison of rank-based normalization methods for standardized test scores. A series of Monte Carlo simulations were performed to compare the Blom, Tukey, Van der Waerden and Rankit approximations in terms of achieving the T score’s specified mean and standard deviation and unit normal skewness and kurtosis. All four normalization methods were accurate on the mean but were variably inaccurate on the standard deviation. Overall, deviation from the target moments was pronounced for the even moments but slight for the odd moments. Rankit emerged as the most accurate method among all …
On Some Discrete Distributions And Their Applications With Real Life Data, Shipra Banik, B. M. Golam Kibria
On Some Discrete Distributions And Their Applications With Real Life Data, Shipra Banik, B. M. Golam Kibria
Journal of Modern Applied Statistical Methods
This article reviews some useful discrete models and compares their performance in terms of the high frequency of zeroes, which is observed in many discrete data (e.g., motor crash, earthquake, strike data, etc.). A simulation study is conducted to determine how commonly used discrete models (such as the binomial, Poisson, negative binomial, zero-inflated and zero-truncated models) behave if excess zeroes are present in the data. Results indicate that the negative binomial model and the ZIP model are better able to capture the effect of excess zeroes. Some real-life environmental data are used to illustrate the performance of the proposed models.
Relationship Between Internal Consistency And Goodness Of Fit Maximum Likelihood Factor Analysis With Varimax Rotation, Gibbs Y. Kanyongo, James B. Schreiber
Relationship Between Internal Consistency And Goodness Of Fit Maximum Likelihood Factor Analysis With Varimax Rotation, Gibbs Y. Kanyongo, James B. Schreiber
Journal of Modern Applied Statistical Methods
This study investigates how reliability (internal consistency) affects model-fitting in maximum likelihood exploratory factor analysis (EFA). This was accomplished through an examination of goodness of fit indices between the population and the sample matrices. Monte Carlo simulations were performed to create pseudo-populations with known parameters. Results indicated that the higher the internal consistency the worse the fit. It is postulated that the observations are similar to those from structural equation modeling where a good fit with low correlations can be observed and also the reverse with higher item correlations.
Estimating Model Complexity Of Feed-Forward Neural Networks, Douglas Landsittel
Estimating Model Complexity Of Feed-Forward Neural Networks, Douglas Landsittel
Journal of Modern Applied Statistical Methods
In a previous simulation study, the complexity of neural networks for limited cases of binary and normally-distributed variables based the null distribution of the likelihood ratio statistic and the corresponding chi-square distribution was characterized. This study expands on those results and presents a more general formulation for calculating degrees of freedom.
Detecting Lag-One Autocorrelation In Interrupted Time Series Experiments With Small Datasets, Clare Riviello, S. Natasha Beretvas
Detecting Lag-One Autocorrelation In Interrupted Time Series Experiments With Small Datasets, Clare Riviello, S. Natasha Beretvas
Journal of Modern Applied Statistical Methods
The power and type I error rates of eight indices for lag-one autocorrelation detection were assessed for interrupted time series experiments (ITSEs) with small numbers of data points. Performance of Huitema and McKean’s (2000) zHM statistic was modified and compared with the zHM, five information criteria and the Durbin-Watson statistic.
Closed Form Confidence Intervals For Small Sample Matched Proportions, James F. Reed Iii
Closed Form Confidence Intervals For Small Sample Matched Proportions, James F. Reed Iii
Journal of Modern Applied Statistical Methods
The behavior of the Wald-z, Wald-c, Quesenberry-Hurst, Wald-m and Agresti-Min methods was investigated for matched proportions confidence intervals. It was concluded that given the widespread use of the repeated-measure design, pretest-posttest design, matched-pairs design, and cross-over design, the textbook Wald-z method should be abandoned in favor of the Agresti-Min alternative.
Confidence Interval Estimation For Intraclass Correlation Coefficient Under Unequal Family Sizes, Madhusudan Bhandary, Koji Fujiwara
Confidence Interval Estimation For Intraclass Correlation Coefficient Under Unequal Family Sizes, Madhusudan Bhandary, Koji Fujiwara
Journal of Modern Applied Statistical Methods
Confidence intervals (based on the χ2 -distribution and (Z) standard normal distribution) for the intraclass correlation coefficient under unequal family sizes based on a single multinormal sample have been proposed. It has been found that the confidence interval based on the χ2 -distribution consistently and reliably produces better results in terms of shorter average interval length than the confidence interval based on the standard normal distribution: especially for larger sample sizes for various intraclass correlation coefficient values. The coverage probability of the interval based on the χ2 -distribution is competitive with the coverage probability of the interval …
Approximate Bayesian Confidence Intervals For The Mean Of A Gaussian Distribution Versus Bayesian Models, Vincent A. R. Camara
Approximate Bayesian Confidence Intervals For The Mean Of A Gaussian Distribution Versus Bayesian Models, Vincent A. R. Camara
Journal of Modern Applied Statistical Methods
This study obtained and compared confidence intervals for the mean of a Gaussian distribution. Considering the square error and the Higgins-Tsokos loss functions, approximate Bayesian confidence intervals for the mean of a normal population are derived. Using normal data and SAS software, the obtained approximate Bayesian confidence intervals were compared to a published Bayesian model. Whereas the published Bayesian method is sensitive to the choice of the hyper-parameters and does not always yield the best confidence intervals, it is shown that the proposed approximate Bayesian approach relies only on the observations and often performs better.
Semi-Parametric Of Sample Selection Model Using Fuzzy Concepts, L. Muhamad Safiih, A. A. Kamil, M. T. Abu Osman
Semi-Parametric Of Sample Selection Model Using Fuzzy Concepts, L. Muhamad Safiih, A. A. Kamil, M. T. Abu Osman
Journal of Modern Applied Statistical Methods
The sample selection model has been studied in the context of semi-parametric methods. With the deficiencies of the parametric model, such as inconsistent estimators, semi-parametric estimation methods provide better alternatives. This article focuses on the context of fuzzy concepts as a hybrid to the semiparametric sample selection model. The better approach when confronted with uncertainty and ambiguity is to use the tools provided by the theory of fuzzy sets, which are appropriate for modeling vague concepts. A fuzzy membership function for solving uncertainty data of a semi-parametric sample selection model is introduced as a solution to the problem.
On Type-Ii Progressively Hybrid Censoring, Debasis Kundu, Avijit Joarder, Hare Krishna
On Type-Ii Progressively Hybrid Censoring, Debasis Kundu, Avijit Joarder, Hare Krishna
Journal of Modern Applied Statistical Methods
The progressive Type-II censoring scheme has become quite popular. A drawback of a progressive censoring scheme is that the length of the experiment can be very large if the items are highly reliable. Recently, Kundu and Joarder (2006) introduced the Type-II progressively hybrid censored scheme and analyzed the data assuming that the lifetimes of the items are exponentially distributed. This article presents the analysis of Type-II progressively hybrid censored data when the lifetime distributions of the items follow Weibull distributions. Maximum likelihood estimators and approximate maximum likelihood estimators are developed for estimating the unknown parameters. Asymptotic confidence intervals based on …
Performance Ratings Of An Autocovariance Base Estimator (Abe) In The Estimation Of Garch Model Parameters When The Normality Assumption Is Invalid, Daniel Eni
Journal of Modern Applied Statistical Methods
The performance of an autocovariance base estimator (ABE) for GARCH models against that of the maximum likelihood estimator (MLE) if a distribution assumption is wrongly specified as normal was studied. This was accomplished by simulating time series data that fits a GARCH model using the Log normal and t-distributions with degrees of freedom of 5, 10 and 15. The simulated time series was considered as the true probability distribution, but normality was assumed in the process of parameter estimations. To track consistency, sample sizes of 200, 500, 1,000 and 1,200 were employed. The two methods were then used to analyze …
Bayesian Analysis Of Evidence From Studies Of Warfarin V Aspirin For Symptomatic Intracranial Stenosis, Vicki Hertzberg, Barney Stern, Karen Johnston
Bayesian Analysis Of Evidence From Studies Of Warfarin V Aspirin For Symptomatic Intracranial Stenosis, Vicki Hertzberg, Barney Stern, Karen Johnston
Journal of Modern Applied Statistical Methods
Bayesian analyses of symptomatic intracranial stenosis studies were conducted to compare the benefits of long-term therapy with warfarin to aspirin. The synthesis of evidence of effect from previous nonrandomized studies in monitoring a randomized clinical trial was of particular interest. Sequential Bayesian learning analysis was conducted and Bayesian hierarchical random effects models were used to incorporate variability between studies. The posterior point estimates for the risk rate ratio (RRR) were similar between analyses, although the interval estimates resulting from the hierarchical analyses are larger than the corresponding Bayesian learning analyses. This demonstrated the difference between these methods in accounting for …
A Maximum Test For The Analysis Of Ordered Categorical Data, Markus Neuhäeuser
A Maximum Test For The Analysis Of Ordered Categorical Data, Markus Neuhäeuser
Journal of Modern Applied Statistical Methods
Different scoring schemes are possible when performing exact tests using scores on ordered categorical data. The standard scheme is based on integer scores, but non-integer scores were proposed to increase power (Ivanova & Berger, 2001). However, different non-integer scores exist and the question arises as to which of the non-integer schemes should be chosen. To solve this problem, a maximum test is proposed. To be precise, the maximum of the competing statistics is used as the new test statistic, rather than arbitrarily choosing one single test statistic.
An Inductive Approach To Calculate The Mle For The Double Exponential Distribution, W. J. Hurley
An Inductive Approach To Calculate The Mle For The Double Exponential Distribution, W. J. Hurley
Journal of Modern Applied Statistical Methods
Norton (1984) presented a calculation of the MLE for the parameter of the double exponential distribution based on the calculus. An inductive approach is presented here.
New Effect Size Rules Of Thumb, Shlomo S. Sawilowsky
New Effect Size Rules Of Thumb, Shlomo S. Sawilowsky
Journal of Modern Applied Statistical Methods
Recommendations to expand Cohen’s (1988) rules of thumb for interpreting effect sizes are given to include very small, very large, and huge effect sizes. The reasons for the expansion, and implications for designing Monte Carlo studies, are discussed.
Multiple Search Paths And The General-To-Specific Methodology, Paul Turner
Multiple Search Paths And The General-To-Specific Methodology, Paul Turner
Journal of Modern Applied Statistical Methods
Increased interest in computer automation of the general-to-specific methodology has resulted from research by Hoover and Perez (1999) and Krolzig and Hendry (2001). This article presents simulation results for a multiple search path algorithm that has better properties than those generated by a single search path. The most noticeable improvements occur when the data contain unit roots.
Intermediate R Values For Use In The Fleishman Power Method, Julie M. Smith
Intermediate R Values For Use In The Fleishman Power Method, Julie M. Smith
Journal of Modern Applied Statistical Methods
Several intermediate r values are calculated at three different correlations for use in the Fleishman Power Method for generating correlated data from normal and non-normal populations.
Estimation Of The Standardized Mean Difference For Repeated Measures Designs, Lindsey J. Wolff Smith, S. Natasha Beretvas
Estimation Of The Standardized Mean Difference For Repeated Measures Designs, Lindsey J. Wolff Smith, S. Natasha Beretvas
Journal of Modern Applied Statistical Methods
This simulation study modified the repeated measures mean difference effect size, d=RM , for scenarios with unequal pre- and post-test score variances. Relative parameter and SE bias were calculated for dRM ≠ versus dRM = . Results consistently favored d≠RM over d=RM with worse positive parameter and negative SE bias identified for d=RM for increasingly heterogeneous variance conditions.