Open Access. Powered by Scholars. Published by Universities.®

Statistical Methodology Commons

Open Access. Powered by Scholars. Published by Universities.®

1,115 Full-Text Articles 1,508 Authors 603,045 Downloads 125 Institutions

All Articles in Statistical Methodology

Faceted Search

1,115 full-text articles. Page 32 of 38.

Proportional Mean Residual Life Model For Right-Censored Length-Biased Data, Gary KWUN CHUEN Chan, Ying Qing Chen, Chongzhi Di 2012 University of Washington

Proportional Mean Residual Life Model For Right-Censored Length-Biased Data, Gary Kwun Chuen Chan, Ying Qing Chen, Chongzhi Di

Chongzhi Di

To study disease association with risk factors in epidemiologic studies, cross-sectional sampling is often more focused and less costly for recruiting study subjects who have already experienced initiating events. For time-to-event outcome, however, such a sampling strategy may be length-biased. Coupled with censoring, analysis of length-biased data can be quite challenging, due to the so-called “induced informative censoring” in which the survival time and censoring time are correlated through a common backward recurrence time. We propose to use the proportional mean residual life model of Oakes and Dasu (1990) for analysis of censored length-biased survival data. Several nonstandard data structures, …


The Quotient Of The Beta-Weibull Distribution, Nonhle Channon Mdziniso 2012 Marshall University

The Quotient Of The Beta-Weibull Distribution, Nonhle Channon Mdziniso

Theses, Dissertations and Capstones

A new class of distributions recently developed involves the logit of the beta distribution. Among this class of distributions are, the beta-Normal (Eugene et al. [15]); beta-Gumbel (Nadarajah and Kotz [18]); beta-Exponential (Nadarajah and Kotz [19]); beta-Weibull (Famoye et al. [6]); beta-Rayleigh (Akinsete and Lowe [3]); beta-Laplace (Kozubowshi and Nadarajah [20]); and beta-Pareto (Akinsete et al. [4]), among a few others. Many useful statistical properties arising from these distributions and their applications to real life data have been discussed in literature. One approach by which a new statistical distribution is generated is by the transformation of random variables having known …


On The Skewness Of Order Statistics With Applications, Subhash C. Kochar, Maochao Xu 2012 Portland State University

On The Skewness Of Order Statistics With Applications, Subhash C. Kochar, Maochao Xu

Mathematics and Statistics Faculty Publications and Presentations

Order statistics from heterogenous samples have been extensively studied in the literature. However, most of the work focused on the effect of heterogeneity on the magnitude and dispersion of order statistics. In this paper, we study the skewness of order statistics from heterogeneous samples in the sense of star order. The main results extended the results in Kochar and Xu (2009, 2011). Examples and applications in statistical inference are highlighted.


Testing For Regime Swtiching: A Comment, Douglas Steigerwald, Andrew Carter 2011 University of California, Santa Barbara

Testing For Regime Swtiching: A Comment, Douglas Steigerwald, Andrew Carter

Douglas G. Steigerwald

An autoregressive model with Markov-regime switching is analyzed that reflects on the properties of the quasi-likelihood ratio test developed by Cho and White (2007). For such a model, we show that consistency of the quasi-maximum likelihood estimator for the population parameter values, on which consistency of the test is based, does not hold. We describe a condition that ensures consistency of the estimator and discuss the consistency of the test in the absence of consistency of the estimator.


Incorporating Network Structure In Integrative Analysis Of Cancer Prognosis Data, Shuangge Ma 2011 Yale University

Incorporating Network Structure In Integrative Analysis Of Cancer Prognosis Data, Shuangge Ma

Shuangge Ma

In high-throughput cancer genomic studies, markers identified from the analysis of single datasets may have unsatisfactory properties because of low sample sizes. Integrative analysis pools and analyzes raw data from multiple studies, and can effectively increase sample size and lead to improved marker identification results. In this study, we consider the integrative analysis of multiple high-throughput cancer prognosis studies. In the existing integrative analysis studies, the interplay among genes, which can be described using the network structure, has not been effectively accounted for. In network analysis, tightly-connected nodes (genes) are more likely to have related biological functions and similar regression …


Risk Factors Of Follicular Lymphoma, Shuangge Ma 2011 Yale University

Risk Factors Of Follicular Lymphoma, Shuangge Ma

Shuangge Ma

No abstract provided.


Health Insurance Coverage And Impact: A Survey In Three Cities In China, Shuangge Ma 2011 Yale University

Health Insurance Coverage And Impact: A Survey In Three Cities In China, Shuangge Ma

Shuangge Ma

No abstract provided.


Integrative Analysis Of Multiple Cancer Genomic Datasets Under The Heterogeneity Model, Shuangge Ma 2011 Yale University

Integrative Analysis Of Multiple Cancer Genomic Datasets Under The Heterogeneity Model, Shuangge Ma

Shuangge Ma

No abstract provided.


Health Insurance Coverage, Medical Expenditure And Coping Strategy: Evidence From Taiwan, Shuangge Ma 2011 Yale University

Health Insurance Coverage, Medical Expenditure And Coping Strategy: Evidence From Taiwan, Shuangge Ma

Shuangge Ma

No abstract provided.


Impact Of Illness And Medical Expenditure On Household Consumptions: A Survey In Western China, Shuangge Ma 2011 Yale University

Impact Of Illness And Medical Expenditure On Household Consumptions: A Survey In Western China, Shuangge Ma

Shuangge Ma

No abstract provided.


Identification Of Gene-Environment Interactions In Cancer Prognosis Studies Using Penalization, Shuangge Ma 2011 Yale University

Identification Of Gene-Environment Interactions In Cancer Prognosis Studies Using Penalization, Shuangge Ma

Shuangge Ma

High-throughput cancer studies have been extensively conducted, searching for genetic risk factors independently associated with prognosis beyond clinical and environmental risk factors. Many studies have shown that the gene-environment interactions may have important implications. Some of the existing methods, such as the commonly adopted single-marker analysis, may be limited in that they cannot accommodate the joint effects of a large number of genetic markers or use ineffective marker identification techniques. In this study, we analyze cancer prognosis studies, and adopt the AFT (accelerated failure time) model to describe survival. A weighted least squares approach, which has the lowest computational cost, …


Modeling Dependence Using Skew T Copulas: Bayesian Inference And Applications, Michael S. Smith, Quan Gan, Robert Kohn 2011 Melbourne Business School

Modeling Dependence Using Skew T Copulas: Bayesian Inference And Applications, Michael S. Smith, Quan Gan, Robert Kohn

Michael Stanley Smith

[THIS IS AN AUGUST 2010 REVISION THAT REPLACES ALL PREVIOUS VERSIONS.]

We construct a copula from the skew t distribution of Sahu, Dey & Branco (2003). This copula can capture asymmetric and extreme dependence between variables, and is one of the few copulas that can do so and still be used in high dimensions effectively. However, it is difficult to estimate the copula model by maximum likelihood when the multivariate dimension is high, or when some or all of the marginal distributions are discrete-valued, or when the parameters in the marginal distributions and copula are estimated jointly. We therefore propose …


Estimation Of Copula Models With Discrete Margins Via Bayesian Data Augmentation, Michael S. Smith, Mohamad A. Khaled 2011 Melbourne Business School

Estimation Of Copula Models With Discrete Margins Via Bayesian Data Augmentation, Michael S. Smith, Mohamad A. Khaled

Michael Stanley Smith

Estimation of copula models with discrete margins is known to be difficult beyond the bivariate case. We show how this can be achieved by augmenting the likelihood with latent variables, and computing inference using the resulting augmented posterior. To evaluate this we propose two efficient Markov chain Monte Carlo sampling schemes. One generates the latent variables as a block using a Metropolis-Hasting step with a proposal that is close to its target distribution, the other generates them one at a time. Our method applies to all parametric copulas where the conditional copula functions can be evaluated, not just elliptical copulas …


Identification And Efficient Estimation Of The Natural Direct Effect Among The Untreated, Samuel D. Lendle, Mark J. van der Laan 2011 University of California, Berkeley, School of Public Health - Division of Biostatistics

Identification And Efficient Estimation Of The Natural Direct Effect Among The Untreated, Samuel D. Lendle, Mark J. Van Der Laan

U.C. Berkeley Division of Biostatistics Working Paper Series

The natural direct effect (NDE), or the effect of an exposure on an outcome if an intermediate variable was set to the level it would have been in the absence of the exposure, is often of interest to investigators. In general, the statistical parameter associated with the NDE is difficult to estimate in the non-parametric model, particularly when the intermediate variable is continuous or high dimensional. In this paper we introduce a new causal parameter called the natural direct effect among the untreated, discus identifiability assumptions, and show that this new parameter is equivalent to the NDE in a randomized …


If And How Many 'Races'? The Application Of Mixture Modeling To World-Wide Human Craniometric Variation, Bridget Frances Beatrice Algee-Hewitt 2011 University of Tennessee, Knoxville

If And How Many 'Races'? The Application Of Mixture Modeling To World-Wide Human Craniometric Variation, Bridget Frances Beatrice Algee-Hewitt

Doctoral Dissertations

Studies in human cranial variation are extensive and widely discussed. While skeletal biologists continue to focus on questions of biological distance and population history, group-specific knowledge is being increasingly used for human identification in medico-legal contexts. The importance of this research has been often overshadowed by both philosophic and methodological concerns. Many analyses have been constrained in their scope by the limited availability of representative samples and readily criticized for adopting statistical techniques that require user-guidance and a priori information. A multi-part project is presented here that implements model-based clustering as an alternative approach for population studies using craniometric traits. …


A General Family Of Dual To Ratio-Cum-Product Estimator In Sample Surveys, Florentin Smarandache, Rajesh Singh, Mukesh Kumar, Pankaj Chauhan, Nirmala Sawan 2011 University of New Mexico

A General Family Of Dual To Ratio-Cum-Product Estimator In Sample Surveys, Florentin Smarandache, Rajesh Singh, Mukesh Kumar, Pankaj Chauhan, Nirmala Sawan

Branch Mathematics and Statistics Faculty and Staff Publications

This paper presents a family of dual to ratio-cum-product estimators for the finite population mean. Under simple random sampling without replacement (SRSWOR) scheme, expressions of the bias and mean-squared error (MSE) up to the first order of approximation are derived. We show that the proposed family is more efficient than usual unbiased estimator, ratio estimator, product estimator, Singh estimator (1967), Srivenkataramana (1980) and Bandyopadhyaya estimator (1980) and Singh et al. (2005) estimator. An empirical study is carried out to illustrate the performance of the constructed estimator over others.


Automating Construction And Selection Of A Neural Network Using Stochastic Optimization, Jason Lee Hurt 2011 University of Nevada, Las Vegas

Automating Construction And Selection Of A Neural Network Using Stochastic Optimization, Jason Lee Hurt

UNLV Theses, Dissertations, Professional Papers, and Capstones

An artificial neural network can be used to solve various statistical problems by approximating a function that provides a mapping from input to output data. No universal method exists for architecting an optimal neural network. Training one with a low error rate is often a manual process requiring the programmer to have specialized knowledge of the domain for the problem at hand.

A distributed architecture is proposed and implemented for generating a neural network capable of solving a particular problem without specialized knowledge of the problem domain. The only knowledge the application needs is a training set that the network …


Longitudinal High-Dimensional Data Analysis, Vadim Zipunnikov, Sonja Greven, Brian Caffo, Daniel S. Reich, Ciprian Crainiceanu 2011 Johns Hopkins Bloomberg School of Public Health, Department of Biostatistics

Longitudinal High-Dimensional Data Analysis, Vadim Zipunnikov, Sonja Greven, Brian Caffo, Daniel S. Reich, Ciprian Crainiceanu

Johns Hopkins University, Dept. of Biostatistics Working Papers

We develop a flexible framework for modeling high-dimensional functional and imaging data observed longitudinally. The approach decomposes the observed variability of high-dimensional observations measured at multiple visits into three additive components: a subject-specific functional random intercept that quantifies the cross-sectional variability, a subject-specific functional slope that quantifies the dynamic irreversible deformation over multiple visits, and a subject-visit specific functional deviation that quantifies exchangeable or reversible visit-to-visit changes. The proposed method is very fast, scalable to studies including ultra-high dimensional data, and can easily be adapted to and executed on modest computing infrastructures. The method is applied to the longitudinal analysis …


Assessing Association For Bivariate Survival Data With Interval Sampling: A Copula Model Approach With Application To Aids Study, Hong Zhu, Mei-Cheng Wang 2011 The Ohio State University

Assessing Association For Bivariate Survival Data With Interval Sampling: A Copula Model Approach With Application To Aids Study, Hong Zhu, Mei-Cheng Wang

Johns Hopkins University, Dept. of Biostatistics Working Papers

In disease surveillance systems or registries, bivariate survival data are typically collected under interval sampling. It refers to a situation when entry into a registry is at the time of the first failure event (e.g., HIV infection) within a calendar time interval, the time of the initiating event (e.g., birth) is retrospectively identified for all the cases in the registry, and subsequently the second failure event (e.g., death) is observed during the follow-up. Sampling bias is induced due to the selection process that the data are collected conditioning on the first failure event occurs within a time interval. Consequently, the …


Corrected Confidence Bands For Functional Data Using Principal Components, Jeff Goldsmith, Sonja Greven, Ciprian M. Crainiceanu 2011 Johns Hopkins Bloomberg School of Public Health, Department of Biostatistics

Corrected Confidence Bands For Functional Data Using Principal Components, Jeff Goldsmith, Sonja Greven, Ciprian M. Crainiceanu

Johns Hopkins University, Dept. of Biostatistics Working Papers

Functional principal components (FPC) analysis is widely used to decompose and express functional observations. Curve estimates implicitly condition on basis functions and other quantities derived from FPC decompositions; however these objects are unknown in practice. In this paper, we propose a method for obtaining correct curve estimates by accounting for uncertainty in FPC decompositions. Additionally, pointwise and simultaneous confidence intervals that account for both model- based and decomposition-based variability are constructed. Standard mixed-model representations of functional expansions are used to construct curve estimates and variances conditional on a specific decomposition. A bootstrap procedure is implemented to understand the uncertainty in …


Digital Commons powered by bepress