Open Access. Powered by Scholars. Published by Universities.®
- Discipline
-
- Statistical Methodology (9)
- Statistical Models (7)
- Categorical Data Analysis (4)
- Applied Statistics (3)
- Biostatistics (3)
-
- Medicine and Health Sciences (3)
- Computer Sciences (2)
- Environmental Public Health (2)
- Epidemiology (2)
- Microarrays (2)
- Public Health (2)
- Artificial Intelligence and Robotics (1)
- Civil Engineering (1)
- Civil and Environmental Engineering (1)
- Clinical Trials (1)
- Congenital, Hereditary, and Neonatal Diseases and Abnormalities (1)
- Diseases (1)
- Engineering (1)
- Environmental Health (1)
- Environmental Monitoring (1)
- Environmental Sciences (1)
- Geographic Information Sciences (1)
- Geography (1)
- International Public Health (1)
- Life Sciences (1)
- Longitudinal Data Analysis and Time Series (1)
- Maternal and Child Health (1)
- Keyword
-
- Sufficient dimension reduction (3)
- Central subspace (2)
- Independence (2)
- MANOVA (2)
- Multivariate Data (2)
-
- Sufficient variable selection (2)
- Air Force (1)
- Appalachia (1)
- Army (1)
- Asymptotic distribution (1)
- At-Fault (1)
- Bilaterally contaminated normal model (1)
- Categorical variable (1)
- Central expectile subspace (1)
- Central mean subspace (1)
- Central quantile subspace (1)
- Clustered data (1)
- Compound Estimation (1)
- Congenital anomalies (1)
- Crash Rate (1)
- Data Fusion (1)
- Demographics (1)
- Discriminant Cluster Analysis (1)
- Distance (1)
- EM algorithm (1)
- Empirical distribution (1)
- Environmental contaminants (1)
- Finite Mixture Models (1)
- Fourier transform (1)
- GIS (1)
Articles 1 - 18 of 18
Full-Text Articles in Multivariate Analysis
Novel Nonparametric Testing Approaches For Multivariate Growth Curve Data: Finite-Sample, Resampling And Rank-Based Methods, Ting Zeng
Theses and Dissertations--Statistics
Multivariate growth curve data naturally arise in various fields, for example, biomedical science, public health, agriculture, social science and so on. For data of this type, the classical approach is to conduct multivariate analysis of variance (MANOVA) based on Wilks' Lambda and other multivariate statistics, which require the assumptions of multivariate normality and homogeneity of within-cell covariance matrices. However, data being analyzed nowadays show marked departure from multivariate normal distribution and homoscedasticity. In this dissertation, we investigate nonparametric testing approaches for multivariate growth curve data from three aspects, i.e., finite-sample, resampling and rank-based methods.
The first project proposes an approximate …
Estimating And Testing Treatment Effects With Misclassified Multivariate Data, Zi Ye
Estimating And Testing Treatment Effects With Misclassified Multivariate Data, Zi Ye
Theses and Dissertations--Statistics
Clinical trials are often used to assess drug efficacy and safety. Participants are sometimes pre-stratified into different groups by diagnostic tools. However, these diagnostic tools are fallible. The traditional method ignores this problem and assumes the diagnostic devices are perfect. This assumption will lead to inefficient and biased estimators. In this era of personalized medicine and measurement-based care, the issues of bias and efficiency are of paramount importance. Despite the prominence, only few researches evaluated the treatment effect in the presence of misclassifications in some special cases and most others focus on assessing the accuracy of the diagnostic devices. In …
Evaluating The Incidence Of Melanoma And Lung Cancer Of Current And Former Active-Duty U.S. Military Who Were Deployed In Support Of Operation Enduring Freedom And Operation Iraqi Freedom, Brian Kovacic
Theses and Dissertations--Epidemiology and Biostatistics
The incidence of melanoma and lung cancer has been gradually increasing in the United States over the past three decades with the reputed causes due to etiological and environmental exposures, and tobacco usage. There has been concern that melanoma and lung cancer incidence among military personnel may be associated with deployment to environments with intense sun exposure and increased smoking rates due to post-traumatic stress disorder. The aim of this study was to examine associations between deployment in support of Operation Enduring Freedom (OEF) or Operation Iraqi Freedom (OIF), from 2001 through 2015, with subsequent melanoma and lung cancer incidence. …
Maternal Proximity To Mountaintop Removal Mining And Birth Defects In Appalachian Kentucky, 1997-2003, Daniel B. Cooper
Maternal Proximity To Mountaintop Removal Mining And Birth Defects In Appalachian Kentucky, 1997-2003, Daniel B. Cooper
Theses and Dissertations--Public Health (M.P.H. & Dr.P.H.)
Background: Extraction of coal through mountaintop removal mining (MTR) alters many dimensions of the landscape, and explosive blasts, exposed rock, and coal washing have the potential to pollute air and water with substances known to increase risk of developmental and birth anomalies. Previous research suggests that infants born to mothers living in MTR coal mining counties have higher prevalence of most types of birth defects.
Objectives: This study seeks to examine further the relationship between MTR activity and birth defects by employing individual level exposure estimation through precise satellite data of MTR activity in the Appalachian region and maternal residence …
Moment Kernels For T-Central Subspace, Weihang Ren
Moment Kernels For T-Central Subspace, Weihang Ren
Theses and Dissertations--Statistics
The T-central subspace allows one to perform sufficient dimension reduction for any statistical functional of interest. We propose a general estimator using a third moment kernel to estimate the T-central subspace. In particular, in this dissertation we develop sufficient dimension reduction methods for the central mean subspace via the regression mean function and central subspace via Fourier transform, central quantile subspace via quantile estimator and central expectile subsapce via expectile estima- tor. Theoretical results are established and simulation studies show the advantages of our proposed methods.
Nonparametric Analysis Of Clustered And Multivariate Data, Yue Cui
Nonparametric Analysis Of Clustered And Multivariate Data, Yue Cui
Theses and Dissertations--Statistics
In this dissertation, we investigate three distinct but interrelated problems for nonparametric analysis of clustered data and multivariate data in pre-post factorial design.
In the first project, we propose a nonparametric approach for one-sample clustered data in pre-post intervention design. In particular, we consider the situation where for some clusters all members are only observed at either pre or post intervention but not both. This type of clustered data is referred to us as partially complete clustered data. Unlike most of its parametric counterparts, we do not assume specific models for data distributions, intra-cluster dependence structure or variability, in effect …
Nonparametric Tests Of Lack Of Fit For Multivariate Data, Yan Xu
Nonparametric Tests Of Lack Of Fit For Multivariate Data, Yan Xu
Theses and Dissertations--Statistics
A common problem in regression analysis (linear or nonlinear) is assessing the lack-of-fit. Existing methods make parametric or semi-parametric assumptions to model the conditional mean or covariance matrices. In this dissertation, we propose fully nonparametric methods that make only additive error assumptions. Our nonparametric approach relies on ideas from nonparametric smoothing to reduce the test of association (lack-of-fit) problem into a nonparametric multivariate analysis of variance. A major problem that arises in this approach is that the key assumptions of independence and constant covariance matrix among the groups will be violated. As a result, the standard asymptotic theory is not …
Composite Nonparametric Tests In High Dimension, Alejandro G. Villasante Tezanos
Composite Nonparametric Tests In High Dimension, Alejandro G. Villasante Tezanos
Theses and Dissertations--Statistics
This dissertation focuses on the problem of making high-dimensional inference for two or more groups. High-dimensional means both the sample size (n) and dimension (p) tend to infinity, possibly at different rates. Classical approaches for group comparisons fail in the high-dimensional situation, in the sense that they have incorrect sizes and low powers. Much has been done in recent years to overcome these problems. However, these recent works make restrictive assumptions in terms of the number of treatments to be compared and/or the distribution of the data. This research aims to (1) propose and investigate refined …
Transforms In Sufficient Dimension Reduction And Their Applications In High Dimensional Data, Jiaying Weng
Transforms In Sufficient Dimension Reduction And Their Applications In High Dimensional Data, Jiaying Weng
Theses and Dissertations--Statistics
The big data era poses great challenges as well as opportunities for researchers to develop efficient statistical approaches to analyze massive data. Sufficient dimension reduction is such an important tool in modern data analysis and has received extensive attention in both academia and industry.
In this dissertation, we introduce inverse regression estimators using Fourier transforms, which is superior to the existing SDR methods in two folds, (1) it avoids the slicing of the response variable, (2) it can be readily extended to solve the high dimensional data problem. For the ultra-high dimensional problem, we investigate both eigenvalue decomposition and minimum …
A New Independence Measure And Its Applications In High Dimensional Data Analysis, Chenlu Ke
A New Independence Measure And Its Applications In High Dimensional Data Analysis, Chenlu Ke
Theses and Dissertations--Statistics
This dissertation has three consecutive topics. First, we propose a novel class of independence measures for testing independence between two random vectors based on the discrepancy between the conditional and the marginal characteristic functions. If one of the variables is categorical, our asymmetric index extends the typical ANOVA to a kernel ANOVA that can test a more general hypothesis of equal distributions among groups. The index is also applicable when both variables are continuous. Second, we develop a sufficient variable selection procedure based on the new measure in a large p small n setting. Our approach incorporates marginal information between …
Effect Of Socioeconomic And Demographic Factors On Kentucky Crashes, Aaron Berry Cambron
Effect Of Socioeconomic And Demographic Factors On Kentucky Crashes, Aaron Berry Cambron
Theses and Dissertations--Civil Engineering
The goal of this research was to examine the potential predictive ability of socioeconomic and demographic data for drivers on Kentucky crash occurrence. Identifying unique background characteristics of at-fault drivers that contribute to crash rates and crash severity may lead to improved and more specific interventions to reduce the negative impacts of motor vehicle crashes. The driver-residence zip code was used as a spatial unit to connect five years of Kentucky crash data with socioeconomic factors from the U.S. Census, such as income, employment, education, age, and others, along with terrain and vehicle age. At-fault driver crash counts, normalized over …
High Dimensional Multivariate Inference Under General Conditions, Xiaoli Kong
High Dimensional Multivariate Inference Under General Conditions, Xiaoli Kong
Theses and Dissertations--Statistics
In this dissertation, we investigate four distinct and interrelated problems for high-dimensional inference of mean vectors in multi-groups.
The first problem concerned is the profile analysis of high dimensional repeated measures. We introduce new test statistics and derive its asymptotic distribution under normality for equal as well as unequal covariance cases. Our derivations of the asymptotic distributions mimic that of Central Limit Theorem with some important peculiarities addressed with sufficient rigor. We also derive consistent and unbiased estimators of the asymptotic variances for equal and unequal covariance cases respectively.
The second problem considered is the accurate inference for high-dimensional repeated …
Using The Qbest Equation To Evaluate Ellagic Acid Safety Data: Generating A Qnoael With Confidence Levels From Disparate Literature, Cynthia Rose Dickerson
Using The Qbest Equation To Evaluate Ellagic Acid Safety Data: Generating A Qnoael With Confidence Levels From Disparate Literature, Cynthia Rose Dickerson
Theses and Dissertations--Pharmacy
QBEST, a novel statistical method, can be applied to the problem of estimating the No Observed Adverse Effect Level (NOAEL or QNOAEL) of a New Molecular Entity (NME) in order to anticipate a safe starting dose for beginning clinical trials. The NOAEL from QBEST (called the QNOAEL) can be calculated using multiple disparate studies in the literature and/or from the lab. The QNOAEL is similar in some ways to the Benchmark Dose Method (BMD) used widely in toxicological research, but is superior to the BMD in some ways. The QNOAEL simulation generates an intuitive curve that is comparable to the …
Informational Index And Its Applications In High Dimensional Data, Qingcong Yuan
Informational Index And Its Applications In High Dimensional Data, Qingcong Yuan
Theses and Dissertations--Statistics
We introduce a new class of measures for testing independence between two random vectors, which uses expected difference of conditional and marginal characteristic functions. By choosing a particular weight function in the class, we propose a new index for measuring independence and study its property. Two empirical versions are developed, their properties, asymptotics, connection with existing measures and applications are discussed. Implementation and Monte Carlo results are also presented.
We propose a two-stage sufficient variable selections method based on the new index to deal with large p small n data. The method does not require model specification and especially focuses …
Development In Normal Mixture And Mixture Of Experts Modeling, Meng Qi
Development In Normal Mixture And Mixture Of Experts Modeling, Meng Qi
Theses and Dissertations--Statistics
In this dissertation, first we consider the problem of testing homogeneity and order in a contaminated normal model, when the data is correlated under some known covariance structure. To address this problem, we developed a moment based homogeneity and order test, and design weights for test statistics to increase power for homogeneity test. We applied our test to microarray about Down’s syndrome. This dissertation also studies a singular Bayesian information criterion (sBIC) for a bivariate hierarchical mixture model with varying weights, and develops a new data dependent information criterion (sFLIC).We apply our model and criteria to birth- weight and gestational …
Normal Mixture And Contaminated Model With Nuisance Parameter And Applications, Qian Fan
Normal Mixture And Contaminated Model With Nuisance Parameter And Applications, Qian Fan
Theses and Dissertations--Statistics
This paper intend to find the proper hypothesis and test statistic for testing existence of bilaterally contamination when there exists nuisance parameter. The test statistic is based on method of moments estimators. Union-Intersection test is used for testing if the distribution of population can be implemented by a bilaterally contaminated normal model with unknown variance. This paper also developed a hierarchical normal mixture model (HNM) and applied it to birth weight data. EM algorithm is employed for parameter estimation and a singular Bayesian information criterion (sBIC) is applied to choose the number components. We also proposed a singular flexible information …
James-Stein Type Compound Estimation Of Multiple Mean Response Functions And Their Derivatives, Limin Feng
James-Stein Type Compound Estimation Of Multiple Mean Response Functions And Their Derivatives, Limin Feng
Theses and Dissertations--Statistics
Charnigo and Srinivasan originally developed compound estimators to nonparametrically estimate mean response functions and their derivatives simultaneously when there is one response variable and one covariate. The compound estimator maintains self consistency and almost optimal convergence rate. This dissertation studies, in part, compound estimation with multiple responses and/or covariates. An empirical comparison of compound estimation, local regression and spline smoothing is included, and near optimal convergence rates are established in the presence of multiple covariates.
James and Stein proposed an estimator of the mean vector of a p dimensional multivariate normal distribution, which produces a smaller risk than the maximum …
Mapping And Decomposing Scale-Dependent Soil Moisture Variability Within An Inner Bluegrass Landscape, Carla Landrum
Mapping And Decomposing Scale-Dependent Soil Moisture Variability Within An Inner Bluegrass Landscape, Carla Landrum
Theses and Dissertations--Plant and Soil Sciences
There is a shared desire among public and private sectors to make more reliable predictions, accurate mapping, and appropriate scaling of soil moisture and associated parameters across landscapes. A discrepancy often exists between the scale at which soil hydrologic properties are measured and the scale at which they are modeled for management purposes. Moreover, little is known about the relative importance of hydrologic modeling parameters as soil moisture fluctuates with time. More research is needed to establish which observation scales in space and time are optimal for managing soil moisture variation over large spatial extents and how these scales are …