Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 18 of 18

Full-Text Articles in Physical Sciences and Mathematics

Multifactor Dimensionality Reduction With P Risk Scores Per Person, Ye Li Jan 2018

Multifactor Dimensionality Reduction With P Risk Scores Per Person, Ye Li

Theses and Dissertations--Statistics

After reviewing Multifactor Dimensionality Reduction(MDR) and its extensions, an approach to obtain P(larger than 1) risk scores is proposed to predict the continuous outcome for each subject. We study the mean square error(MSE) of dimensionality reduced models fitted with sets of 2 risk scores and investigate the MSE for several special cases of the covariance matrix. A methodology is proposed to select a best set of P risk scores when P is specified a priori. Simulation studies based on true models of different dimensions(larger than 3) demonstrate that the selected set of P(larger than 1) risk scores outperforms the single …


Modeling And Mapping Location-Dependent Human Appearance, Zachary Bessinger Jan 2018

Modeling And Mapping Location-Dependent Human Appearance, Zachary Bessinger

Theses and Dissertations--Computer Science

Human appearance is highly variable and depends on individual preferences, such as fashion, facial expression, and makeup. These preferences depend on many factors including a person's sense of style, what they are doing, and the weather. These factors, in turn, are dependent upon geographic location and time. In our work, we build computational models to learn the relationship between human appearance, geographic location, and time. The primary contributions are a framework for collecting and processing geotagged imagery of people, a large dataset collected by our framework, and several generative and discriminative models that use our dataset to learn the relationship …


Occurrence And Attributes Of Two Echinoderm-Bearing Faunas From The Upper Mississippian (Chesterian; Lower Serpukhovian) Ramey Creek Member, Slade Formation, Eastern Kentucky, U.S.A., Ann Well Harris Jan 2018

Occurrence And Attributes Of Two Echinoderm-Bearing Faunas From The Upper Mississippian (Chesterian; Lower Serpukhovian) Ramey Creek Member, Slade Formation, Eastern Kentucky, U.S.A., Ann Well Harris

Theses and Dissertations--Earth and Environmental Sciences

Well-preserved echinoderm faunas are rare in the fossil record, and when uncovered, understanding their occurrence can be useful in interpreting other faunas. In this study, two such faunas of the same age from separate localities in the shallow-marine Ramey Creek Member of the Slade Formation in the Upper Mississippian (Chesterian) rocks of eastern Kentucky are examined. Of the more than 5,000 fossil specimens from both localities, only 9–34 percent were echinoderms from 3–5 classes. Nine non-echinoderm (8 invertebrate and one vertebrate) classes occurred at both localities, but of these, bryozoans, brachiopods and sponges dominated. To understand the attributes of both …


Aggressive Diuresis And Severity-Adjusted Length Of Hospital Stay In Acute Congestive Heart Failure Patients, Muhammad U. Butt Jan 2018

Aggressive Diuresis And Severity-Adjusted Length Of Hospital Stay In Acute Congestive Heart Failure Patients, Muhammad U. Butt

Theses and Dissertations--Clinical Research Design

To see if aggressive diuresis in first twenty four hours is associated with a comparable number of total days in the hospital as compared to non-aggressive diuresis. In this retrospective cohort study, we compared the length of hospital stay of consecutive patients admitted in one year based on their diuresis during the first twenty-four hours of hospitalization: aggressive diuresis (group 1) i.e. > 2400mL versus non-aggressive diuresis (group 2) i.e. ≤ 2400mL urine output. Patients were excluded if in cardiogenic shock, had creatinine level above 3 mg/dL on admission, or on dialysis. A total of 194 patients were enrolled (29 in …


Accounting For Spatial Autocorrelation In Modeling The Distribution Of Water Quality Variables, Lorrayne Miralha Jan 2018

Accounting For Spatial Autocorrelation In Modeling The Distribution Of Water Quality Variables, Lorrayne Miralha

Theses and Dissertations--Geography

Several studies in hydrology have reported differences in outcomes between models in which spatial autocorrelation (SAC) is accounted for and those in which SAC is not. However, the capacity to predict the magnitude of such differences is still ambiguous. In this thesis, I hypothesized that SAC, inherently possessed by a response variable, influences spatial modeling outcomes. I selected ten watersheds in the USA and analyzed them to determine whether water quality variables with higher Moran’s I values undergo greater increases in the coefficient of determination (R²) and greater decreases in residual SAC (rSAC) after spatial modeling. I compared non-spatial ordinary …


High Dimensional Multivariate Inference Under General Conditions, Xiaoli Kong Jan 2018

High Dimensional Multivariate Inference Under General Conditions, Xiaoli Kong

Theses and Dissertations--Statistics

In this dissertation, we investigate four distinct and interrelated problems for high-dimensional inference of mean vectors in multi-groups.

The first problem concerned is the profile analysis of high dimensional repeated measures. We introduce new test statistics and derive its asymptotic distribution under normality for equal as well as unequal covariance cases. Our derivations of the asymptotic distributions mimic that of Central Limit Theorem with some important peculiarities addressed with sufficient rigor. We also derive consistent and unbiased estimators of the asymptotic variances for equal and unequal covariance cases respectively.

The second problem considered is the accurate inference for high-dimensional repeated …


Bivariate Generalization Of The Time-To-Event Conditional Reassessment Method With A Novel Adaptive Randomization Method, Donglin Yan Jan 2018

Bivariate Generalization Of The Time-To-Event Conditional Reassessment Method With A Novel Adaptive Randomization Method, Donglin Yan

Theses and Dissertations--Epidemiology and Biostatistics

Phase I clinical trials in oncology aim to evaluate the toxicity risk of new therapies and identify a safe but also effective dose for future studies. Traditional Phase I trials of chemotherapies focus on estimating the maximum tolerated dose (MTD). The rationale for finding the MTD is that better therapeutic effects are expected at higher dose levels as long as the risk of severe toxicity is acceptable. With the advent of a new generation of cancer treatments such as the molecularly targeted agents (MTAs) and immunotherapies, higher dose levels no longer guarantee increased therapeutic effects, and the focus has shifted …


Improved Methods And Selecting Classification Types For Time-Dependent Covariates In The Marginal Analysis Of Longitudinal Data, I-Chen Chen Jan 2018

Improved Methods And Selecting Classification Types For Time-Dependent Covariates In The Marginal Analysis Of Longitudinal Data, I-Chen Chen

Theses and Dissertations--Epidemiology and Biostatistics

Generalized estimating equations (GEE) are popularly utilized for the marginal analysis of longitudinal data. In order to obtain consistent regression parameter estimates, these estimating equations must be unbiased. However, when certain types of time-dependent covariates are presented, these equations can be biased unless an independence working correlation structure is employed. Moreover, in this case regression parameter estimation can be very inefficient because not all valid moment conditions are incorporated within the corresponding estimating equations. Therefore, approaches using the generalized method of moments or quadratic inference functions have been proposed for utilizing all valid moment conditions. However, we have found that …


Using Prescription Drug Monitoring Data To Inform Population Level Analysis Of Opioid Analgesic Utilization, Huong T. T. Luu Jan 2018

Using Prescription Drug Monitoring Data To Inform Population Level Analysis Of Opioid Analgesic Utilization, Huong T. T. Luu

Theses and Dissertations--Epidemiology and Biostatistics

Increased opioid analgesic (OA) prescribing has been associated with increased risk of prescription opioid diversion, misuse, and abuse. States established prescription drug monitoring programs (PDMPs) to collect and analyze electronic records for dispensed controlled substances to reduce prescription drug abuse and diversion. PDMP data can be used by prescribers for tracking patient’s history of controlled substance prescribing to inform clinical decisions.

The studies in this dissertation are focused on the less utilized potential of the PDMP data to enhance public health surveillance to monitor OA prescribing and co-prescribing and association with opioid overdose mortality and morbidity. Longitudinal analysis of OA …


Effect Of Socioeconomic And Demographic Factors On Kentucky Crashes, Aaron Berry Cambron Jan 2018

Effect Of Socioeconomic And Demographic Factors On Kentucky Crashes, Aaron Berry Cambron

Theses and Dissertations--Civil Engineering

The goal of this research was to examine the potential predictive ability of socioeconomic and demographic data for drivers on Kentucky crash occurrence. Identifying unique background characteristics of at-fault drivers that contribute to crash rates and crash severity may lead to improved and more specific interventions to reduce the negative impacts of motor vehicle crashes. The driver-residence zip code was used as a spatial unit to connect five years of Kentucky crash data with socioeconomic factors from the U.S. Census, such as income, employment, education, age, and others, along with terrain and vehicle age. At-fault driver crash counts, normalized over …


Investigating The Role Of Prescription Drug Monitoring Programs In Reducing Rates Of Opioid-Related Poisonings, Nathan James Pauly Jan 2018

Investigating The Role Of Prescription Drug Monitoring Programs In Reducing Rates Of Opioid-Related Poisonings, Nathan James Pauly

Theses and Dissertations--Pharmacy

The United States is in the midst of an opioid epidemic. In addition to other system level interventions, almost all states have responded to the crisis by implementing prescription drug monitoring programs (PDMPs). PDMPs are state-level interventions that track the dispensing of Controlled Substances. Data generated at the time of medication dispensing is uploaded to a central data server that may be used to assist in identifying drug diversion, medication misuse, or potentially aberrant prescribing practices.

Prior studies assessing the impact of PDMPs on trends in opioid-related morbidity have often failed to take into account the wide heterogeneity of program …


Using The Qbest Equation To Evaluate Ellagic Acid Safety Data: Generating A Qnoael With Confidence Levels From Disparate Literature, Cynthia Rose Dickerson Jan 2018

Using The Qbest Equation To Evaluate Ellagic Acid Safety Data: Generating A Qnoael With Confidence Levels From Disparate Literature, Cynthia Rose Dickerson

Theses and Dissertations--Pharmacy

QBEST, a novel statistical method, can be applied to the problem of estimating the No Observed Adverse Effect Level (NOAEL or QNOAEL) of a New Molecular Entity (NME) in order to anticipate a safe starting dose for beginning clinical trials. The NOAEL from QBEST (called the QNOAEL) can be calculated using multiple disparate studies in the literature and/or from the lab. The QNOAEL is similar in some ways to the Benchmark Dose Method (BMD) used widely in toxicological research, but is superior to the BMD in some ways. The QNOAEL simulation generates an intuitive curve that is comparable to the …


The Family Of Conditional Penalized Methods With Their Application In Sufficient Variable Selection, Jin Xie Jan 2018

The Family Of Conditional Penalized Methods With Their Application In Sufficient Variable Selection, Jin Xie

Theses and Dissertations--Statistics

When scientists know in advance that some features (variables) are important in modeling a data, then these important features should be kept in the model. How can we utilize this prior information to effectively find other important features? This dissertation is to provide a solution, using such prior information. We propose the Conditional Adaptive Lasso (CAL) estimates to exploit this knowledge. By choosing a meaningful conditioning set, namely the prior information, CAL shows better performance in both variable selection and model estimation. We also propose Sufficient Conditional Adaptive Lasso Variable Screening (SCAL-VS) and Conditioning Set Sufficient Conditional Adaptive Lasso Variable …


Estimation In Partially Linear Models With Correlated Observations And Change-Point Models, Liangdong Fan Jan 2018

Estimation In Partially Linear Models With Correlated Observations And Change-Point Models, Liangdong Fan

Theses and Dissertations--Statistics

Methods of estimating parametric and nonparametric components, as well as properties of the corresponding estimators, have been examined in partially linear models by Wahba [1987], Green et al. [1985], Engle et al. [1986], Speckman [1988], Hu et al. [2004], Charnigo et al. [2015] among others. These models are appealing due to their flexibility and wide range of practical applications including the electricity usage study by Engle et al. [1986], gum disease study by Speckman [1988], etc., wherea parametric component explains linear trends and a nonparametric part captures nonlinear relationships.

The compound estimator (Charnigo et al. [2015]) has been used to …


Automated Tree-Level Forest Quantification Using Airborne Lidar, Hamid Hamraz Jan 2018

Automated Tree-Level Forest Quantification Using Airborne Lidar, Hamid Hamraz

Theses and Dissertations--Computer Science

Traditional forest management relies on a small field sample and interpretation of aerial photography that not only are costly to execute but also yield inaccurate estimates of the entire forest in question. Airborne light detection and ranging (LiDAR) is a remote sensing technology that records point clouds representing the 3D structure of a forest canopy and the terrain underneath. We present a method for segmenting individual trees from the LiDAR point clouds without making prior assumptions about tree crown shapes and sizes. We then present a method that vertically stratifies the point cloud to an overstory and multiple understory tree …


Accounting For Matching Uncertainty In Photographic Identification Studies Of Wild Animals, Amanda R. Ellis Jan 2018

Accounting For Matching Uncertainty In Photographic Identification Studies Of Wild Animals, Amanda R. Ellis

Theses and Dissertations--Statistics

I consider statistical modelling of data gathered by photographic identification in mark-recapture studies and propose a new method that incorporates the inherent uncertainty of photographic identification in the estimation of abundance, survival and recruitment. A hierarchical model is proposed which accepts scores assigned to pairs of photographs by pattern recognition algorithms as data and allows for uncertainty in matching photographs based on these scores. The new models incorporate latent capture histories that are treated as unknown random variables informed by the data, contrasting past models having the capture histories being fixed. The methods properly account for uncertainty in the matching …


Mixtures-Of-Regressions With Measurement Error, Xiaoqiong Fang Jan 2018

Mixtures-Of-Regressions With Measurement Error, Xiaoqiong Fang

Theses and Dissertations--Statistics

Finite Mixture model has been studied for a long time, however, traditional methods assume that the variables are measured without error. Mixtures-of-regression model with measurement error imposes challenges to the statisticians, since both the mixture structure and the existence of measurement error can lead to inconsistent estimate for the regression coefficients. In order to solve the inconsistency, We propose series of methods to estimate the mixture likelihood of the mixtures-of-regressions model when there is measurement error, both in the responses and predictors. Different estimators of the parameters are derived and compared with respect to their relative efficiencies. The simulation results …


Improved Standard Error Estimation For Maintaining The Validities Of Inference In Small-Sample Cluster Randomized Trials And Longitudinal Studies, Whitney Ford Tanner Jan 2018

Improved Standard Error Estimation For Maintaining The Validities Of Inference In Small-Sample Cluster Randomized Trials And Longitudinal Studies, Whitney Ford Tanner

Theses and Dissertations--Epidemiology and Biostatistics

Data arising from Cluster Randomized Trials (CRTs) and longitudinal studies are correlated and generalized estimating equations (GEE) are a popular analysis method for correlated data. Previous research has shown that analyses using GEE could result in liberal inference due to the use of the empirical sandwich covariance matrix estimator, which can yield negatively biased standard error estimates when the number of clusters or subjects is not large. Many techniques have been presented to correct this negative bias; However, use of these corrections can still result in biased standard error estimates and thus test sizes that are not consistently at their …