Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 7 of 7

Full-Text Articles in Physical Sciences and Mathematics

Deep Adversarial Subspace Clustering, Pan Zhou, Yunqing Hou, Jiashi Feng Jun 2018

Deep Adversarial Subspace Clustering, Pan Zhou, Yunqing Hou, Jiashi Feng

Research Collection School Of Computing and Information Systems

Most existing subspace clustering methods hinge on self-expression of handcrafted representations and are unaware of potential clustering errors. Thus they perform unsatisfactorily on real data with complex underlying subspaces. To solve this issue, we propose a novel deep adversarial subspace clustering (DASC) model, which learns more favorable sample representations by deep learning for subspace clustering, and more importantly introduces adversarial learning to supervise sample representation learning and subspace clustering. Specifically, DASC consists of a subspace clustering generator and a quality-verifying discriminator, which learn against each other. The generator produces subspace estimation and sample clustering. The discriminator evaluates current clustering performance …


Fitting A Complex Markov Chain Model For Firm And Market Productivity, Julia Ruth Valder May 2018

Fitting A Complex Markov Chain Model For Firm And Market Productivity, Julia Ruth Valder

Theses and Dissertations

This thesis develops a methodology of estimating parameters for a complex Markov chain model for firm productivity. The model consists of two Markov chains, one describing firm-level productivity and the other modeling the productivity of the whole market. If applicable, the model can be used to help with optimal decision making problems for labor demand. The need for such a model is motivated and the economical background of this research is shown. A brief introduction to the concept of Markov chains and their application in this context is given. The simulated data that is being used for the estimation is …


Sabermetrics - Statistical Modeling Of Run Creation And Prevention In Baseball, Parker Chernoff Mar 2018

Sabermetrics - Statistical Modeling Of Run Creation And Prevention In Baseball, Parker Chernoff

FIU Electronic Theses and Dissertations

The focus of this thesis was to investigate which baseball metrics are most conducive to run creation and prevention. Stepwise regression and Liu estimation were used to formulate two models for the dependent variables and also used for cross validation. Finally, the predicted values were fed into the Pythagorean Expectation formula to predict a team’s most important goal: winning.

Each model fit strongly and collinearity amongst offensive predictors was considered using variance inflation factors. Hits, walks, and home runs allowed, infield putouts, errors, defense-independent earned run average ratio, defensive efficiency ratio, saves, runners left on base, shutouts, and walks per …


Strategies To Adjust For Response Bias In Clinical Trials: A Simulation Study, Victoria R. Swaidan Feb 2018

Strategies To Adjust For Response Bias In Clinical Trials: A Simulation Study, Victoria R. Swaidan

USF Tampa Graduate Theses and Dissertations

Background: Response bias can distort treatment effect estimates and inferences in clinical trials. Although prevention, quantification, and adjustments have been developed, current methods are not applicable when subject-level reliability is used as the measure of response bias. Thus, the objective of the current study is to develop, test, and recommend a series of bias correction strategies for use in these cases. Methods: Monte Carlo simulation and logistic regression modeling were used to develop the strategies, examining the collective impact of sample size (N), effect size (ES), reliability distribution, and response style on estimating the treatment effect size in a series …


Dimension Reduction For Classification With Many Covariates And Pathway Activity Level Estimation, Seungchul Baek Jan 2018

Dimension Reduction For Classification With Many Covariates And Pathway Activity Level Estimation, Seungchul Baek

Theses and Dissertations

The development of science and technology has enabled the use of more covariates. As a result, it has become more difficult to identify dependencies among many covariates. Dimension reduction provides an efficient way to handle this issue by summarizing the effect of covariates via a few linear combinations of covariates. In this dissertation, two methodologies for real-life problems are provided by using dimension reduction equipped with semiparametric theory. The use of semiparametrics allows maximal flexibility of the model by letting some features of the model completely unspecified, while we still enjoy the interpretability of the model through estimating the parameters …


Uncertainty Estimation Of Deep Neural Networks, Chao Chen Jan 2018

Uncertainty Estimation Of Deep Neural Networks, Chao Chen

Theses and Dissertations

Normal neural networks trained with gradient descent and back-propagation have received great success in various applications. On one hand, point estimation of the network weights is prone to over-fitting problems and lacks important uncertainty information associated with the estimation. On the other hand, exact Bayesian neural network methods are intractable and non-applicable for real-world applications. To date, approximate methods have been actively under development for Bayesian neural networks, including but not limited to: stochastic variational methods, Monte Carlo dropouts, and expectation propagation. Though these methods are applicable for current large networks, there are limits to these approaches with either underestimation …


Semiparametric Statistical Estimation And Inference With Latent Information, Qianqian Wang Jan 2018

Semiparametric Statistical Estimation And Inference With Latent Information, Qianqian Wang

Theses and Dissertations

In Chapter 1, we predicted disease risk by transformation models in the presence of missing subgroup identifiers. When a discrete covariate defining subgroup membership is missing for some of the subjects in a study, the distribution of the outcome follows a mixture distribution of the subgroup-specific distributions. Taking into account the uncertain distribution of the group membership and the covariates, we model the relation between the disease onset time and the covariates through transformation models in each sub-population, and develop a nonparametric maximum likelihood based estimation implemented through EM algorithm along with its inference procedure. We further propose methods to …