Open Access. Powered by Scholars. Published by Universities.®

Statistical Models Commons

Open Access. Powered by Scholars. Published by Universities.®

2016

Discipline
Institution
Keyword
Publication
Publication Type
File Type

Articles 1 - 30 of 52

Full-Text Articles in Statistical Models

Novel Models Of Visual Topographic Map Alignment In The Superior Colliculus., Ruben A Tikidji-Hamburyan, Tarek A El-Ghazawi, Jason W. Triplett Dec 2016

Novel Models Of Visual Topographic Map Alignment In The Superior Colliculus., Ruben A Tikidji-Hamburyan, Tarek A El-Ghazawi, Jason W. Triplett

Pediatrics Faculty Publications

The establishment of precise neuronal connectivity during development is critical for sensing the external environment and informing appropriate behavioral responses. In the visual system, many connections are organized topographically, which preserves the spatial order of the visual scene. The superior colliculus (SC) is a midbrain nucleus that integrates visual inputs from the retina and primary visual cortex (V1) to regulate goal-directed eye movements. In the SC, topographically organized inputs from the retina and V1 must be aligned to facilitate integration. Previously, we showed that retinal input instructs the alignment of V1 inputs in the SC in a manner dependent on …


A Traders Guide To The Predictive Universe- A Model For Predicting Oil Price Targets And Trading On Them, Jimmie Harold Lenz Dec 2016

A Traders Guide To The Predictive Universe- A Model For Predicting Oil Price Targets And Trading On Them, Jimmie Harold Lenz

Doctor of Business Administration Dissertations

At heart every trader loves volatility; this is where return on investment comes from, this is what drives the proverbial “positive alpha.” As a trader, understanding the probabilities related to the volatility of prices is key, however if you could also predict future prices with reliability the world would be your oyster. To this end, I have achieved three goals with this dissertation, to develop a model to predict future short term prices (direction and magnitude), to effectively test this by generating consistent profits utilizing a trading model developed for this purpose, and to write a paper that anyone with …


A Multi-Indexed Logistic Model For Time Series, Xiang Liu Dec 2016

A Multi-Indexed Logistic Model For Time Series, Xiang Liu

Electronic Theses and Dissertations

In this thesis, we explore a multi-indexed logistic regression (MILR) model, with particular emphasis given to its application to time series. MILR includes simple logistic regression (SLR) as a special case, and the hope is that it will in some instances also produce significantly better results. To motivate the development of MILR, we consider its application to the analysis of both simulated sine wave data and stock data. We looked at well-studied SLR and its application in the analysis of time series data. Using a more sophisticated representation of sequential data, we then detail the implementation of MILR. We compare …


Applications Of Credit Scoring Models, Mimi Mei Ling Chong Dec 2016

Applications Of Credit Scoring Models, Mimi Mei Ling Chong

Electronic Thesis and Dissertation Repository

The application of credit scoring on consumer lending is an automated, objective and consistent tool which helps lenders to provide quick loan decisions. In order to apply for a loan, applicants must provide their attributes by filling out an application form. Certain attributes are then selected as inputs to a credit scoring model which generates a credit score. The magnitude of this credit score is proved to be related to the credit quality of the loan applicant. As such, it is used to determine whether the loan will be granted, and also the amount of interest being charged. Currently, little …


Towards Deeper Understanding In Neuroimaging, Rex Devon Hjelm Nov 2016

Towards Deeper Understanding In Neuroimaging, Rex Devon Hjelm

Computer Science ETDs

Neuroimaging is a growing domain of research, with advances in machine learning having tremendous potential to expand understanding in neuroscience and improve public health. Deep neural networks have recently and rapidly achieved historic success in numerous domains, and as a consequence have completely redefined the landscape of automated learners, giving promise of significant advances in numerous domains of research. Despite recent advances and advantages over traditional machine learning methods, deep neural networks have yet to have permeated significantly into neuroscience studies, particularly as a tool for discovery. This dissertation presents well-established and novel tools for unsupervised learning which aid in …


The Impacts Of Telecommuting On The Time-Space Distribution Of Daily Activities, Mario Benito Rojas Iv Nov 2016

The Impacts Of Telecommuting On The Time-Space Distribution Of Daily Activities, Mario Benito Rojas Iv

FIU Electronic Theses and Dissertations

As major cities have aged, they have also met or exceeded their transportation infrastructure’s capacity. This has led to many negative impacts such as increased greenhouse gas emissions, delay, travel time, congestion, as well as decreased energy independence, standard of living for the cities’ inhabitants and the world as a whole. As a result, these cities will undoubtedly suffer and will struggle to meet the needs of their citizens. It is becoming more evident, and relevant, that the solution to today’s and tomorrow’s transportation problems will be overcome through the use of policy as well as innovative strategies, one of …


Stationary Points For Parametric Stochastic Frontier Models, William C. Horrace, Ian A. Wright Nov 2016

Stationary Points For Parametric Stochastic Frontier Models, William C. Horrace, Ian A. Wright

Center for Policy Research

The results of Waldman (1982) on the Normal-Half Normal stochastic frontier model are generalized using the theory of the Dirac delta (Dirac, 1930), and distribution-free conditions are established to ensure a stationary point in the likelihood as the variance of the inefficiency distribution goes to zero. Stability of the stationary point and "wrong skew" results are derived or simulated for common parametric assumptions on the model. Identification is discussed.


Advanced Data Analysis - Lecture Notes, Erik B. Erhardt, Edward J. Bedrick, Ronald M. Schrader Oct 2016

Advanced Data Analysis - Lecture Notes, Erik B. Erhardt, Edward J. Bedrick, Ronald M. Schrader

Open Textbooks

Lecture notes for Advanced Data Analysis (ADA1 Stat 427/527 and ADA2 Stat 428/528), Department of Mathematics and Statistics, University of New Mexico, Fall 2016-Spring 2017. Additional material including RMarkdown templates for in-class and homework exercises, datasets, R code, and video lectures are available on the course websites: https://statacumen.com/teaching/ada1 and https://statacumen.com/teaching/ada2 .

Contents

I ADA1: Software

  • 0 Introduction to R, Rstudio, and ggplot

II ADA1: Summaries and displays, and one-, two-, and many-way tests of means

  • 1 Summarizing and Displaying Data
  • 2 Estimation in One-Sample Problems
  • 3 Two-Sample Inferences
  • 4 Checking Assumptions
  • 5 One-Way Analysis of Variance

III ADA1: Nonparametric, categorical, …


Exploring New Models For Seatbelt Use In Survey Data, Mark K. Ledbetter, Norou Diawara, Bryan E. Porter Oct 2016

Exploring New Models For Seatbelt Use In Survey Data, Mark K. Ledbetter, Norou Diawara, Bryan E. Porter

Virginia Journal of Science

Problem: Several approaches to analyze seatbelt use have been proposed in the literature. Two methods that has not been explored are the use of unweighted and weighted logistic regression model and the use of item response theory (IRT) or the Rasch model. Since accurate methods to predict seatbelt use behavior based upon observed data must include a built-in design method and model, and overcome computation challenges, weighted and IRT method deem to be other options for an observational survey of seat belt use in the state of Virginia.

Method: The observed data from 136 sites within the Commonwealth …


The Reduced Form Of Litigation Models And The Plaintiff's Win Rate, Jonah B. Gelbach Sep 2016

The Reduced Form Of Litigation Models And The Plaintiff's Win Rate, Jonah B. Gelbach

All Faculty Scholarship

In this paper I introduce what I call the reduced form approach to studying the plaintiff's win rate in litigation selection models. A reduced form comprises a joint distribution of plaintiff's and defendant's beliefs concerning the probability that the plaintiff would win in the event a dispute were litigated; a conditional win rate function that tells us the actual probability of a plaintiff win in the event of litigation, given the parties' subjective beliefs; and a litigation rule that provides the probability that a case will be litigated given the two parties' beliefs. I show how models with very different-looking …


Addition To Pglr Chap 6, Joseph M. Hilbe Aug 2016

Addition To Pglr Chap 6, Joseph M. Hilbe

Joseph M Hilbe

Addition to Chapter 6 in Practical Guide to Logistic Regression. Added section on Bayesian logistic regression using Stata.


Passive Visual Analytics Of Social Media Data For Detection Of Unusual Events, Kush Rustagi, Junghoon Chae Aug 2016

Passive Visual Analytics Of Social Media Data For Detection Of Unusual Events, Kush Rustagi, Junghoon Chae

The Summer Undergraduate Research Fellowship (SURF) Symposium

Now that social media sites have gained substantial traction, huge amounts of un-analyzed valuable data are being generated. Posts containing images and text have spatiotemporal data attached as well, having immense value for increasing situational awareness of local events, providing insights for investigations and understanding the extent of incidents, their severity, and consequences, as well as their time-evolving nature. However, the large volume of unstructured social media data hinders exploration and examination. To analyze such social media data, the S.M.A.R.T system provides the analyst with an interactive visual spatiotemporal analysis and spatial decision support environment that assists in evacuation planning …


A Gene-Based Association Method For Mapping Traits Using Reference Transcriptome Data, Eric R. Gamazon, Heather Wheeler, Kaanan P. Shah, Sahar V. Mozaffari, Keston Aquino-Michaels, Robert J. Carroll, Anne E. Eyler, Joshua C. Denny, Dan L. Nicolae, Nancy J. Cox, Hae Kyung Im Aug 2016

A Gene-Based Association Method For Mapping Traits Using Reference Transcriptome Data, Eric R. Gamazon, Heather Wheeler, Kaanan P. Shah, Sahar V. Mozaffari, Keston Aquino-Michaels, Robert J. Carroll, Anne E. Eyler, Joshua C. Denny, Dan L. Nicolae, Nancy J. Cox, Hae Kyung Im

Heather Wheeler

Genome-wide association studies (GWAS) have identified thousands of variants robustly associated with complex traits. However, the biological mechanisms underlying these associations are, in general, not well understood. We propose a gene-based association method called PrediXcan that directly tests the molecular mechanisms through which genetic variation affects phenotype. The approach estimates the component of gene expression determined by an individual’s genetic profile and correlates ‘imputed’ gene expression with the phenotype under investigation to identify genes involved in the etiology of the phenotype. Genetically regulated gene expression is estimated using whole-genome tissue-dependent prediction models trained with reference transcriptome data sets. PrediXcan enjoys …


Newsvendor Models With Monte Carlo Sampling, Ijeoma W. Ekwegh Aug 2016

Newsvendor Models With Monte Carlo Sampling, Ijeoma W. Ekwegh

Electronic Theses and Dissertations

Newsvendor Models with Monte Carlo Sampling by Ijeoma Winifred Ekwegh The newsvendor model is used in solving inventory problems in which demand is random. In this thesis, we will focus on a method of using Monte Carlo sampling to estimate the order quantity that will either maximizes revenue or minimizes cost given that demand is uncertain. Given data, the Monte Carlo approach will be used in sampling data over scenarios and also estimating the probability density function. A bootstrapping process yields an empirical distribution for the order quantity that will maximize the expected profit. Finally, this method will be used …


The Influence Of The Electric Supply Industry On Economic Growth In Less Developed Countries, Edward Richard Bee Aug 2016

The Influence Of The Electric Supply Industry On Economic Growth In Less Developed Countries, Edward Richard Bee

Dissertations

This study measures the impact that electrical outages have on manufacturing production in 135 less developed countries using stochastic frontier analysis and data from World Bank’s Investment Climate surveys. Outages of electricity, for firms with and without backup power sources, are the most frequently cited constraint on manufacturing growth in these surveys.

Outages are shown to reduce output below the production frontier by almost five percent in Africa and by a lower percentage in South Asia, Southeast Asia and the Middle East and North Africa. Production response to outages is quadratic in form. Outages also increase labor cost, reduce exports …


Multilevel Models For Longitudinal Data, Aastha Khatiwada Aug 2016

Multilevel Models For Longitudinal Data, Aastha Khatiwada

Electronic Theses and Dissertations

Longitudinal data arise when individuals are measured several times during an ob- servation period and thus the data for each individual are not independent. There are several ways of analyzing longitudinal data when different treatments are com- pared. Multilevel models are used to analyze data that are clustered in some way. In this work, multilevel models are used to analyze longitudinal data from a case study. Results from other more commonly used methods are compared to multilevel models. Also, comparison in output between two software, SAS and R, is done. Finally a method consisting of fitting individual models for each …


Social Learning And Adoption Of New Behavior In A Virtual Agent Society, Benjamin D. Nye, Barry G. Silverman Jul 2016

Social Learning And Adoption Of New Behavior In A Virtual Agent Society, Benjamin D. Nye, Barry G. Silverman

Barry G Silverman

Social learning and adoption of new behavior govern the rise of a variety of behaviors: from actions as mundane as dance steps to those as dangerous as new ways to make IED detonators. However, agents in immersive virtual environments lack the ability to realistically simulate the spread of new behavior. To address this gap, a cognitive model was designed that represents the well-known socio-cognitive factors of attention, social influence, and motivation that influence learning and the adoption of a new behavior. To explore the effectiveness of this model, simulations modeled the spread of two competing memes in Hamariyah, an archetypal …


Social Learning And Adoption Of New Behavior In A Virtual Agent Society, Benjamin D. Nye, Barry G. Silverman Jul 2016

Social Learning And Adoption Of New Behavior In A Virtual Agent Society, Benjamin D. Nye, Barry G. Silverman

Barry G Silverman

Social learning and adoption of new behavior govern the rise of a variety of behaviors: from actions as mundane as dance steps to those as dangerous as new ways to make IED detonators. However, agents in immersive virtual environments lack the ability to realistically simulate the spread of new behavior. To address this gap, a cognitive model was designed that represents the well-known socio-cognitive factors of attention, social influence, and motivation that influence learning and the adoption of a new behavior. To explore the effectiveness of this model, simulations modeled the spread of two competing memes in Hamariyah, an archetypal …


The Influence Of Model Resolution On The Simulated Sensitivity Of North Atlantic Tropical Cyclone Maximum Intensity To Sea Surface Temperature, Sarah Strazzo, James Elsner, Timothy Larow, Hiroyuki Murakami, Michael Wehner, Ming Zhao Jul 2016

The Influence Of Model Resolution On The Simulated Sensitivity Of North Atlantic Tropical Cyclone Maximum Intensity To Sea Surface Temperature, Sarah Strazzo, James Elsner, Timothy Larow, Hiroyuki Murakami, Michael Wehner, Ming Zhao

Publications

No abstract provided.


Analysis Off Dependent Discrete Choices Using Gaussian Copula, Arjun Poddar Jul 2016

Analysis Off Dependent Discrete Choices Using Gaussian Copula, Arjun Poddar

Mathematics & Statistics Theses & Dissertations

A popular tool for analyzing product choices of consumers is the well-known conditional logit discrete choice model. Originally publicized by McFadden (1974), this model assumes that the random components of the underlying latent utility functions of the consumers follow independent Gumbel distributions. However, in practice the independence assumption may be violated and a more reasonable model should account for the dependence of the utilities. In this dissertation we use the Gaussian copula with compound symmetric and autoregressive of order one correlation matrices to construct a general multivariate model for the joint distribution of the utilities. The induced correlations on the …


Bayesian Nonparametric Approaches To Multiple Testing, Density Estimation, And Supervised Learning, William Cipolli Iii Jun 2016

Bayesian Nonparametric Approaches To Multiple Testing, Density Estimation, And Supervised Learning, William Cipolli Iii

Theses and Dissertations

This dissertation presents methods for several applications of Polya tree models. These novel nonparametric approaches to the problems of multiple testing, density estimation and supervised learning provide an alternative to other parametric and nonparametric models. In Chapter 2, the proposed approximate finite Polya tree multiple testing procedure is very successful in correctly classifying the observations with non-zero mean in a computationally efficient manner; this holds even when the non-zero means are simulated from a mean-zero distribution. Further, the model is capable of this for “interestingly different” observations in the cases where that is of interest. Chapter 3 proposes discrete, and …


Well I'Ll Be Damned - Insights Into Predictive Value Of Pedigree Information In Horse Racing, Timothy Baker Mr, Ming-Chien Sung, Johnnie Johnson Professor, Tiejun Ma Jun 2016

Well I'Ll Be Damned - Insights Into Predictive Value Of Pedigree Information In Horse Racing, Timothy Baker Mr, Ming-Chien Sung, Johnnie Johnson Professor, Tiejun Ma

International Conference on Gambling & Risk Taking

Fundamental form characteristics like how fast a horse ran at its last start, are widely used to help predict the outcome of horse racing events. The exception being in races where horses haven’t previously competed, such as Maiden races, where there is little or no publicly available past performance information. In these types of events bettors need only consider a simplified suite of factors however this is offset by a higher level of uncertainty. This paper examines the inherent information content embedded within a horse’s ancestry and the extent to which this information is discounted in the United Kingdom bookmaker …


Quantifying Transit Access In New York City: Formulating An Accessibility Index For Analyzing Spatial And Social Patterns Of Public Transportation, Maxwell S. Siegel May 2016

Quantifying Transit Access In New York City: Formulating An Accessibility Index For Analyzing Spatial And Social Patterns Of Public Transportation, Maxwell S. Siegel

Theses and Dissertations

This paper aims to analyze accessibility within New York City’s transportation system through creating unique accessibility indices. Indices are detailed and implemented using GIS, analyzing the distribution of transit need and access. Regression analyses are performed highlighting relationships between demographics and accessibility and recommendations for transit expansion are presented.


Examining Cost Functionality And Optimization: A Case Study On Testing The Reasonableness Of New Aircraft Using Historical Aircraft Data, Katherine Jozefiak May 2016

Examining Cost Functionality And Optimization: A Case Study On Testing The Reasonableness Of New Aircraft Using Historical Aircraft Data, Katherine Jozefiak

Arts & Sciences Electronic Theses and Dissertations

When pursuing business by competing for government contracts, proving the submitted price is reasonable is often required. This proof is called a test of reasonableness. This study analyzes data from historical aircraft programs in relation of a new aircraft program in order to demonstrate the estimated cost of the new program is reasonable. The purpose of this study is to investigate three questions. Is the new program cost reasonable using current industry and government parameters? Is it better to look at programs from a total cost perspective or break the total cost into subcategory levels? Finally, this study applies a …


Spot Volatility Estimation Of Ito Semimartingales Using Delta Sequences, Weixuan Gao May 2016

Spot Volatility Estimation Of Ito Semimartingales Using Delta Sequences, Weixuan Gao

Arts & Sciences Electronic Theses and Dissertations

This thesis studies a unifying class of nonparametric spot volatility estimators proposed by Mancini et. al.(2013). This method is based on delta sequences and is conceived to include many of the existing estimators in the field as special cases. The thesis first surveys the asymptotic theory of the proposed estimators under an infill asymptotic scheme and fixed time horizon, when the state variable follows a Brownian semimartingale. Then, some extensions to include jumps and financial microstructure noise in the observed price process are also presented. The main goal of the thesis is to assess the suitability of the proposed methods …


Automated Sea State Classification From Parameterization Of Survey Observations And Wave-Generated Displacement Data, Jason A. Teichman May 2016

Automated Sea State Classification From Parameterization Of Survey Observations And Wave-Generated Displacement Data, Jason A. Teichman

University of New Orleans Theses and Dissertations

Sea state is a subjective quantity whose accuracy depends on an observer’s ability to translate local wind waves into numerical scales. It provides an analytical tool for estimating the impact of the sea on data quality and operational safety. Tasks dependent on the characteristics of local sea surface conditions often require accurate and immediate assessment. An attempt to automate sea state classification using eleven years of ship motion and sea state observation data is made using parametric modeling of distribution-based confidence and tolerance intervals and a probabilistic model using sea state frequencies. Models utilizing distribution intervals are not able to …


Takens Theorem With Singular Spectrum Analysis Applied To Noisy Time Series, Thomas K. Torku May 2016

Takens Theorem With Singular Spectrum Analysis Applied To Noisy Time Series, Thomas K. Torku

Electronic Theses and Dissertations

The evolution of big data has led to financial time series becoming increasingly complex, noisy, non-stationary and nonlinear. Takens theorem can be used to analyze and forecast nonlinear time series, but even small amounts of noise can hopelessly corrupt a Takens approach. In contrast, Singular Spectrum Analysis is an excellent tool for both forecasting and noise reduction. Fortunately, it is possible to combine the Takens approach with Singular Spectrum analysis (SSA), and in fact, estimation of key parameters in Takens theorem is performed with Singular Spectrum Analysis. In this thesis, we combine the denoising abilities of SSA with the Takens …


Integration Of Multi-Platform High-Dimensional Omic Data, Xuebei An May 2016

Integration Of Multi-Platform High-Dimensional Omic Data, Xuebei An

Dissertations & Theses (Open Access)

The development of high-throughput biotechnologies have made data accessible from different platforms, including RNA sequencing, copy number variation, DNA methylation, protein lysate arrays, etc. The high-dimensional omic data derived from different technological platforms have been extensively used to facilitate comprehensive understanding of disease mechanisms and to determine personalized health treatments. Although vital to the progress of clinical research, the high dimensional multi-platform data impose new challenges for data analysis. Numerous studies have been proposed to integrate multi-platform omic data; however, few have efficiently and simultaneously addressed the problems that arise from high dimensionality and complex correlations.

In my dissertation, I …


Construction Of Pair-Wise Balanced Design, Rajarathinam Arunachalam, Mahalakshmi Sivasubramanian, Dilip Kumar Ghosh May 2016

Construction Of Pair-Wise Balanced Design, Rajarathinam Arunachalam, Mahalakshmi Sivasubramanian, Dilip Kumar Ghosh

Journal of Modern Applied Statistical Methods

A new procedure for construction of pair wise balanced design with equal replication and un-equal block sizes based on factorial design have been evolved. Numerical illustration also provided. It was found that the constructed pair wise balanced design was found to be universal optimal.


Simulation And Application Of Binary Logic Regression Models, Jobany J. Heredia Rico Apr 2016

Simulation And Application Of Binary Logic Regression Models, Jobany J. Heredia Rico

FIU Electronic Theses and Dissertations

Logic regression (LR) is a methodology to identify logic combinations of binary predictors in the form of intersections (and), unions (or) and negations (not) that are linearly associated with an outcome variable. Logic regression uses the predictors as inputs and enables us to identify important logic combinations of independent variables using a computationally efficient tree-based stochastic search algorithm, unlike the classical regression models, which only consider pre-determined conventional interactions (the “and” rules). In the thesis, we focused on LR with a binary outcome in a logistic regression framework. Simulation studies were conducted to examine the performance of LR under the …