Open Access. Powered by Scholars. Published by Universities.®

Applied Statistics Commons

Open Access. Powered by Scholars. Published by Universities.®

Social and Behavioral Sciences

2018

Institution
Keyword
Publication
Publication Type

Articles 1 - 30 of 62

Full-Text Articles in Applied Statistics

A Proficient Two-Stage Stratified Randomized Response Strategy, Tanveer A. Tarray, Housila P. Singh Dec 2018

A Proficient Two-Stage Stratified Randomized Response Strategy, Tanveer A. Tarray, Housila P. Singh

Journal of Modern Applied Statistical Methods

A stratified randomized response model based on R. Singh, Singh, Mangat, and Tracy (1995) improved two-stage randomized response strategy is proposed. It has an optimal allocation and large gain in precision. Conditions are obtained under which the proposed model is more efficient than R. Singh et al. (1995) and H. P. Singh and Tarray (2015) models. Numerical illustrations are also given in support of the present study.


Extended Method For Several Dichotomous Covariates To Estimate The Instantaneous Risk Function Of The Aalen Additive Model, Luciane Teixeira Passos Giarola, Mario Javier Ferrua Vivanco, Marcelo Angelo Cirillo, Fortunato Silva Menezes Dec 2018

Extended Method For Several Dichotomous Covariates To Estimate The Instantaneous Risk Function Of The Aalen Additive Model, Luciane Teixeira Passos Giarola, Mario Javier Ferrua Vivanco, Marcelo Angelo Cirillo, Fortunato Silva Menezes

Journal of Modern Applied Statistical Methods

The instantaneous risk function of Aalen’s model is estimated considering dichotomous covariates, using parametric accumulated risk functions to smooth cumulative risk of Aalen by grouping the individuals into sets named parcels. This methodology can be used for data with dichotomous covariates.


Simple Unbalanced Ranked Set Sampling For Mean Estimation Of Response Variable Of Developmental Programs, Girish Chandra, Dinesh S. Bhoj, Rajiv Pandey Dec 2018

Simple Unbalanced Ranked Set Sampling For Mean Estimation Of Response Variable Of Developmental Programs, Girish Chandra, Dinesh S. Bhoj, Rajiv Pandey

Journal of Modern Applied Statistical Methods

An unbalanced ranked set sampling (RSS) procedure on the skewed survey variable is proposed to estimate the population mean of a response variable from the area of developmental programs which are generally implemented under different phases. It is based on the unbalanced RSS under linear impacts of the program and is compared with the estimators based on simple random sampling (SRS) and balanced RSS. It is shown that the relative precision of the proposed estimator is higher than those of the estimators based on SRS and balanced RSS for three chosen skewed distributions of survey variables.


Spatio-Temporal Reconstruction Of Remote Sensing Observations, Kamrul Khan Dec 2018

Spatio-Temporal Reconstruction Of Remote Sensing Observations, Kamrul Khan

Graduate Theses and Dissertations

The USDA Forest Service aims to use satellite imagery for monitoring and predicting changes in forest conditions over time within the country. We specifically focus on a 230, 400 hectares region in north-central Wisconsin between 2003 - 2012. The auxiliary data collected from the satellite imagery of this region are relatively dense in space and time and can be used to efficiently predict how the forest condition changed over that decade. However, these records have a significant proportion of missing values due to weather conditions and system failures. To fill in these missing values, we build spaciotemporal models based on …


The Impact Of Sample Size In Cross-Classified Multiple Membership Multilevel Models, Hyewon Chung, Jiseon Kim, Ryoungsun Park, Hyeonjeong Jean Nov 2018

The Impact Of Sample Size In Cross-Classified Multiple Membership Multilevel Models, Hyewon Chung, Jiseon Kim, Ryoungsun Park, Hyeonjeong Jean

Journal of Modern Applied Statistical Methods

A simulation study was conducted to examine parameter recovery in a cross-classified multiple membership multilevel model. No substantial relative bias was identified for the fixed effect or level-one variance component estimates. However, the level-two cross-classification multiple membership factor variance components were substantially biased with relatively fewer groups.


An Introduction To Psychological Statistics, Garett C. Foster, David Lane, David Scott, Mikki Hebl, Rudy Guerra, Dan Osherson, Heidi Zimmer Nov 2018

An Introduction To Psychological Statistics, Garett C. Foster, David Lane, David Scott, Mikki Hebl, Rudy Guerra, Dan Osherson, Heidi Zimmer

Open Educational Resources Collection

This work has been superseded by Introduction to Statistics in the Psychological Sciences available from https://irl.umsl.edu/oer/25/.

-

We are constantly bombarded by information, and finding a way to filter that information in an objective way is crucial to surviving this onslaught with your sanity intact. This is what statistics, and logic we use in it, enables us to do. Through the lens of statistics, we learn to find the signal hidden in the noise when it is there and to know when an apparent trend or pattern is really just randomness. The study of statistics involves math and relies …


Resource Assessment Report Temperate Demersal Elasmobranch Resource Of Western Australia, Matias Braccini, Nick Blay, S. A. Hesp, Brett Molony Nov 2018

Resource Assessment Report Temperate Demersal Elasmobranch Resource Of Western Australia, Matias Braccini, Nick Blay, S. A. Hesp, Brett Molony

Fisheries research reports

This document provides a cumulative description and assessment of the TDER and all of the fishing activities (i.e. fisheries / fishing sectors) affecting this resource in WA. Future Resource Assessment Reports will assess the Statewide Sharks and Rays Resource. The report is focused on the temperate indicator species (whiskery, gummy, dusky and sandbar sharks) used to assess the suites of demersal sharks and rays that comprise this resource. These species are primarily captured by demersal gillnets used in the TDGDLF that operate in the West Coast and South Coast Bioregions. For the North Coast bioregion, no commercial fishing for sharks …


Essays In Financial Economics: Announcement Effects In Fixed Income Markets, James J. Forest Oct 2018

Essays In Financial Economics: Announcement Effects In Fixed Income Markets, James J. Forest

Doctoral Dissertations

ABSTRACT ESSAYS IN FINANCIAL ECONOMICS: ANNOUNCEMENT EFFECTS IN FIXED INCOME MARKETS PHD IN FINANCE MAY 2018 JAMES J FOREST B.A., FRAMINGHAM STATE UNIVERSITY M.S., NORTHEASTERN UNIVERSITY Ph.D., UNIVERSITY OF MASSACHUSETTS – AMHERST Directed by: Professor Hossein B. Kazemi This dissertation demonstrates the use of empirical techniques for dealing with modeling issues that arise when analyzing announcement effects in fixed income markets. It describes empirical challenges in achieving unbiased and efficient parameter estimates and shows the importance of modelling a wide range of macroeconomic announcement effects to avoid omitted variable bias. Employing techniques common in Macroeconomics, financial market researchers are better …


Using Cyclical Components To Improve The Forecasts Of The Stock Market And Macroeconomic Variables, Kenneth R. Szulczyk, Shibley Sadique Oct 2018

Using Cyclical Components To Improve The Forecasts Of The Stock Market And Macroeconomic Variables, Kenneth R. Szulczyk, Shibley Sadique

Journal of Modern Applied Statistical Methods

Economic variables such as stock market indices, interest rates, and national output measures contain cyclical components. Forecasting methods excluding these cyclical components yield inaccurate out-of-sample forecasts. Accordingly, a three-stage procedure is developed to estimate a vector autoregression (VAR) with cyclical components. A Monte Carlo simulation shows the procedure estimates the parameters accurately. Subsequently, a VAR with cyclical components improves the root-mean-square error of out-of-sample forecasts by 50% for a stock market model with macroeconomic variables.


Comparison Of Multiple Imputation Methods For Categorical Survey Items With High Missing Rates: Application To The Family Life, Activity, Sun, Health And Eating (Flashe) Study, Benmei Liu, Erin Hennessy, April Oh, Laura A. Dwyer, Linda Nebeling Sep 2018

Comparison Of Multiple Imputation Methods For Categorical Survey Items With High Missing Rates: Application To The Family Life, Activity, Sun, Health And Eating (Flashe) Study, Benmei Liu, Erin Hennessy, April Oh, Laura A. Dwyer, Linda Nebeling

Journal of Modern Applied Statistical Methods

Two multiple imputation methods, the Sequential Regression Multivariate Imputation Algorithm and the Cox-Lannacchione Weighted Sequential Hotdeck, were examined and compared to impute highly missing categorical variables from the Family Life, Activity, Sun, Health and Eating (FLASHE) study. This paper describes the imputation approaches and results from the study.


Dealing With Sensitive Quantitative Variables: A Comparison Of Sampling Designs For The Procedure Of Gupta And Thornton, Carlos Narciso Bouza Herrera, Prayas Sharma Sep 2018

Dealing With Sensitive Quantitative Variables: A Comparison Of Sampling Designs For The Procedure Of Gupta And Thornton, Carlos Narciso Bouza Herrera, Prayas Sharma

Journal of Modern Applied Statistical Methods

The use of randomized response procedures allows diminishing the number of non-responses and increasing the accuracy of the responses. A new sampling strategy is developed where the reports are scrambled using the procedure of Gupta and Thornton. The estimator of the mean as well as the errors are developed for the Rao-Hartley-Cochran and Ranked Sets Sampling designs. The proposals are compared with the original model based on the use of simple random sampling.


Bayesian And Semi-Bayesian Estimation Of The Parameters Of Generalized Inverse Weibull Distribution, Kamaljit Kaur, Kalpana K. Mahajan, Sangeeta Arora Sep 2018

Bayesian And Semi-Bayesian Estimation Of The Parameters Of Generalized Inverse Weibull Distribution, Kamaljit Kaur, Kalpana K. Mahajan, Sangeeta Arora

Journal of Modern Applied Statistical Methods

Bayesian and semi-Bayesian estimators of parameters of the generalized inverse Weibull distribution are obtained using Jeffreys’ prior and informative prior under specific assumptions of loss function. Using simulation, the relative efficiency of the proposed estimators is obtained under different set-ups. A real life example is also given.


Yelp’S Review Filtering Algorithm, Yao Yao, Ivelin Angelov, Jack Rasmus-Vorrath, Mooyoung Lee, Daniel W. Engels Aug 2018

Yelp’S Review Filtering Algorithm, Yao Yao, Ivelin Angelov, Jack Rasmus-Vorrath, Mooyoung Lee, Daniel W. Engels

SMU Data Science Review

In this paper, we present an analysis of features influencing Yelp's proprietary review filtering algorithm. Classifying or misclassifying reviews as recommended or non-recommended affects average ratings, consumer decisions, and ultimately, business revenue. Our analysis involves systematically sampling and scraping Yelp restaurant reviews. Features are extracted from review metadata and engineered from metrics and scores generated using text classifiers and sentiment analysis. The coefficients of a multivariate logistic regression model were interpreted as quantifications of the relative importance of features in classifying reviews as recommended or non-recommended. The model classified review recommendations with an accuracy of 78%. We found that reviews …


Of Typicality And Predictive Distributions In Discriminant Function Analysis, Lyle W. Konigsberg, Susan R. Frankenberg Aug 2018

Of Typicality And Predictive Distributions In Discriminant Function Analysis, Lyle W. Konigsberg, Susan R. Frankenberg

Human Biology Open Access Pre-Prints

While discriminant function analysis is an inherently Bayesian method, researchers attempting to estimate ancestry in human skeletal samples often follow discriminant function analysis with the calculation of frequentist-based typicalities for assigning group membership. Such an approach is problematic in that it fails to account for admixture and for variation in why individuals may be classified as outliers, or non-members of particular groups. This paper presents an argument and methodology for employing a fully Bayesian approach in discriminant function analysis applied to cases of ancestry estimation. The approach requires adding the calculation, or estimation, of predictive distributions as the final step …


Generalized Non-Inferential Approach To Modeling Restricted Discrete Choice For The Case Of The Spatial Random Utility, Elena Labzina Aug 2018

Generalized Non-Inferential Approach To Modeling Restricted Discrete Choice For The Case Of The Spatial Random Utility, Elena Labzina

Arts & Sciences Electronic Theses and Dissertations

Multinomial logistic regression model (MNL) is a powerful and easily tractable way for measuring the probabilistic impact of input variables on individual categorical choices. Crucially, the standard MNL assumes that all subjects of the study have the same choice sets. In the meanwhile, especially in political science and economics, this condition is frequently violated. Probably, the most graphical example of varying choice sets (VCS) is partially contested elections. Furthermore, the MNL implicitly implies the Independence of the Irregular Alternatives (IIA) assumption by requiring i.i.d errors that contrasts the MNL and the multinomial probit (MNP) and mixed logit (MXL) models. In …


Pretrial Release And Failure-To-Appear In Mclean County, Il, Jonathan Monsma Jul 2018

Pretrial Release And Failure-To-Appear In Mclean County, Il, Jonathan Monsma

Stevenson Center for Community and Economic Development—Student Research

Actuarial risk assessment tools increasingly have been employed in jurisdictions across the U.S. to assist courts in the decision of whether someone charged with a crime should be detained or released prior to their trial. These tools should be continually monitored and researched by independent 3rd parties to ensure that these powerful tools are being administered properly and used in the most proficient way as to provide socially optimal results. McLean County, Illinois began using the Public Safety Assessment-CourtTM (PSA-Court or simply PSA) risk assessment tool beginning in 2016. This study culls data from the McLean County Jail …


A Distance Based Method For Solving Multi-Objective Optimization Problems, Murshid Kamal, Syed Aqib Jalil, Syed Mohd Muneeb, Irfan Ali Jul 2018

A Distance Based Method For Solving Multi-Objective Optimization Problems, Murshid Kamal, Syed Aqib Jalil, Syed Mohd Muneeb, Irfan Ali

Journal of Modern Applied Statistical Methods

A new model for the weighted method of goal programming is proposed based on minimizing the distances between ideal objectives to feasible objective space. It provides the best compromised solution for Multi Objective Linear Programming Problems (MOLPP). The proposed model tackles MOLPP by solving a series of single objective sub-problems, where the objectives are transformed into constraints. The compromise solution so obtained may be improved by defining priorities in terms of the weight. A criterion is also proposed for deciding the best compromise solution. Applications of the algorithm are discussed for transportation and assignment problems involving multiple and conflicting objectives. …


Goalie Analytics: Statistical Evaluation Of Context-Specific Goalie Performance Measures In The National Hockey League, Marc Naples, Logan Gage, Amy Nussbaum Jul 2018

Goalie Analytics: Statistical Evaluation Of Context-Specific Goalie Performance Measures In The National Hockey League, Marc Naples, Logan Gage, Amy Nussbaum

SMU Data Science Review

In this paper, we attempt to improve upon the classic formulation of save percentage in the NHL by controlling the context of the shots and use alternative measures than save percentage. In particular, we find save percentage to be both a weakly repeatable skill and predictor of future performance, and we seek other goalie performance calculations that are more robust. To do so, we use three primary tests to test intra-season consistency, intra-season predictability, and inter-season consistency, and extend the analysis to disentangle team effects on goalie statistics. We find that there are multiple ways to improve upon classic save …


Estimation Of Finite Population Mean By Using Minimum And Maximum Values In Stratified Random Sampling, Umer Daraz, Javid Shabbir, Hina Khan Jul 2018

Estimation Of Finite Population Mean By Using Minimum And Maximum Values In Stratified Random Sampling, Umer Daraz, Javid Shabbir, Hina Khan

Journal of Modern Applied Statistical Methods

In this paper we have suggested an improved class of ratio type estimators in estimating the finite population mean when information on minimum and maximum values of the auxiliary variable is known. The properties of the suggested class of estimators in terms of bias and mean square error are obtained up to first order of approximation. Two data sets are used for efficiency comparisons.


A Study On Modelling Spatial-Temporal Human Mobility Patterns For Improving Personalized Weather Warning, Yue Xu Jul 2018

A Study On Modelling Spatial-Temporal Human Mobility Patterns For Improving Personalized Weather Warning, Yue Xu

Masters Theses

Understanding human mobility patterns is important for severe weather warning since these patterns can help identify where people are in time and in space when flash floods, tornados, high winds and hurricanes are occurring or are predicted to occur. A GIS (Geographic Information Science) data model was proposed to describe the spatial-temporal human activity. Based on this model, a metric was designed to represent the spatial-temporal activity intensity of human mobility, and an index was generated to quantitatively describe the change in human activities. By analyzing high-resolution human mobility data, the paper verified that human daily mobility patterns could be …


A Bayesian Beta-Mixture Model For Nonparametric Irt (Bbm-Irt), Ethan A. Arenson, George Karabatsos Jul 2018

A Bayesian Beta-Mixture Model For Nonparametric Irt (Bbm-Irt), Ethan A. Arenson, George Karabatsos

Journal of Modern Applied Statistical Methods

Item response models typically assume that the item characteristic (step) curves follow a logistic or normal cumulative distribution function, which are strictly monotone functions of person test ability. Such assumptions can be overly-restrictive for real item response data. A simple and more flexible Bayesian nonparametric IRT model for dichotomous items is introduced, which constructs monotone item characteristic (step) curves by a finite mixture of beta distributions, which can support the entire space of monotone curves to any desired degree of accuracy. An adaptive random-walk Metropolis-Hastings algorithm is proposed to estimate the posterior distribution of the model parameters. The Bayesian IRT …


Robust Estimation And Inference On Current Status Data With Applications To Phase Iv Cancer Trial, Deo Kumar Srivastava, Liang Zhu, Melissa M. Hudson, Jianmin Pan, Shesh N. Rai Jul 2018

Robust Estimation And Inference On Current Status Data With Applications To Phase Iv Cancer Trial, Deo Kumar Srivastava, Liang Zhu, Melissa M. Hudson, Jianmin Pan, Shesh N. Rai

Journal of Modern Applied Statistical Methods

The use of piecewise exponential distributions was proposed by Rai et al. (2013) for analyzing cardiotoxicity data. Some parametric models are proposed, but the focus is on the Weibull distribution, which overcomes the limitation of piecewise exponential.


Australian Herring And West Australian Salmon Scientific Workshop Report, October 2017, Department Of Primary Industries And Regional Development, Western Australia Jul 2018

Australian Herring And West Australian Salmon Scientific Workshop Report, October 2017, Department Of Primary Industries And Regional Development, Western Australia

Fisheries research reports

No abstract provided.


Robust Heteroscedasticity Consistent Covariance Matrix Estimator Based On Robust Mahalanobis Distance And Diagnostic Robust Generalized Potential Weighting Methods In Linear Regression, M. Habshah, Muhammad Sani, Jayanthi Arasan Jun 2018

Robust Heteroscedasticity Consistent Covariance Matrix Estimator Based On Robust Mahalanobis Distance And Diagnostic Robust Generalized Potential Weighting Methods In Linear Regression, M. Habshah, Muhammad Sani, Jayanthi Arasan

Journal of Modern Applied Statistical Methods

The violation of the assumption of homoscedasticity and the presence of high leverage points (HLPs) are common in the use of regression models. The weighted least squares can provide the solution to heteroscedastic regression model if the heteroscedastic error structures are known. Based on Furno (1996), two robust weighting methods are proposed based on HLP detection measures (robust Mahalanobis distance based on minimum volume ellipsoid and diagnostic robust generalized potential based on index set equality (DRGP(ISE)) on robust heteroscedasticity consistent covariance matrix estimators. Results obtained from a simulation study and real data sets indicated the DRGP(ISE) method is superior.


Fitting The Rasch Model Under The Logistic Regression Framework To Reduce Estimation Bias, Tianshu Pan Jun 2018

Fitting The Rasch Model Under The Logistic Regression Framework To Reduce Estimation Bias, Tianshu Pan

Journal of Modern Applied Statistical Methods

This article showed how and why the Rasch model can be fitted under the logistic regression framework. Then a penalized maximum likelihood (Firth 1993) for logistic regression models can also be used to reduce ML biases when fitting the Rasch model. These conclusions are supported by a simulation study.


Internal Consistency Reliability In Measurement: Aggregate And Multilevel Approaches, Georgios Sideridis, Abdullah Saddaawi, Khaleel Al-Harbi Jun 2018

Internal Consistency Reliability In Measurement: Aggregate And Multilevel Approaches, Georgios Sideridis, Abdullah Saddaawi, Khaleel Al-Harbi

Journal of Modern Applied Statistical Methods

The purpose of the present paper was to evaluate the internal consistency reliability of the General Teacher Test assuming clustered and non-clustered data using commercial software (Mplus). Participants were 2,000 testees who were selected using random sampling from a larger pool of examinees (more than 65k). The measure involved four factors, namely: (a) planning for learning, (b) promoting learning, (c) supporting learning, and (d) professional responsibilities and was hypothesized to comprise a unidimensional instrument assessing generalized skills and competencies. Intra-class correlation coefficients and variance ratio statistics suggested the need to incorporate a clustering variable (i.e., university) when evaluating the factor …


Regressions Regularized By Correlations, Stan Lipovetsky Jun 2018

Regressions Regularized By Correlations, Stan Lipovetsky

Journal of Modern Applied Statistical Methods

The regularization of multiple regression by proportionality to correlations of predictors with dependent variable is applied to the least squares objective and normal equations to relax the exact equalities and to get a robust solution. This technique produces models not prone to multicollinearity and is very useful in practical applications.


Optimum Stratification In Bivariate Auxiliary Variables Under Neyman Allocation, Faizan Danish, S.E.H. Rizvi Jun 2018

Optimum Stratification In Bivariate Auxiliary Variables Under Neyman Allocation, Faizan Danish, S.E.H. Rizvi

Journal of Modern Applied Statistical Methods

In several situations complete data set of the study variable is unknown that becomes a stumbling block in various stratification techniques in order to obtain stratification points on two way stratification method. In this paper a technique has been proposed under Neyman allocation when the stratification is done oj the two auxiliary variable having one estimation variable under consideration. Due to complexities created by minimal equations approximate optimum strata boundaries has been obtained. Empirical study has been done to illustrate the proposed method when the auxiliary variables have standard Cauchy and power distributions.


An Explanatory Study On The Non-Parametric Multivariate T2 Control Chart, Abdolrasoul Mostajeran, Nasrolah Iranpanah, Rassoul Noorossana Jun 2018

An Explanatory Study On The Non-Parametric Multivariate T2 Control Chart, Abdolrasoul Mostajeran, Nasrolah Iranpanah, Rassoul Noorossana

Journal of Modern Applied Statistical Methods

Most control charts require the assumption of normal distribution for observations. When distribution is not normal, one can use non-parametric control charts such as sign control chart. A deficiency of such control charts could be the loss of information due to replacing an observation with its sign or rank. Furthermore, because the chart statistics of T2 are correlated, the T2 chart is not a desire performance. Non-parametric bootstrap algorithm could help to calculate control chart parameters using the original observations while no assumption regarding the distribution is needed. In this paper, first, a bootstrap multivariate control chart is …


Moment Generating Functions Of Complementary Exponential-Geometric Distribution Based On K-Th Lower Record Values, Devendra Kumar, Sanku Dey, Mansoor Rashid Malik, Fahad M. Al-Aboud Jun 2018

Moment Generating Functions Of Complementary Exponential-Geometric Distribution Based On K-Th Lower Record Values, Devendra Kumar, Sanku Dey, Mansoor Rashid Malik, Fahad M. Al-Aboud

Journal of Modern Applied Statistical Methods

The complementary exponential-geometric (CEG) distribution is a useful model for analyzing lifetime data. For this distribution, some recurrence relations satisfied by marginal and joint moment generating functions of k-th lower record values were established. They enable the computation of the means, variances, and covariances of k-th lower record values for all sample sizes in a simple and efficient recursive manner. Means, variances, and covariances of lower record values were tabulated from samples of sizes up to 10 for various values of the parameters.