Open Access. Powered by Scholars. Published by Universities.®

Multivariate Analysis Commons

Open Access. Powered by Scholars. Published by Universities.®

394 Full-Text Articles 623 Authors 207,880 Downloads 83 Institutions

All Articles in Multivariate Analysis

Faceted Search

394 full-text articles. Page 3 of 16.

A Geochemical And Statistical Investigation Of The Big Four Springs Region In Southern Missouri, Jordan Jasso Vega 2020 Missouri State University

A Geochemical And Statistical Investigation Of The Big Four Springs Region In Southern Missouri, Jordan Jasso Vega

MSU Graduate Theses

The Big Four Springs region hosts four major first-order magnitude springs in southern Missouri and northern Arkansas. These springs are Big Spring (Carter County, MO), Greer Spring (Oregon County, MO), Mammoth Spring (Fulton County, AR), and Hodgson Mill Spring (Ozark County, MO). Based on historic dye traces and hydrogeological investigations, these springs drain an area of approximately 1500 square miles and collectively discharge an average of 780 million gallons of water per day. The rocks from youngest to oldest that are found in Big Four Springs region are the Cotter and Jefferson City Dolomite (Ordovician), Roubidoux Formation (Ordovician), Gasconade Dolomite …


Empirical Modeling Of Used Nuclear Fuel Radiation Emissions For Safeguards Purposes, Amanda M. Bachmann 2020 University of Tennessee, Knoxville

Empirical Modeling Of Used Nuclear Fuel Radiation Emissions For Safeguards Purposes, Amanda M. Bachmann

Masters Theses

For nuclear nonproliferation safeguards, the ability to characterize used nuclear fuel (UNF) is a vital process. Fuel characterization allows for independent verification by inspectors of operator declarations of the special nuclear material flow and nuclear related activities within a facility, and an estimation of fissile material remaining in a fuel assembly. Current methods to verify this information rely heavily on non-destructive assay techniques, such as gamma spectroscopy and neutron detection measurements. While these measurements are effective tools for estimating a specific characteristic of the fuel, such as burnup or cooling time, they often require an accurate estimation of a select …


Improving The Quality And Design Of Retrospective Clinical Outcome Studies That Utilize Electronic Health Records, Oliwier Dziadkowiec, Jeffery Durbin, Vignesh Jayaraman Muralidharan, Megan Novak, Brendon Cornett 2020 HCA Healthcare Mountain MidAmerica and Continental Divisions

Improving The Quality And Design Of Retrospective Clinical Outcome Studies That Utilize Electronic Health Records, Oliwier Dziadkowiec, Jeffery Durbin, Vignesh Jayaraman Muralidharan, Megan Novak, Brendon Cornett

HCA Healthcare Journal of Medicine

Electronic health records (EHRs) are an excellent source for secondary data analysis. Studies based on EHR-derived data, if designed properly, can answer previously unanswerable clinical research questions. In this paper we will highlight the benefits of large retrospective studies from secondary sources such as EHRs, examine retrospective cohort and case-control study design challenges, as well as methodological and statistical adjustment that can be made to overcome some of the inherent design limitations, in order to increase the generalizability, validity and reliability of the results obtained from these studies.


Applications Of Portable Libs For Actinide Analysis, Ashwin P. Rao, John D. Auxier II, Dung Vu, Michael B. Shattan 2020 Air Force Institute of Technology

Applications Of Portable Libs For Actinide Analysis, Ashwin P. Rao, John D. Auxier Ii, Dung Vu, Michael B. Shattan

Faculty Publications

A portable LIBS device was used for rapid elemental impurity analysis of plutonium alloys. This device demonstrates the potential for fast, accurate in-situ chemical analysis and could significantly reduce the fabrication time of plutonium alloys.


Learning Networks With Categorical Data Using Distance Correlation, And A Novel Graph-Based Multivariate Test, Jian Tinker 2020 University of Arkansas, Fayetteville

Learning Networks With Categorical Data Using Distance Correlation, And A Novel Graph-Based Multivariate Test, Jian Tinker

Graduate Theses and Dissertations

We study the use of distance correlation for statistical inference on categorical data, especially the induction of probability networks. Szekely et al. first defined distance correlation for continuous variables in [42], and Zhang translated the concept into the categorical setting in [57] by defining dCor(X,Y) for categorical variables X = (x1,...,xI) and Y = (y1,...,yJ) where P(X=xi)=[pi]i and P(Y=yi)=[pi]j with the formula [Please open the document]

Part I of the dissertation covers the background we need to understand this formula, and prepares us to analyze the properties and performance of its applications.

Part II then presents the main results of …


Effect Of Predictor Dependence On Variable Selection For Linear And Log-Linear Regression, Apu Chandra Das 2020 University of Arkansas, Fayetteville

Effect Of Predictor Dependence On Variable Selection For Linear And Log-Linear Regression, Apu Chandra Das

Graduate Theses and Dissertations

We propose a Bayesian approach to the Dirichlet-Multinomial (DM) regression model, which uses horseshoe, Laplace, and horseshoe plus priors for shrinkage and selection. The Dirichlet-Multinomial model can be used to find the significant association between a set of available covariates and taxa for a microbiome sample. We incorporate the covariates in a log-linear regression framework. We design a simulation study to make a comparison among the performance of the three shrinkage priors in terms of estimation accuracy and the ability to detect true signals. Our results have clearly separated the performance of the three priors and indicated that the horseshoe …


Assessing Differential Item Functioning In The Perceived Stress Scale, Nana Amma Berko Asamoah 2020 University of Arkansas, Fayetteville

Assessing Differential Item Functioning In The Perceived Stress Scale, Nana Amma Berko Asamoah

Graduate Theses and Dissertations

When an item on a test functions differently for subgroups of respondents with respect to an exogenous variable (or covariate) after conditioning on the latent variable of interest, the item is said to exhibit Differential Item Functioning (DIF). The 10-item Perceived Stress Scale (PSS10) is administered to respondents via MTurk to quantify “perceived stress” and identify if items on the scale function differently for specific subgroups defined by age, sex, race, marital status, number of children, employment status and social media usage.

The purpose of this study was to compare traditional DIF detection approaches (Mantel-Haenszel, logistic regression, likelihood ratio test …


Chemostratigraphy Of Carbonate Gravity Flows Of The Wolfcamp Formation In Crockett County, Midland Basin, Texas, Alex Blizzard, Julie Bloxson 2020 Stephen F Austin State University

Chemostratigraphy Of Carbonate Gravity Flows Of The Wolfcamp Formation In Crockett County, Midland Basin, Texas, Alex Blizzard, Julie Bloxson

Electronic Theses and Dissertations

Sediment gravity flows into deep-water environments are important stratigraphic traps in lithologically diverse reservoirs generating multiple plays for hydrocarbon exploration. These highly heterogeneous deposits can be studied by utilizing chemostratigraphy and higher-order sequence stratigraphy; being an accurate method for reservoir characterization. Studying these gravity flows along a carbonate platform’s slope can further expand an understanding of the stratigraphy that is filling adjacent basins. The application of elemental analyses can support in identifying mineralogy that impact reservoir quality, especially when conventional testing cannot be applied.

This study utilizes five cores containing the Wolfcamp Formation from the southeastern slope of the Central …


Forecasting Daily Stock Market Return With Multiple Linear Regression, Shengxuan Chen 2020 Louisiana Tech University

Forecasting Daily Stock Market Return With Multiple Linear Regression, Shengxuan Chen

Mathematics Senior Capstone Papers

The purpose of this project is to use data mining and big data analytic techniques to forecast daily stock market return with multiple linear regression. Using mathematical and statistical models to analyze the stock market is important and challenging. The accuracy of the final results relies on the quality of the input data and the validity of the methodology. In the report, within 5-year period, the data regarding eleven financial and economical features are observed and recorded on each trading day. After preprocessing the raw data with statistical method, we use the multiple linear regression to predict the daily return …


Demand Forecasting In Wholesale Alcohol Distribution: An Ensemble Approach, Tanvi Arora, Rajat Chandna, Stacy Conant, Bivin Sadler, Robert Slater 2020 Southern Methodist University

Demand Forecasting In Wholesale Alcohol Distribution: An Ensemble Approach, Tanvi Arora, Rajat Chandna, Stacy Conant, Bivin Sadler, Robert Slater

SMU Data Science Review

In this paper, historical data from a wholesale alcoholic beverage distributor was used to forecast sales demand. Demand forecasting is a vital part of the sale and distribution of many goods. Accurate forecasting can be used to optimize inventory, improve cash ow, and enhance customer service. However, demand forecasting is a challenging task due to the many unknowns that can impact sales, such as the weather and the state of the economy. While many studies focus effort on modeling consumer demand and endpoint retail sales, this study focused on demand forecasting from the distributor perspective. An ensemble approach was applied …


Data-Driven Investment Decisions In P2p Lending: Strategies Of Integrating Credit Scoring And Profit Scoring, Yan Wang 2020 Kennesaw State University

Data-Driven Investment Decisions In P2p Lending: Strategies Of Integrating Credit Scoring And Profit Scoring, Yan Wang

Analytics and Data Science Dissertations

In this dissertation, we develop and discuss several loan evaluation methods to guide the investment decisions for peer-to-peer (P2P) lending. In evaluating loans, credit scoring and profit scoring are the two widely utilized approaches. Credit scoring aims at minimizing the risk while profit scoring aims at maximizing the profit. This dissertation addresses the strengths and weaknesses of each scoring method by integrating them in various ways in order to provide the optimal investment suggestions for different investors. Before developing the methods for loan evaluation at the individual level, we applied the state-of-the-art method called the Long Short Term Memory (LSTM) …


The Expanded View Of Individualism And Collectivism: One, Two, Or Four Dimensions?, Jennifer L. Priestley, Kamal Fatehi, Gita Taasoobshirazi 2020 Kennesaw State University

The Expanded View Of Individualism And Collectivism: One, Two, Or Four Dimensions?, Jennifer L. Priestley, Kamal Fatehi, Gita Taasoobshirazi

Faculty Publications

Recent research to analyze and discuss cultural differences has employed a combination of five major dimensions of individualism–collectivism, power distance, uncertainty avoidance, femininity– masculinity (gender role differentiation), and long-term orientation. Among these dimensions, individualism–collectivism has received the most attention. Chronologically, this cultural attribute has been regarded as one, then two, and more recently, four dimensions of horizontal and vertical individualism and collectivism. However, research on this issue has not been conclusive and some have argued against this expansion. The current study attempts to explain and clarify this discussion by using a shortened version of the scale developed by Singelis et …


Interdependence Across Foreign Exchange Rate Markets- A Mixed Copula Approach, Richard Adjei-Boateng 2020 Western Kentucky University

Interdependence Across Foreign Exchange Rate Markets- A Mixed Copula Approach, Richard Adjei-Boateng

Masters Theses & Specialist Projects

The purpose of this thesis is to study the dependence structure of exchange rate pairs using a mixture of copula as opposed to a single copula approach. Mixed copula models have the ability to generate dependence structures that do not belong to existing copula families. The flexibility in choosing component copulas in this mixture model aids the construction of a system that is simultaneously parsimonious and flexible enough to generate most dependence patterns in exchange rate data. Furthermore, the method of mixture copulas facilitates the separation of both the structure and degree of dependence, concepts that are respectively embodied in …


Bayesian Methods For The Assessment Of Reporting Errors For Data-Sparse Population-Periods With Applications To Estimating Mortality, Emily Peterson 2020 University of Massachusetts Amherst

Bayesian Methods For The Assessment Of Reporting Errors For Data-Sparse Population-Periods With Applications To Estimating Mortality, Emily Peterson

Doctoral Dissertations

Population level mortality data is often subject to substantial reporting errors due to misclassification of cause of death, misclassification of death status, or age reporting errors. Accuracy of error-prone data sources can be assessed by comparing such data to gold standard data for the same population-period. We present Bayesian methods for assessing the extent of reporting errors across different population-periods and generalizing those to settings where gold-standard data are lacking. Firstly, we investigate misclassification errors of maternal cause of death reporting in civil registration vital statistics data. We use a Bayesian hierarchical bivariate random-walk model to estimate country-year specific sensitivity …


Nanoindentation Characterization Of Elastic Properties Of Shales And Swelling Clay Minerals, Shengmin Luo 2020 University of Massachusetts Amherst

Nanoindentation Characterization Of Elastic Properties Of Shales And Swelling Clay Minerals, Shengmin Luo

Doctoral Dissertations

Oil and gas shales are a class of multiscale, multiphase, hybrid inorganic-organic sedimentary rocks that consist of a generally uniform, preferentially oriented clay matrix with randomly embedded silt and sand particles as solid inclusions. A thorough understanding of the mechanical properties of shales is crucial for the exploration and production of oil and gas in the unconventional shale reservoirs, but it can be a challenging task due to their nature of compositional heterogeneity and microstructural anisotropy. In efforts to better characterize the mechanical properties of shales across different length scales and to fundamentally understand the laws of upscaling from individual …


Development Of Gaussian Learning Algorithms For Early Detection Of Alzheimer's Disease, Chen Fang 2020 Florida International University

Development Of Gaussian Learning Algorithms For Early Detection Of Alzheimer's Disease, Chen Fang

FIU Electronic Theses and Dissertations

Alzheimer’s disease (AD) is the most common form of dementia affecting 10% of the population over the age of 65 and the growing costs in managing AD are estimated to be $259 billion, according to data reported in the 2017 by the Alzheimer's Association. Moreover, with cognitive decline, daily life of the affected persons and their families are severely impacted. Taking advantage of the diagnosis of AD and its prodromal stage of mild cognitive impairment (MCI), an early treatment may help patients preserve the quality of life and slow the progression of the disease, even though the underlying disease cannot …


Characterizing Uncertainty In Correlated Response Variables For Pareto Front Optimization, Peter A. Calhoun 2020 Air Force Institute of Technology

Characterizing Uncertainty In Correlated Response Variables For Pareto Front Optimization, Peter A. Calhoun

Theses and Dissertations

Current research provides a method to incorporate uncertainty into Pareto front optimization by simulating additional response surface model parameters according to a Multivariate Normal Distribution (MVN). This research shows that analogous to the univariate case, the MVN understates uncertainty, leading to overconfident conclusions when variance is not known and there are few observations (less than 25-30 per response). This research builds upon current methods using simulated response surface model parameters that are distributed according to an Multivariate t-Distribution (MVT), which can be shown to produce a more accurate inference when variance is not known. The MVT better addresses uncertainty in …


Quantitative Model For Setting Manufacturer's Suggested Retail Price, Peter Byrd, Jonathan Knowles, Dmitry Andreev, Jacob Turner, Brian Mente, LaRoux Wallace 2020 Southern Methodist University

Quantitative Model For Setting Manufacturer's Suggested Retail Price, Peter Byrd, Jonathan Knowles, Dmitry Andreev, Jacob Turner, Brian Mente, Laroux Wallace

SMU Data Science Review

In this paper, we present a quantitative approach to model the manufacturer’s suggested retail price (MSRP) for children’s doll- houses and establish relationships among key features that contribute most to establishing MSRP. Determination of the MSRP is a critical step in how consumers respond with their wallets when purchasing an item. KidKraft, a global leader in toys and juvenile products, sets MSRP subjectively using product experts. The process is arduous and time consuming requiring the focus of specialized resources and knowledge of the interaction between key attributes and their impact on consumer value. An accurate prediction of MSRP during the …


Zero-Inflated Longitudinal Mixture Model For Stochastic Radiographic Lung Compositional Change Following Radiotherapy Of Lung Cancer, Viviana A. Rodríguez Romero 2020 Virginia Commonwealth University

Zero-Inflated Longitudinal Mixture Model For Stochastic Radiographic Lung Compositional Change Following Radiotherapy Of Lung Cancer, Viviana A. Rodríguez Romero

Theses and Dissertations

Compositional data (CD) is mostly analyzed as relative data, using ratios of components, and log-ratio transformations to be able to use known multivariable statistical methods. Therefore, CD where some components equal zero represent a problem. Furthermore, when the data is measured longitudinally, observations are spatially related and appear to come from a mixture population, the analysis becomes highly complex. For this matter, a two-part model was proposed to deal with structural zeros in longitudinal CD using a mixed-effects model. Furthermore, the model has been extended to the case where the non-zero components of the vector might a two component mixture …


Moment Kernels For T-Central Subspace, Weihang Ren 2020 University of Kentucky

Moment Kernels For T-Central Subspace, Weihang Ren

Theses and Dissertations--Statistics

The T-central subspace allows one to perform sufficient dimension reduction for any statistical functional of interest. We propose a general estimator using a third moment kernel to estimate the T-central subspace. In particular, in this dissertation we develop sufficient dimension reduction methods for the central mean subspace via the regression mean function and central subspace via Fourier transform, central quantile subspace via quantile estimator and central expectile subsapce via expectile estima- tor. Theoretical results are established and simulation studies show the advantages of our proposed methods.


Digital Commons powered by bepress