Open Access. Powered by Scholars. Published by Universities.®

Multivariate Analysis Commons

Open Access. Powered by Scholars. Published by Universities.®

351 Full-Text Articles 570 Authors 116,029 Downloads 75 Institutions

All Articles in Multivariate Analysis

Faceted Search

351 full-text articles. Page 1 of 14.

Improving The Quality And Design Of Retrospective Clinical Outcome Studies That Utilize Electronic Health Records, Oliwier Dziadkowiec, Jeffery S. Durbin, Vignesh Jayaraman Muralidharan, Megan L. Novak, Brendon T. Cornett 2020 HCA Healthcare Mountain MidAmerica and Continental Divisions

Improving The Quality And Design Of Retrospective Clinical Outcome Studies That Utilize Electronic Health Records, Oliwier Dziadkowiec, Jeffery S. Durbin, Vignesh Jayaraman Muralidharan, Megan L. Novak, Brendon T. Cornett

HCA Healthcare Journal of Medicine

Electronic health records (EHRs) are an excellent source for secondary data analysis. Studies based on EHR-derived data, if designed properly, can answer previously unanswerable clinical research questions. In this paper we will highlight the benefits of large retrospective studies from secondary sources such as EHRs, examine retrospective cohort and case-control study design challenges, as well as methodological and statistical adjustment that can be made to overcome some of the inherent design limitations, in order to increase the generalizability, validity and reliability of the results obtained from these studies.


Demand Forecasting In Wholesale Alcohol Distribution: An Ensemble Approach, Tanvi Arora, Rajat Chandna, Stacy Conant, Bivin Sadler, Robert Slater 2020 Southern Methodist University

Demand Forecasting In Wholesale Alcohol Distribution: An Ensemble Approach, Tanvi Arora, Rajat Chandna, Stacy Conant, Bivin Sadler, Robert Slater

SMU Data Science Review

In this paper, historical data from a wholesale alcoholic beverage distributor was used to forecast sales demand. Demand forecasting is a vital part of the sale and distribution of many goods. Accurate forecasting can be used to optimize inventory, improve cash ow, and enhance customer service. However, demand forecasting is a challenging task due to the many unknowns that can impact sales, such as the weather and the state of the economy. While many studies focus effort on modeling consumer demand and endpoint retail sales, this study focused on demand forecasting from the distributor perspective. An ensemble approach was applied ...


The Expanded View Of Individualism And Collectivism: One, Two, Or Four Dimensions?, Jennifer L. Priestley, Kamal Fatehi, Gita Taasoobshirazi 2020 Kennesaw State University

The Expanded View Of Individualism And Collectivism: One, Two, Or Four Dimensions?, Jennifer L. Priestley, Kamal Fatehi, Gita Taasoobshirazi

Faculty Publications

Recent research to analyze and discuss cultural differences has employed a combination of five major dimensions of individualism–collectivism, power distance, uncertainty avoidance, femininity– masculinity (gender role differentiation), and long-term orientation. Among these dimensions, individualism–collectivism has received the most attention. Chronologically, this cultural attribute has been regarded as one, then two, and more recently, four dimensions of horizontal and vertical individualism and collectivism. However, research on this issue has not been conclusive and some have argued against this expansion. The current study attempts to explain and clarify this discussion by using a shortened version of the scale developed by ...


Interdependence Across Foreign Exchange Rate Markets- A Mixed Copula Approach, Richard Adjei-Boateng 2020 Western Kentucky University

Interdependence Across Foreign Exchange Rate Markets- A Mixed Copula Approach, Richard Adjei-Boateng

Masters Theses & Specialist Projects

The purpose of this thesis is to study the dependence structure of exchange rate pairs using a mixture of copula as opposed to a single copula approach. Mixed copula models have the ability to generate dependence structures that do not belong to existing copula families. The flexibility in choosing component copulas in this mixture model aids the construction of a system that is simultaneously parsimonious and flexible enough to generate most dependence patterns in exchange rate data. Furthermore, the method of mixture copulas facilitates the separation of both the structure and degree of dependence, concepts that are respectively embodied in ...


Nanoindentation Characterization Of Elastic Properties Of Shales And Swelling Clay Minerals, Shengmin Luo 2020 University of Massachusetts Amherst

Nanoindentation Characterization Of Elastic Properties Of Shales And Swelling Clay Minerals, Shengmin Luo

Doctoral Dissertations

Oil and gas shales are a class of multiscale, multiphase, hybrid inorganic-organic sedimentary rocks that consist of a generally uniform, preferentially oriented clay matrix with randomly embedded silt and sand particles as solid inclusions. A thorough understanding of the mechanical properties of shales is crucial for the exploration and production of oil and gas in the unconventional shale reservoirs, but it can be a challenging task due to their nature of compositional heterogeneity and microstructural anisotropy. In efforts to better characterize the mechanical properties of shales across different length scales and to fundamentally understand the laws of upscaling from individual ...


Characterizing Uncertainty In Correlated Response Variables For Pareto Front Optimization, Peter A. Calhoun 2020 Air Force Institute of Technology

Characterizing Uncertainty In Correlated Response Variables For Pareto Front Optimization, Peter A. Calhoun

Theses and Dissertations

Current research provides a method to incorporate uncertainty into Pareto front optimization by simulating additional response surface model parameters according to a Multivariate Normal Distribution (MVN). This research shows that analogous to the univariate case, the MVN understates uncertainty, leading to overconfident conclusions when variance is not known and there are few observations (less than 25-30 per response). This research builds upon current methods using simulated response surface model parameters that are distributed according to an Multivariate t-Distribution (MVT), which can be shown to produce a more accurate inference when variance is not known. The MVT better addresses uncertainty in ...


Quantitative Model For Setting Manufacturer's Suggested Retail Price, Peter Byrd, Jonathan Knowles, Dmitry Andreev, Jacob Turner, Brian Mente, LaRoux Wallace 2020 Southern Methodist University

Quantitative Model For Setting Manufacturer's Suggested Retail Price, Peter Byrd, Jonathan Knowles, Dmitry Andreev, Jacob Turner, Brian Mente, Laroux Wallace

SMU Data Science Review

In this paper, we present a quantitative approach to model the manufacturer’s suggested retail price (MSRP) for children’s doll- houses and establish relationships among key features that contribute most to establishing MSRP. Determination of the MSRP is a critical step in how consumers respond with their wallets when purchasing an item. KidKraft, a global leader in toys and juvenile products, sets MSRP subjectively using product experts. The process is arduous and time consuming requiring the focus of specialized resources and knowledge of the interaction between key attributes and their impact on consumer value. An accurate prediction of MSRP ...


An Assessment Of Convergence In The Feeding Morphology Of Xiphactinus Audax And Megalops Atlanticus Using Landmark-Based Geometric Morphometrics, Edward Chase Shelburne 2020 Fort Hays State University

An Assessment Of Convergence In The Feeding Morphology Of Xiphactinus Audax And Megalops Atlanticus Using Landmark-Based Geometric Morphometrics, Edward Chase Shelburne

Master's Theses

Convergence is an evolutionary phenomenon wherein distantly related organisms independently develop features or functional adaptations to overcome similar environmental constraints. Historically, convergence among organisms has been speculated or asserted with little rigorous or quantitative investigation. More recent advancements in systematics has allowed for the detection and study of convergence in a phylogenetic context, but this does little to elucidate convergent anatomical features in extinct taxa with poorly understood evolutionary histories. The purpose of this study is to investigate one potentially convergent system—the feeding structure of Xiphactinus audax (Teleostei: Ichthyodectiformes) and Megalops atlanticus (Teleostei: Elopiformes)—using a comparative anatomical approach ...


A Geochemical And Statistical Investigation Of The Big Four Springs Region In Southern Missouri, Jordan Jasso Vega 2020 Missouri State University

A Geochemical And Statistical Investigation Of The Big Four Springs Region In Southern Missouri, Jordan Jasso Vega

MSU Graduate Theses

The Big Four Springs region hosts four major first-order magnitude springs in southern Missouri and northern Arkansas. These springs are Big Spring (Carter County, MO), Greer Spring (Oregon County, MO), Mammoth Spring (Fulton County, AR), and Hodgson Mill Spring (Ozark County, MO). Based on historic dye traces and hydrogeological investigations, these springs drain an area of approximately 1500 square miles and collectively discharge an average of 780 million gallons of water per day. The rocks from youngest to oldest that are found in Big Four Springs region are the Cotter and Jefferson City Dolomite (Ordovician), Roubidoux Formation (Ordovician), Gasconade Dolomite ...


Theory Of Principal Components For Applications In Exploratory Crime Analysis And Clustering, Daniel Silva 2020 Minnesota State University, Mankato

Theory Of Principal Components For Applications In Exploratory Crime Analysis And Clustering, Daniel Silva

All Graduate Theses, Dissertations, and Other Capstone Projects

The purpose of this paper is to develop the theory of principal components analysis succinctly from the fundamentals of matrix algebra and multivariate statistics. Principal components analysis is sometimes used as a descriptive technique to explain the variance-covariance or correlation structure of a dataset. However, most often, it is used as a dimensionality reduction technique to visualize a high dimensional dataset in a lower dimensional space. Principal components analysis accomplishes this by using the first few principal components, provided that they account for a substantial proportion of variation in the original dataset. In the same way, the first few principal ...


Conformal Prediction Intervals For Neural Networks Using Cross Validation, Saeed Khaki 2020 Iowa State University

Conformal Prediction Intervals For Neural Networks Using Cross Validation, Saeed Khaki

Creative Components

Neural networks are among the most powerful nonlinear models used to address supervised learning problems. Similar to most machine learning algorithms, neural networks produce point predictions and do not provide any prediction interval which includes an unobserved response value with a specified probability. In this creative component, we propose the k-fold prediction interval method to construct prediction intervals for neural networks based on k-fold cross validation. Simulation studies and analysis of 10 real datasets are used to compare the finite-sample properties of the prediction intervals produced by the proposed method and the split conformal (SC) method. The results suggest that ...


Three Essays On Health Economics And Policy Evaluation, Shishir Shakya 2020 West Virginia University

Three Essays On Health Economics And Policy Evaluation, Shishir Shakya

Graduate Theses, Dissertations, and Problem Reports

This dissertation consists of three essays on the U.S. Health care policy. Each paragraph below refers to the three abstracts for the three chapters in this dissertation, respectively. I provide quantitative evidence on how much Prescription Drug Monitoring Programs (PDMPs) affects the retail opioid prescribing behaviors. Using the American Community Survey (ACS), I retrieve county-level high dimensional panel data set from 2010 to 2017. I employ three separate identification strategies: difference-in-difference, double selection post-LASSO, and spatial difference-in-difference. I compare how the retail opioid prescribing behaviors of counties, that are mandatory for prescribers to check the PDMP before prescribing controlled ...


Process Based Analysis Of Fluvial Stratigraphic Record: Middle Pennsylvanian Allegheny Formation, North-Central Wv, Oluwasegun O. Abatan 2020 West Virginia University

Process Based Analysis Of Fluvial Stratigraphic Record: Middle Pennsylvanian Allegheny Formation, North-Central Wv, Oluwasegun O. Abatan

Graduate Theses, Dissertations, and Problem Reports

Fluvial deposits represent some of the best hydrocarbon reservoirs, but the quality of fluvial reservoirs varies depending on the reservoir architecture, which is controlled by allogenic and autogenic processes. Allogenic controls, including paleoclimate, tectonics, and glacio-eustasy, have long been debated as dominant controls in the deposition of fluvial strata. However, recent research has questioned the validity of this cyclicity and may indicate major influence from autogenic controls. To further investigate allogenic controls on stratal order, I analyzed the facies architecture, geomorphology, paleohydrology, and the stratigraphic framework of the Middle Pennsylvanian Allegheny Formation (MPAF), a fluvial depositional system in the Appalachian ...


Generalized Matrix Decomposition Regression: Estimation And Inference For Two-Way Structured Data, Yue Wang, Ali Shojaie, Tim Randolph, Jing Ma 2019 University of Washington

Generalized Matrix Decomposition Regression: Estimation And Inference For Two-Way Structured Data, Yue Wang, Ali Shojaie, Tim Randolph, Jing Ma

UW Biostatistics Working Paper Series

Analysis of two-way structured data, i.e., data with structures among both variables and samples, is becoming increasingly common in ecology, biology and neuro-science. Classical dimension-reduction tools, such as the singular value decomposition (SVD), may perform poorly for two-way structured data. The generalized matrix decomposition (GMD, Allen et al., 2014) extends the SVD to two-way structured data and thus constructs singular vectors that account for both structures. While the GMD is a useful dimension-reduction tool for exploratory analysis of two-way structured data, it is unsupervised and cannot be used to assess the association between such data and an outcome of ...


Statistical Inference For Networks Of High-Dimensional Point Processes, Xu Wang, Mladen Kolar, Ali Shojaie 2019 University of Washington - Seattle Campus

Statistical Inference For Networks Of High-Dimensional Point Processes, Xu Wang, Mladen Kolar, Ali Shojaie

UW Biostatistics Working Paper Series

Fueled in part by recent applications in neuroscience, high-dimensional Hawkes process have become a popular tool for modeling the network of interactions among multivariate point process data. While evaluating the uncertainty of the network estimates is critical in scientific applications, existing methodological and theoretical work have only focused on estimation. To bridge this gap, this paper proposes a high-dimensional statistical inference procedure with theoretical guarantees for multivariate Hawkes process. Key to this inference procedure is a new concentration inequality on the first- and second-order statistics for integrated stochastic processes, which summarizes the entire history of the process. We apply this ...


Function Space Tensor Decomposition And Its Application In Sports Analytics, Justin Reising 2019 East Tennessee State University

Function Space Tensor Decomposition And Its Application In Sports Analytics, Justin Reising

Electronic Theses and Dissertations

Recent advancements in sports information and technology systems have ushered in a new age of applications of both supervised and unsupervised analytical techniques in the sports domain. These automated systems capture large volumes of data points about competitors during live competition. As a result, multi-relational analyses are gaining popularity in the field of Sports Analytics. We review two case studies of dimensionality reduction with Principal Component Analysis and latent factor analysis with Non-Negative Matrix Factorization applied in sports. Also, we provide a review of a framework for extending these techniques for higher order data structures. The primary scope of this ...


How To Read And Interpret The Results Of A Bayesian Network Meta-Analysis: A Short Tutorial, Dapeng Hu, Annette M. O’Connor, Charlotte B. Winder, Jan M. Sargeant, Chong Wang 2019 Iowa State University

How To Read And Interpret The Results Of A Bayesian Network Meta-Analysis: A Short Tutorial, Dapeng Hu, Annette M. O’Connor, Charlotte B. Winder, Jan M. Sargeant, Chong Wang

Veterinary Diagnostic and Production Animal Medicine Publications

In this manuscript we use realistic data to conduct a network meta-analysis using a Bayesian approach to analysis. The purpose of this manuscript is to explain, in lay terms, how to interpret the output of such an analysis. Many readers are familiar with the forest plot as an approach to presenting the results of a pairwise meta-analysis. However when presented with the results of network meta-analysis, which often does not include the forest plot, the output and results can be difficult to understand. Further, one of the advantages of Bayesian network meta-analyses is in the novel outputs such as treatment ...


#46 - America's Response To President Trump's Tweets, Amanda Friend 2019 University of West Georgia

#46 - America's Response To President Trump's Tweets, Amanda Friend

Georgia Undergraduate Research Conference (GURC)

Purpose: The purpose of the research throughout this study was to examine Trump’s tweets during the first six months he was in office. Due to Trump using Twitter as his main form of communication it is important for journalists and individuals to follow his tweets.

Research Questions: The analysis covers how many times people shared positive or negative tweets and if people shared more issue based tweets. This study emphasizes President Trump’s most popular tweets and how people responded to his first six months on Twitter.

Method: The tweets were coded with a key using content analysis to ...


Classification Of Coronary Artery Disease In Non-Diabetic Patients Using Artificial Neural Networks, Demond Handley 2019 Illinois State University

Classification Of Coronary Artery Disease In Non-Diabetic Patients Using Artificial Neural Networks, Demond Handley

Annual Symposium on Biomathematics and Ecology Education and Research

No abstract provided.


Identifying Risk Factors Related To Premature Birth Through Binary Logistic And Proportional Odds Ordinal Logistic Regression, Clayton Elwood 2019 Duquesne University

Identifying Risk Factors Related To Premature Birth Through Binary Logistic And Proportional Odds Ordinal Logistic Regression, Clayton Elwood

Electronic Theses and Dissertations

Premature birth has been identified as the single greatest cause of death worldwide in children under the age of five. This thesis will implement binary logistic regression and proportional odds ordinal logistic regression to predict different levels of premature birth and identify associated risk factors. The models will be built from the Center for Disease Control and Prevention's 2014 Vital Statistics Natality Birth Data containing nearly 4 million live births within the United States. Odds ratios and confidence intervals on risk factors were produced utilizing binary logistic regression.


Digital Commons powered by bepress