Open Access. Powered by Scholars. Published by Universities.®

Multivariate Analysis Commons

Open Access. Powered by Scholars. Published by Universities.®

528 Full-Text Articles 779 Authors 249,635 Downloads 98 Institutions

All Articles in Multivariate Analysis

Faceted Search

528 full-text articles. Page 1 of 19.

A Novel Correction For The Multivariate Ljung-Box Test, Minhao Huang 2024 Chapman University

A Novel Correction For The Multivariate Ljung-Box Test, Minhao Huang

Computational and Data Sciences (PhD) Dissertations

This research introduces an analytical improvement to the Multivariate Ljung-Box test that addresses significant deviations of the original test from the nominal Type I error rates under almost all scenarios. Prior attempts to mitigate this issue have been directed at modification of the test statistics or correction of the test distribution to achieve precise results in finite samples. In previous studies, focused on designing corrections to the univariate Ljung-Box, a method that specifically adjusts the test rejection region has been the most successful of attaining the best Type I error rates. We adopt the same approach for the more complex, …


Factors Predictive Of The Development Of Surgical Site Infection In Thyroidectomy, A Replication Study Of Myssiorek (2018), Kaitlyn M. Kenig 2024 University of Nebraska Medical Center

Factors Predictive Of The Development Of Surgical Site Infection In Thyroidectomy, A Replication Study Of Myssiorek (2018), Kaitlyn M. Kenig

Capstone Experience

The original study aimed to show that thyroidectomy does not result in surgical site infection (SSI) in most cases, and thus routine prescription of antibiotics is not necessary. The study looked to see what risk factors could predict the incidence of SSI. This would highlight those individuals who were at most risk of developing SSI, and then antibiotics would only be prescribed to these individuals instead of all or most individuals who undergo thyroidectomy.

This study used NSQIP data to look at incidence of SSI and look for risk factors that may be predictive of SSI. Only surgeries that were …


Accurate Estimation Of Ethanol Content In Fruit Juices Using Cielab Color Space And Chemometrics Via Smartphone-Based Digital Image Colorimetry, Chairul Ichsan, Yasir Amrulloh, Desti Erviana 2024 Department of Chemistry, Faculty of Science and Technology, Universitas Islam Negeri Raden Fatah Palembang, Palembang 30252, Indonesia

Accurate Estimation Of Ethanol Content In Fruit Juices Using Cielab Color Space And Chemometrics Via Smartphone-Based Digital Image Colorimetry, Chairul Ichsan, Yasir Amrulloh, Desti Erviana

Makara Journal of Science

This study aims to investigate the optimal color space and chemometric technique for digital image colorimetry to determine ethanol content (% v/v) in apple, orange, and grape juices, using potassium dichromate (K2Cr2O7) under acidic conditions. The accuracy of colorimetric–chemometric integration across various color spaces (RGB, HSV, CIELab, CMYK, CIELuv, CIEXYZ, and CIELch) was benchmarked against UV–Vis spectrophotometry using metrics such as coefficient of determination (R²), mean absolute percentage error (MAPE), and root–mean–squared error (RMSE). Various chemometric techniques (PLS, PCR, MLR, multivariable–SVR, and multivariable NN regression) were evaluated. Results demonstrate that combining the CIELab color …


Principal Component Analysis With Application To Credit Card Data, Eleanor Cain, Semhar Michael, Gary Hatfield 2024 South Dakota State University

Principal Component Analysis With Application To Credit Card Data, Eleanor Cain, Semhar Michael, Gary Hatfield

SDSU Data Science Symposium

Principal Component Analysis (PCA) is a type of dimension reduction technique used in data analysis to process the data before making a model. In general, dimension reduction allows analysts to make conclusions about large data sets by reducing the number of variables while retaining as much information as possible. Using the numerical variables from a data set, PCA aims to compute a smaller set of uncorrelated variables, called principal components, that account for a majority of the variability from the data. The purpose of this poster is to understand PCA as well as perform PCA on a large sample credit …


Session 6: Model-Based Clustering Analysis On The Spatial-Temporal And Intensity Patterns Of Tornadoes, Yana Melnykov, Yingying Zhang, Rong Zheng 2024 University of Alabama - Tuscaloosa

Session 6: Model-Based Clustering Analysis On The Spatial-Temporal And Intensity Patterns Of Tornadoes, Yana Melnykov, Yingying Zhang, Rong Zheng

SDSU Data Science Symposium

Tornadoes are one of the nature’s most violent windstorms that can occur all over the world except Antarctica. Previous scientific efforts were spent on studying this nature hazard from facets such as: genesis, dynamics, detection, forecasting, warning, measuring, and assessing. While we want to model the tornado datasets by using modern sophisticated statistical and computational techniques. The goal of the paper is developing novel finite mixture models and performing clustering analysis on the spatial-temporal and intensity patterns of the tornadoes. To analyze the tornado dataset, we firstly try a Gaussian distribution with the mean vector and variance-covariance matrix represented as …


Predicting Superconducting Critical Temperature Using Regression Analysis, Roland Fiagbe 2024 University of Central Florida

Predicting Superconducting Critical Temperature Using Regression Analysis, Roland Fiagbe

Data Science and Data Mining

This project estimates a regression model to predict the superconducting critical temperature based on variables extracted from the superconductor’s chemical formula. The regression model along with the stepwise variable selection gives a reasonable and good predictive model with a lower prediction error (MSE). Variables extracted based on atomic radius, valence, atomic mass and thermal conductivity appeared to have the most contribution to the predictive model.


Imputation Strategies For Different Categories Of Missing Data, Karthik Chalumuri 2024 University of New Hampshire, Durham

Imputation Strategies For Different Categories Of Missing Data, Karthik Chalumuri

Honors Theses and Capstones

Addressing missing data in research is crucial for ensuring the reliability and validity of study findings, yet it remains a significant challenge. This study investigates the impact of missing data on research outcomes and explores the underutilization of existing tools for managing missingness, potentially leading to gaps in critical information with tangible implications for decision-making processes (Dziura et al.).

Focusing on the different categories of missing data—Missing Completely At Random (MCAR), Missing At Random (MAR), and Missing Not At Random (MNAR)—this research examines various imputation strategies tailored to each category. Specifically, we compare the efficacy of several model-based imputation methods, …


Measuring The Performance Of Sdgs In Provincial Level Using Regional Sustainable Development Index, Nurafiza Thamrin, Ika Yuni Wulansari, Puguh Bodro Irawan 2023 BPS - Statistics Solok Regency

Measuring The Performance Of Sdgs In Provincial Level Using Regional Sustainable Development Index, Nurafiza Thamrin, Ika Yuni Wulansari, Puguh Bodro Irawan

Journal of Environmental Science and Sustainable Development

Measuring the national and sub-national progress in achieving such globally adopted development agendas as Sustainable Development Goals (SDGs) is particularly challenging due to data availability and compatibility of indicators to measure SDGs, especially in Indonesia. This paper attempts to measure the performance of sustainable development at the regional level in Indonesia by newly constructing a multidimensional composite index called the Regional Sustainable Development Index (RSDI). RSDI comprises four dimensions, covering comprehensive economic, social, environmental, and governance indicators. By applying factor analysis, the paper assesses the uncertainty of RSDI and the sensitivity of its composing indicators, then further investigates the relationship …


Reducing Food Scarcity: The Benefits Of Urban Farming, S.A. Claudell, Emilio Mejia 2023 Brigham Young University

Reducing Food Scarcity: The Benefits Of Urban Farming, S.A. Claudell, Emilio Mejia

Journal of Nonprofit Innovation

Urban farming can enhance the lives of communities and help reduce food scarcity. This paper presents a conceptual prototype of an efficient urban farming community that can be scaled for a single apartment building or an entire community across all global geoeconomics regions, including densely populated cities and rural, developing towns and communities. When deployed in coordination with smart crop choices, local farm support, and efficient transportation then the result isn’t just sustainability, but also increasing fresh produce accessibility, optimizing nutritional value, eliminating the use of ‘forever chemicals’, reducing transportation costs, and fostering global environmental benefits.

Imagine Doris, who is …


Differentiation Of Human, Dog, And Cat Hair Fibers Using Dart Tofms And Machine Learning, Laura Ahumada, Erin R. McClure-Price, Chad Kwong, Edgard O. Espinoza, John Santerre 2023 Southern Methodist University

Differentiation Of Human, Dog, And Cat Hair Fibers Using Dart Tofms And Machine Learning, Laura Ahumada, Erin R. Mcclure-Price, Chad Kwong, Edgard O. Espinoza, John Santerre

SMU Data Science Review

Hair is found in over 90% of crime scenes and has long been analyzed as trace evidence. However, recent reviews of traditional hair fiber analysis techniques, primarily morphological examination, have cast doubt on its reliability. To address these concerns, this study employed machine learning algorithms, specifically Linear Discriminant Analysis (LDA) and Random Forest, on Direct Analysis in Real Time time-of-flight mass spectra collected from human, cat, and dog hair samples. The objective was to develop a chemistry- and statistics-based classification method for unbiased taxonomic identification of hair. The results of the study showed that LDA and Random Forest were highly …


Wavelet Compression As An Observational Operator In Data Assimilation Systems For Sea Surface Temperature, Bradley J. Sciacca 2023 University of New Orleans, New Orleans

Wavelet Compression As An Observational Operator In Data Assimilation Systems For Sea Surface Temperature, Bradley J. Sciacca

University of New Orleans Theses and Dissertations

The ocean remains severely under-observed, in part due to its sheer size. Containing nearly billion of water with most of the subsurface being invisible because water is extremely difficult to penetrate using electromagnetic radiation, as is typically used by satellite measuring instruments. For this reason, most observations of the ocean have very low spatial-temporal coverage to get a broad capture of the ocean’s features. However, recent “dense but patchy” data have increased the availability of high-resolution – low spatial coverage observations. These novel data sets have motivated research into multi-scale data assimilation methods. Here, we demonstrate a new assimilation approach …


Interactions Between Sediment Mechanical Structure And Infaunal Community Structure Following Physical Disturbance, William Cyrus Roger Clemo 2023 University of South Alabama

Interactions Between Sediment Mechanical Structure And Infaunal Community Structure Following Physical Disturbance, William Cyrus Roger Clemo

<strong> Theses and Dissertations </strong>

Shallow, river-influenced coastal sediments are important for global carbon storage and nutrient cycling and provide a habitat for diverse communities of invertebrates (infauna). Elevated bed shear stress from extreme storms can resuspend, transport, and deposit sediments, disrupting the cohesive structure of muds, and sorting and depositing sand eroded from beaches. These physical disruptions can also resuspend or smother infauna, decreasing abundances and changing community structure. Infaunal activities such as burrowing, tube construction, and feeding can impact sediment structure and stability. However, little is known about how physical disturbance impacts short and long-term sediment habitat suitability and whether disturbance-tolerant infauna influence …


The Private Pilot Check Ride: Applying The Spacing Effect Theory To Predict Time To Proficiency For The Practical Test, Michael Scott Harwin 2023 Florida Institute of Technology - Melbourne

The Private Pilot Check Ride: Applying The Spacing Effect Theory To Predict Time To Proficiency For The Practical Test, Michael Scott Harwin

Theses and Dissertations

This study examined the relationship between a set of targeted factors and the total flight time students needed to become ready to take the private pilot check ride. The study was grounded in Ebbinghaus’s (1885/1913/2013) forgetting curve theory and spacing effect, and Ausubel’s (1963) theory of meaningful learning. The research factors included (a) training time to proficiency, which represented the number of training days needed to become check-ride ready; (b) flight training program (Part 61 vs. Part 141); (c) organization offering the training program (2- or 4-year college/university vs. FBO); (d) scheduling policy (mandated vs. student-driven); and demographical variables, which …


Expansionary Fiscal Contraction Hypothesis: An Evidence From Pakistan, Aisha Irum 2023 Pakistan Institute of Development Economics (PIDE), Islamabad Researcher, NIPS

Expansionary Fiscal Contraction Hypothesis: An Evidence From Pakistan, Aisha Irum

CBER Conference

The fiscal sector in Pakistan has been facing mule-layered challenges over several years. One of the reasons is the stubborn and unproductive nature of its public expenditure, and the other one is the lower tax revenues. This issue of hovering fiscal deficit is mostly dealt with the tools of fiscal contraction/austerity which can have a potential impact on the private sector of the economy. Thus, the question which has been addressed in this study is whether the Expansionary Fiscal Contraction (EFC) hypothesis holds in case of Pakistan. Fiscal contraction episodes have been identified using growth in the growth rates of …


Nonparametric Derivative Estimation Using Penalized Splines: Theory And Application, Bright Antwi Boasiako 2023 University of Massachusetts Amherst

Nonparametric Derivative Estimation Using Penalized Splines: Theory And Application, Bright Antwi Boasiako

Doctoral Dissertations

This dissertation is in the field of Nonparametric Derivative Estimation using
Penalized Splines. It is conducted in two parts. In the first part, we study the L2
convergence rates of estimating derivatives of mean regression functions using penalized splines. In 1982, Stone provided the optimal rates of convergence for estimating derivatives of mean regression functions using nonparametric methods. Using these rates, Zhou et. al. in their 2000 paper showed that the MSE of derivative estimators based on regression splines approach zero at the optimal rate of convergence. Also, in 2019, Xiao showed that, under some general conditions, penalized spline estimators …


Parameter Estimation For Normally Distributed Grouped Data And Clustering Single-Cell Rna Sequencing Data Via The Expectation-Maximization Algorithm, Zahra Aghahosseinalishirazi 2023 Western University

Parameter Estimation For Normally Distributed Grouped Data And Clustering Single-Cell Rna Sequencing Data Via The Expectation-Maximization Algorithm, Zahra Aghahosseinalishirazi

Electronic Thesis and Dissertation Repository

The Expectation-Maximization (EM) algorithm is an iterative algorithm for finding the maximum likelihood estimates in problems involving missing data or latent variables. The EM algorithm can be applied to problems consisting of evidently incomplete data or missingness situations, such as truncated distributions, censored or grouped observations, and also to problems in which the missingness of the data is not natural or evident, such as mixed-effects models, mixture models, log-linear models, and latent variables. In Chapter 2 of this thesis, we apply the EM algorithm to grouped data, a problem in which incomplete data are evident. Nowadays, data confidentiality is of …


Traditional Vs Machine Learning Approaches: A Comparison Of Time Series Modeling Methods, Miguel E. Bonilla Jr., Jason McDonald, Tamas Toth, Bivin Sadler 2023 Southern Methodist University

Traditional Vs Machine Learning Approaches: A Comparison Of Time Series Modeling Methods, Miguel E. Bonilla Jr., Jason Mcdonald, Tamas Toth, Bivin Sadler

SMU Data Science Review

In recent years, various new Machine Learning and Deep Learning algorithms have been introduced, claiming to offer better performance than traditional statistical approaches when forecasting time series. Studies seeking evidence to support the usage of ML/DL over statistical approaches have been limited to comparing the forecasting performance of univariate, linear time series data. This research compares the performance of traditional statistical-based and ML/DL methods for forecasting multivariate and nonlinear time series.


Geometric Morphometric Analysis Of Modern Viperid Vertebrae Facilitates Identification Of Fossil Specimens, Lance D. Jessee 2023 East Tennessee State University

Geometric Morphometric Analysis Of Modern Viperid Vertebrae Facilitates Identification Of Fossil Specimens, Lance D. Jessee

Electronic Theses and Dissertations

Snake vertebrae are common in the fossil record, whereas cranial remains are generally fragile and rare. Consequently, vertebrae are the most commonly studied fossil element of snakes. However, identification of snake vertebrae can be problematic due to extensive variation. This study utilizes 2-D geometric morphometrics and canonical variates analysis to 1) reveal variation between genera and species and 2) classify vertebrae of modern and fossil eastern North American Agkistrodon and Crotalus. The results show that vertebrae of Agkistrodon and Crotalus can reliably be classified to genus and species using these methods. Based on the statistical analyses, four of the …


A Data-Driven Multi-Regime Approach For Predicting Real-Time Energy Consumption Of Industrial Machines., Abdulgani Kahraman 2023 University of Louisville

A Data-Driven Multi-Regime Approach For Predicting Real-Time Energy Consumption Of Industrial Machines., Abdulgani Kahraman

Electronic Theses and Dissertations

This thesis focuses on methods for improving energy consumption prediction performance in complex industrial machines. Working with real-world industrial machines brings several challenges, including data access, algorithmic bias, data privacy, and the interpretation of machine learning algorithms. To effectively manage energy consumption in the industrial sector, it is essential to develop a framework that enhances prediction performance, reduces energy costs, and mitigates air pollution in heavy industrial machine operations. This study aims to assist managers in making informed decisions and driving the transition towards green manufacturing. The energy consumption of industrial machinery is substantial, and the recent increase in CO2 …


A Multivariate Investigation Of The Motivational, Academic, And Well-Being Characteristics Of First-Generation And Continuing-Generation College Students, Christopher L. Thomas, Staci Zolkoski 2023 The University of Texas at Tyler

A Multivariate Investigation Of The Motivational, Academic, And Well-Being Characteristics Of First-Generation And Continuing-Generation College Students, Christopher L. Thomas, Staci Zolkoski

Journal of Research Initiatives

Prior research has noted differences in motivational, academic, and well-being factors between first-generation and continuing-education students. However, past investigations have primarily overlooked the interactive influence of protective and risk factors when comparing the characteristics of first-generation and continuing-education students. Thus, the current study adopted a multivariate approach to gain a more nuanced understanding of the influence of generational status on students' self-regulated learning capabilities, academic anxiety, sense of belonging, academic barriers, mental health concerns, and satisfaction with life. University students (N = 432, 67.46% Caucasian, 87.55% female, Age = 28.10 ± 9.46) completed the Cognitive Test Anxiety Scale-2nd …


Digital Commons powered by bepress