Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability

Journal

Institution
Keyword
Publication Year
Publication

Articles 31 - 60 of 1623

Full-Text Articles in Physical Sciences and Mathematics

Exploring Parameter Sensitivity Analysis In Mathematical Modeling With Ordinary Differential Equations, Viktoria Savatorova Oct 2023

Exploring Parameter Sensitivity Analysis In Mathematical Modeling With Ordinary Differential Equations, Viktoria Savatorova

CODEE Journal

This paper presents an exploration into parameter sensitivity analysis in mathematical modeling using ordinary differential equations (ODEs). Taking the first steps in understanding local sensitivity analysis through the direct differential method and global sensitivity analysis using metrics like Pearson, Spearman, PRCC, and Sobol’, we provide readers with a basic understanding of parameter sensitivity analysis for mathematical modeling using ODEs. As an illustrative application, the system of differential equations modeling population dynamics of several fish species with harvest considerations is utilized. The results of employing local and global sensitivity analysis are compared, shedding light on the strengths and limitations of each …


All-Cause And Opioid-Related Mortality Compared Between Traumatic Spinal Cord Injury And The Us General Population, Jaden Whitehead, Beatrice Ugiliweneza Sep 2023

All-Cause And Opioid-Related Mortality Compared Between Traumatic Spinal Cord Injury And The Us General Population, Jaden Whitehead, Beatrice Ugiliweneza

The Cardinal Edge

Individuals with spinal cord injury (SCI) are susceptible to the misuse of opioids due to the introduction of these substances for pain management. There are very few studies examining the relationship between unintentional deaths caused by opioid usage following spinal cord injury. The objective of this study was to evaluate the trend of opioid-related mortality of individuals with spinal cord injury (SCI) over the years and compare these findings to the mortality rates due to opioid misuse in the general population. In this study, we used data provided by the National Spinal Cord Injury Model Systems (NSCIMS) for SCI 1999-2016 …


Reu-Deim Classification Of Hispanic Voters In Hispanic Groups Using Name And Zip Code Data In Palm Beach, Florida, Kamila Soto-Ortiz Sep 2023

Reu-Deim Classification Of Hispanic Voters In Hispanic Groups Using Name And Zip Code Data In Palm Beach, Florida, Kamila Soto-Ortiz

Beyond: Undergraduate Research Journal

When it comes to registering to vote, Hispanic voters can only register as “Hispanic” in the “Race/Ethnicity” category, causing difficulties when analyzing voting trends amongst the Hispanic community. Upon the recent idea that not all Hispanic Groups vote the same, the goal is to create a model that can possibly identify a voter’s Hispanic Group with the information provided on the public Florida voter file. This is accomplished using name and zip code data for all voters in Palm Beach, Florida. This paper will explore the model implemented, its findings and limitations. Palm Beach, Florida, is met with low confidence …


Asymptotic Results For Empirical Processes In Informative Model Of Random Censorship From Both Sides, Abdurakhim Abdushukurov, Dilshod Mansurov Sep 2023

Asymptotic Results For Empirical Processes In Informative Model Of Random Censorship From Both Sides, Abdurakhim Abdushukurov, Dilshod Mansurov

Bulletin of National University of Uzbekistan: Mathematics and Natural Sciences

In the paper, the empirical process in informative model of random censorship from both sides is investigated. For it, the limit Gaussian process with mean zero is founded. Under investigating of empirical process, the characterization properties of the considered informative model is used. The properties of the semiparametric estimator by using methods of numerical modeling are discussed.


Number Of Regions Created By Random Chords In The Circle, Shi Feng Sep 2023

Number Of Regions Created By Random Chords In The Circle, Shi Feng

Rose-Hulman Undergraduate Mathematics Journal

In this paper we discuss the number of regions in a unit circle after drawing n i.i.d. random chords in the circle according to a particular family of distribution. We find that as n goes to infinity, the distribution of the number of regions, properly shifted and scaled, converges to the standard normal distribution and the error can be bounded by Stein's method for proving Central Limit Theorem.


Prediction Of Factors For Patients With Hypertension And Dyslipidemia Using Multilayer Feedforward Neural Networks And Ordered Logistic Regression Analysis: A Robust Hybrid Methodology, Wan Muhamad Amir W Ahmad, Mohamad Nasarudin Bin Adnan, Norhayati Yusop, Hazik Bin Shahzad, Farah Muna Mohamad Ghazali, Nor Azlida Aleng, Nor Farid Mohd Noor Aug 2023

Prediction Of Factors For Patients With Hypertension And Dyslipidemia Using Multilayer Feedforward Neural Networks And Ordered Logistic Regression Analysis: A Robust Hybrid Methodology, Wan Muhamad Amir W Ahmad, Mohamad Nasarudin Bin Adnan, Norhayati Yusop, Hazik Bin Shahzad, Farah Muna Mohamad Ghazali, Nor Azlida Aleng, Nor Farid Mohd Noor

Makara Journal of Health Research

Background: Hypertension is characterized by abnormally high arterial blood pressure and is a public health problem with a high prevalence of 20%–30% worldwide. This research combined multiple logistic regression (MLR) and multilayer feedforward neural networks to construct and validate a model for evaluating the factors linked with hypertension in patients with dyslipidemia.

Methods: A total of 1000 data entries from Hospital Universiti Sains Malaysia and advanced computational statistical modeling methodologies were used to evaluate seven traits associated with hypertension. R-Studio software was utilized. Each sample's statistics were calculated using a hybrid model that included bootstrapping.

Results: Variable …


Healthy Lifestyle Behaviors And Sociodemographic Characteristics Among Medical Students In Indonesia During The New Normal Era: A Cross-Sectional Study, Sharren Shera Vionnetta, Tommy Nugroho Tanumihardja, Kevin Kristian Aug 2023

Healthy Lifestyle Behaviors And Sociodemographic Characteristics Among Medical Students In Indonesia During The New Normal Era: A Cross-Sectional Study, Sharren Shera Vionnetta, Tommy Nugroho Tanumihardja, Kevin Kristian

Kesmas

This study aimed to identify medical students’ healthy lifestyle behaviors during the new normal era and to determine its relationship with sociodemographic factors, bearing in mind that, as future physicians and health role models, medical students play an important role in adopting and promoting healthy lifestyle behaviors to reduce the risk of future health problems as well as optimize communities’ health status. This cross-sectional study was conducted at the School of Medicine and Health Sciences of Universitas Katolik Indonesia Atma Jaya, with 111 medical students selected through stratified random sampling. Data were collected using sociodemographic characteristics (sex, residence, year of …


Making The Error Bar Overlap Myth A Reality: Comparative Confidence Intervals, Frank S. Corotto Aug 2023

Making The Error Bar Overlap Myth A Reality: Comparative Confidence Intervals, Frank S. Corotto

Georgia Journal of Science

Many interpret error bars to mean that if they do not overlap the difference is statistically “significant”. This overlap rule is really an overlap myth; the rule does not hold true for any conventional type of error bar. There are rules of thumb for estimating P values, but it would be better to show error bars for which the overlap rule holds true. Here I explain how to calculate comparative confidence intervals which, when plotted as error bars, let us judge significance based on overlap or separation. Others have published on these intervals (the mathematical basis goes back to John …


Using Geographic Information To Explore Player-Specific Movement And Its Effects On Play Success In The Nfl, Hayley Horn, Eric Laigaie, Alexander Lopez, Shravan Reddy Aug 2023

Using Geographic Information To Explore Player-Specific Movement And Its Effects On Play Success In The Nfl, Hayley Horn, Eric Laigaie, Alexander Lopez, Shravan Reddy

SMU Data Science Review

American Football is a billion-dollar industry in the United States. The analytical aspect of the sport is an ever-growing domain, with open-source competitions like the NFL Big Data Bowl accelerating this growth. With the amount of player movement during each play, tracking data can prove valuable in many areas of football analytics. While concussion detection, catch recognition, and completion percentage prediction are all existing use cases for this data, player-specific movement attributes, such as speed and agility, may be helpful in predicting play success. This research calculates player-specific speed and agility attributes from tracking data and supplements them with descriptive …


Traditional Vs Machine Learning Approaches: A Comparison Of Time Series Modeling Methods, Miguel E. Bonilla Jr., Jason Mcdonald, Tamas Toth, Bivin Sadler Aug 2023

Traditional Vs Machine Learning Approaches: A Comparison Of Time Series Modeling Methods, Miguel E. Bonilla Jr., Jason Mcdonald, Tamas Toth, Bivin Sadler

SMU Data Science Review

In recent years, various new Machine Learning and Deep Learning algorithms have been introduced, claiming to offer better performance than traditional statistical approaches when forecasting time series. Studies seeking evidence to support the usage of ML/DL over statistical approaches have been limited to comparing the forecasting performance of univariate, linear time series data. This research compares the performance of traditional statistical-based and ML/DL methods for forecasting multivariate and nonlinear time series.


A Hybrid Ensemble Of Learning Models, Bivin Sadler, Dhruba Dey, Duy Nguyen, Tavin Weeda Aug 2023

A Hybrid Ensemble Of Learning Models, Bivin Sadler, Dhruba Dey, Duy Nguyen, Tavin Weeda

SMU Data Science Review

Statistical models in time series forecasting have long been challenged to be superseded by the advent of deep learning models. This research proposes a new hybrid ensemble of forecasting models that combines the strengths of several strong candidates from these two model types. The proposed ensemble aims to improve the accuracy of forecasts and reduce computational complexity by leveraging the strengths of each candidate model.


The "Benfordness" Of Bach Music, Chadrack Bantange, Darby Burgett, Luke Haws, Sybil Prince Nelson Aug 2023

The "Benfordness" Of Bach Music, Chadrack Bantange, Darby Burgett, Luke Haws, Sybil Prince Nelson

Journal of Humanistic Mathematics

In this paper we analyze the distribution of musical note frequencies in Hertz to see whether they follow the logarithmic Benford distribution. Our results show that the music of Johann Sebastian Bach and Johann Christian Bach is Benford distributed while the computer-generated music is not. We also find that computer-generated music is statistically less Benford distributed than human- composed music.


Math And Democracy, Kimberly A. Roth, Erika L. Ward Aug 2023

Math And Democracy, Kimberly A. Roth, Erika L. Ward

Journal of Humanistic Mathematics

Math and Democracy is a math class containing topics such as voting theory, weighted voting, apportionment, and gerrymandering. It was first designed by Erika Ward for math master’s students, mostly educators, but then adapted separately by both Erika Ward and Kim Roth for a general audience of undergraduates. The course contains materials that can be explored in mathematics classes from those for non-majors through graduate students. As such, it serves students from all majors and allows for discussion of fairness, racial justice, and politics while exploring mathematics that non-major students might not otherwise encounter. This article serves as a guide …


Editorial, Al Asyary Jul 2023

Editorial, Al Asyary

Kesmas

No abstract provided.


A Multivariate Investigation Of The Motivational, Academic, And Well-Being Characteristics Of First-Generation And Continuing-Generation College Students, Christopher L. Thomas, Staci Zolkoski Jul 2023

A Multivariate Investigation Of The Motivational, Academic, And Well-Being Characteristics Of First-Generation And Continuing-Generation College Students, Christopher L. Thomas, Staci Zolkoski

Journal of Research Initiatives

Prior research has noted differences in motivational, academic, and well-being factors between first-generation and continuing-education students. However, past investigations have primarily overlooked the interactive influence of protective and risk factors when comparing the characteristics of first-generation and continuing-education students. Thus, the current study adopted a multivariate approach to gain a more nuanced understanding of the influence of generational status on students' self-regulated learning capabilities, academic anxiety, sense of belonging, academic barriers, mental health concerns, and satisfaction with life. University students (N = 432, 67.46% Caucasian, 87.55% female, Age = 28.10 ± 9.46) completed the Cognitive Test Anxiety Scale-2nd …


Limit Theorems For Weakly Dependent Random Variables With Values In Stable Type P Banach Spaces, Olimjon Sharipov, Utkir Kobilov Jun 2023

Limit Theorems For Weakly Dependent Random Variables With Values In Stable Type P Banach Spaces, Olimjon Sharipov, Utkir Kobilov

Bulletin of National University of Uzbekistan: Mathematics and Natural Sciences

We consider stable type p Banach spaces. We extend results known for independent random variables to the mixing random variables.

In particular we prove moment in equalities, low of large numbers and almost sure convergence of the series in the case of mixing random variables.


A Characterization Of Complex-Valued Random Variables With Rotationally-Invariant Moments, Michael L. Maiello Jun 2023

A Characterization Of Complex-Valued Random Variables With Rotationally-Invariant Moments, Michael L. Maiello

Rose-Hulman Undergraduate Mathematics Journal

A complex-valued random variable Z is rotationally invariant if the moments of Z are the same as the moments of W=e^{i*theta}Z. In the first part of the article, we characterize such random variables, in terms of "vanishing unbalanced moments," moment and cumulant generating functions, and polar decomposition. In the second part, we consider random variables whose moments are not necessarily finite, but which have a density. In this setting, we prove two characterizations that are equivalent to rotational invariance, one involving polar decomposition, and the other involving entropy. If a random variable has both a density and moments which determine …


(R2051) Analysis Of Map/Ph1, Ph2/2 Queueing Model With Working Breakdown, Repairs, Optional Service, And Balking, G. Ayyappan, G. Archana Jun 2023

(R2051) Analysis Of Map/Ph1, Ph2/2 Queueing Model With Working Breakdown, Repairs, Optional Service, And Balking, G. Ayyappan, G. Archana

Applications and Applied Mathematics: An International Journal (AAM)

In this paper, a classical queueing system with two types of heterogeneous servers has been considered. The Markovian Arrival Process (MAP) is used for the customer arrival, while phase type distribution (PH) is applicable for the offering of service to customers as well as the repair time of servers. Optional service are provided by the servers to the unsatisfied customers. The server-2 may get breakdown during the busy period of any type of service. Though the server- 2 got breakdown, server-2 has a capacity to provide the service at a slower rate to the current customer who is receiving service …


(R1975) Map/Ph(1), Ph(2)/2 Queue With Multiple Vacation, Optional Service, Consultations And Interruptions, G. Ayyappan, S. Sankeetha Jun 2023

(R1975) Map/Ph(1), Ph(2)/2 Queue With Multiple Vacation, Optional Service, Consultations And Interruptions, G. Ayyappan, S. Sankeetha

Applications and Applied Mathematics: An International Journal (AAM)

Two types of services are explored in this paper: regular server and main server, both of which provide both regular and optional services. Customers arrive using the Markovian Arrival Process (MAP), and service time is allocated based on phase type. The regular server uses the main server as a resource. Customers’ service at the primary server is disrupted as a result. When the queue size is empty, the main server can take several vacations. This system has been represented as a QBD Process that investigates steady state with the use of matrix analytic techniques, employing finite-dimensional block matrices. Our model’s …


(R2027) A New Class Of Pareto Distribution: Estimation And Its Applications, Anitta Susan Aniyan, Dais George Jun 2023

(R2027) A New Class Of Pareto Distribution: Estimation And Its Applications, Anitta Susan Aniyan, Dais George

Applications and Applied Mathematics: An International Journal (AAM)

The classical Pareto distribution is a positively skewed and right heavy-tailed lifetime distribution having a lot many applications in various fields of science and social science. In this work, via logarithmic trans-formed method, a new three parameter lifetime distribution, an extension of classical Pareto distribution is generated. The different structural properties of the new distribution are studied. The model parameters are estimated by the method of maximum likelihood and Bayesian procedure. When all the three parameters of the distribution are unknown, the Bayes estimators cannot be obtained in a closed form and hence, the Lindley’s approximation under squared error loss …


(R2053) Analysis Of Map/Ph/1 Queueing Model Subject To Two-Stage Vacation Policy With Imperfect Service, Setup Time, Breakdown, Delay Time, Phase Type Repair And Reneging Customer, N. Arulmozhi Jun 2023

(R2053) Analysis Of Map/Ph/1 Queueing Model Subject To Two-Stage Vacation Policy With Imperfect Service, Setup Time, Breakdown, Delay Time, Phase Type Repair And Reneging Customer, N. Arulmozhi

Applications and Applied Mathematics: An International Journal (AAM)

In this paper, we study a continuous-time single server queueing system with an infinite system of capacity, a two-stage vacation policy with imperfect service, setup, breakdown, delay time, phase-type of repair and customer reneging. The Markovian Arrival Process is used for the arrival of a customer and the phase-type distribution is used when offering service. This encompasses the policy of two vacations: a single working vacation and multiple vacations. Using the Matrix-Analytic Method to approach the system generates an invariant probability vector for this model. Henceforth, the busy period, waiting time distribution and cost analysis are the additional findings. The …


(R2025) Improving The Lda Linear Discriminant Analysis Method By Eliminating Redundant Variables For The Diagnosis Of Covid-19 Patients, Kianoush Fathi Vajargah, Hamid Mottaghi Golshan, Fazel Badakhshan Farahabadi Jun 2023

(R2025) Improving The Lda Linear Discriminant Analysis Method By Eliminating Redundant Variables For The Diagnosis Of Covid-19 Patients, Kianoush Fathi Vajargah, Hamid Mottaghi Golshan, Fazel Badakhshan Farahabadi

Applications and Applied Mathematics: An International Journal (AAM)

Nowadays, with the increase in data production speed, the process of data analysis has faced many problems because this big data is often accompanied by plug-in data and redundant data. Therefore, the use of dimensional methods in the pre-data analysis stage is necessary. In data mining, dimensional reduction is one of the most important steps in data pre-processing. Principal component analysis (PCA) and linear discriminant analysis (LDA) are often used to reduce dimensions in data mining. The LDA method is a monitored and controlled method but the PCA is not controlled method. When the number of samples in classes is …


Dimensions Of Vaccination Attitudes In Nigeria: A Study Of The Impacts Of Covid-19 Vaccine Risk Perception And Acceptance, Abiodun Musbau Lawal, Babatola Dominic Olawa, Ikenna Maximillian Odoh, Ayodeji Olorunfemi Olawole, Olubukola Ajayi, Judith Chineye Azikiwe, Israel Oluwatosin Ayodele, Emmanuel Kolawole Odusina, Thomas Attah, Ezekiel Adeyemi Odedokun, Stephen Ishola Babatunde, Teslim Alabi Oladejo, Confidence Chioma Otoghile, Saheed Abiola Saka May 2023

Dimensions Of Vaccination Attitudes In Nigeria: A Study Of The Impacts Of Covid-19 Vaccine Risk Perception And Acceptance, Abiodun Musbau Lawal, Babatola Dominic Olawa, Ikenna Maximillian Odoh, Ayodeji Olorunfemi Olawole, Olubukola Ajayi, Judith Chineye Azikiwe, Israel Oluwatosin Ayodele, Emmanuel Kolawole Odusina, Thomas Attah, Ezekiel Adeyemi Odedokun, Stephen Ishola Babatunde, Teslim Alabi Oladejo, Confidence Chioma Otoghile, Saheed Abiola Saka

Kesmas

Nigeria has been affected by the COVID-19 pandemic, and vaccination is a key strategy. However, the country faces vaccination hesitancy, poor risk perception, and low acceptance. This study aimed to assess the direct and interactive impacts of COVID-19 vaccine risk perception and acceptability on COVID-19 vaccination attitudes in the general Nigerian population. In a cross-sectional approach, participants completed a structured questionnaire including demographics, COVID-19 vaccine risk perception, acceptance, and vaccination attitude from April 2-30, 2021. The sample included 1,026 participants from different ethnicities across four regions (Southwest, South, Southeast, and North Central) in Nigeria, which were selected using the convenience …


Extremely Hot Ambient Temperature And Injury-Related Mortality, Mien T N Nguyen, Man V M Nguyen, Huong V T Le, Hoai Viet Nguyen, Vu Anh Nguyen, Ngoan Tran Le May 2023

Extremely Hot Ambient Temperature And Injury-Related Mortality, Mien T N Nguyen, Man V M Nguyen, Huong V T Le, Hoai Viet Nguyen, Vu Anh Nguyen, Ngoan Tran Le

Kesmas

This pilot study aimed to evaluate the effects of extremely hot ambient temperatures on the total number of fatal injuries. Data were collected from a population-based mortality registry of Thanh Hoa, a province in the North Central region of Vietnam. This study qualified the distributed lag non-linear model and calculated the RR and 95% CI adjusted for long-term trend and absolute humidity. For the entire study population with 3,949 registered deaths due to injuries collected during 2005-2007, after the onset of extremely hot ambient temperatures, an increased risk of death was observed on the 9th day RR (95% CI) = …


Effects Of Using An Application For Postpartum Contraceptive Use In Family Planning Counseling During Pregnancy, Lia Nurcahyani, Dyah Widiyastuti, Arief Tarmansyah Iman, Yanti Cahyati, Yeni Fitrianingsih May 2023

Effects Of Using An Application For Postpartum Contraceptive Use In Family Planning Counseling During Pregnancy, Lia Nurcahyani, Dyah Widiyastuti, Arief Tarmansyah Iman, Yanti Cahyati, Yeni Fitrianingsih

Kesmas

A decision-making tool for family planning flipchart is used for contraceptive counseling, but the use of this flipchart is suboptimal. In this study, primary study resulted in innovative decision-making tools for family planning applications. “Si KB Pintar” was also developed, a tool that women can use to discuss contraceptives with their husbands after family planning counseling. This study analyzed the effectiveness of family planning counseling during pregnancy by applying postpartum contraceptive use. Analytical quantitative quasi-experimental methods were used with a control group design. The sampling method was two-stage sampling. In the first stage, from 22 primary health care (PHC) in …


Assessment Of Rabies Control Attitudes During The Covid-19 Pandemic Through Partial Least Square-Structural Equation Modeling, Sang Gede Purnama, Ni Wayan Arya Utami, Made Subrata, Putu Erma Pradnyani, Karang Agustina, Ibn Swacita May 2023

Assessment Of Rabies Control Attitudes During The Covid-19 Pandemic Through Partial Least Square-Structural Equation Modeling, Sang Gede Purnama, Ni Wayan Arya Utami, Made Subrata, Putu Erma Pradnyani, Karang Agustina, Ibn Swacita

Kesmas

The COVID-19 pandemic disrupts rabies control activities in the community. A new approach is needed to control rabies during the COVID-19 pandemic through digital health interventions by conducting digital surveillance and education. This study aimed to determine key attitude indicators in controlling rabies during the COVID-19 pandemic. A cross-sectional study on 166 participants in Denpasar City with a total of 31 indicators measuring five variables: perceptions of the benefits of rabies control (6 indicators), perceptions of rabies risk (6 indicators), perceptions of obstacles to rabies control (5 indicators), perceptions of the need for technology (7 indicators), and attitudes toward rabies …


Public Acceptance Of Guidance And Regulations For Space Flight Participation, Cory Trunkhill, Robert Joslin, Joseph Keebler May 2023

Public Acceptance Of Guidance And Regulations For Space Flight Participation, Cory Trunkhill, Robert Joslin, Joseph Keebler

Journal of Aviation Technology and Engineering

Space flight participants are not professional astronauts and not subject to the rules and guidance covering space flight crewmembers. Ordinal logistic regression of survey data was utilized to explore public acceptance of current medical screening recommendations and regulations for safety risk and implied liability for civil space flight participation. Independent variables constituted participant demographic representations while dependent variables represented current Federal Aviation Administration guidance and regulations. Odds ratios were derived based on the demographic categories to interpret likelihood of acceptance for the criteria. Significant likely acceptance of guidance and regulations was found for five of twelve demographic variables influencing public …


Two Sample Statistical Test For Location Parameters, Narinder Kumar, Arun Kumar Apr 2023

Two Sample Statistical Test For Location Parameters, Narinder Kumar, Arun Kumar

Journal of Modern Applied Statistical Methods

A class of distribution-free tests for the homogeneity of location parameters is proposed and compared with different competitors in terms of Pitman asymptotic relative efficiency. A numerical example is provided and a simulation study is made to check the performance of the tests.


Bridging The Chasm Between Fundamental, Momentum, And Quantitative Investing, Allen Hoskins, Jeff Reed, Robert Slater Apr 2023

Bridging The Chasm Between Fundamental, Momentum, And Quantitative Investing, Allen Hoskins, Jeff Reed, Robert Slater

SMU Data Science Review

A chasm exists between the active public equity investment management industry's fundamental, momentum, and quantitative styles. In this study, the researchers explore ways to bridge this gap by leveraging domain knowledge, fundamental analysis, momentum, crowdsourcing, and data science methods. This research also seeks to test the developed tools and strategies during the volatile time period of 2020 and 2021.


Comparison Of Sampling Methods For Predicting Wine Quality Based On Physicochemical Properties, Robert Burigo, Scott Frazier, Eli Kravez, Nibhrat Lohia Apr 2023

Comparison Of Sampling Methods For Predicting Wine Quality Based On Physicochemical Properties, Robert Burigo, Scott Frazier, Eli Kravez, Nibhrat Lohia

SMU Data Science Review

Using the physicochemical properties of wine to predict quality has been done in numerous studies. Given the nature of these properties, the data is inherently skewed. Previous works have focused on handful of sampling techniques to balance the data. This research compares multiple sampling techniques in predicting the target with limited data. For this purpose, an ensemble model is used to evaluate the different techniques. There was no evidence found in this research to conclude that there are specific oversampling methods that improve random forest classifier for a multi-class problem.