Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

PDF

Journal

Regression

Discipline
Institution
Publication Year
Publication

Articles 1 - 30 of 32

Full-Text Articles in Physical Sciences and Mathematics

The Conviction Of Miss Prediction, Dane C. Joseph Jan 2024

The Conviction Of Miss Prediction, Dane C. Joseph

Journal of Humanistic Mathematics

Miss Prediction is questioned in a court of law over her involvement in the mischaracterization of linear models when they were inappropriate.


The Regression Of The Flood In Virginia, James C. Rakestraw, Jim Melnick Dec 2023

The Regression Of The Flood In Virginia, James C. Rakestraw, Jim Melnick

Proceedings of the International Conference on Creationism

The geology, tectonics, and hydraulics of the regression of the Flood formed much of the geomorphology of Virginia. Opportunities to view and study geology and geomorphology are available through visiting parks, traveling on public roads, and viewing geographic information system (GIS) resources.

Virginia is part of the North American Plate. A series of “blocks” of basement rocks within the plate underlie the geomorphological provinces of Virginia. These “blocks” form a series of steps between the Atlantic Ocean Basin and the Blue Ridge. The “Fall Line” found in Virginia is a fault between two blocks of basement rock. The basement rocks …


Predicting Insulin Pump Therapy Settings, Riccardo L. Ferraro, David Grijalva, Alex Trahan Sep 2022

Predicting Insulin Pump Therapy Settings, Riccardo L. Ferraro, David Grijalva, Alex Trahan

SMU Data Science Review

Millions of people live with diabetes worldwide [7]. To mitigate some of the many symptoms associated with diabetes, an estimated 350,000 people in the United States rely on insulin pumps [17]. For many of these people, how effectively their insulin pump performs is the difference between sleeping through the night and a life threatening emergency treatment at a hospital. Three programmed insulin pump therapy settings governing effective insulin pump function are: Basal Rate (BR), Insulin Sensitivity Factor (ISF), and Carbohydrate Ratio (ICR). For many people using insulin pumps, these therapy settings are often not correct, given their physiological needs. While …


Comparison Of Deep Learning And Regression-Based Mppt Algorithms In Pv Systems, Murat Sali̇m Karabi̇naoğlu, Beki̇r Çakir, Mustafa Engi̇n Başoğlu, Abdülvehhab Kazdaloğlu, Azi̇z Güneroğlu Sep 2022

Comparison Of Deep Learning And Regression-Based Mppt Algorithms In Pv Systems, Murat Sali̇m Karabi̇naoğlu, Beki̇r Çakir, Mustafa Engi̇n Başoğlu, Abdülvehhab Kazdaloğlu, Azi̇z Güneroğlu

Turkish Journal of Electrical Engineering and Computer Sciences

Solar energy systems (SES) and photovoltaic (PV) modules should be operated at the maximum power point (MPP) to achieve the highest efficiency in the energy generation processes. Maximum power point tracking (MPPT) applications using conventional methods may not be able to follow the global MPP (GMPP) of the PV system under changing atmospheric conditions and they could oscillate around the local MPP. In this study, a machine learning and deep learning (DL) based long short-term memory (LSTM) model is proposed as an innovative solution for MPPT. Contrary to the traditional MPPT applications using current and voltage sensors, the output resistance …


A Machine Learning Approach To Revenue Generation Within The Professional Hair Care Industry, Alexander K. Sepenu, Linda Eliasen Jun 2022

A Machine Learning Approach To Revenue Generation Within The Professional Hair Care Industry, Alexander K. Sepenu, Linda Eliasen

SMU Data Science Review

The cosmetic and beauty industry continues to grow and evolve to satisfy its patrons. In the United States, the industry is heavily science-driven, innovative, and fast-paced, suggesting that to remain productive and profitable, companies must seek smart alternatives to their current modus operandi or risk losing out on this multi-billion-dollar industry to fierce competition. In this paper, the authors seek to utilize machine learning models such as clustering and regression to improve the efficiency of current sales and customer segmentation models to help HairCo (pseudonym for confidentiality), a professional hair products manufacturer, strategize their marketing and sales efforts for revenue …


Forecasting Tv Ratings Of Turkish Television Series Using A Two-Level Machinelearning Framework, Büşranur Akgül, Tayfun Küçükyilmaz Mar 2022

Forecasting Tv Ratings Of Turkish Television Series Using A Two-Level Machinelearning Framework, Büşranur Akgül, Tayfun Küçükyilmaz

Turkish Journal of Electrical Engineering and Computer Sciences

TV rating is a numeric estimate of the popularity of television programs. Forecasting TV ratings is considered an important asset for investment planning of media due to its potential of reducing the risks of future ventures. The aim of this study is to develop a machine learning model capable of efficiently forecasting the TV ratings of Turkish TV series in a practical manner. To this end, two prediction models were proposed for forecasting the TV ratings of television series, facilitating an extensive set of features. A contribution of this study is the inclusion of social media-based features using search trends …


(R1239) A New Type Ii Half Logistic-G Family Of Distributions With Properties, Regression Models, System Reliability And Applications, Emrah Altun, Morad Alizadeh, Haitham M. Yousof, Mahdi Rasekhi, G. G. Hamedani Dec 2021

(R1239) A New Type Ii Half Logistic-G Family Of Distributions With Properties, Regression Models, System Reliability And Applications, Emrah Altun, Morad Alizadeh, Haitham M. Yousof, Mahdi Rasekhi, G. G. Hamedani

Applications and Applied Mathematics: An International Journal (AAM)

This study proposes a new family of distributions based on the half logistic distribution. With the new family, the baseline distributions gain flexibility through additional shape parameters. The important statistical properties of the proposed family are derived. A new generalization of the Weibull distribution is used to introduce a location-scale regression model for the censored response variable. The utility of the introduced models is demonstrated in survival analysis and estimation of the system reliability. Three data sets are analyzed. According to the empirical results, it is observed that the proposed family gives better results than other existing models.


Investigating Model Solution Correctness For Parameter Uncertainty In Both Objective Function And Constraints, Shangyao Yan, Sin-Siang Wang, Chun-Yi Wang Jul 2021

Investigating Model Solution Correctness For Parameter Uncertainty In Both Objective Function And Constraints, Shangyao Yan, Sin-Siang Wang, Chun-Yi Wang

Journal of Marine Science and Technology

To resolve engineering management problems encountered in the real world, optimization models are usually formulated with some parameter assumptions. Parameter uncertainty, which may arise due to changes in the environment or human error, may thus be incorporated into the objective function and the constraints. However, to simplify the modeling, the values of these parameters are usually set or projected as deterministic values. It is no wonder that the modelling results based on these inaccurate parameters are neither correct nor reliable. Thus, it is important to examine the correctness of the model results in relation to parameter uncertainty. This study aims …


Cover Song Identification - A Novel Stem-Based Approach To Improve Song-To-Song Similarity Measurements, Lavonnia Newman, Dhyan Shah, Chandler Vaughn, Faizan Javed Sep 2020

Cover Song Identification - A Novel Stem-Based Approach To Improve Song-To-Song Similarity Measurements, Lavonnia Newman, Dhyan Shah, Chandler Vaughn, Faizan Javed

SMU Data Science Review

Music is incorporated into our daily lives whether intentional or unintentional. It evokes responses and behavior so much so there is an entire study dedicated to the psychology of music. Music creates the mood for dancing, exercising, creative thought or even relaxation. It is a powerful tool that can be used in various venues and through advertisements to influence and guide human reactions. Music is also often "borrowed" in the industry today. The practices of sampling and remixing music in the digital age have made cover song identification an active area of research. While most of this research is focused …


3d Skull Surface Completion Method Based On Multi-Exemplars, Reziwanguli Xiamixiding, Guohua Geng, Gulisong Nasierding, Qingqiong Deng, Dilinuer Keyimu, Zulipiya Maimaitiming, Wanrong Zhao, Zheng Lei Aug 2020

3d Skull Surface Completion Method Based On Multi-Exemplars, Reziwanguli Xiamixiding, Guohua Geng, Gulisong Nasierding, Qingqiong Deng, Dilinuer Keyimu, Zulipiya Maimaitiming, Wanrong Zhao, Zheng Lei

Journal of System Simulation

Abstract: In order to repair the damaged skulls, a skull completion method based on multiple exemplars was proposed. A 3D skull model database was constructed, and all the exemplars within the database were registered with a standard skull model. Each exemplar was then divided into a missing part and a remaining part according to the given damaged skull. After that, the relationship between the missing part and the remaining part was obtained by a regression algorithm. This relationship was used to calculate the missing part of the given skull, and a complete skull could be obtained by merging the two …


Introduction To Research Statistical Analysis: An Overview Of The Basics, Christian Vandever Apr 2020

Introduction To Research Statistical Analysis: An Overview Of The Basics, Christian Vandever

HCA Healthcare Journal of Medicine

This article covers many statistical ideas essential to research statistical analysis. Sample size is explained through the concepts of statistical significance level and power. Variable types and definitions are included to clarify necessities for how the analysis will be interpreted. Categorical and quantitative variable types are defined, as well as response and predictor variables. Statistical tests described include t-tests, ANOVA and chi-square tests. Multiple regression is also explored for both logistic and linear regression. Finally, the most common statistics produced by these methods are explored.


An Exploration Of Link Functions Used In Ordinal Regression, Thomas J. Smith, David A. Walker, Cornelius M. Mckenna Apr 2020

An Exploration Of Link Functions Used In Ordinal Regression, Thomas J. Smith, David A. Walker, Cornelius M. Mckenna

Journal of Modern Applied Statistical Methods

The purpose of this study is to examine issues involved with choice of a link function in generalized linear models with ordinal outcomes, including distributional appropriateness, link specificity, and palindromic invariance are discussed and an exemplar analysis provided using the Pew Research Center 25th anniversary of the Web Omnibus Survey data. Simulated data are used to compare the relative palindromic invariance of four distinct indices of determination/discrimination, including a newly proposed index by Smith et al. (2017).


Modeling Compaction Parameters Using Support Vector And Decision Treeregression Algorithms, Abdurrahman Özbeyaz, Mehmet Söylemez Jan 2020

Modeling Compaction Parameters Using Support Vector And Decision Treeregression Algorithms, Abdurrahman Özbeyaz, Mehmet Söylemez

Turkish Journal of Electrical Engineering and Computer Sciences

Shortening the periods of compaction tests can be possible by analyzing the data obtained from previous laboratory tests with regression methods. The regression analysis applied to current data reduces the cost of experiments, saves time, and gives estimated outputs. In this study, the MLS-SVR, KB-SVR, and DTR algorithms were employed for the first time for the estimation of soil compaction parameters. The performances of these regression algorithms in estimating maximum dry unit weight (MDD) and optimum water content (OMC) were compared. Furthermore, the soil properties (fine-grained soil, sand, gravel, specific gravity, liquid limit, and plastic limit) were employed as inputs …


Visualization And Machine Learning Techniques For Nasa’S Em-1 Big Data Problem, Antonio P. Garza Iii, Jose Quinonez, Misael Santana, Nibhrat Lohia May 2019

Visualization And Machine Learning Techniques For Nasa’S Em-1 Big Data Problem, Antonio P. Garza Iii, Jose Quinonez, Misael Santana, Nibhrat Lohia

SMU Data Science Review

In this paper, we help NASA solve three Exploration Mission-1 (EM-1) challenges: data storage, computation time, and visualization of complex data. NASA is studying one year of trajectory data to determine available launch opportunities (about 90TBs of data). We improve data storage by introducing a cloud-based solution that provides elasticity and server upgrades. This migration will save $120k in infrastructure costs every four years, and potentially avoid schedule slips. Additionally, it increases computational efficiency by 125%. We further enhance computation via machine learning techniques that use the classic orbital elements to predict valid trajectories. Our machine learning model decreases trajectory …


Logistic Regression: An Inferential Method For Identifying The Best Predictors, Rand Wilcox Mar 2019

Logistic Regression: An Inferential Method For Identifying The Best Predictors, Rand Wilcox

Journal of Modern Applied Statistical Methods

When dealing with a logistic regression model, there is a simple method for estimating the strength of the association between the jth covariate and the dependent variable when all covariates are entered into the model. There is the issue of determining whether the jth independent variable has a stronger or weaker association than the kth independent variable. This note describes a method for dealing with this issue that was found to perform reasonably well in simulations.


Automated Elimination Of Eog Artifacts In Sleep Eeg Using Regression Method, Mehmet Dursun, Seral Özşen, Sali̇h Güneş, Bayram Akdemi̇r, Şebnem Yosunkaya Jan 2019

Automated Elimination Of Eog Artifacts In Sleep Eeg Using Regression Method, Mehmet Dursun, Seral Özşen, Sali̇h Güneş, Bayram Akdemi̇r, Şebnem Yosunkaya

Turkish Journal of Electrical Engineering and Computer Sciences

Sleep electroencephalogram (EEG) signal is an important clinical tool for automatic sleep staging process. Sleep EEG signal is effected by artifacts and other biological signal sources, such as electrooculogram (EOG) and electromyogram (EMG), and since it is effected, its clinical utility reduces. Therefore, eliminating EOG artifacts from sleep EEG signal is a major challenge for automatic sleep staging. We have studied the effects of EOG signals on sleep EEG and tried to remove them from the EEG signals by using regression method. The EEG and EOG recordings of seven subjects were obtained from the Sleep Research Laboratory of Meram Medicine …


Yelp’S Review Filtering Algorithm, Yao Yao, Ivelin Angelov, Jack Rasmus-Vorrath, Mooyoung Lee, Daniel W. Engels Aug 2018

Yelp’S Review Filtering Algorithm, Yao Yao, Ivelin Angelov, Jack Rasmus-Vorrath, Mooyoung Lee, Daniel W. Engels

SMU Data Science Review

In this paper, we present an analysis of features influencing Yelp's proprietary review filtering algorithm. Classifying or misclassifying reviews as recommended or non-recommended affects average ratings, consumer decisions, and ultimately, business revenue. Our analysis involves systematically sampling and scraping Yelp restaurant reviews. Features are extracted from review metadata and engineered from metrics and scores generated using text classifiers and sentiment analysis. The coefficients of a multivariate logistic regression model were interpreted as quantifications of the relative importance of features in classifying reviews as recommended or non-recommended. The model classified review recommendations with an accuracy of 78%. We found that reviews …


Fuel Flow Reduction Impact Analysis Of Drag Reducing Film Applied To Aircraft Wings, Damon Resnick, Chris Donlan, Nimish Sakalle, Cody Pinkerman Jul 2018

Fuel Flow Reduction Impact Analysis Of Drag Reducing Film Applied To Aircraft Wings, Damon Resnick, Chris Donlan, Nimish Sakalle, Cody Pinkerman

SMU Data Science Review

In this paper, we present an analysis of flight data in order to determine whether the application of the Edge Aerodynamix Conformal Vortex Generator (CVG), applied to the wings of aircraft, reduces fuel flow during cruising conditions of flight. The CVG is a special treatment and film applied to the wings of an aircraft to protect the wings and reduce the non-laminar flow of air around the wings during flight. It is thought that by reducing the non-laminar flow or vortices around and directly behind the wings that an aircraft will move more smoothly through the air and provide a …


Prediction Of Gross Calorific Value Of Coal Based On Proximate Analysis Using Multiple Linear Regression And Artificial Neural Networks, Mustafa Açikkar, Osman Si̇vri̇kaya Jan 2018

Prediction Of Gross Calorific Value Of Coal Based On Proximate Analysis Using Multiple Linear Regression And Artificial Neural Networks, Mustafa Açikkar, Osman Si̇vri̇kaya

Turkish Journal of Electrical Engineering and Computer Sciences

Gross calorific value (GCV) of coal was predicted by using as-received basis proximate analysis data. Two main objectives of the study were to develop prediction models for GCV using proximate analysis variables and to reveal the distinct predictors of GCV. Multiple linear regression (MLR) and artifcial neural network (ANN) (multilayer perceptron MLP, general regression neural network GRNN, and radial basis function neural network RBFNN) methods were applied to the developed 11 models created by different combinations of the predictor variables. By conducting 10-fold cross-validation, the prediction accuracy of the models has been tested by using $ R^2 $, $ RMSE …


Palynological And Petroleum Geochemical Assessment Of The Lower Oligocene Mezardere Formation, Thrace Basin, Nw Turkey, Kadi̇r Gürgey, Zühtü Bati Jan 2018

Palynological And Petroleum Geochemical Assessment Of The Lower Oligocene Mezardere Formation, Thrace Basin, Nw Turkey, Kadi̇r Gürgey, Zühtü Bati

Turkish Journal of Earth Sciences

The Oligocene clastic sequence of the Mezardere Formation (MF) with laterally variable organic richness has long been known as a proven source of gas with minor oil accumulations across the Thrace Basin of northwest Turkey. However, based on well data for the thick MF, neither detailed work in relation to age dating and stratigraphy nor a close linkage between the depositional facies/ environments, organic richness/organic proxies, and cyclicity has been established yet. In the present study, the MF was informally subdivided into Lower MF (LMF) and Upper MF (UMF) based on the distinct differences in palynological and geochemical data. Based …


Effective Estimation Strategy Of Finite Population Variance Using Multi-Auxiliary Variables In Double Sampling, Reba Maji, G. N. Singh, Arnab Bandyopadhyay May 2017

Effective Estimation Strategy Of Finite Population Variance Using Multi-Auxiliary Variables In Double Sampling, Reba Maji, G. N. Singh, Arnab Bandyopadhyay

Journal of Modern Applied Statistical Methods

Estimation of population variance in two-phase (double) sampling is considered using information on multiple auxiliary variables. An unbiased estimator is proposed and its properties are studied under two different structures. The superiority of the suggested estimator over some contemporary estimators of population variance was established through empirical studies from a natural and an artificially generated dataset.


A 10-Point Approximating Subdivision Scheme Based On Least Squares Technique, Ghulam Mustafa, Muhammad T. Iqbal Dec 2016

A 10-Point Approximating Subdivision Scheme Based On Least Squares Technique, Ghulam Mustafa, Muhammad T. Iqbal

Applications and Applied Mathematics: An International Journal (AAM)

In this paper, a 10-point approximating subdivision scheme is presented. Least squares technique for fitting the polynomial of degree 9 to data is used to develop this scheme. The proposed strategy can be used to generate a family of schemes. The important characteristics of the scheme are also discussed. Graphical efficiency of the scheme is shown by applying it on different types of data.


Efficient And Unbiased Estimation Procedure Of Population Mean In Two-Phase Sampling, Reba Maji, Arnab Bandyopadhyay, G. N. Singh Nov 2016

Efficient And Unbiased Estimation Procedure Of Population Mean In Two-Phase Sampling, Reba Maji, Arnab Bandyopadhyay, G. N. Singh

Journal of Modern Applied Statistical Methods

In this paper, an unbiased regression-ratio type estimator has been developed for estimating the population mean using two auxiliary variables in double sampling. Its properties are studied under two different cases. Empirical studies and graphical simulation have been done to demonstrate the efficiency of the proposed estimator over other estimators.


A Spatial Analytical Framework For Examining Road Traffic Crashes, Grace O. Korter May 2016

A Spatial Analytical Framework For Examining Road Traffic Crashes, Grace O. Korter

Journal of Modern Applied Statistical Methods

A number of different modeling techniques have been used to examine road traffic crashes for analytic and predictive purposes. Map-based spatial analysis is introduced. Applications are given which show the power in a combination of existing exploratory and statistical methods.


Contrails: Causal Inference Using Propensity Scores, Dean S. Barron Nov 2015

Contrails: Causal Inference Using Propensity Scores, Dean S. Barron

Journal of Modern Applied Statistical Methods

Contrails are clouds caused by airplane exhausts, which geologists contend decrease daily temperature ranges on Earth. Following the 2001 World Trade Center attack, cancelled domestic flights triggered the first absence of contrails in decades. Resultant exceptional data capacitated causal inference analysis by propensity score matching. Estimated contrail effect was 6.8981°F.


A Likelihood Ratio Test Approach To Profile Monitoring In Tourism Industry, R. Noorossana, H. Izadbakhsh, M. R. Nayebpour Dec 2014

A Likelihood Ratio Test Approach To Profile Monitoring In Tourism Industry, R. Noorossana, H. Izadbakhsh, M. R. Nayebpour

Applications and Applied Mathematics: An International Journal (AAM)

A new statistical profile monitoring technique to monitor and detect changes in logistic profiles with an application in the tourism industry is presented in this paper. In the statistical process control literature, profile is usually referred to as a relationship between a response variable and one or more explanatory variables. In the tourism case study presented in this paper, time is considered as the explanatory variable and tourism satisfaction as the response variable. The Likelihood ratio test is used as a vehicle to detect any changes in the satisfaction profile in phase II of profile monitoring. The performance of the …


A Comparative Review Of Regression Ensembles On Drug Design Datasets, Mehmet Fati̇h Amasyali, Kadri̇ Okan Ersoy Jan 2013

A Comparative Review Of Regression Ensembles On Drug Design Datasets, Mehmet Fati̇h Amasyali, Kadri̇ Okan Ersoy

Turkish Journal of Electrical Engineering and Computer Sciences

Drug design datasets are usually known as hard-modeled, having a large number of features and a small number of samples. Regression types of problems are common in the drug design area. Committee machines (ensembles) have become popular in machine learning because of their good performance. In this study, the dynamics of ensembles used in regression-related drug design problems are investigated with a drug design dataset collection. The study tries to determine the most successful ensemble algorithm, the base algorithm--ensemble pair having the best/worst results, the best successful single algorithm, and the similarities of algorithms according to their performances. We also …


Improved Estimator In The Presence Of Multicollinearity, Ghadban Khalaf May 2012

Improved Estimator In The Presence Of Multicollinearity, Ghadban Khalaf

Journal of Modern Applied Statistical Methods

The performances of two biased estimators for the general linear regression model under conditions of collinearity are examined and a new proposed ridge parameter is introduced. Using Mean Square Error (MSE) and Monte Carlo simulation, the resulting estimator’s performance is evaluated and compared with the Ordinary Least Square (OLS) estimator and the Hoerl and Kennard (1970a) estimator. Results of the simulation study indicate that, with respect to MSE criteria, in all cases investigated the proposed estimator outperforms both the OLS and the Hoerl and Kennard estimators.


Number Of Replications Required In Monte Carlo Simulation Studies: A Synthesis Of Four Studies, Daniel J. Mundform, Jay Schaffer, Myoung-Jin Kim, Dale Shaw, Ampai Thongteeraparp, Pornsin Supawan May 2011

Number Of Replications Required In Monte Carlo Simulation Studies: A Synthesis Of Four Studies, Daniel J. Mundform, Jay Schaffer, Myoung-Jin Kim, Dale Shaw, Ampai Thongteeraparp, Pornsin Supawan

Journal of Modern Applied Statistical Methods

Monte Carlo simulations are used extensively to study the performance of statistical tests and control charts. Researchers have used various numbers of replications, but rarely provide justification for their choice. Currently, no empirically-based recommendations regarding the required number of replications exist. Twenty-two studies were re-analyzed to determine empirically-based recommendations.


Review Of Super Crunchers By Ian Ayers, Eric Gaze Jun 2009

Review Of Super Crunchers By Ian Ayers, Eric Gaze

Numeracy

Ayers, I. Super Crunchers: Why Thinking-by-Numbers Is the New Way to be Smart. (Bantam Dell Publishing Group, 2007). 272 pp. Hard cover $25 ISBN 978-0-553-80540-6.

Super Crunchers tells the story of how analyzing data is changing the ways in which decisions are made. We in the National Numeracy Network make a case for the importance of quantitative literacy by referring to how much quantitative information is now available to each of us: “a world awash in numbers.” Ian Ayres zeroes in on the people who are making a living crunching all of these data. From the seemingly innocuous (how …