Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

2019

Journal

Statistics and Probability

Institution
Keyword
Publication

Articles 1 - 30 of 85

Full-Text Articles in Physical Sciences and Mathematics

Identifying Customer Churn In After-Market Operations Using Machine Learning Algorithms, Vitaly Briker, Richard Farrow, William Trevino, Brent Allen Dec 2019

Identifying Customer Churn In After-Market Operations Using Machine Learning Algorithms, Vitaly Briker, Richard Farrow, William Trevino, Brent Allen

SMU Data Science Review

This paper presents a comparative study on machine learning methods as they are applied to product associations, future purchase predictions, and predictions of customer churn in aftermarket operations. Association rules are used help to identify patterns across products and find correlations in customer purchase behaviour. Studying customer behaviour as it pertains to Recency, Frequency, and Monetary Value (RFM) helps inform customer segmentation and identifies customers with propensity to churn. Lastly, Flowserve’s customer purchase history enables the establishment of churn thresholds for each customer group and assists in constructing a model to predict future churners. The aim of this model is …


Personalized Detection Of Anxiety Provoking News Events Using Semantic Network Analysis, Jacquelyn Cheun Phd, Luay Dajani, Quentin B. Thomas Dec 2019

Personalized Detection Of Anxiety Provoking News Events Using Semantic Network Analysis, Jacquelyn Cheun Phd, Luay Dajani, Quentin B. Thomas

SMU Data Science Review

In the age of hyper-connectivity, 24/7 news cycles, and instant news alerts via social media, mental health researchers don't have a way to automatically detect news content which is associated with triggering anxiety or depression in mental health patients. Using the Associated Press news wire, a semantic network was built with 1,056 news articles containing over 500,000 connections across multiple topics to provide a personalized algorithm which detects problematic news content for a given reader. We make use of Semantic Network Analysis to surface the relationship between news article text and anxiety in readers who struggle with mental health disorders. …


Achieving Optimal Horizontal Drill Operations, Daniel J. Serna, James Vasquez, Donald Markley Dec 2019

Achieving Optimal Horizontal Drill Operations, Daniel J. Serna, James Vasquez, Donald Markley

SMU Data Science Review

In this paper, we present a novel method of predicting the onset of a slide event in horizontal drilling operations. Horizontal drilling operations attempt to create a well through a subsurface as quickly as possible by rotating a drill through the subsurface. A slide event occurs when the drill begins to inefficiently rotate through the subsurface, resulting in a significantly reduced rate of penetration. Slide events can be prevented, or significantly reduced in their impact, when their onset is accurately predicted. We present a method of accurately predicting the onset of slide events with a time-series based predictive model that …


A Simulation Study On The Size And Power Properties Of Some Ridge Regression Tests, B. M. Golam Kibria, Shipra Banik Dec 2019

A Simulation Study On The Size And Power Properties Of Some Ridge Regression Tests, B. M. Golam Kibria, Shipra Banik

Applications and Applied Mathematics: An International Journal (AAM)

Ridge regression techniques have been extensively used to solve the multicollinearity problem for both linear and non-linear regression models since its inception. This paper studied different ridge regression t-type tests of the individual coefficients of a linear regression model. A simulation study has been conducted to evaluate the performance of the proposed tests with respect to their sizes and powers under different settings of the linear regression model. Our simulation results demonstrated that most of the proposed tests have sizes close to the 5% nominal level and all tests except tAKS, tkM2 and tkM9 have considerable gain in powers over …


Alpha-Skew Generalized Normal Distribution And Its Applications, Eisa Mahmoudi, Hamideh Jafari, Rahmat S. Meshkat Dec 2019

Alpha-Skew Generalized Normal Distribution And Its Applications, Eisa Mahmoudi, Hamideh Jafari, Rahmat S. Meshkat

Applications and Applied Mathematics: An International Journal (AAM)

The main object of this paper is to introduce a new family of distributions, which is quite flexible to fit both unimodal and bimodal shapes. This new family is entitled alpha-skew generalized normal (ASGN), that skews the symmetric distributions, especially generalized normal distribution through this paper. Here, some properties of this new distribution including cumulative distribution function, survival function, hazard rate function and moments are derived. To estimate the model parameters, the maximum likelihood estimators and the asymptotic distribution of the estimators are discussed. The observed information matrix is derived. Finally, the flexibility of the new distribution, as well as …


An (S - 1; S) Inventory System With Negative Arrivals And Multiple Vacations, Kathiresan Jothivel, Anbazhagan Neelamegam Dec 2019

An (S - 1; S) Inventory System With Negative Arrivals And Multiple Vacations, Kathiresan Jothivel, Anbazhagan Neelamegam

Applications and Applied Mathematics: An International Journal (AAM)

In this paper, we consider a continuous review one-to-one ordering policy inventory system with multiple vacations and negative customers. The maximum storage capacity is S. The customers arrive according to a Poisson process with finite waiting hall. There are two types of customers: ordinary and negative. An ordinary customer, on arrival, joins the queue and the negative customer does not join the queue and takes away any one of the waiting customers. When the waiting hall is full, the arriving primary customer is considered to be lost. The service time and lead time are assumed to have independent exponential distribution. …


A Further Result On The Aging Properties Of An Extended Additive Hazard Model, Morteza Raeisi, Gholamhossein Yari Dec 2019

A Further Result On The Aging Properties Of An Extended Additive Hazard Model, Morteza Raeisi, Gholamhossein Yari

Applications and Applied Mathematics: An International Journal (AAM)

The passing of time is an important factor for covariates in the additive and proportional hazard models. According to this idea, the extended additive hazard model (EAHM) is introduced by considering the time-varying effects of covariates and is investigated several properties of this model related to reliability analysis. In this paper, we obtain a further result for the EAHM with respect to the aging properties.


Series Of Divergence Measures Of Type K, Information Inequalities And Particular Cases, R. N. Saraswat, Ajay Tak Dec 2019

Series Of Divergence Measures Of Type K, Information Inequalities And Particular Cases, R. N. Saraswat, Ajay Tak

Applications and Applied Mathematics: An International Journal (AAM)

Information and Divergence measures deals with the study of problems concerning information processing, information storage, information retrieval and decision making. The purpose of this paper is to find a new series of divergence measures and their applications, discuss the mathematical tools for finding convexity of the functions. Applications of convex functions in information theory, relationship between new and well-known divergence measures are discussed. Also some new bounds have been established for divergence measures using new f divergence measures and its properties.


Analysis Of Two Stage M[X1],M[X2]/G1,G2/1 Retrial G-Queue With Discretionary Priority Services, Working Breakdown, Bernoulli Vacation, Preferred And Impatient Units, G. Ayyappan, B. Somasundaram Dec 2019

Analysis Of Two Stage M[X1],M[X2]/G1,G2/1 Retrial G-Queue With Discretionary Priority Services, Working Breakdown, Bernoulli Vacation, Preferred And Impatient Units, G. Ayyappan, B. Somasundaram

Applications and Applied Mathematics: An International Journal (AAM)

In this paper, we study M[X1] , M[X2] /G1 ,G2 /1 retrial queueing system with discretionary priority services. There are two stages of service for the ordinary units. During the first stage of service of the ordinary unit, arriving priority units can have an option to interrupt the service, but, in the second stage of service it cannot interrupt. When ordinary units enter the system, they may get the service even if the server is busy with the first stage of service of an ordinary unit or may enter into the orbit or leave …


Analysis Of Batch Arrival Single And Bulk Service Queue With Multiple Vacation Closedown And Repair, T. Deepa, A. Azhagappan Dec 2019

Analysis Of Batch Arrival Single And Bulk Service Queue With Multiple Vacation Closedown And Repair, T. Deepa, A. Azhagappan

Applications and Applied Mathematics: An International Journal (AAM)

In this paper, we analyze batch arrival single and bulk service queueing model with multiple vacation, closedown and repair. The single server provides single service if the queue size is ‘< a’ and bulk service if the queue size is ‘ a’. After completing the service (single or bulk), the server may breakdown with probability ξ and then it will be sent for repair. When the system becomes empty or the server is ready to serve after the repair but no one is waiting, the server resumes closedown and then goes for a multiple vacation of random length. Using supplementary variable technique, the steady-state probability generating function (PGF) of …


Economic Design Of Acceptance Sampling Plans For Truncated Life Tests Using Three-Parameter Lindley Distribution, Amer Ibrahim Al-Omari, Enrico Ciavolino, Amjad D. Al-Nasser Nov 2019

Economic Design Of Acceptance Sampling Plans For Truncated Life Tests Using Three-Parameter Lindley Distribution, Amer Ibrahim Al-Omari, Enrico Ciavolino, Amjad D. Al-Nasser

Journal of Modern Applied Statistical Methods

A single acceptance sampling plan for the three-parameter Lindley distribution under a truncated life test is developed. For various consumer’s confidence levels, acceptance numbers, and values of the ratio of the experimental time to the specified average lifetime, the minimum sample size important to assert a certain average lifetime are calculated. The operating characteristic (OC) function values as well as the associated producer’s risks are also provided. A numerical example is presented to illustrate the suggested acceptance sampling plans.


Population Health Management, Data And Technology, Helena Ladd, Cody Hepp, Anna Mccloud, Hannah Granger, Mary Ellen Hethcox, Samuel Calabrese Oct 2019

Population Health Management, Data And Technology, Helena Ladd, Cody Hepp, Anna Mccloud, Hannah Granger, Mary Ellen Hethcox, Samuel Calabrese

Pharmacy and Wellness Review

No abstract provided.


Phase Iv Clinical Trials: Postmarketing Surveillance Of Prescription Drugs, Morgan Belling, Jacquline Nunner, Jessica Stemen Oct 2019

Phase Iv Clinical Trials: Postmarketing Surveillance Of Prescription Drugs, Morgan Belling, Jacquline Nunner, Jessica Stemen

Pharmacy and Wellness Review

No abstract provided.


An Animal-Assisted Intervention Study In The Nursing Home: Lessons Learned, Lonneke G. J. A. Schuurmans, Inge Noback, Jos M. G. A. Schols, Marie-Jose Enders-Slegers Sep 2019

An Animal-Assisted Intervention Study In The Nursing Home: Lessons Learned, Lonneke G. J. A. Schuurmans, Inge Noback, Jos M. G. A. Schols, Marie-Jose Enders-Slegers

People and Animals: The International Journal of Research and Practice

AAI studies in the nursing home pose a specific set of challenges. In this article the practical and ethical issues encountered during a Dutch psychogeriatric nursing home AAI study are addressed with the aim of sharing our experiences for future researchers as well as AAI practitioners in general.

In our study we compared three groups of clients with dementia who participated in group sessions of either visiting dog teams, visiting FurReal Friend robot animals, or visiting students (control group) and monitored the effect on social interaction and neuropsychiatric symptoms through video analysis and questionnaires. We encountered the following four categories …


Fake News And Stem, Vikki French Sep 2019

Fake News And Stem, Vikki French

The Liminal: Interdisciplinary Journal of Technology in Education

Based on over ten years teaching mathematics, statistics and science in universities, communities colleges, and for-profit universities, I have witnessed how Fake News is part of these disciplines and how students can easily be misled into accepting pseudoscience. This is a report of my findings.


The Estimation Of Missing Values In Rectangular Lattice Designs, Emmanuel Ogochukwu Ossai, Abimibola Victoria Oladugba Sep 2019

The Estimation Of Missing Values In Rectangular Lattice Designs, Emmanuel Ogochukwu Ossai, Abimibola Victoria Oladugba

Journal of Modern Applied Statistical Methods

Algebraic expressions for estimating missing data when one or more observation(s) are missing in Rectangular lattice designs with repetition were derived using the method of minimizing the residual sum of squares. Results showed that the estimated value(s) were significantly approximate to that of the actual value(s).


Predicting Wind Turbine Blade Erosion Using Machine Learning, Casey Martinez, Festus Asare Yeboah, Scott Herford, Matt Brzezinski, Viswanath Puttagunta Aug 2019

Predicting Wind Turbine Blade Erosion Using Machine Learning, Casey Martinez, Festus Asare Yeboah, Scott Herford, Matt Brzezinski, Viswanath Puttagunta

SMU Data Science Review

Using time-series data and turbine blade inspection assessments, we present a classification model in order to predict remaining turbine blade life in wind turbines. Capturing the kinetic energy of wind requires complex mechanical systems, which require sophisticated maintenance and planning strategies. There are many traditional approaches to monitoring the internal gearbox and generator, but the condition of turbine blades can be difficult to measure and access. Accurate and cost- effective estimates of turbine blade life cycles will drive optimal investments in repairs and improve overall performance. These measures will drive down costs as well as provide cheap and clean electricity …


Machine Learning In Support Of Electric Distribution Asset Failure Prediction, Robert D. Flamenbaum, Thomas Pompo, Christopher Havenstein, Jade Thiemsuwan Aug 2019

Machine Learning In Support Of Electric Distribution Asset Failure Prediction, Robert D. Flamenbaum, Thomas Pompo, Christopher Havenstein, Jade Thiemsuwan

SMU Data Science Review

In this paper, we present novel approaches to predicting as- set failure in the electric distribution system. Failures in overhead power lines and their associated equipment in particular, pose significant finan- cial and environmental threats to electric utilities. Electric device failure furthermore poses a burden on customers and can pose serious risk to life and livelihood. Working with asset data acquired from an electric utility in Southern California, and incorporating environmental and geospatial data from around the region, we applied a Random Forest methodology to predict which overhead distribution lines are most vulnerable to fail- ure. Our results provide evidence …


Identifying Undervalued Players In Fantasy Football, Christopher D. Morgan, Caroll Rodriguez, Korey Macvittie, Robert Slater, Daniel W. Engels Aug 2019

Identifying Undervalued Players In Fantasy Football, Christopher D. Morgan, Caroll Rodriguez, Korey Macvittie, Robert Slater, Daniel W. Engels

SMU Data Science Review

In this paper we present a model to predict player performance in fantasy football. In particular, identifying high-performance players can prove to be a difficult problem, as there are on occasion players capable of high performance whose past metrics give no indication of this capacity. These "sleepers"' are often undervalued, and the acquisition of such players can have notable impact on a fantasy football team's overall performance. We constructed a regression model that accounts for players' past performance and athletic metrics to predict their future performance. The model we built performs favorably in predicting athlete performance in relation to other …


Machine Learning Predicts Aperiodic Laboratory Earthquakes, Olha Tanyuk, Daniel Davieau, Charles South, Daniel W. Engels Aug 2019

Machine Learning Predicts Aperiodic Laboratory Earthquakes, Olha Tanyuk, Daniel Davieau, Charles South, Daniel W. Engels

SMU Data Science Review

In this paper we find a pattern of aperiodic seismic signals that precede earthquakes at any time in a laboratory earthquake’s cycle using a small window of time. We use a data set that comes from a classic laboratory experiment having several stick-slip displacements (earthquakes), a type of experiment which has been studied as a simulation of seismologic faults for decades. This data exhibits similar behavior to natural earthquakes, so the same approach may work in predicting the timing of them. Here we show that by applying random forest machine learning technique to the acoustic signal emitted by a laboratory …


Longitudinal Analysis With Modes Of Operation For Aes, Dana Geislinger, Cory Thigpen, Daniel W. Engels Aug 2019

Longitudinal Analysis With Modes Of Operation For Aes, Dana Geislinger, Cory Thigpen, Daniel W. Engels

SMU Data Science Review

In this paper, we present an empirical evaluation of the randomness of the ciphertext blocks generated by the Advanced Encryption Standard (AES) cipher in Counter (CTR) mode and in Cipher Block Chaining (CBC) mode. Vulnerabilities have been found in the AES cipher that may lead to a reduction in the randomness of the generated ciphertext blocks that can result in a practical attack on the cipher. We evaluate the randomness of the AES ciphertext using the standard key length and NIST randomness tests. We evaluate the randomness through a longitudinal analysis on 200 billion ciphertext blocks using logistic regression and …


Mathematics Versus Statistics, Mindy B. Capaldi Jul 2019

Mathematics Versus Statistics, Mindy B. Capaldi

Journal of Humanistic Mathematics

Mathematics and statistics are both important and useful subjects, but the former has maintained prominence in the American education system. On the other hand, statistics is more prevalent in daily life and is an increasingly marketable subject to know. This article gives a personal history of one mathematician’s bumpy road to learning and teaching statistics. Additionally, arguments for how and why to include statistics in the K-12 and college curricula are provided.


Choose Your Own Adventure: An Analysis Of Interactive Gamebooks Using Graph Theory, D'Andre Adams, Daniela Beckelhymer, Alison Marr Jul 2019

Choose Your Own Adventure: An Analysis Of Interactive Gamebooks Using Graph Theory, D'Andre Adams, Daniela Beckelhymer, Alison Marr

Journal of Humanistic Mathematics

"BEWARE and WARNING! This book is different from other books. You and YOU ALONE are in charge of what happens in this story." This is the captivating introduction to every book in the interactive novel series, Choose Your Own Adventure (CYOA). Our project uses the mathematical field of graph theory to analyze forty books from the CYOA book series for ages 9-12. We first began by drawing the digraphs of each book. Then we analyzed these digraphs by collecting structural data such as longest path length (i.e. longest story length) and number of vertices with outdegree zero (i.e. number …


Graphicacy For Numeracy: Review Of Fundamentals Of Data Visualization: A Primer On Making Informative And Compelling Figures By Claus O. Wilke (2019), Christy M. Bebeau Jul 2019

Graphicacy For Numeracy: Review Of Fundamentals Of Data Visualization: A Primer On Making Informative And Compelling Figures By Claus O. Wilke (2019), Christy M. Bebeau

Numeracy

Wilke, Claus O. 2019. Fundamentals of Data Visualization: A Primer on Making Informative and Compelling Figures. (Sebastopol, CA: O’Reilly Media, Inc.). 390 pp. ISBN 978-1-492-03108-6. First edition. First release: 03-15-2019.

Claus O. Wilke has authored an excellent reference about producing and understanding static figures, figures used online, in print, and for presentations. His book is neither a statistics nor programming text, but familiarity with basic statistical concepts is helpful. Written in three parts, the book presents both the math and artistic design aspects of telling a story through figures. Wilke makes extensive use of examples, labels them good, bad, …


Taking Multiple Regression Analysis To Task: A Review Of Mindware: Tools For Smart Thinking, By Richard Nisbett (2015), Jason Makansi Jul 2019

Taking Multiple Regression Analysis To Task: A Review Of Mindware: Tools For Smart Thinking, By Richard Nisbett (2015), Jason Makansi

Numeracy

Richard Nisbett. 2015. Mindware: Tools for Smart Thinking.(New York, NY: Farrar, Strauss, and Giroux). 336 pp. ISBN: 9780374536244

Nisbett, a psychologist, may not achieve his stated goal of teaching readers to “effortlessly” extend their common sense when it comes to quantitative analysis applied to everyday issues, but his critique of multiple regression analysis (MRA) in the middle chapters of Mindware is worth attention from, and contemplation by, the QL/QR and Numeracy community. While in at least one other source, Nisbett’s critique has been called a “crusade” against MRA, what he really advocates is that it not be used as …


Using Meta-Analysis To Assess Affective Outcomes In A Multi-Course Qr Module Intervention, James Friedrich, Kelley D. Strawn Jul 2019

Using Meta-Analysis To Assess Affective Outcomes In A Multi-Course Qr Module Intervention, James Friedrich, Kelley D. Strawn

Numeracy

When quantitative reasoning(QR) interventions share a common hypothesis or goal, a promising approach for evaluation involves integrating separate analyses through the use of meta-analysis. This paper reports an assessment of a module-based QR intervention distributed across 20 courses at a single institution. Topics and participating courses were diverse, including arts & humanities, quantitative behavioral sciences, and natural sciences & mathematics groupings, but all addressed the shared affective goals of reducing student QR self-doubt and increasing appreciation for QR value and utility. With a local framework to guide module development, we assess these outcomes using reliable self-report measures in a pre-post …


Cancerous Male And Female Gene Expression, Clarissa Farmer, E. Shannon Tass Jun 2019

Cancerous Male And Female Gene Expression, Clarissa Farmer, E. Shannon Tass

Journal of Undergraduate Research

Genetic diagnosing is becoming more popular, as well as more and more accurate. However, many genetic diseases have complex genetic effects and are still not fully understood. Transthyretin Amyloidosis (ATTR; also known as familial or hereditary amyloidosis) is a terminal genetic disease. It is caused by unstable transthyretin proteins that fold improperly, and then deteriorate. The fragmented proteins are deposited outside of the cell and build up in the tissues over time, forming insoluble oligomers. The oligomers continue to grow into Amyloid fibrils, which adversely affect many organs in the body, eventually causing their failure. In order to accurately diagnose, …


Cluster Analysis Via Random Partition Distributions, Brandon Carter, Dr. David B. Dahl Jun 2019

Cluster Analysis Via Random Partition Distributions, Brandon Carter, Dr. David B. Dahl

Journal of Undergraduate Research

Cluster analysis is an important exploratory data analysis technique used in a wide variety of fields. Cluster analysis seeks to discover a natural grouping of the data, where items in the same cluster or group are more similar than items from different clusters. Through our research, we developed a novel method for cluster analysis which takes pairwise distance information as input. Our new method improves upon traditional cluster analysis methods which also take pairwise distance information as input, such as hierarchical clustering. Our method, cluster analysis via random partition distributions (CaviarPD) is based on probability distributions and therefore allows the …


Data Mining And Machine Learning To Improve Northern Florida’S Foster Care System, Daniel Oldham, Nathan Foster, Mihhail Berezovski Jun 2019

Data Mining And Machine Learning To Improve Northern Florida’S Foster Care System, Daniel Oldham, Nathan Foster, Mihhail Berezovski

Beyond: Undergraduate Research Journal

The purpose of this research project is to use statistical analysis, data mining, and machine learning techniques to determine identifiable factors in child welfare service records that could lead to a child entering the foster care system multiple times. This would allow us the capability of accurately predicting a case’s outcome based on these factors. We were provided with eight years of data in the form of multiple spreadsheets from Partnership for Strong Families (PSF), a child welfare services organization based in Gainesville, Florida, who is contracted by the Florida Department for Children and Families (DCF). This data contained a …


Measure Of Departure From Marginal Average Point-Symmetry For Two-Way Contingency Tables, Kiyotaka Iki, Sadao Tomizawa Jun 2019

Measure Of Departure From Marginal Average Point-Symmetry For Two-Way Contingency Tables, Kiyotaka Iki, Sadao Tomizawa

Journal of Modern Applied Statistical Methods

For the analysis of two-way contingency tables with ordered categories, Yamamoto, Tahata, Suzuki, and Tomizawa (2011) considered a measure to represent the degree of departure from marginal point-symmetry. The maximum value of the measure cannot distinguish two kinds of marginal complete asymmetry with respect to the midpoint. A measure is proposed which can distinguish two kinds of marginal asymmetry with respect to the midpoint. It also gives large-sample confidence interval for the proposed measure.