Open Access. Powered by Scholars. Published by Universities.®

Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability

PDF

2018

Institution
Keyword
Publication
Publication Type

Articles 1 - 30 of 104

Full-Text Articles in Mathematics

Nonparametric Collective Spectral Density Estimation With An Application To Clustering The Brain Signals, Mehdi Maadooliat, Ying Sun, Tianbo Chen Dec 2018

Nonparametric Collective Spectral Density Estimation With An Application To Clustering The Brain Signals, Mehdi Maadooliat, Ying Sun, Tianbo Chen

Mathematics, Statistics and Computer Science Faculty Research and Publications

In this paper, we develop a method for the simultaneous estimation of spectral density functions (SDFs) for a collection of stationary time series that share some common features. Due to the similarities among the SDFs, the log‐SDF can be represented using a common set of basis functions. The basis shared by the collection of the log‐SDFs is estimated as a low‐dimensional manifold of a large space spanned by a prespecified rich basis. A collective estimation approach pools information and borrows strength across the SDFs to achieve better estimation efficiency. Moreover, each estimated spectral density has a concise representation using the …


Power In Pairs: Assessing The Statistical Value Of Paired Samples In Tests For Differential Expression, John R. Stevens, Jennifer S. Herrick, Roger K. Wolff, Martha L. Slattery Dec 2018

Power In Pairs: Assessing The Statistical Value Of Paired Samples In Tests For Differential Expression, John R. Stevens, Jennifer S. Herrick, Roger K. Wolff, Martha L. Slattery

Mathematics and Statistics Faculty Publications

Background: When genomics researchers design a high-throughput study to test for differential expression, some biological systems and research questions provide opportunities to use paired samples from subjects, and researchers can plan for a certain proportion of subjects to have paired samples. We consider the effect of this paired samples proportion on the statistical power of the study, using characteristics of both count (RNA-Seq) and continuous (microarray) expression data from a colorectal cancer study.

Results: We demonstrate that a higher proportion of subjects with paired samples yields higher statistical power, for various total numbers of samples, and for various strengths of …


Almost Periodic Functions In Quantum Calculus, Martin Bohner, Jaqueline Godoy Mesquita Dec 2018

Almost Periodic Functions In Quantum Calculus, Martin Bohner, Jaqueline Godoy Mesquita

Mathematics and Statistics Faculty Research & Creative Works

In this article, we introduce the concepts of Bochner and Bohr almost periodic functions in quantum calculus and show that both concepts are equivalent. Also, we present a correspondence between almost periodic functions defined in quantum calculus and N0, proving several important properties for this class of functions. We investigate the existence of almost periodic solutions of linear and nonlinear q-difference equations. Finally, we provide some examples of almost periodic functions in quantum calculus.


The Strong Law Of Large Numbers For U-Statistics Under Random Censorship, Jan Höft Dec 2018

The Strong Law Of Large Numbers For U-Statistics Under Random Censorship, Jan Höft

Theses and Dissertations

We introduce a semi-parametric U-statistics estimator for randomly right censored data. We will study the strong law of large numbers for this estimator under proper assumptions about the conditional expectation of the censoring indicator with re- spect to the observed life times. Moreover we will conduct simulation studies, where the semi-parametric estimator is compared to a U-statistic based on the Kaplan- Meier product limit estimator in terms of bias, variance and mean squared error, under different censoring models.


The Compensation For Few Clusters In Clustered Randomized Trials With Binary Outcomes, Lily Stalter Nov 2018

The Compensation For Few Clusters In Clustered Randomized Trials With Binary Outcomes, Lily Stalter

Mathematics & Statistics ETDs

Cluster randomized trials are increasingly popular in epidemiological and medical research. When analyzing the data from such studies it is imperative that the hierarchical structure of the data be taken into account. Multilevel logistic regression is used to analyze clustered data with binary outcomes. Previous literature shows that a greater number of clusters is more important than a large number of subjects per cluster. This paper investigates if it is possible to compensate for the increased bias found for parameter estimates when the number of clusters is decreased. A simulation study was conducted where the absolute percent relative bias for …


An Introduction To Psychological Statistics, Garett C. Foster, David Lane, David Scott, Mikki Hebl, Rudy Guerra, Dan Osherson, Heidi Zimmer Nov 2018

An Introduction To Psychological Statistics, Garett C. Foster, David Lane, David Scott, Mikki Hebl, Rudy Guerra, Dan Osherson, Heidi Zimmer

Open Educational Resources Collection

This work has been superseded by Introduction to Statistics in the Psychological Sciences available from https://irl.umsl.edu/oer/25/.

-

We are constantly bombarded by information, and finding a way to filter that information in an objective way is crucial to surviving this onslaught with your sanity intact. This is what statistics, and logic we use in it, enables us to do. Through the lens of statistics, we learn to find the signal hidden in the noise when it is there and to know when an apparent trend or pattern is really just randomness. The study of statistics involves math and relies …


U.S. College Students’ Social Network Characteristics And Perceived Social Exclusion: A Comparison Between Drinkers And Nondrinkers Based On Pastmonth Alcohol Use, Sara G. Balestrieri, Graham T. Diguiseppi, Matthew Meisel, Melissa A. Clark, Miles Q. Ott, Nancy P. Barnett Oct 2018

U.S. College Students’ Social Network Characteristics And Perceived Social Exclusion: A Comparison Between Drinkers And Nondrinkers Based On Pastmonth Alcohol Use, Sara G. Balestrieri, Graham T. Diguiseppi, Matthew Meisel, Melissa A. Clark, Miles Q. Ott, Nancy P. Barnett

Statistical and Data Sciences: Faculty Publications

There is a general perception on college campuses that alcohol use is normative. However, nondrinking students account for 40% of the U.S. college population. With much of the literature focusing on intervening among drinkers, there has been less of a focus on understanding the nondrinker college experience. The current study has two aims: to describe the social network differences between nondrinkers and drinkers in a college setting, and to assess perceived social exclusion among nondrinkers. METHOD:First-year U.S. college students (n = 1,342; 55.3% female; 47.7% non-Hispanic White) were participants in a larger study examining a social network of one college …


Smartphone-Based Prenatal Education For Parents With Preterm Birth Risk Factors, U. Olivia Kim, K. Barnekow, Sheikh Iqbal Ahamed, S. Dreier, C. Jones, M. Taylor, Md Kamrul Hasan, M. A. Basir Oct 2018

Smartphone-Based Prenatal Education For Parents With Preterm Birth Risk Factors, U. Olivia Kim, K. Barnekow, Sheikh Iqbal Ahamed, S. Dreier, C. Jones, M. Taylor, Md Kamrul Hasan, M. A. Basir

Mathematics, Statistics and Computer Science Faculty Research and Publications

Objective

To develop an educational mobile application (app) for expectant parents diagnosed with risk factors for premature birth.

Methods

Parent and medical advisory panels delineated the vision for the app. The app helps prepare for preterm birth. For pilot testing, obstetricians offered the app between 18–22 weeks gestational age to English speaking parents with risk factors for preterm birth. After 4 weeks of use, each participant completed a questionnaire. The software tracked topics accessed and duration of use.

Results

For pilot testing, 31 participants were recruited and 28 completed the questionnaire. After app utilization, participants reported heightened awareness of preterm …


Real-Time Dengue Forecasting In Thailand: A Comparison Of Penalized Regression Approaches Using Internet Search Data, Caroline Kusiak Oct 2018

Real-Time Dengue Forecasting In Thailand: A Comparison Of Penalized Regression Approaches Using Internet Search Data, Caroline Kusiak

Masters Theses

Dengue fever affects over 390 million people annually worldwide and is of particu- lar concern in Southeast Asia where it is one of the leading causes of hospitalization. Modeling trends in dengue occurrence can provide valuable information to Public Health officials, however many challenges arise depending on the data available. In Thailand, reporting of dengue cases is often delayed by more than 6 weeks, and a small fraction of cases may not be reported until over 11 months after they occurred. This study shows that incorporating data on Google Search trends can improve dis- ease predictions in settings with severely …


Quantitative Validation Of Simulated Sea Ice Displacements, Bryan R. Mccormick Oct 2018

Quantitative Validation Of Simulated Sea Ice Displacements, Bryan R. Mccormick

Mathematics & Statistics ETDs

Accurate simulations of Arctic sea ice are important for forecasting as well as for understanding the global climate. However, quantitative measures for simulation displacements are underutilized. We present five such measures proposed as being useful in the validation of simulated sea ice displacements. Using drifting buoy and satellite measurements of sea ice motion as observation, we apply the metrics in a comparison of observed displacements and predicted displacements from the Arctic sea ice simulation MPM\_ice. We find the metric scores are useful for comparing simulations and observations. The metrics also brought to light problems in the simulation MPM_ice, demonstrating their …


A Further Extension Of The Extended Riemann-Liouville Fractional Derivative Operator, Martin Bohner, Gauhar Rahman, Shahid Mubeen, Kottakkaran Sooppy Nisar Sep 2018

A Further Extension Of The Extended Riemann-Liouville Fractional Derivative Operator, Martin Bohner, Gauhar Rahman, Shahid Mubeen, Kottakkaran Sooppy Nisar

Mathematics and Statistics Faculty Research & Creative Works

The main objective of this paper is to establish the extension of an extended fractional derivative operator by using an extended beta function recently defined by Parmar et al. by considering the Bessel functions in its kernel. We also give some results related to the newly defined fractional operator, such as Mellin transform and relations to extended hypergeometric and Appell's function via generating functions.


Semicontinuity Of Betweenness Functions, Paul Bankston, Aisling Mccluskey, Richard J. Smith Sep 2018

Semicontinuity Of Betweenness Functions, Paul Bankston, Aisling Mccluskey, Richard J. Smith

Mathematics, Statistics and Computer Science Faculty Research and Publications

A ternary relational structure〈X,[⋅,⋅,⋅]〉, interpreting a notion of betweenness, gives rise to the family of intervals, with interval [a,b] being defined as the set of elements of X between a and b. Under very reasonable circumstances, X is also equipped with some topological structure, in such a way that each interval is a closed nonempty subset of X. The question then arises as to the continuity behavior—within the hyperspace context—of the betweenness function {x,y}↦[x,y]. We investigate two broad scenarios: the first involves metric spaces and Menger's betweenness interpretation; the second deals with continua and the subcontinuum interpretation.


Yelp’S Review Filtering Algorithm, Yao Yao, Ivelin Angelov, Jack Rasmus-Vorrath, Mooyoung Lee, Daniel W. Engels Aug 2018

Yelp’S Review Filtering Algorithm, Yao Yao, Ivelin Angelov, Jack Rasmus-Vorrath, Mooyoung Lee, Daniel W. Engels

SMU Data Science Review

In this paper, we present an analysis of features influencing Yelp's proprietary review filtering algorithm. Classifying or misclassifying reviews as recommended or non-recommended affects average ratings, consumer decisions, and ultimately, business revenue. Our analysis involves systematically sampling and scraping Yelp restaurant reviews. Features are extracted from review metadata and engineered from metrics and scores generated using text classifiers and sentiment analysis. The coefficients of a multivariate logistic regression model were interpreted as quantifications of the relative importance of features in classifying reviews as recommended or non-recommended. The model classified review recommendations with an accuracy of 78%. We found that reviews …


Optimization For Lng Terminals Routing In North China, Shuting Wang Aug 2018

Optimization For Lng Terminals Routing In North China, Shuting Wang

World Maritime University Dissertations

No abstract provided.


Study On The Efficiency Of China’S Main River Ports Based On Dea Model, Yunwu Cao Aug 2018

Study On The Efficiency Of China’S Main River Ports Based On Dea Model, Yunwu Cao

World Maritime University Dissertations

No abstract provided.


A Math Research Project Inspired By Twin Motherhood, Tiffany N. Kolba Aug 2018

A Math Research Project Inspired By Twin Motherhood, Tiffany N. Kolba

Tiffany N Kolba

The phenomenon of twins, triplets, quadruplets, and other higher order multiples has fascinated humans for centuries and has even captured the attention of mathematicians who have sought to model the probabilities of multiple births. However, there has not been extensive research into the phenomenon of polyovulation, which is one of the biological mechanisms that produces multiple births. In this paper, I describe how my own experience becoming a mother to twins led me on a quest to better understand the scientific processes going on inside my own body and motivated me to conduct research on polyovulation frequencies. An overview of …


The Transmuted Geometric-Quadratic Hazard Rate Distribution: Development, Properties, Characterizations And Applications, Fiaz Ahmad Bhatti, Gholamhossein Hamedani, Mustafa Ç. Korkmaz, Munir Ahmad Aug 2018

The Transmuted Geometric-Quadratic Hazard Rate Distribution: Development, Properties, Characterizations And Applications, Fiaz Ahmad Bhatti, Gholamhossein Hamedani, Mustafa Ç. Korkmaz, Munir Ahmad

Mathematics, Statistics and Computer Science Faculty Research and Publications

We propose a five parameter transmuted geometric quadratic hazard rate (TG-QHR) distribution derived from mixture of quadratic hazard rate (QHR), geometric and transmuted distributions via the application of transmuted geometric-G (TG-G) family of Afify et al.(Pak J Statist 32(2), 139-160, 2016). Some of its structural properties are studied. Moments, incomplete moments, inequality measures, residual life functions and some other properties are theoretically taken up. The TG-QHR distribution is characterized via different techniques. Estimates of the parameters for TG-QHR distribution are obtained using maximum likelihood method. The simulation studies are performed on the basis of graphical results to illustrate the performance …


Automatic Knowledge Extraction From Ocr Documents Using Hierarchical Document Analysis, Mohammad Masum, Sai Kosaraju, Tanju Bayramoglu, Girish Modgil, Mingon Kang Aug 2018

Automatic Knowledge Extraction From Ocr Documents Using Hierarchical Document Analysis, Mohammad Masum, Sai Kosaraju, Tanju Bayramoglu, Girish Modgil, Mingon Kang

Published and Grey Literature from PhD Candidates

Industries can improve their business efficiency by analyzing and extracting relevant knowledge from large numbers of documents. Knowledge extraction manually from large volume of documents is labor intensive, unscalable and challenging. Consequently, there have been a number of attempts to develop intelligent systems to automatically extract relevant knowledge from OCR documents. Moreover, the automatic system can improve the capability of search engine by providing application-specific domain knowledge. However, extracting the efficient information from OCR documents is challenging due to highly unstructured format. In this paper, we propose an efficient framework for a knowledge extraction system that takes keywords based queries …


Confidence Intervals For The Area Under The Receiver Operating Characteristic Curve In The Presence Of Ignorable Missing Data, Hunyong Cho, Gregory J. Matthews, Ofer Harel Aug 2018

Confidence Intervals For The Area Under The Receiver Operating Characteristic Curve In The Presence Of Ignorable Missing Data, Hunyong Cho, Gregory J. Matthews, Ofer Harel

Mathematics and Statistics: Faculty Publications and Other Works

Receiver operating characteristic curves are widely used as a measure of accuracy of diagnostic tests and can be summarised using the area under the receiver operating characteristic curve (AUC). Often, it is useful to construct a confidence interval for the AUC; however, because there are a number of different proposed methods to measure variance of the AUC, there are thus many different resulting methods for constructing these intervals. In this article, we compare different methods of constructing Wald‐type confidence interval in the presence of missing data where the missingness mechanism is ignorable. We find that constructing confidence intervals using multiple …


Wald Confidence Intervals For A Single Poisson Parameter And Binomial Misclassification Parameter When The Data Is Subject To Misclassification, Nishantha Janith Chandrasena Poddiwala Hewage Aug 2018

Wald Confidence Intervals For A Single Poisson Parameter And Binomial Misclassification Parameter When The Data Is Subject To Misclassification, Nishantha Janith Chandrasena Poddiwala Hewage

Electronic Theses and Dissertations

This thesis is based on a Poisson model that uses both error-free data and error-prone data subject to misclassification in the form of false-negative and false-positive counts. We present maximum likelihood estimators (MLEs), Fisher's Information, and Wald statistics for Poisson rate parameter and the two misclassification parameters. Next, we invert the Wald statistics to get asymptotic confidence intervals for Poisson rate parameter and false-negative rate parameter. The coverage and width properties for various sample size and parameter configurations are studied via a simulation study. Finally, we apply the MLEs and confidence intervals to one real data set and another realistic …


Empirical Bayesian Approach To Testing Multiple Hypotheses With Separate Priors For Left And Right Alternatives, Naveen K. Bansal, Mehdi Maadooliat, Steven J. Schrodi Aug 2018

Empirical Bayesian Approach To Testing Multiple Hypotheses With Separate Priors For Left And Right Alternatives, Naveen K. Bansal, Mehdi Maadooliat, Steven J. Schrodi

Mathematics, Statistics and Computer Science Faculty Research and Publications

We consider a multiple hypotheses problem with directional alternatives in a decision theoretic framework. We obtain an empirical Bayes rule subject to a constraint on mixed directional false discovery rate (mdFDRα) under the semiparametric setting where the distribution of the test statistic is parametric, but the prior distribution is nonparametric. We proposed separate priors for the left tail and right tail alternatives as it may be required for many applications. The proposed Bayes rule is compared through simulation against rules proposed by Benjamini and Yekutieli and Efron. We illustrate the proposed methodology for two sets of …


The Expected Number Of Patterns In A Random Generated Permutation On [N] = {1,2,...,N}, Evelyn Fokuoh Aug 2018

The Expected Number Of Patterns In A Random Generated Permutation On [N] = {1,2,...,N}, Evelyn Fokuoh

Electronic Theses and Dissertations

Previous work by Flaxman (2004) and Biers-Ariel et al. (2018) focused on the number of distinct words embedded in a string of words of length n. In this thesis, we will extend this work to permutations, focusing on the maximum number of distinct permutations contained in a permutation on [n] = {1,2,...,n} and on the expected number of distinct permutations contained in a random permutation on [n]. We further considered the problem where repetition of subsequences are as a result of the occurrence of (Type A and/or Type B) replications. Our method of enumerating the Type A replications causes double …


Dynamics Of Paramagnetic And Ferromagnetic Ellipsoidal Particles In Shear Flow Under A Uniform Magnetic Field, Christopher A. Sobecki, Jie Zhang, Yanzhi Zhang, Cheng Wang Aug 2018

Dynamics Of Paramagnetic And Ferromagnetic Ellipsoidal Particles In Shear Flow Under A Uniform Magnetic Field, Christopher A. Sobecki, Jie Zhang, Yanzhi Zhang, Cheng Wang

Mathematics and Statistics Faculty Research & Creative Works

We investigate the two-dimensional dynamic motion of magnetic particles of ellipsoidal shapes in shear flow under the influence of a uniform magnetic field. In the first part, we present a theoretical analysis of the rotational dynamics of the particles in simple shear flow. By considering paramagnetic and ferromagnetic particles, we study the effects of the direction and strength of the magnetic field on the particle rotation. The critical magnetic-field strength, at which particle rotation is impeded, is determined. In a weak-field regime (i.e., below the critical strength) where the particles execute complete rotations, the symmetry property of the rotational velocity …


A Math Research Project Inspired By Twin Motherhood, Tiffany N. Kolba Jul 2018

A Math Research Project Inspired By Twin Motherhood, Tiffany N. Kolba

Journal of Humanistic Mathematics

The phenomenon of twins, triplets, quadruplets, and other higher order multiples has fascinated humans for centuries and has even captured the attention of mathematicians who have sought to model the probabilities of multiple births. However, there has not been extensive research into the phenomenon of polyovulation, which is one of the biological mechanisms that produces multiple births. In this paper, I describe how my own experience becoming a mother to twins led me on a quest to better understand the scientific processes going on inside my own body and motivated me to conduct research on polyovulation frequencies. An overview of …


Excess Versions Of The Minkowski And Hölder Inequalities, Iosif Pinelis Jul 2018

Excess Versions Of The Minkowski And Hölder Inequalities, Iosif Pinelis

Iosif Pinelis

No abstract provided.


Asymptotic Behavior Of The Random Logistic Model And Of Parallel Bayesian Logspline Density Estimators, Konstandinos Kotsiopoulos Jul 2018

Asymptotic Behavior Of The Random Logistic Model And Of Parallel Bayesian Logspline Density Estimators, Konstandinos Kotsiopoulos

Doctoral Dissertations

This dissertation is comprised of two separate projects. The first concerns a Markov chain called the Random Logistic Model. For r in (0,4] and x in [0,1] the logistic map fr(x) = rx(1 - x) defines, for positive integer t, the dynamical system xr(t + 1) = f(xr(t)) on [0,1], where xr(1) = x. The interplay between this dynamical system and the Markov chain xr,N(t) defined by perturbing the logistic map by truncated Gaussian noise scaled by N-1/2, where N -> infinity, is studied. A natural question is …


Cancerin: A Computational Pipeline To Infer Cancer-Associated Cerna Interaction Networks, Duc Do, Serdar Bozdag Jul 2018

Cancerin: A Computational Pipeline To Infer Cancer-Associated Cerna Interaction Networks, Duc Do, Serdar Bozdag

Mathematics, Statistics and Computer Science Faculty Research and Publications

MicroRNAs (miRNAs) inhibit expression of target genes by binding to their RNA transcripts. It has been recently shown that RNA transcripts targeted by the same miRNA could “compete” for the miRNA molecules and thereby indirectly regulate each other. Experimental evidence has suggested that the aberration of such miRNA-mediated interaction between RNAs—called competing endogenous RNA (ceRNA) interaction—can play important roles in tumorigenesis. Given the difficulty of deciphering context-specific miRNA binding, and the existence of various gene regulatory factors such as DNA methylation and copy number alteration, inferring context-specific ceRNA interactions accurately is a computationally challenging task. Here we propose a computational …


Mathematical Models, Patty Wagner, Marnie Phipps Jul 2018

Mathematical Models, Patty Wagner, Marnie Phipps

Mathematics Grants Collections

This Grants Collection for Mathematical Models was created under a Round Nine ALG Textbook Transformation Grant.

Affordable Learning Georgia Grants Collections are intended to provide faculty with the frameworks to quickly implement or revise the same materials as a Textbook Transformation Grants team, along with the aims and lessons learned from project teams during the implementation process.

Documents are in .pdf format, with a separate .docx (Word) version available for download. Each collection contains the following materials:

  • Linked Syllabus
  • Initial Proposal
  • Final Report


A Note On Sum, Difference, Product And Ratio Of Kumaraswamy Random Variables, Avishek Mallick, Indranil Ghosh, Gholamhossein G. Hamedani Jul 2018

A Note On Sum, Difference, Product And Ratio Of Kumaraswamy Random Variables, Avishek Mallick, Indranil Ghosh, Gholamhossein G. Hamedani

Mathematics, Statistics and Computer Science Faculty Research and Publications

Explicit expressions for the densities of S = X1 + X2 , D = X1X2 , P = X1X2 and R= X1/X2 are derived when X1 and X2 are independent or sub-independent Kumaraswamy random variables. The expressions appear to involve the incomplete gamma functions. Some possible real life scenarios are mentioned in which such quantities might be of interest.


Reality Versus Grant Application Research “Plans”, Linda Burhansstipanov, Linda U. Krebs, Daniel Petereit, Mark Dignan, Sheikh Iqbal Ahamed, Michele Sargent, Krisin Cina, Kimberly Crawford, Doris Thibeault, Simone Bordeaux, Shalini Kanekar, Golam Mushih Tanimul Ahsan, Drew Williams, Ivor D. Addo Jul 2018

Reality Versus Grant Application Research “Plans”, Linda Burhansstipanov, Linda U. Krebs, Daniel Petereit, Mark Dignan, Sheikh Iqbal Ahamed, Michele Sargent, Krisin Cina, Kimberly Crawford, Doris Thibeault, Simone Bordeaux, Shalini Kanekar, Golam Mushih Tanimul Ahsan, Drew Williams, Ivor D. Addo

Mathematics, Statistics and Computer Science Faculty Research and Publications

This article describes the implementation of the American Indian mHealth Smoking Dependence Study focusing on the differences between what was written in the grant application compared to what happened in reality. The study was designed to evaluate a multicomponent intervention involving 256 participants randomly assigned to one of 15 groups. Participants received either a minimal or an intense level of four intervention components: (1) nicotine replacement therapy, (2) precessation counseling, (3) cessation counseling, and (4) mHealth text messaging. The project team met via biweekly webinars as well as one to two in-person meetings per year throughout the study. The project …