Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Applied Statistics

PDF

2019

Institution
Keyword
Publication
Publication Type

Articles 1 - 30 of 141

Full-Text Articles in Physical Sciences and Mathematics

Personalized Detection Of Anxiety Provoking News Events Using Semantic Network Analysis, Jacquelyn Cheun Phd, Luay Dajani, Quentin B. Thomas Dec 2019

Personalized Detection Of Anxiety Provoking News Events Using Semantic Network Analysis, Jacquelyn Cheun Phd, Luay Dajani, Quentin B. Thomas

SMU Data Science Review

In the age of hyper-connectivity, 24/7 news cycles, and instant news alerts via social media, mental health researchers don't have a way to automatically detect news content which is associated with triggering anxiety or depression in mental health patients. Using the Associated Press news wire, a semantic network was built with 1,056 news articles containing over 500,000 connections across multiple topics to provide a personalized algorithm which detects problematic news content for a given reader. We make use of Semantic Network Analysis to surface the relationship between news article text and anxiety in readers who struggle with mental health disorders. …


Statistical Analysis Of Social Network Change, Teresa Danielle Schmidt Dec 2019

Statistical Analysis Of Social Network Change, Teresa Danielle Schmidt

Dissertations and Theses

This project explores two statistical methods that infer social network structures and statistically test those structures for change over time: regression-based differential network analysis (R-DNA) and information theory-based differential analysis (I-DNA). R-DNA is adapted from bioinformatics and I-DNA employs reconstructability analysis.

This project applies both R-DNA and I-DNA to analyze Medicaid claims data from one-year periods before (May 2011- Apr 2012) and after (Jan 2013-Dec 2013) the formation of the Health Share of Oregon Coordinated Care Organization (CCO). The formation of CCOs was legislated by the state of Oregon in 2012 with the triple aim of improving health outcomes, reducing …


Function Space Tensor Decomposition And Its Application In Sports Analytics, Justin Reising Dec 2019

Function Space Tensor Decomposition And Its Application In Sports Analytics, Justin Reising

Electronic Theses and Dissertations

Recent advancements in sports information and technology systems have ushered in a new age of applications of both supervised and unsupervised analytical techniques in the sports domain. These automated systems capture large volumes of data points about competitors during live competition. As a result, multi-relational analyses are gaining popularity in the field of Sports Analytics. We review two case studies of dimensionality reduction with Principal Component Analysis and latent factor analysis with Non-Negative Matrix Factorization applied in sports. Also, we provide a review of a framework for extending these techniques for higher order data structures. The primary scope of this …


Evaluation Of Relationship Between Lead-Dust Loading, Lead-Dust Concentration, And Total Dust Loading Metrics Across Multiple Data Sets, Charles Bevington Dec 2019

Evaluation Of Relationship Between Lead-Dust Loading, Lead-Dust Concentration, And Total Dust Loading Metrics Across Multiple Data Sets, Charles Bevington

Capstone Experience

Lead-dust monitoring studies report values as either lead-dust loadings µg/ft2 or as lead-dust concentrations µg/g. It is rare for studies to report both metrics. When only lead-dust loading values are present, professionals require an approach to estimate lead-dust concentration values. A literature search identified five studies that contained raw data for both lead-dust loading and lead-dust concentration. An additional thirty-two studies had summary-statistics available for both lead-dust loading and lead-dust concentration. Studies with raw-data were used to develop an empirically-based loading to concentration statistical relationship. Raw data sets were critically evaluated to determine whether elimination or …


Communications And Methodologies In Crime Geography: Contemporary Approaches To Disseminating Criminal Incidence And Research, Mitchell Ogden Dec 2019

Communications And Methodologies In Crime Geography: Contemporary Approaches To Disseminating Criminal Incidence And Research, Mitchell Ogden

Electronic Theses and Dissertations

Many tools exist to assist law enforcement agencies in mitigating criminal activity. For centuries, academics used statistics in the study of crime and criminals, and more recently, police departments make use of spatial statistics and geographic information systems in that pursuit. Clustering and hot spot methods of analysis are popular in this application for their relative simplicity of interpretation and ease of process. With recent advancements in geospatial technology, it is easier than ever to publicly share data through visual communication tools like web applications and dashboards. Sharing data and results of analyses boosts transparency and the public image of …


Habitat Associations And Reproduction Of Fishes On The Northwestern Gulf Of Mexico Shelf Edge, Elizabeth Marie Keller Nov 2019

Habitat Associations And Reproduction Of Fishes On The Northwestern Gulf Of Mexico Shelf Edge, Elizabeth Marie Keller

LSU Doctoral Dissertations

Several of the northwestern Gulf of Mexico (GOM) shelf-edge banks provide critical hard bottom habitat for coral and fish communities, supporting a wide diversity of ecologically and economically important species. These sites may be fish aggregation and spawning sites and provide important habitat for fish growth and reproduction. Already designated as habitat areas of particular concern, many of these banks are also under consideration for inclusion in the expansion of the Flower Garden Banks National Marine Sanctuary. This project aimed to gain a more comprehensive understanding of the communities and fish species on shelf-edge banks by way of gonad histology, …


Economic Design Of Acceptance Sampling Plans For Truncated Life Tests Using Three-Parameter Lindley Distribution, Amer Ibrahim Al-Omari, Enrico Ciavolino, Amjad D. Al-Nasser Nov 2019

Economic Design Of Acceptance Sampling Plans For Truncated Life Tests Using Three-Parameter Lindley Distribution, Amer Ibrahim Al-Omari, Enrico Ciavolino, Amjad D. Al-Nasser

Journal of Modern Applied Statistical Methods

A single acceptance sampling plan for the three-parameter Lindley distribution under a truncated life test is developed. For various consumer’s confidence levels, acceptance numbers, and values of the ratio of the experimental time to the specified average lifetime, the minimum sample size important to assert a certain average lifetime are calculated. The operating characteristic (OC) function values as well as the associated producer’s risks are also provided. A numerical example is presented to illustrate the suggested acceptance sampling plans.


The Graphs That Have Antivoltages Using Groups Of Small Order, Vaidy Sivaraman, Dan Slilaty Nov 2019

The Graphs That Have Antivoltages Using Groups Of Small Order, Vaidy Sivaraman, Dan Slilaty

Mathematics and Statistics Faculty Publications

Given a group Γ of order at most six, we characterize the graphs that have Γ-antivoltages and also determine the list of minor-minimal graphs that have no Γ-antivoltage. Our characterizations yield polynomial-time recognition algorithms for such graphs.


Quantifying Distribution In Carbon Uptake Across A Global Measurement Network Of Terrestrial Ecosystems, John Zobitz, Madeline Oswood Oct 2019

Quantifying Distribution In Carbon Uptake Across A Global Measurement Network Of Terrestrial Ecosystems, John Zobitz, Madeline Oswood

Annual Symposium on Biomathematics and Ecology Education and Research

No abstract provided.


A Study On Discrete And Discrete Fractional Pharmacokinetics-Pharmacodynamics Models For Tumor Growth And Anti-Cancer Effects, Ferhan Atici, Ngoc Nguyen Oct 2019

A Study On Discrete And Discrete Fractional Pharmacokinetics-Pharmacodynamics Models For Tumor Growth And Anti-Cancer Effects, Ferhan Atici, Ngoc Nguyen

Annual Symposium on Biomathematics and Ecology Education and Research

No abstract provided.


Classification Of Coronary Artery Disease In Non-Diabetic Patients Using Artificial Neural Networks, Demond Handley Oct 2019

Classification Of Coronary Artery Disease In Non-Diabetic Patients Using Artificial Neural Networks, Demond Handley

Annual Symposium on Biomathematics and Ecology Education and Research

No abstract provided.


Statistical Modeling And Characterization Of Induced Seismicity Within The Western Canada Sedimentary Basin, Sid Kothari Oct 2019

Statistical Modeling And Characterization Of Induced Seismicity Within The Western Canada Sedimentary Basin, Sid Kothari

Electronic Thesis and Dissertation Repository

In western Canada, there has been an increase in seismic activity linked to anthropogenic energy-related operations including conventional hydrocarbon production, wastewater fluid injection and more recently hydraulic fracturing (HF). Statistical modeling and characterization of the space, time and magnitude distributions of the seismicity clusters is vital for a better understanding of induced earthquake processes and development of predictive models. In this work, a statistical analysis of the seismicity in the Western Canada Sedimentary Basin was performed across past and present time periods by utilizing a compiled earthquake catalogue for Alberta and eastern British Columbia. Specifically, the frequency-magnitude statistics were analyzed …


A “How-To” Manual For Doing Standard Statistics In R, Elizabeth Newton Oct 2019

A “How-To” Manual For Doing Standard Statistics In R, Elizabeth Newton

OER Textbooks

This “How To….” Manual is intended to assist the new user in implementing standard statistical methods, both parametric and non-parametric, using R statistical software. Its focus is on R implementation, not statistical theory. It includes the R commands, with examples, for the following: proportion tests, t-tests, ANOVA, variance tests, several correlation measures and regression models, Mann-Whitney-Wilcoxon tests, Kruskal-Wallis tests, chi-squared tests, multiple pairwise comparisons and effect sizes. Basic graphical methods are also illustrated.

[See note on 2024 update below.]


Generating Electromagnetic Schell-Model Sources Using Complex Screens With Spatially Varying Auto- And Cross-Correlation Functions, Milo W. Hyde Iv Sep 2019

Generating Electromagnetic Schell-Model Sources Using Complex Screens With Spatially Varying Auto- And Cross-Correlation Functions, Milo W. Hyde Iv

Faculty Publications

We present a method to generate any physically realizable electromagnetic Schell-model source. Our technique can be directly implemented on existing vector-beam generators that utilize spatial light modulators for coherence control, beam shaping, and relative phasing. This work significantly extends published research on the subject, where control over the partially coherent source’s cross-spectral density matrix was limited. We begin by presenting the statistical optics theory necessary to derive and implement our method. We then apply our technique, both analytically and in simulation, to produce two electromagnetic Schell-model sources from the literature. We demonstrate control over the full cross-spectral density matrices of …


Utilization Of Statistics For Provision Of Business Information: Implementation Of Α-Sutte Indicator On Provision Of Stock Movement Prediction Information, Nuning Kurniasih, Ansari Saleh Ahmar, Nanik Kurniawati Sep 2019

Utilization Of Statistics For Provision Of Business Information: Implementation Of Α-Sutte Indicator On Provision Of Stock Movement Prediction Information, Nuning Kurniasih, Ansari Saleh Ahmar, Nanik Kurniawati

Library Philosophy and Practice (e-journal)

The Current information services are dealing with big data that is freely accessible. Companies providing information services and products need to develop creativity and innovation to maintain their existence. In this paper, we offer that information specialist can add value to information. The added value is given through an analysis of information that is relevant to user needs. The Research and Development Method can be used to develop a framework for service information products and services, and bridge the gap between the theories studied in higher education and the needs of the industry. α-Sutte Indicator can be used to predict …


Student Insights Report, Fall 2019, The Center For Student Analytics Sep 2019

Student Insights Report, Fall 2019, The Center For Student Analytics

Publications

For the past three years, the staff of the Center for Student Analytics have worked to discover and expose meaningful, data-informed insights into what helps students succeed at Utah State University. The following pages highlight 20 of the most useful insights we found provided here in small sets that will be useful to students, faculty, staff, university leadership, parents, and even prospective students. As you explore this report, we encourage you to see the student data as a window into USU itself. While big data helps us understand how individual students are performing, it tells us a great deal more …


The Estimation Of Missing Values In Rectangular Lattice Designs, Emmanuel Ogochukwu Ossai, Abimibola Victoria Oladugba Sep 2019

The Estimation Of Missing Values In Rectangular Lattice Designs, Emmanuel Ogochukwu Ossai, Abimibola Victoria Oladugba

Journal of Modern Applied Statistical Methods

Algebraic expressions for estimating missing data when one or more observation(s) are missing in Rectangular lattice designs with repetition were derived using the method of minimizing the residual sum of squares. Results showed that the estimated value(s) were significantly approximate to that of the actual value(s).


Rplidar A2 Accuracy, Ramiro O. Garcia Sep 2019

Rplidar A2 Accuracy, Ramiro O. Garcia

STAR Program Research Presentations

Traffic is not only a source of frustration but also a leading cause of death for people under 35 years of age. Recent research has focused on how driver assistance technology can be used to mitigate traffic fatalities and create more enjoyable commutes. In addition, self-driving vehicles can reduce fuel consumption the amount by 5% and increases the number of cars on the highway. To achieve this we need to research reliable sensors. This summer I research Rplidar A2 sensor which hopefully will be responsible for recording distance to the preceding car and helping prevent Insider Attacks or Misbehaviors of …


Predicting Wind Turbine Blade Erosion Using Machine Learning, Casey Martinez, Festus Asare Yeboah, Scott Herford, Matt Brzezinski, Viswanath Puttagunta Aug 2019

Predicting Wind Turbine Blade Erosion Using Machine Learning, Casey Martinez, Festus Asare Yeboah, Scott Herford, Matt Brzezinski, Viswanath Puttagunta

SMU Data Science Review

Using time-series data and turbine blade inspection assessments, we present a classification model in order to predict remaining turbine blade life in wind turbines. Capturing the kinetic energy of wind requires complex mechanical systems, which require sophisticated maintenance and planning strategies. There are many traditional approaches to monitoring the internal gearbox and generator, but the condition of turbine blades can be difficult to measure and access. Accurate and cost- effective estimates of turbine blade life cycles will drive optimal investments in repairs and improve overall performance. These measures will drive down costs as well as provide cheap and clean electricity …


Texture-Based Deep Neural Network For Histopathology Cancer Whole Slide Image (Wsi) Classification, Nelson Zange Tsaku Aug 2019

Texture-Based Deep Neural Network For Histopathology Cancer Whole Slide Image (Wsi) Classification, Nelson Zange Tsaku

Master of Science in Computer Science Theses

Automatic histopathological Whole Slide Image (WSI) analysis for cancer classification has been highlighted along with the advancements in microscopic imaging techniques. However, manual examination and diagnosis with WSIs is time-consuming and tiresome. Recently, deep convolutional neural networks have succeeded in histopathological image analysis. In this paper, we propose a novel cancer texture-based deep neural network (CAT-Net) that learns scalable texture features from histopathological WSIs. The innovation of CAT-Net is twofold: (1) capturing invariant spatial patterns by dilated convolutional layers and (2) Reducing model complexity while improving performance. Moreover, CAT-Net can provide discriminative texture patterns formed on cancerous regions of histopathological …


Identifying Risk Factors Related To Premature Birth Through Binary Logistic And Proportional Odds Ordinal Logistic Regression, Clayton Elwood Aug 2019

Identifying Risk Factors Related To Premature Birth Through Binary Logistic And Proportional Odds Ordinal Logistic Regression, Clayton Elwood

Electronic Theses and Dissertations

Premature birth has been identified as the single greatest cause of death worldwide in children under the age of five. This thesis will implement binary logistic regression and proportional odds ordinal logistic regression to predict different levels of premature birth and identify associated risk factors. The models will be built from the Center for Disease Control and Prevention's 2014 Vital Statistics Natality Birth Data containing nearly 4 million live births within the United States. Odds ratios and confidence intervals on risk factors were produced utilizing binary logistic regression.


Garch Modeling Of Value At Risk And Expected Shortfall Using Bayesian Model Averaging, Ismail Kheir Aug 2019

Garch Modeling Of Value At Risk And Expected Shortfall Using Bayesian Model Averaging, Ismail Kheir

Theses and Dissertations

This thesis conducts Value at Risk (VaR) and Expected Shortfall (ES) estimation using GARCH modeling and Bayesian Model Averaging (BMA). BMA considers multiple models weighted by some information criterion. Through BMA, this thesis finds that VaR and ES estimates can be improved through enhanced modeling of the data generation process.


Spatio-Temporal Analysis Of Tree Ring Chronology And Precipitation, Ruizhe Yin Aug 2019

Spatio-Temporal Analysis Of Tree Ring Chronology And Precipitation, Ruizhe Yin

Graduate Theses and Dissertations

Tree ring chronology data is known to reflect regional climate due to the strong impact of rainfall and temperature. Therefore, tree ring data can be used to reconstruct historical climate in order to understand how climate changed in the past and make prediction about the future behavior of the climate. For simplicity, this research only considers the influence of precipitation on tree ring growth within the New England area. A total of 94 measurement sites are used to record tree ring width over 881 years and corresponding precipitation data are given at some locations for 121 years. We developed a …


Probabilistic Models For Order-Picking Operations With Multiple In-The-Aisle Pick Positions, Jingming Liu Aug 2019

Probabilistic Models For Order-Picking Operations With Multiple In-The-Aisle Pick Positions, Jingming Liu

Graduate Theses and Dissertations

The development of probability density functions (pdfs) for travel time of a narrow aisle lift truck (NALT) and an automated storage and retrieval (AS/R) machine is the focus of the dissertation. The multiple in-the-aisle pick positions (MIAPP) order picking system can be modeled as an M/G/1 queueing problem in which storage and retrieval requests are the customers and the vehicle (NALT or AS/R machine) is the server. Service time is the sum of travel time and the deterministic time to pick up and deposit a pallet (TPD).

Our first contribution is the development of travel time pdfs for retrieval operations …


Optimal Design For A Causal Structure, Zaher Kmail Aug 2019

Optimal Design For A Causal Structure, Zaher Kmail

Department of Statistics: Dissertations, Theses, and Student Work

Linear models and mixed models are important statistical tools. But in many natural phenomena, there is more than one endogenous variable involved and these variables are related in a sophisticated way. Structural Equation Modeling (SEM) is often used to model the complex relationships between the endogenous and exogenous variables. It was first implemented in research to estimate the strength and direction of direct and indirect effects among variables and to measure the relative magnitude of each causal factor.

Historically, traditional optimal design theory focuses on univariate linear, nonlinear, and mixed models. There is no current literature on the subject of …


Choose Your Own Adventure: An Analysis Of Interactive Gamebooks Using Graph Theory, D'Andre Adams, Daniela Beckelhymer, Alison Marr Jul 2019

Choose Your Own Adventure: An Analysis Of Interactive Gamebooks Using Graph Theory, D'Andre Adams, Daniela Beckelhymer, Alison Marr

Journal of Humanistic Mathematics

"BEWARE and WARNING! This book is different from other books. You and YOU ALONE are in charge of what happens in this story." This is the captivating introduction to every book in the interactive novel series, Choose Your Own Adventure (CYOA). Our project uses the mathematical field of graph theory to analyze forty books from the CYOA book series for ages 9-12. We first began by drawing the digraphs of each book. Then we analyzed these digraphs by collecting structural data such as longest path length (i.e. longest story length) and number of vertices with outdegree zero (i.e. number …


Constraining The Oxygen Values Of The Late Cretaceous Western Interior Seaway Using Marine Bivalves, Camille H. Dwyer Jul 2019

Constraining The Oxygen Values Of The Late Cretaceous Western Interior Seaway Using Marine Bivalves, Camille H. Dwyer

Earth and Planetary Sciences ETDs

The Western Interior Seaway (WIS) remains an oceanographic enigma, including its circulation, similarity to the open ocean, and the fidelity of geochemical proxies to reconstruct paleoenvironments. Across the late Campanian and early Maastrichtian I test whether: 1) the WIS had unique δ18OVPDB compared to other marine settings, 2) increasing oceanographic restriction changed the stable isotope composition, and 3) biases, e.g., taxonomy or diagenesis, influenced stable isotope compositions. Results indicate distinct δ18OVPDB in the WIS compared to other marine settings. δ18OVPDB values were stable through time, suggesting insignificant oceanographic restriction and a …


Stability Of Single-Parent Gene Expression Complementation In Maize Hybrids Upon Water Deficit Stress, Caroline Marcon, Anja Paschold, Waqas Ahmed Malik, Andrew Lithio, Jutta A. Baldauf, Lena Altrogge, Nina Opitz, Christa Lanz, Heiko Schoof, Dan Nettleton, Hans-Peter Piepho, Frank Hochholdinger Jul 2019

Stability Of Single-Parent Gene Expression Complementation In Maize Hybrids Upon Water Deficit Stress, Caroline Marcon, Anja Paschold, Waqas Ahmed Malik, Andrew Lithio, Jutta A. Baldauf, Lena Altrogge, Nina Opitz, Christa Lanz, Heiko Schoof, Dan Nettleton, Hans-Peter Piepho, Frank Hochholdinger

Dan Nettleton

Heterosis is the superior performance of F1 hybrids compared with their homozygous, genetically distinct parents. In this study, we monitored the transcriptomic divergence of the maize (Zea mays) inbred lines B73 and Mo17 and their reciprocal F1 hybrid progeny in primary roots under control and water deficit conditions simulated by polyethylene glycol treatment. Single-parent expression (SPE) of genes is an extreme instance of gene expression complementation, in which genes are active in only one of two parents but are expressed in both reciprocal hybrids. In this study, 1,997 genes only expressed in B73 and 2,024 genes …


Genomic Neighborhoods For Arabidopsisretrotransposons: A Role For Targeted Integration In The Distribution Of The Metaviridae, Brooke D. Peterson-Burch, Dan Nettleton, Daniel F. Voytas Jul 2019

Genomic Neighborhoods For Arabidopsisretrotransposons: A Role For Targeted Integration In The Distribution Of The Metaviridae, Brooke D. Peterson-Burch, Dan Nettleton, Daniel F. Voytas

Dan Nettleton

Background: Retrotransposons are an abundant component of eukaryotic genomes. The high quality of the Arabidopsis thaliana genome sequence makes it possible to comprehensively characterize retroelement populations and explore factors that contribute to their genomic distribution.

Results: We identified the full complement of A. thaliana long terminal repeat (LTR) retroelements using RetroMap, a software tool that iteratively searches genome sequences for reverse transcriptases and then defines retroelement insertions. Relative ages of full-length elements were estimated by assessing sequence divergence between LTRs: the Pseudoviridae were significantly younger than the Metaviridae. All retroelement insertions were mapped onto the genome sequence and their distribution …


Allocative Poisson Factorization For Computational Social Science, Aaron Schein Jul 2019

Allocative Poisson Factorization For Computational Social Science, Aaron Schein

Doctoral Dissertations

Social science data often comes in the form of high-dimensional discrete data such as categorical survey responses, social interaction records, or text. These data sets exhibit high degrees of sparsity, missingness, overdispersion, and burstiness, all of which present challenges to traditional statistical modeling techniques. The framework of Poisson factorization (PF) has emerged in recent years as a natural way to model high-dimensional discrete data sets. This framework assumes that each observed count in a data set is a Poisson random variable $y ~ Pois(\mu)$ whose rate parameter $\mu$ is a function of shared model parameters. This thesis examines a specific …