Machine Learning Pipeline For Exoplanet Classification, 2019 Southern Methodist University
Machine Learning Pipeline For Exoplanet Classification, George Clayton Sturrock, Brychan Manry, Sohail Rafiqi
SMU Data Science Review
Planet identification has typically been a tasked performed exclusively by teams of astronomers and astrophysicists using methods and tools accessible only to those with years of academic education and training. NASA’s Exoplanet Exploration program has introduced modern satellites capable of capturing a vast array of data regarding celestial objects of interest to assist with researching these objects. The availability of satellite data has opened up the task of planet identification to individuals capable of writing and interpreting machine learning models. In this study, several classification models and datasets are utilized to assign a probability of an observation being an exoplanet. …
Leveraging Natural Language Processing Applications And Microblogging Platform For Increased Transparency In Crisis Areas, 2019 Southern Methodist University
Leveraging Natural Language Processing Applications And Microblogging Platform For Increased Transparency In Crisis Areas, Ernesto Carrera-Ruvalcaba, Johnson Ekedum, Austin Hancock, Ben Brock
SMU Data Science Review
Through microblogging applications, such as Twitter, people actively document their lives even in times of natural disasters such as hurricanes and earthquakes. While first responders and crisis-teams are able to help people who call 911, or arrive at a designated shelter, there are vast amounts of information being exchanged online via Twitter that provide real-time, location-based alerts that are going unnoticed. To effectively use this information, the Tweets must be verified for authenticity and categorized to ensure that the proper authorities can be alerted. In this paper, we create a Crisis Message Corpus from geotagged Tweets occurring during 7 hurricanes …
Predictive Distributions Via Filtered Historical Simulation For Financial Risk Management, 2019 Utah State University
Predictive Distributions Via Filtered Historical Simulation For Financial Risk Management, Tyson Clark
All Graduate Plan B and other Reports, Spring 1920 to Spring 2023
Filtered historical simulation with an underlying GARCH process can be used as a valuable tool in VaR analysis, as it derives risk estimates that are sensitive to the distributional properties of the historical data of the produced predictive density. I examine the applications to risk analysis that filtered historical simulation can provide, as well as an interpretation of the predictive density as a poor man’s Bayesian posterior distribution. The predictive density allows us to make associated probabilistic statements regarding the results for VaR analysis, giving greater measurement of risk and the ability to maintain the optimal level of risk per …
Generalizations Of The Arcsine Distribution, 2019 East Tennessee State University
Generalizations Of The Arcsine Distribution, Rebecca Rasnick
Electronic Theses and Dissertations
The arcsine distribution looks at the fraction of time one player is winning in a fair coin toss game and has been studied for over a hundred years. There has been little further work on how the distribution changes when the coin tosses are not fair or when a player has already won the initial coin tosses or, equivalently, starts with a lead. This thesis will first cover a proof of the arcsine distribution. Then, we explore how the distribution changes when the coin the is unfair. Finally, we will explore the distribution when one person has won the first …
Ergodicity For The 3d Stochastic Navier-Stokes Equations Perturbed By Lévy Noise, 2019 Air Force Institute of Technology
Ergodicity For The 3d Stochastic Navier-Stokes Equations Perturbed By Lévy Noise, Manil T. Mohan, K. Sakthivel, Sivaguru S. Sritharan
Faculty Publications
In this work we construct a Markov family of martingale solutions for 3D stochastic Navier–Stokes equations (SNSE) perturbed by Lévy noise with periodic boundary conditions. Using the Kolmogorov equations of integrodifferential type associated with the SNSE perturbed by Lévy noise, we construct a transition semigroup and establish the existence of a unique invariant measure. We also show that it is ergodic and strongly mixing.
Abstract © Wiley.
Dynamic Attribute-Level Best Worst Discrete Choice Experiments, 2019 Old Dominion University
Dynamic Attribute-Level Best Worst Discrete Choice Experiments, Amanda Working, Mohammed Alqawba, Norou Diawara
Mathematics & Statistics Faculty Publications
Dynamic modelling of decision maker choice behavior of best and worst in discrete choice experiments (DCEs) has numerous applications. Such models are proposed under utility function of decision maker and are used in many areas including social sciences, health economics, transportation research, and health systems research. After reviewing references on the study of such experiments, we present example in DCE with emphasis on time dependent best-worst choice and discrimination between choice attributes. Numerical examples of the dynamic DCEs are simulated, and the associated expected utilities over time of the choice models are derived using Markov decision processes. The estimates are …
Best Probable Subset: A New Method For Reducing Data Dimensionality In Linear Regression, 2019 Florida International University
Best Probable Subset: A New Method For Reducing Data Dimensionality In Linear Regression, Elieser Nodarse
FIU Electronic Theses and Dissertations
Regression is a statistical technique for modeling the relationship between a dependent variable Y and two or more predictor variables, also known as regressors. In the broad field of regression, there exists a special case in which the relationship between the dependent variable and the regressor(s) is linear. This is known as linear regression.
The purpose of this paper is to create a useful method that effectively selects a subset of regressors when dealing with high dimensional data and/or collinearity in linear regression. As the name depicts it, high dimensional data occurs when the number of predictor variables is far …
Dice Mythbusters, 2019 Western Kentucky University
Dice Mythbusters, C. Warren Campbell, William P. Dolan
Student Research Conference Select Presentations
All dice are unfair because they cannot be manufactured with absolute precision. However, some dice are more unfair than others. Each year hundreds of millions of dice are sold worldwide. Dice commonly used in role playing games are 4-sided (D4), 6-sided (D6), 8-sided (D8), 10-sided (D10), 12-sided (D12), and 20-sided (D20). Most of these are manufactured using plastic mold injection and rock tumbler methods. This method can result in dimensional inaccuracies in the dice and sometimes density inhomogeneities. In 3000-roll tests of eleven D20 dice only three tested fair. In a running chi square test it was shown that for …
Optimal Conditional Expectation At The Video Poker Game Jacks Or Better, 2019 University of Utah
Optimal Conditional Expectation At The Video Poker Game Jacks Or Better, Stewart N. Ethier, John J. Kim, Jiyeon Lee
UNLV Gaming Research & Review Journal
There are 134,459 distinct initial hands at the video poker game Jacks or Better, taking suit exchangeability into account. A computer program can determine the optimal strategy (i.e., which cards to hold) for each such hand, but a complete list of these strategies would require a book-length manuscript. Instead, a hand-rank table, which fits on a single page and reproduces the optimal strategy perfectly, was found for Jacks or Better as early as the mid 1990s. Is there a systematic way to derive such a hand-rank table? We show that there is indeed, and it involves finding the exact optimal …
Unified Methods For Feature Selection In Large-Scale Genomic Studies With Censored Survival Outcomes, 2019 Temple University
Unified Methods For Feature Selection In Large-Scale Genomic Studies With Censored Survival Outcomes, Lauren Spirko-Burns, Karthik Devarajan
COBRA Preprint Series
One of the major goals in large-scale genomic studies is to identify genes with a prognostic impact on time-to-event outcomes which provide insight into the disease's process. With rapid developments in high-throughput genomic technologies in the past two decades, the scientific community is able to monitor the expression levels of tens of thousands of genes and proteins resulting in enormous data sets where the number of genomic features is far greater than the number of subjects. Methods based on univariate Cox regression are often used to select genomic features related to survival outcome; however, the Cox model assumes proportional hazards …
Surprise Vs. Probability As A Metric For Proof, 2019 U.S. Court of Federal Claims
Surprise Vs. Probability As A Metric For Proof, Edward K. Cheng, Matthew Ginther
Edward Cheng
In this Symposium issue celebrating his career, Professor Michael Risinger in Leveraging Surprise proposes using "the fundamental emotion of surprise" as a way of measuring belief for purposes of legal proof. More specifically, Professor Risinger argues that we should not conceive of the burden of proof in terms of probabilities such as 51%, 95%, or even "beyond a reasonable doubt." Rather, the legal system should reference the threshold using "words of estimative surprise" -asking jurors how surprised they would be if the fact in question were not true. Toward this goal (and being averse to cardinality), he suggests categories such …
Non Parametric Test For Testing Exponentiality Against Exponential Better Than Used In Laplace Transform Order, 2019 Al-Azhar University
Non Parametric Test For Testing Exponentiality Against Exponential Better Than Used In Laplace Transform Order, Mahmoud Mansour, M A W Mahmoud Prof.
Basic Science Engineering
In this paper, the test statistic for testing exponentiality against exponential better than used in Laplace transform order (EBUL) based on the Laplace transform technique is proposed. Pitman’s asymptotic efficiency of our test is calculated and compared with other tests. The percentiles of this test are tabulated. The powers of the test are estimated for famously used distributions in aging problems. In the case of censored data, our test is applied and the percentiles are also calculated and tabulated. Finally, real examples in different areas are utilized as practical applications for the proposed test.
One-Dimensional Excited Random Walk With Unboundedly Many Excitations Per Site, 2019 The Graduate Center, City University of New York
One-Dimensional Excited Random Walk With Unboundedly Many Excitations Per Site, Omar Chakhtoun
Dissertations, Theses, and Capstone Projects
We study a discrete time excited random walk on the integers lattice requiring a tail decay estimate on the number of excitations per site and extend the existing framework, methods, and results to a wider class of excited random walks.
We give criteria for recurrence versus transience, ballisticity versus zero linear speed, completely classify limit laws in the transient regime, and establish a functional limit laws in the recurrence regime.
Infinite Sums, Products, And Urn Models, 2019 University of Windsor
Infinite Sums, Products, And Urn Models, Yiyan Ni
Major Papers
This paper considers an urn and its evolution in discrete time steps. The
urn initially has two different colored balls(blue and red). We discuss different
cases where k blue balls (k = 1, 2, 3, ... ) will be added (or removed) at every
step if a blue ball is withdrawn, based on the goal of eventually withdrawing a
red ball P(R eventually). We compute the probability of eventually withdrawing
a red ball with two different methods–one using infinite sums and other using
infinite products. One advantage of this is that we can obtain P(R eventually) in
a complex but …
Statistical Inference For The Transformed Rayleigh Lomax Distribution With Progressive Type-Ii Right Censorship, 2019 Bowling Green State University
Statistical Inference For The Transformed Rayleigh Lomax Distribution With Progressive Type-Ii Right Censorship, Amani Alghami, Wei Ning, Arjun K. Gupta
Mathematics and Statistics Faculty Publications
In this paper, we study the transformed Rayleigh Lomax (Trans-RL) distribution which belongs to a certain family of two parameters lifetime distributions given by Wang et al (2010). Confidence intervals and inverse estimators of the Trans-RL parameters are derived in terms of order statistics. A simulation study is conducted to report the coverage probabilities, the average biases and the average relative mean square errors for the maximum likelihood, L-moments and inverse estimators. We compare the performance of these methods under different schemes of progressively Type-II right censoring. Finally, an illustrative example is provided to demonstrate the proposed methods.
Modeling Stochastically Intransitive Relationships In Paired Comparison Data, 2019 Southern Methodist University
Modeling Stochastically Intransitive Relationships In Paired Comparison Data, Ryan Patrick Alexander Mcshane
Statistical Science Theses and Dissertations
If the Warriors beat the Rockets and the Rockets beat the Spurs, does that mean that the Warriors are better than the Spurs? Sophisticated fans would argue that the Warriors are better by the transitive property, but could Spurs fans make a legitimate argument that their team is better despite this chain of evidence?
We first explore the nature of intransitive (rock-scissors-paper) relationships with a graph theoretic approach to the method of paired comparisons framework popularized by Kendall and Smith (1940). Then, we focus on the setting where all pairs of items, teams, players, or objects have been compared to …
High Dimensional Outlier Detection, 2019 University of Montana
High Dimensional Outlier Detection, Omid Khormali
Graduate Student Theses, Dissertations, & Professional Papers
In statistics and data science, outliers are data points that differ greatly from other observations in a data set. They are important attributes of the data because they can dramatically influence patterns and relationships manifested by non-outliers. It is therefore very important to detect and adequately deal with outliers. Recently, a novel algorithm, the ROMA algorithm, has been proposed [11]. In this paper, we propose a modification of the ROMA algorithm that reduces its computational complexity from $O(n^2 m)$ to $O((n/(2^m-o(1)))^2 m)$ where $n$ is the number of data points and $m$ is the dimension of the space. And as …
Counting And Coloring Sudoku Graphs, 2019 Portland State University
Counting And Coloring Sudoku Graphs, Kyle Oddson
Mathematics and Statistics Dissertations, Theses, and Final Project Papers
A sudoku puzzle is most commonly a 9 × 9 grid of 3 × 3 boxes wherein the puzzle player writes the numbers 1 - 9 with no repetition in any row, column, or box. We generalize the notion of the n2 × n2 sudoku grid for all n ϵ Z ≥2 and codify the empty sudoku board as a graph. In the main section of this paper we prove that sudoku boards and sudoku graphs exist for all such n we prove the equivalence of [3]'s construction using unions and products of graphs to the definition of …
Snap Scholar: The User Experience Of Engaging With Academic Research Through A Tappable Stories Medium, 2019 Claremont Colleges
Snap Scholar: The User Experience Of Engaging With Academic Research Through A Tappable Stories Medium, Ieva Burk
CMC Senior Theses
With the shift to learn and consume information through our mobile devices, most academic research is still only presented in long-form text. The Stanford Scholar Initiative has explored the segment of content creation and consumption of academic research through video. However, there has been another popular shift in presenting information from various social media platforms and media outlets in the past few years. Snapchat and Instagram have introduced the concept of tappable “Stories” that have gained popularity in the realm of content consumption.
To accelerate the growth of the creation of these research talks, I propose an alternative to video: …
The Dark Sky Character Of Archaeological Landscapes: Cultural Meaning And Conservation Strategies, 2019 Technological University Dublin
The Dark Sky Character Of Archaeological Landscapes: Cultural Meaning And Conservation Strategies, Frank Prendergast
Book/Book Chapter
This paper presents the first ever study of light pollution at selected Irish prehistoric archaeological landscapes. The concepts of cosmology and landscape are first briefly described and followed by a summary of early human settlement of the island. Building on this, the extant corpus of early prehistoric megalithic burial tombs is illustrated to show their contrasting distribution patterns and typology. Analysis of tomb locations using nearest-neighbour statistical methods reveals evidence of intentional clustering. Further geo-statistical analysis identifies the geographical locations and the density ranking of these nucleated clusters - a feature especially evident in the passage tomb tradition on this …