Open Access. Powered by Scholars. Published by Universities.®

Statistical Methodology Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 7 of 7

Full-Text Articles in Statistical Methodology

Using Short Bursts To Optimize Redistricting In Georgia, Vedika Vishweshwar Jan 2022

Using Short Bursts To Optimize Redistricting In Georgia, Vedika Vishweshwar

CMC Senior Theses

Identifying extreme outliers in large state spaces is a difficult prob-
lem. I consider this problem in the context of finding political district-
ing plans that maximize the number of districts in which the majority
of the population is from a minority group, such as African Americans.
Since the set of all possible districting plans is enormous and unfeasi-
ble to examine in practice, this paper proposes a sampling method to
find these outlying plans. Specifically, this paper experiments with short
bursts in the context of minority voting rights in Georgia. Short bursts
are a type of Markov Chain in …


An Evaluation Of Knot Placement Strategies For Spline Regression, William Klein Jan 2021

An Evaluation Of Knot Placement Strategies For Spline Regression, William Klein

CMC Senior Theses

Regression splines have an established value for producing quality fit at a relatively low-degree polynomial. This paper explores the implications of adopting new methods for knot selection in tandem with established methodology from the current literature. Structural features of generated datasets, as well as residuals collected from sequential iterative models are used to augment the equidistant knot selection process. From analyzing a simulated dataset and an application onto the Racial Animus dataset, I find that a B-spline basis paired with equally-spaced knots remains the best choice when data are evenly distributed, even when structural features of a dataset are known …


How Machine Learning And Probability Concepts Can Improve Nba Player Evaluation, Harrison Miller Jan 2020

How Machine Learning And Probability Concepts Can Improve Nba Player Evaluation, Harrison Miller

CMC Senior Theses

In this paper I will be breaking down a scholarly article, written by Sameer K. Deshpande and Shane T. Jensen, that proposed a new method to evaluate NBA players. The NBA is the highest level professional basketball league in America and stands for the National Basketball Association. They proposed to build a model that would result in how NBA players impact their teams chances of winning a game, using machine learning and probability concepts. I preface that by diving into these concepts and their mathematical backgrounds. These concepts include building a linear model using ordinary least squares method, the bias …


The Paradox Of Big Data, Gary N. Smith Jan 2019

The Paradox Of Big Data, Gary N. Smith

Pomona Economics

Data-mining is often used to discover patterns in Big Data. It is tempting believe that because an unearthed pattern is unusual it must be meaningful, but patterns are inevitable in Big Data and usually meaningless. The paradox of Big Data is that data mining is most seductive when there are a large number of variables, but a large number of variables exacerbates the perils of data mining.


Machine Learning On Statistical Manifold, Bo Zhang Jan 2017

Machine Learning On Statistical Manifold, Bo Zhang

HMC Senior Theses

This senior thesis project explores and generalizes some fundamental machine learning algorithms from the Euclidean space to the statistical manifold, an abstract space in which each point is a probability distribution. In this thesis, we adapt the optimal separating hyperplane, the k-means clustering method, and the hierarchical clustering method for classifying and clustering probability distributions. In these modifications, we use the statistical distances as a measure of the dissimilarity between objects. We describe a situation where the clustering of probability distributions is needed and useful. We present many interesting and promising empirical clustering results, which demonstrate the statistical-distance-based clustering algorithms …


What's In A Name? A Critical Review Of Definitions Of Quantitative Literacy, Numeracy, And Quantitative Reasoning, Gizem Karaali, Edwin H Villafane Hernandez '18, Jeremy Alexander Taylor '18 Jan 2016

What's In A Name? A Critical Review Of Definitions Of Quantitative Literacy, Numeracy, And Quantitative Reasoning, Gizem Karaali, Edwin H Villafane Hernandez '18, Jeremy Alexander Taylor '18

Pomona Faculty Publications and Research

This article aims to bring together various threads in the eclectic literature that make up the scholarship around the theme of Quantitative Literacy. In investigating the meanings of terms like "quantitative literacy," "quantitative reasoning," and "numeracy," we seek common ground, common themes, common goals and aspirations of a community of practitioners. A decade ago, these terms were relatively new in the public sphere; today policy makers and accrediting agencies are routinely inserting them into general education conversations. Having good, representative, and perhaps even compact and easily digestible definitions of these terms might come in handy in public relations contexts as …


Musical Missteps: The Severity Of The Sophomore Slump In The Music Industry, Shane M. Zackery May 2014

Musical Missteps: The Severity Of The Sophomore Slump In The Music Industry, Shane M. Zackery

Scripps Senior Theses

This study looks at alternative models of follow-up album success in order to determine if there is a relationship between the decrease in Metascore ratings (assigned by Metacritic.com) between the first and second album for a musician or band and the 1) music genre or 2) the number of years between the first and second album release. The results support the dominant thought, which suggests that neither belonging to a certain genre of music nor waiting more or less time to drop the second album makes an artist more susceptible to the Sophomore Slump. This finding is important because it …