Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 21 of 21

Full-Text Articles in Physical Sciences and Mathematics

Analyzing Baseball Data With R, Claudia Sison Jun 2017

Analyzing Baseball Data With R, Claudia Sison

Statistics

No abstract provided.


A Study Of The Parametric And Nonparametric Linear-Circular Correlation Coefficient, Robin Tu Jun 2015

A Study Of The Parametric And Nonparametric Linear-Circular Correlation Coefficient, Robin Tu

Statistics

Circular statistics are specialized statistical methods that deal specifically with directional data. Data that is angular require specialized techniques due to the modulo 2π (in radians) or modulo 360 (in degrees) nature of angles.

Correlation, typically in terms of Pearson’s correlation coefficient, is a measure of association between two linear random variables x and y. In this paper, the specific circular technique of the parametric and nonparametric linear-circular correlation coefficient will be explored where correlation is no longer between two linear variables x and y, but between a linear random variable x and circular random variable θ.

A simulation …


Statistical Consulting - Senior Project, Cary Hernandez Jun 2015

Statistical Consulting - Senior Project, Cary Hernandez

Statistics

No abstract provided.


#Twittercritic: Sentiment Analysis Of Tweets To Predict Tv Ratings, Isabel Litton Jun 2015

#Twittercritic: Sentiment Analysis Of Tweets To Predict Tv Ratings, Isabel Litton

Statistics

Twitter has rapidly become one of the most popular sites of the Internet. It functions not just as a microblogging service, but as a crowdsourcing tool for listening, promotion, insight and much more. From the perspective of TV networks, tweets capture the real time reactions of viewers, making them an ideal indicator of a show’s ratings. This paper predicts Internet Movie Database (IMDB) television ratings by text mining Twitter data.

Tweets for five television shows were downloaded over a period of several months utilizing a SAS macro. Television show data, such as rating, show title, episode title, and more were …


Viewing The Moon In Infrared, Kyle Beekman Dec 2014

Viewing The Moon In Infrared, Kyle Beekman

Statistics

Man has been fascinated by the heavens since ancient times, yet there is still so much that we don’t know. This project was created by Dr. Gary Hughes with goal of obtaining information about the moon and other objects in the vicinity of the Earth. The project was mostly experimental in nature and there was no specific goal at the outset of the project. In the end the project focused on the moon and meteors that traveled through the Earth’s upper atmosphere. Throughout the month of August, students traveled to the Mount Barcroft Research Station in the Eastern Sierras to …


Analyzing Alcohol Behavior In San Luis Obispo, Ariana Montes Dec 2014

Analyzing Alcohol Behavior In San Luis Obispo, Ariana Montes

Statistics

No abstract provided.


Simulating Influenza Transmission With Network Data, Henry V. Bongiovi Jun 2014

Simulating Influenza Transmission With Network Data, Henry V. Bongiovi

Statistics

Simulating Influenza Transmission with Real Network Data

Henry Bongiovi BS Statistics, California Polytechnic State University, San Luis Obispo

bongiovihenry@gmail.com

Keywords: Network Data, Simulation, Education, Influenza, Epidemic

Disease has been humanities arch rival since the dawn of our existence. As such, we have been trying our best to understand its spread and proliferation. One of the most common diseases, Influenza, is also one of the most complex. To understand the complexities of its spread would greatly improve our ability to combat it and other diseases like it. Using R in conjunction with the package statnet, I have created a simulation of …


A Comparison Of Prenatal Alcohol, Tobacco, And Other Drug Use Between San Luis Obispo County And Ventura County, Dana M. Williamson May 2014

A Comparison Of Prenatal Alcohol, Tobacco, And Other Drug Use Between San Luis Obispo County And Ventura County, Dana M. Williamson

Statistics

Prenatal substance abuse is a growing issue in America. It can lead to fetal alcohol spectrum disorder, long term growth, behavior, and executive functioning problems, and creates a predisposition for drug use for the child.

This project summarizes the statistical analyses comparing alcohol, tobacco, and other drug use by pregnant women between San Luis Obispo County and Ventura County. The main goal of these analyses is to determine if there is a difference between San Luis Obispo County and Ventura County. This is an interesting comparison because these counties are neighboring counties, and past data have shown that the rate …


Nba Salaries: Assessing True Player Value, Michael Ghirardo Jun 2013

Nba Salaries: Assessing True Player Value, Michael Ghirardo

Statistics

This paper analyzes and calculates an advanced NBA statistic that is becoming more and more widely used in the NBA. The Adjusted plus-minus (APM) statistic measures a player’s contribution, independent of all other players on the court. The most appealing aspect to the APM is that it only attempts to capture how a team’s scoring margin changes with a particular player on and off the court. Scoring margin in basketball effects winning percentage greatly, so it only makes sense that players with high APM’s will increase their team’s scoring margin and, therefore, help win games. The APM statistic is not …


Emirical Assessment Of The Future Performance Of The S&P 500 Losers, Nicholas Powers Jun 2013

Emirical Assessment Of The Future Performance Of The S&P 500 Losers, Nicholas Powers

Statistics

In the Wall Street Journal in early 2013, there was an article posted by Andrew Bary that explored a trend in the previous 3 years of the S&P 500. The article pointed out that the average returns for the top 10 percentage decliners for 2009, 2010, and 2011 outperformed the S&P 500 for the first two weeks of the next year. These top 10 percentage decliners or losers well enough to bet on. This study looks to see if there is statistical evidence that the losers outperformed the S&P 500.


Pedestrian Detection Using Image Blending, Hannah Haggerty Jun 2013

Pedestrian Detection Using Image Blending, Hannah Haggerty

Statistics

No abstract provided.


Investigation Of A Pregnancy Lifestyle Intervention Using Mediation Analysis And A Power Analysis Simulation, Kelsey Grantham Jan 2013

Investigation Of A Pregnancy Lifestyle Intervention Using Mediation Analysis And A Power Analysis Simulation, Kelsey Grantham

Statistics

No abstract provided.


Analysis Of Alcohol Use Among Pregnant Women In San Luis Obispo County, Samantha Law Dec 2012

Analysis Of Alcohol Use Among Pregnant Women In San Luis Obispo County, Samantha Law

Statistics

Drinking alcohol during pregnancy is harmful to the fetus, and can lead to serious alcohol related developmental birth defects. Utilizing prenatal screening, such as the 4P’s Plus© screening tool, during a woman’s first prenatal doctors visit can help educate women and reduce continued alcohol use during pregnancy. Currently the CDC reports that 1 in 13 women in the US drink alcohol while pregnant compared to local reports that 1 in 3 women in San Luis Obispo County continue to drink alcohol during pregnancy. A primary concern for many local county health care experts and organizations is to raise awareness that …


An Analysis Of Risk Reduction Choices In Dcis Breast Cancer Patients, Lauren Soltesz Dec 2012

An Analysis Of Risk Reduction Choices In Dcis Breast Cancer Patients, Lauren Soltesz

Statistics

The main focus of this paper was to evaluate possible demographic and clinical characteristics associated with a woman’s choice of breast conserving surgery (BCS), unilateral mastectomy (ULM), or bilateral risk reduction mastectomy (BRRM). The cohort consisted of patients presenting to the City of Hope National Medical Center with ductal carcinoma in situ breast cancer who elected to have cancer directed surgery (N=305). Analyses to examine associations of patient characteristics with type of surgery were conducted using a multinomial logistic regression. Results showed that older women were more likely to choose breast conserving surgery over bilateral risk reduction mastectomy than younger …


Comparative Analysis Of Dispersion Parameter Estimates In Loglinear Modeling: Applied To E-Commerce Sales And Customer Data, Scott Davis Sep 2012

Comparative Analysis Of Dispersion Parameter Estimates In Loglinear Modeling: Applied To E-Commerce Sales And Customer Data, Scott Davis

Statistics

When loglinear models are applied to count data the issue of over-dispersion often arises. Moment and maximum likelihood estimation methods in accounting for over-dispersion are widely used because they allow for model checking tools such as Chi-square, F, and likelihood ratio tests. Here is a comparison between R functions that each uses one method; glm.nb uses MLE, and glm.poisson.disp uses MME. The Index of Dissimilarity and visual model selection (ECDF plots) are also incorporated. These are applied to sales data using product and customer information compiled over the last five years that was generously provided by an e-commerce company.


Analysis Of Dietary Patterns Over Freshman Year Of College, Chelsea Lofland Jun 2012

Analysis Of Dietary Patterns Over Freshman Year Of College, Chelsea Lofland

Statistics

This analysis is an investigation of changes in Cal Poly students’ eating habits over freshman year. The motivation behind this was an interest in college students’ lifestyles; college is the first time most students live on their own and it can be an important maturation period. College is stressful, exciting, liberating, and terrifying all at the same time. This distinctive life experience, along with my desire to handle big and messy data, led me to this research question.

The response variable analyzed was food consumption and the explanatory variables were: sex, race, quarter, food group, stress, exercise, BMI, sleep quality …


Analyzing Multiple Independent Spatial Point Processes, Neal Grantham May 2012

Analyzing Multiple Independent Spatial Point Processes, Neal Grantham

Statistics

No abstract provided.


Using The R Library Rpanel For Gui-Based Simulations In Introductory Statistics Courses, Ryan M. Allison May 2012

Using The R Library Rpanel For Gui-Based Simulations In Introductory Statistics Courses, Ryan M. Allison

Statistics

As a student, I noticed that the statistical package R (http://www.r-project.org) would have several benefits of its usage in the classroom. One benefit to the package is its free and open-source nature. This would be a great benefit for instructors and students alike since it would be of no cost to use, unlike other statistical packages. Due to this, students could continue using the program after their statistical courses and into their professional careers. It would be good to expose students while they are in school to a tool that professionals use in industry. R also has powerful …


Like Mother Like Child: An Investigation Of Mother Characteristics And Child Temperaments, Tempus Fugitt Dec 2010

Like Mother Like Child: An Investigation Of Mother Characteristics And Child Temperaments, Tempus Fugitt

Statistics

Much research has gone into what biological and social factors raise the risk of children developing cognitive, social, or behavioral problems. This project looks at what characteristics of the mother are significantly associated with different temperaments in the child which may predict problems developed in the child later in life. These characteristics include mother’s age, her education level, household income, parenting attitudes, involvement with the child, and drug and alcohol use. Data was used from the Fragile Families and Child Wellbeing Study conducted by the Office of Population Research at Princeton University. Cumulative logistic regression was used to analyze the …


Statistical Analysis Of Texas Holdem Poker, Daniel Bragonier Jun 2010

Statistical Analysis Of Texas Holdem Poker, Daniel Bragonier

Statistics

Gathered lifetime online Poker data for Mike Linn. Attempted to analyze data to obtain information to maximize profit. Techniques included Univariate Analysis, Regression analysis, Anova analysis, Logistic Regression, and outlier Analysis. After the analysis, nothing of supreme importance or sustenance was found. Encountered issues with too much power. Results lead to plenty of statistical significance, but little practical significance. Results showed that the data did not provide all the answers that were being sought after, but there was some value in examining the data in a strict statistical manner.


Applied Statistics: Experience & Cerification In Quality Assurance, Huey D. Dodson Feb 2010

Applied Statistics: Experience & Cerification In Quality Assurance, Huey D. Dodson

Statistics

The composition of my senior project can be broken down into two parts. The first part of my project, without which the second could not be pursued, involved a 13 week internship at a produce processing facility where I took part in several projects varying in scope and type. The second part was to acquire certification as a Quality Process Analyst from the American Society for Quality.

This document is structured to represent the dichotomous nature of my project; the first section is dedicated to my internship experience, and the second dedicated to the certification examination preparation and completion.