Open Access. Powered by Scholars. Published by Universities.®

Data Science Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 30 of 30

Full-Text Articles in Data Science

Session 6: Model-Based Clustering Analysis On The Spatial-Temporal And Intensity Patterns Of Tornadoes, Yana Melnykov, Yingying Zhang, Rong Zheng Feb 2024

Session 6: Model-Based Clustering Analysis On The Spatial-Temporal And Intensity Patterns Of Tornadoes, Yana Melnykov, Yingying Zhang, Rong Zheng

SDSU Data Science Symposium

Tornadoes are one of the nature’s most violent windstorms that can occur all over the world except Antarctica. Previous scientific efforts were spent on studying this nature hazard from facets such as: genesis, dynamics, detection, forecasting, warning, measuring, and assessing. While we want to model the tornado datasets by using modern sophisticated statistical and computational techniques. The goal of the paper is developing novel finite mixture models and performing clustering analysis on the spatial-temporal and intensity patterns of the tornadoes. To analyze the tornado dataset, we firstly try a Gaussian distribution with the mean vector and variance-covariance matrix represented as …


Data Quality Checks: Implementation With Popular Data Collection Crowdsourcing Platforms, James Down, Gregory Balkcom, Kristine Duncan, Ngan (An) Truong, Andrew Lewis Nov 2023

Data Quality Checks: Implementation With Popular Data Collection Crowdsourcing Platforms, James Down, Gregory Balkcom, Kristine Duncan, Ngan (An) Truong, Andrew Lewis

Symposium of Student Scholars

The utilization of online crowdsourcing platforms for data collection has increased over the past two decades in the field of public health due to the ease of use, the cost-saving benefits, the speed of the data collection process, and the accessibility of a potentially true representative population. Although these platforms offer many advantages to researchers, significant drawbacks exist, such as poor data quality, that threaten the reliability and validity of the study. Previous studies have examined data quality concerns, but differences in results arise due to variations in study designs, disciplinary contexts, and the platforms being investigated. Therefore, this study …


Mathematical Modeling Of The Impact Of Lobbying On Climate Policy, Andrew Jacoby, Claire Hannah, James Hutchinson, Jasmine Narehood, Aditi Ghosh, Padmanabhan Seshaiyer Nov 2023

Mathematical Modeling Of The Impact Of Lobbying On Climate Policy, Andrew Jacoby, Claire Hannah, James Hutchinson, Jasmine Narehood, Aditi Ghosh, Padmanabhan Seshaiyer

Annual Symposium on Biomathematics and Ecology Education and Research

No abstract provided.


Reducing Uncertainty In Sea-Level Rise Prediction: A Spatial-Variability-Aware Approach, Subhankar Ghosh, Shuai An, Arun Sharma, Jayant Gupta, Shashi Shekhar, Aneesh Subramanian Oct 2023

Reducing Uncertainty In Sea-Level Rise Prediction: A Spatial-Variability-Aware Approach, Subhankar Ghosh, Shuai An, Arun Sharma, Jayant Gupta, Shashi Shekhar, Aneesh Subramanian

I-GUIDE Forum

Given multi-model ensemble climate projections, the goal is to accurately and reliably predict future sea-level rise while lowering the uncertainty. This problem is important because sea-level rise affects millions of people in coastal communities and beyond due to climate change's impacts on polar ice sheets and the ocean. This problem is challenging due to spatial variability and unknowns such as possible tipping points (e.g., collapse of Greenland or West Antarctic ice-shelf), climate feedback loops (e.g., clouds, permafrost thawing), future policy decisions, and human actions. Most existing climate modeling approaches use the same set of weights globally, during either regression or …


Statistical Methods To Generate Artificial Slot Floor Data For The Advancement Of Casino Related Research, Courtney Bonner, Anastasia (Stasi) D. Baran, Jason D. Fiege, Saman Muthukumarana May 2023

Statistical Methods To Generate Artificial Slot Floor Data For The Advancement Of Casino Related Research, Courtney Bonner, Anastasia (Stasi) D. Baran, Jason D. Fiege, Saman Muthukumarana

International Conference on Gambling & Risk Taking

Abstract:

A common difficulty when researching gambling topics is the availability of high-quality data sets for development and testing. Due to the high level of secrecy within the gambling industry, if data is obtained for research purposes it is often prohibitively obfuscated, incomplete, or aggregated. Although these data have allowed for advancement in academic work, it leaves both the researchers and readers left wondering about what would be possible if more detailed data sets were available. To mitigate the paucity of data available to researchers, we present a Markov chain-based statistical process for producing artificial event data for a simulated …


Analytical Approach For Monitoring The Behavior Of Patients With Pancreatic Adenocarcinoma At Different Stages As A Function Of Time, Aditya Chakaborty Dr, Chris P. Tsokos Dr May 2023

Analytical Approach For Monitoring The Behavior Of Patients With Pancreatic Adenocarcinoma At Different Stages As A Function Of Time, Aditya Chakaborty Dr, Chris P. Tsokos Dr

Biology and Medicine Through Mathematics Conference

No abstract provided.


Employee Attrition: Analyzing Factors Influencing Job Satisfaction Of Ibm Data Scientists, Graham Nash Apr 2023

Employee Attrition: Analyzing Factors Influencing Job Satisfaction Of Ibm Data Scientists, Graham Nash

Symposium of Student Scholars

Employee attrition is a relevant issue that every business employer must consider when gauging the effectiveness of their employees. Whether or not an employee chooses to leave their job can come from a multitude of factors. As a result, employers need to develop methods in which they can measure attrition by calculating the several qualities of their employees. Factors like their age, years with the company, which department they work in, their level of education, their job role, and even their marital status are all considered by employers to assist in predicting employee attrition. This project will be analyzing a …


A Chairpersons Guide To Managing Time And Stress, Christian K. Hansen Mar 2023

A Chairpersons Guide To Managing Time And Stress, Christian K. Hansen

Academic Chairpersons Conference Proceedings

In this interactive workshop we discuss time and stress management specifically from the perspective of a department chairperson responsible for leading an academic department through numerous internal and external challenges. The focus will be on practical strategies for effective use of time, not only at a personal level, but also at a department wide level.


Investigation Of Key Factors To Earthquake Insurance Take-Up Rates In Quebec And British Columbia Households And Prediction Model Building, Yongcheng Jiang Aug 2022

Investigation Of Key Factors To Earthquake Insurance Take-Up Rates In Quebec And British Columbia Households And Prediction Model Building, Yongcheng Jiang

Undergraduate Student Research Internships Conference

Maintaining an adequate level of earthquake take-up rate could protect the insurance industry from systemic failure. Past research has shown that British Columbia and Quebec have significant differences in earthquake insurance take-up rate. This report investigates key factors from the structure (default options and various types) of the insurance plan and personal characteristics along with socioeconomic/demographic profiles that affect the demand for earthquake protection in the form of insurance. The report also provides a prediction model for earthquake insurance take-up rate. The results show an importance ranking of key factors of earthquake insurance take up, the most important three are …


Financial Literacy: Self-Evaluation And Reality, Yangsijia Wang Aug 2022

Financial Literacy: Self-Evaluation And Reality, Yangsijia Wang

Undergraduate Student Research Internships Conference

This study is on the topic of financial literacy, with the data source containing information on clients' demographic information and self-evaluation, change in account value, and trade record, three major problems were investigated: first, whether a client's demographic traits are related to his/her self-evaluation of financial knowledge level; second, does the trading behaviour differ for clients who self-identified as in different financial knowledge groups; and third, do people who self-identified as financially knowledgeable have better investment result. Data manipulation was done using SQL and R. Exploratory analysis including multiple types of plots and proportion tables was used to derive the …


Session 5: Equipment Finance Credit Risk Modeling - A Case Study In Creative Model Development & Nimble Data Engineering, Edward Krueger, Landon Thompson, Josh Moore Feb 2022

Session 5: Equipment Finance Credit Risk Modeling - A Case Study In Creative Model Development & Nimble Data Engineering, Edward Krueger, Landon Thompson, Josh Moore

SDSU Data Science Symposium

This presentation will focus first on providing an overview of Channel and the Risk Analytics team that performed this case study. Given that context, we’ll then dive into our approach for building the modeling development data set, techniques and tools used to develop and implement the model into a production environment, and some of the challenges faced upon launch. Then, the presentation will pivot to the data engineering pipeline. During this portion, we will explore the application process and what happens to the data we collect. This will include how we extract & store the data along with how it …


Mathematical Modeling And Analysis Of Covid-19 Epidemic With Vaccination, Caitlin Seibel, Tina Huang, Jackson Reisman, Erika Johanna Martinez Salinas, Viswanathan Arunachalam, Moatlhodi Kgosimore, Anuj Mubayi, Padmanabhan Seshaiyer, Allen Bone Sehunelo Nov 2021

Mathematical Modeling And Analysis Of Covid-19 Epidemic With Vaccination, Caitlin Seibel, Tina Huang, Jackson Reisman, Erika Johanna Martinez Salinas, Viswanathan Arunachalam, Moatlhodi Kgosimore, Anuj Mubayi, Padmanabhan Seshaiyer, Allen Bone Sehunelo

Annual Symposium on Biomathematics and Ecology Education and Research

No abstract provided.


Anti-Vaxxers: Parents Fighting Science, Katie West Aug 2021

Anti-Vaxxers: Parents Fighting Science, Katie West

Symposium of Student Scholars

Immunizing children helps protect the health of our community, especially those people who cannot be immunized. Yet, since 1996 after a study was released that linked autism to vaccinations, there has been a trend of parents refusing to vaccinate their children. What are the demographics of the parents who believe their children are better off without vaccines? By knowing where these parents live and what decisions they make for their children’s education, counties and medical professionals can provide education and address their concerns.

My research involves data on 116,141 kindergarten classes from 2000-2015 in California. The two vaccine exemption options …


Opioid Abuse: Are Doctors Creating The Problem?, Nguyen Tran Aug 2021

Opioid Abuse: Are Doctors Creating The Problem?, Nguyen Tran

Symposium of Student Scholars

Opioid abuse and overdose are serious health problems in the United States. Current research has concentrated on the treatment and prevention of opioid abuse. Using data from the Controlled Substance Utilization Review and Evaluation System (CURES) for California zip codes, my research focuses on the causes of opioid overdose by considering the relationships between the following variables within each zip code: population size, average number of prescriptions per doctor, percentage of people who receive opioid prescriptions, percentage of people receiving the same prescription drug from 3 or more doctors, average number of opioid pills per prescription and number of people …


Market Research: How To Keep And Gain Customers, Chris Mccall Aug 2021

Market Research: How To Keep And Gain Customers, Chris Mccall

Symposium of Student Scholars

Customer-centered market research is essential to the creation and management of successful marketing campaigns. A company that understands their customers will be able to provide those customers with products and services that fit their needs better than the competition, and ultimately increase profits. My research focuses on a database containing customer information for a telecommunications company called Telco. Within this research, I will focus on a number of customer attributes including demographics, services provided, payment methods, contract lengths, monthly charges, and tenure with the company. Considering how these attributes relate to one another will give me a better understanding of …


Food Deserts: Hungry For Answers, Lawren Cumberbatch Aug 2021

Food Deserts: Hungry For Answers, Lawren Cumberbatch

Symposium of Student Scholars

In 2010, the United States Department of Agriculture (USDA) reported that 23.5 million people in the United States live in food deserts. As defined by the USDA, a “food desert” is a neighborhood that lacks healthy food sources. This can be measured by distance to a store, number of stores in an area, individual-level resources such as family income or vehicle availability, and neighborhood-level resources such as availability of public transportation. Past research provides evidence that food deserts are especially likely to occur in communities heavily populated by minorities. As a Black Indian pre-med student aiming to join the world …


Determining Malignancy: Can Mammogram Results Help Predict The Diagnosis Of Breast Tumors?, Taylor Behrens Aug 2021

Determining Malignancy: Can Mammogram Results Help Predict The Diagnosis Of Breast Tumors?, Taylor Behrens

Symposium of Student Scholars

Even with advancements in treatment and preventative care, breast cancer remains an epidemic claiming more than 40,000 American male and female lives each year. The mammogram dataset that I am analyzing was initially complied in the early 1990s by a team from the University of Wisconsin - Madison. Past research diagnoses breast cancer from fine-needle aspirates. My research focuses on predicting whether we can determine breast cancer diagnoses without the use of invasive procedures and, in particular, whether we can predict breast cancer based on mammogram data. Do measures of gray-scale texture, radius, concavity, perimeter, compactness, area, and smoothness of …


Accidental Overdoses: Insights To Aid In Prevention, Annabel Nganga Aug 2021

Accidental Overdoses: Insights To Aid In Prevention, Annabel Nganga

Symposium of Student Scholars

Having lost a friend six years ago to an accidental cocaine overdose, I am very passionate about spreading awareness of accidental drug overdoses that have affected thousands of families countrywide. According to past research, deaths resulting from opiates specifically have been on the rise, and a significant number of deaths in the United States for those below fifty years are caused by drug overdoses. Data exists indicating which states have more overdoses. The data set I will be using includes variables on race, sex, age, drug with which person overdosed, location of the overdose, ultimate cause of death and year …


Death By Police: When “Protecting And Serving” Goes Wrong, Hesper Mallis Aug 2021

Death By Police: When “Protecting And Serving” Goes Wrong, Hesper Mallis

Symposium of Student Scholars

The recent cases of law enforcement using lethal force in the United States have gained massive public attention. My dataset is from the Mapping Police Violence website. The website’s focus was to create a heat map to display where police killings occurred most frequently. The website has a dataset with information on 7,664 deaths of suspects. The variables in the dataset include age, sex and race of the suspect; geographic location; alleged threat level; alleged weapon; cause of death; and criminal charges against the officer. In addition, the variables include whether the individual had a mental illness, was armed or …


Are There Predictors Of A Running Back’S Success?, Joshua Price Aug 2021

Are There Predictors Of A Running Back’S Success?, Joshua Price

Symposium of Student Scholars

People who analyze football have concentrated in the past on a running back’s 40-yard dash, shuffle, broad jump, vertical jump, and bench press measures. My research will test if the following variables can predict a running back’s success in the NFL: height, weight, conference, offensive line ranking for their team, the running back’s total yards for the season, their average yards for each attempt, the number of times the running back has entered the end zone for a touchdown that season, the running back’s time average time behind the line of scrimmage (TLOS), the percentage of times the running back …


Sources And Aftermaths Of Pipeline Related Leaks And Spills, Justin Smith Aug 2021

Sources And Aftermaths Of Pipeline Related Leaks And Spills, Justin Smith

Symposium of Student Scholars

The escape of oil and other hazardous materials have been shown to pollute and destroy ecosystems. As an aspiring chemist, I am adamant about the secure handling and transportation of oil and other hazardous materials. In the past, researchers have concentrated on oil’s high viscosity. Oil’s high viscosity physically smothers wildlife, affecting their ability to continue critical functions such as respiration, feeding, and thermoregulation. My research focuses on the source of these oil spills, as well as natural gas leaks, for the purpose of risk assessment. In addition, I compare recovery efforts based on the cause of the leak/spill, the …


On The Front Lines Of Fire: How Do We Save Their Lives?, Cathrine Jatta Aug 2021

On The Front Lines Of Fire: How Do We Save Their Lives?, Cathrine Jatta

Symposium of Student Scholars

The National Institute for Occupational Safety and Health (NIOSH) reports that the United States depends on about 1.1 million firefighters to protect its citizens and property from fire. NIOSH adds that approximately 336,000 are career firefighters; 812,000 are volunteers; and 80 to 100 die in the line of duty each year. NIOSH investigates each fatality individually for the cause and prevention. In contrast, my research will look at a complete dataset of 2005 firefighter fatalities and see if any of the following variables may predict firefighter death: age, cause of death, property type, type of duty (e.g. on-duty, training), and …


Cervical Cancer: Are There Ways To Reduce The Risks?, Madelyn Dorn Aug 2021

Cervical Cancer: Are There Ways To Reduce The Risks?, Madelyn Dorn

Symposium of Student Scholars

History has shown us that when caught early, cervical cancer is curable. Past research has found that the sexually transmitted diseases (STDs), herpes and human papillomavirus (HPV), have been associated with cervical cancer. In contrast, my dataset on 859 women has many more STDs and lifestyle choices compiled on 36 variables. The diagnoses in the dataset are many: cervical condylomatosis, vaginal condylomatosis, vulvo-perineral condylomatosis, syphilis, pelvic inflammatory disease, genital herpes, molluscum contagiosum, acquired immune deficiency syndrome (AIDS), human immunodeficiency virus (HIV), hepatitis B, HPV, and cervical cancer. In addition to the demographic variable on age, there are many lifestyle choice …


Marijuana Arrests In Toronto Canada: A Look Into The Canadian Criminal Justice System, Steven Tully Aug 2021

Marijuana Arrests In Toronto Canada: A Look Into The Canadian Criminal Justice System, Steven Tully

Symposium of Student Scholars

Marijuana related drug offenses made up fifty-eight percent of all Controlled Drugs and Substances Act offenses in Canada in 2016. On October 17, 2018, Canada legalized marijuana. As part of the efforts to legalize marijuana, descriptive statistics of single variables, like the age of the arrestees and the number of people arrested per year, were reported by the Toronto Star newspaper. The dataset analyzed in this research predates the legalization of marijuana and was collected from 1997 to 2002 on 5,226 individuals arrested in Toronto, Canada for simple possession of small quantities of marijuana. When an offender was arrested for …


Who Is Next? Evaluating Factors That May Contribute To Heart Failure, Davon Broadwater Aug 2021

Who Is Next? Evaluating Factors That May Contribute To Heart Failure, Davon Broadwater

Symposium of Student Scholars

Cardiovascular diseases are the number one causes of death globally, and for African Americans those risks are even higher. As an African American university student studying Biology, I am passionate about researching the diseases that affect my race. Current research states that behavioral factors such as obesity, tobacco use, unhealthy diet, and harmful use of alcohol should be avoided. I have chosen to research predictors of what helps patients survive if they already have heart failure. Heart failure develops gradually, where the heart becomes weaker over time and has trouble pumping blood to nourish the cells in the body. Data …


Eradicating Zebra Mussels: What Works?, Elijah Davies Aug 2021

Eradicating Zebra Mussels: What Works?, Elijah Davies

Symposium of Student Scholars

The invasion of U.S lakes and rivers by the invasive species of zebra mussels called Dreissena polymorpha has caused catastrophic harm to the local ecosystem by reproducing and outcompeting native mussel species as well as harm to pipes leading into water sources by binding to surfaces and reproducing to the point that the mussels clog pipes. In addition, recreation areas must be closed due to the sharp shells making areas unusable. In the past, research has focused on individual molluscicides and their eradication of zebra mussels, as well as their effect on native flora and fauna. My research will contrast …


Bias In Police Shootings: Is It Just An Opinion?, Phuong Ho Aug 2021

Bias In Police Shootings: Is It Just An Opinion?, Phuong Ho

Symposium of Student Scholars

The claims of racism have drawn public attention toward police brutality and its impact on minorities. Is this just an opinion or is there any statistical evidence? Recent studies from The Atlantic have investigated the average age and ethnicity of victims from police killings in 2015-2016. As an Asian-American, I am motivated to examine the issue of police killings among races and other demographics to find any bias that is present. Using the dataset of 2,204 victims of police killings (2015-2016) collected by The Guardian, I will examine the following variables for bias: age, cause of death, armed/unarmed, race/ethnicity, and …


Do Environmental Toxins Predict Violent Crimes?, Tyler Stahl Aug 2021

Do Environmental Toxins Predict Violent Crimes?, Tyler Stahl

Symposium of Student Scholars

Do chemical pollutants that persistent in the environment and bioaccumulate in the body affect human health and behavior? Could these Persistent, Bioaccumulative, and Toxic (PBT) chemicals play a role in the cause of violent crimes due to deterioration of mental and cognitive functions? In the past, Mercury, a PBT chemical, has been shown in salmon to be associated with aggression. Could similar aggression occur in humans exposed to mercury through a toxic spill? Two sources of data are utilized in this analysis. The Environmental Protection Agency’s (EPA) Annual Toxic Release Inventory publishes data on toxic releases into the environment and …


Reporting Of Eating Disorder Deaths, Katherine Mobley, Amy Hord May 2021

Reporting Of Eating Disorder Deaths, Katherine Mobley, Amy Hord

Symposium of Student Scholars

Those affected by eating disorders experience disturbances in eating behaviors which are often related to underlying psychiatric disorders such as anxiety, depression, or obsessive-compulsive disorder (Parekh, 2017, Drieberg et al., 1998 p.53). The duplicitous nature of the disorder makes it difficult to diagnose, and the tole it takes on an individual’s physical health makes its mortality rate the second highest among psychiatric disorders (Guinhut et al., 2021 p.130). Even if the correct education and resources are accessible to certain individuals, negative stigmatization about the disorder can make sufferers unlikely to seek help (Becker et al., 2010). Findings from analysis of …


A Study Of Sentiment Of Covid-19 Related Tweets In The Usa, Jack Luu, Rosangela Follmann Nov 2020

A Study Of Sentiment Of Covid-19 Related Tweets In The Usa, Jack Luu, Rosangela Follmann

Annual Symposium on Biomathematics and Ecology Education and Research

No abstract provided.