Open Access. Powered by Scholars. Published by Universities.®

Categorical Data Analysis Commons

Open Access. Powered by Scholars. Published by Universities.®

341 Full-Text Articles 538 Authors 86,862 Downloads 77 Institutions

All Articles in Categorical Data Analysis

Faceted Search

341 full-text articles. Page 1 of 14.

First-Year Computer Science Students: Pathways And Perceptions In Introductory Computer Science Courses, Christina A. LeBlanc 2020 University of Maine

First-Year Computer Science Students: Pathways And Perceptions In Introductory Computer Science Courses, Christina A. Leblanc

Electronic Theses and Dissertations

This study examined student perceptions and experiences of an introductory Computer Science course at the University of Maine; COS 125: Introduction to Problem Solving Using Computer Programs. It also explored the pathways that students pursue after taking COS 125, depending on their success in the course, and their motivation to persist. Through characterizing student populations and their performance in their first semester in the Computer Science program, they can be placed into one of three categories that explain their path; a “continuer” (passed COS 125 and decided to stay in the major), a “persister” (did not pass COS 125 and ...


Act Scores Across Minnesota's Congressional Districts, Katie Moynihan 2020 Concordia University St. Paul

Act Scores Across Minnesota's Congressional Districts, Katie Moynihan

Research and Scholarship Symposium Posters

Data analysis was conducted to test factors which could affect the ACT scores of Minnesota high school students. Average composite scores across the state’s eight congressional districts were evaluated. Factors studied include family income, parental education, diversity, district location, graduation class size, and graduation rate. Methodology and results will be discussed.


How Data Is Changing The World Of Healthcare, Cameron Marous 2020 Ohio Northern University

How Data Is Changing The World Of Healthcare, Cameron Marous

Honors Capstone Enhancement Presentations

No abstract provided.


Using Alteryx Designer In Audit, Nolan Asiala 2020 Grand Valley State University

Using Alteryx Designer In Audit, Nolan Asiala

Honors Projects

My senior project was built around data analysis and how it relates to the auditing profession. Initially, I was planning on attending a data analytics competition, but that was canceled due to the events of COVID-19. This project utilized the Alteryx Designer program to demonstrate how it can be used during an audit engagement. By creating a workflow in Alteryx Designer, a report from a client can be cleaned and reformatted into a working dataset. My project includes two Excel files, a Microsoft Word document that serves as a brief introduction to the program, and a video describing the workflow ...


Evaluation Of Text Mining Techniques Using Twitter Data For Hurricane Disaster Resilience, Joshua Eason, Sathish Kumar 2020 Creighton University

Evaluation Of Text Mining Techniques Using Twitter Data For Hurricane Disaster Resilience, Joshua Eason, Sathish Kumar

SDSU Data Science Symposium

Data obtained from social media microblogging websites such as Twitter provide the unique ability to collect and analyze conversations of the public in order to gain perspective on the thoughts and feelings of the general public. Sentiment and volume analysis techniques were applied to the dataset in order to gain an understanding of the amount and level of sentiment associated with certain disaster-related tweets, including a topical analysis of specific terms. This study showed that disaster-type events such as a hurricane can cause some strong negative sentiment in the period of time directly preceding the event, but ultimately returns quickly ...


Informal Professional Development On Twitter: Exploring The Online Communities Of Mathematics Educators, Jaymie Ruddock 2020 Southern Methodist University

Informal Professional Development On Twitter: Exploring The Online Communities Of Mathematics Educators, Jaymie Ruddock

SMU Journal of Undergraduate Research

Professional development in its most traditional form is a classroom setting with a lecturer and an overwhelming amount of information. It is no surprise, then, that informal professional development away from institutions and on the teacher's own terms is a growing phenomenon due to an increased presence of educators on social media. These communities of educators use hashtags to broadcast to each other, with general hashtags such as #edchat having the broadest audience. However, many math educators usethe hashtags #ITeachMath and #MTBoS, communities I was interested in learning more about. I built a python script that used Tweepy to ...


Mapping Relationships And Positions Of Objects In Images Using Mask And Bounding Box Data, Jaime M. Villanueva Jr, Anantharam Subramanian, Vishal Ahir, Andrew Pollock 2020 Southern Methodist University

Mapping Relationships And Positions Of Objects In Images Using Mask And Bounding Box Data, Jaime M. Villanueva Jr, Anantharam Subramanian, Vishal Ahir, Andrew Pollock

SMU Data Science Review

In this paper we present novel methods for automatically annotating images with relationship and position tags that are derived using mask and bounding box data. A Mask Region-based Convolutional Neural Network (Mask R-CNN) is used as the foundation for the ob- ject detection process. The relationships are found by manipulating the bounding box and mask segmentation outputs of a Mask R-CNN. The absolute positions, the positions of the objects relative to the image, and the relative positions, the positions of objects relative to the other objects, are then associated with the images as annotations that are out- put in order ...


Accuracy Of Avs Life Expectancy Reports, Ariya Aghababa 2020 The University of Akron

Accuracy Of Avs Life Expectancy Reports, Ariya Aghababa

Williams Honors College, Honors Research Projects

Use insurance company data to predict the trends in life insurance life expectancy reports. Also, use the data to predict what impairments could potentially decrease or increase an insured's life expectancy based on reports created by various Actuaries at life settlement companies.


Inference Of Heterogeneity In Meta-Analysis Of Rare Binary Events And Rss-Structured Cluster Randomized Studies, Chiyu Zhang 2019 Southern Methodist University

Inference Of Heterogeneity In Meta-Analysis Of Rare Binary Events And Rss-Structured Cluster Randomized Studies, Chiyu Zhang

Statistical Science Theses and Dissertations

This dissertation contains two topics: (1) A Comparative Study of Statistical Methods for Quantifying and Testing Between-study Heterogeneity in Meta-analysis with Focus on Rare Binary Events; (2) Estimation of Variances in Cluster Randomized Designs Using Ranked Set Sampling.

Meta-analysis, the statistical procedure for combining results from multiple studies, has been widely used in medical research to evaluate intervention efficacy and safety. In many practical situations, the variation of treatment effects among the collected studies, often measured by the heterogeneity parameter, may exist and can greatly affect the inference about effect sizes. Comparative studies have been done for only one or ...


Achieving Optimal Horizontal Drill Operations, Daniel J. Serna, James Vasquez, Donald Markley 2019 Southern Methodist University

Achieving Optimal Horizontal Drill Operations, Daniel J. Serna, James Vasquez, Donald Markley

SMU Data Science Review

In this paper, we present a novel method of predicting the onset of a slide event in horizontal drilling operations. Horizontal drilling operations attempt to create a well through a subsurface as quickly as possible by rotating a drill through the subsurface. A slide event occurs when the drill begins to inefficiently rotate through the subsurface, resulting in a significantly reduced rate of penetration. Slide events can be prevented, or significantly reduced in their impact, when their onset is accurately predicted. We present a method of accurately predicting the onset of slide events with a time-series based predictive model that ...


#46 - America's Response To President Trump's Tweets, Amanda Friend 2019 University of West Georgia

#46 - America's Response To President Trump's Tweets, Amanda Friend

Georgia Undergraduate Research Conference (GURC)

Purpose: The purpose of the research throughout this study was to examine Trump’s tweets during the first six months he was in office. Due to Trump using Twitter as his main form of communication it is important for journalists and individuals to follow his tweets.

Research Questions: The analysis covers how many times people shared positive or negative tweets and if people shared more issue based tweets. This study emphasizes President Trump’s most popular tweets and how people responded to his first six months on Twitter.

Method: The tweets were coded with a key using content analysis to ...


Reduced Bias For Respondent Driven Sampling: Accounting For Non-Uniform Edge Sampling Probabilities In People Who Inject Drugs In Mauritius, Miles Q. Ott, Krista J. Gile, Matthew T. Harrison, Lisa G. Johnston, Joseph W. Hogan 2019 Smith College

Reduced Bias For Respondent Driven Sampling: Accounting For Non-Uniform Edge Sampling Probabilities In People Who Inject Drugs In Mauritius, Miles Q. Ott, Krista J. Gile, Matthew T. Harrison, Lisa G. Johnston, Joseph W. Hogan

Statistical and Data Sciences: Faculty Publications

People who inject drugs are an important population to study in order to reduce transmission of blood-borne illnesses including HIV and Hepatitis. In this paper we estimate the HIV and Hepatitis C prevalence among people who inject drugs, as well as the proportion of people who inject drugs who are female in Mauritius. Respondent driven sampling (RDS), a widely adopted link-tracing sampling design used to collect samples from hard-to-reach human populations, was used to collect this sample. The random walk approximation underlying many common RDS estimators assumes that each social relation (edge) in the underlying social network has an equal ...


Integrating Data Science Into A General Education Information Technology Course: An Approach To Developing Data Savvy Undergraduates, Malcolm Haynes, Joshua Groen, Eric Sturzinger, Danny Zhu, Justin Shafer, Timothy McGee 2019 United States Military Academy

Integrating Data Science Into A General Education Information Technology Course: An Approach To Developing Data Savvy Undergraduates, Malcolm Haynes, Joshua Groen, Eric Sturzinger, Danny Zhu, Justin Shafer, Timothy Mcgee

West Point Research Papers

The National Academies recommend academic institutions foster a basic understanding of data science in all undergraduates. However, data science education is not currently a graduation requirement at most colleges and universities. As a result, many graduates lack even basic knowledge of data science. To address the shortfall, academic institutions should incorporate introductory data science into general education courses. A general education IT course provides a unique opportunity to integrate data science education. Modules covering databases, spreadsheets, and presentation software, already present in many survey IT courses, teach concepts and skills needed for data science. As a result, a survey IT ...


Allocative Poisson Factorization For Computational Social Science, Aaron Schein 2019 University of Massachusetts Amherst

Allocative Poisson Factorization For Computational Social Science, Aaron Schein

Doctoral Dissertations

Social science data often comes in the form of high-dimensional discrete data such as categorical survey responses, social interaction records, or text. These data sets exhibit high degrees of sparsity, missingness, overdispersion, and burstiness, all of which present challenges to traditional statistical modeling techniques. The framework of Poisson factorization (PF) has emerged in recent years as a natural way to model high-dimensional discrete data sets. This framework assumes that each observed count in a data set is a Poisson random variable $y ~ Pois(\mu)$ whose rate parameter $\mu$ is a function of shared model parameters. This thesis examines a specific ...


Exploiting Mobility For Predictive Urban Analytics & Operations, Kasthuri JAYARAJAH 2019 Singapore Management University

Exploiting Mobility For Predictive Urban Analytics & Operations, Kasthuri Jayarajah

Dissertations and Theses Collection (Open Access)

As cities worldwide invest heavily in smart city infrastructure, it invites opportunities for a next wave of urban analytics. Unlike its predecessors, urban analytics applications and services can now be real-time and proactive -- they can (a) leverage situational data from large deployments of connected sensors, (b) capture attributes of a variety of entities that make up the urban fabric (e.g., people and their social relationships, transport nodes, utilities, etc.), and (c) use predictive insights to both proactively optimize urban operations (e.g., HVAC systems in smart buildings, buses in the transportation network, crowd-workers, etc.) and promote smarter policy decisions ...


Field Drilling Data Cleaning And Preparation For Data Analytics Applications, Daniel Cardoso Braga 2019 Louisiana State University

Field Drilling Data Cleaning And Preparation For Data Analytics Applications, Daniel Cardoso Braga

LSU Master's Theses

Throughout the history of oil well drilling, service providers have been continuously striving to improve performance and reduce total drilling costs to operating companies. Despite constant improvement in tools, products, and processes, data science has not played a large part in oil well drilling. With the implementation of data science in the energy sector, companies have come to see significant value in efficiently processing the massive amounts of data produced by the multitude of internet of thing (IOT) sensors at the rig. The scope of this project is to combine academia and industry experience to analyze data from 13 different ...


Responding To Some Challenges Posed By The Re-Identification Of Anonymized Personal Data, Herman T. Tavani, Frances S. Grodzinsky 2019 Rivier University

Responding To Some Challenges Posed By The Re-Identification Of Anonymized Personal Data, Herman T. Tavani, Frances S. Grodzinsky

Computer Ethics - Philosophical Enquiry (CEPE) Proceedings

In this paper, we examine a cluster of ethical controversies generated by the re-identification of anonymized personal data in the context of big data analytics, with particular attention to the implications for personal privacy. Our paper is organized into two main parts. Part One examines some ethical problems involving re-identification of personally identifiable information (PII) in large data sets. Part Two begins with a brief description of Moor and Weckert’s Dynamic Ethics (DE) and Nissenbaum’s Contextual Integrity (CI) Frameworks. We then investigate whether these frameworks, used together, can provide us with a more robust scheme for analyzing privacy ...


Market Research On Student Concert Attendance At Bgsu's College Of Musical Arts, Mary Solomon 2019 Bowling Green State University

Market Research On Student Concert Attendance At Bgsu's College Of Musical Arts, Mary Solomon

Honors Projects

Bowling Green State University boasts a well established College of Musical Arts which holds concerts performed by esteemed faculty, prestigious guest artists, and students. The school hosts these events in Kobacker Hall and Bryan Recital Hall which can accommodate up to 800 and 250 audience members, respectively. However, performances in Kobacker hall only fill one- fourth of the 800 seats, on average. Why is this so? This project aims to investigate the factors that influence students’ decisions to attend concerts at the College of Musical Arts (CMA). By methodology of survey research and statistical analysis, this project will look into ...


Black Swamp Pub And Bistro Analysis, Sara Aniol 2019 Bowling Green State University

Black Swamp Pub And Bistro Analysis, Sara Aniol

Honors Projects

The Black Swamp Pub and Bistro is a full-service restaurant located in the Union on the Bowling Green State University Campus. We mainly do sit-down service, but we also do take-out orders and have a full bar with draft beers as well as mixed drinks. Our menu tends to change a lot, with new additions as well as some of the items being deleted. My goal of this project is to try to give some insight on the patterns that are too big to see with day-to-day operations as well as give some recommendations for the future that is backed ...


Examining The Relationship Between Pre-Collegiate Educational Experiences And Religious Affiliation, Erica Augustyniak 2019 Bowling Green State University

Examining The Relationship Between Pre-Collegiate Educational Experiences And Religious Affiliation, Erica Augustyniak

Honors Projects

This paper explores the relationship between the type of school students experienced before college and how that schooling affected the students’ religious affiliation. The specific types of schools examined are public and private schools with private schools being further divided into religious and non-religious private schools. I explore the differences in religious importance among several groups including students who attended Catholic schools and those who did not, students who attended religious schools for varying lengths of time (low, medium, high, and no involvement), and students who had a choice in the schools they attended and those who did not. I ...


Digital Commons powered by bepress