Open Access. Powered by Scholars. Published by Universities.®

Data Science Commons

Open Access. Powered by Scholars. Published by Universities.®

795 Full-Text Articles 1,798 Authors 113,441 Downloads 143 Institutions

All Articles in Data Science

Faceted Search

795 full-text articles. Page 1 of 40.

Exploring The Effectiveness Of Multiple-Exemplar Training For Visual Analysis Of Ab-Design Graphs, Verena S. Bethke 2022 The Graduate Center, City University of New York

Exploring The Effectiveness Of Multiple-Exemplar Training For Visual Analysis Of Ab-Design Graphs, Verena S. Bethke

Dissertations, Theses, and Capstone Projects

In behavior analysis, data are usually analyzed using visual analysis of the graphed data. There are a wide range of methods used to visually analyze data, from a basic ‘textbook’ style approach to the use of visual aids, decision-rubrics, and computer-based approaches. In the literature, there have been some comparisons of the efficacy of different approaches. Visual analysis as a behavior can be taught using a variety of methods, independent of how the skill itself is to be performed. Teaching methods include lecture, online instruction, and equivalence-based instruction. There is not much research on the teaching of visual analysis specifically ...


Inferring Dynamics Of Biological Systems, Tracey G. Oellerich 2022 George Mason University

Inferring Dynamics Of Biological Systems, Tracey G. Oellerich

Biology and Medicine Through Mathematics Conference

No abstract provided.


Trophish: Building A Global Database Of Freshwater Trophic Interactions, Jacob M. Ridgway 2022 University of South Dakota

Trophish: Building A Global Database Of Freshwater Trophic Interactions, Jacob M. Ridgway

Honors Thesis

Freshwater management and research frequently use the trophic data of freshwater fishes. Despite this fact, it is difficult to perform a simple search of dietary information for any one fish species. FishBase represents, to our knowledge, the largest compilation of freshwater dietary information to date. However, it excludes a large portion of the ecological literature due to its development taking place prior to the creation of most modern scientific search engines. Our project (TroPhish) is building upon FishBase by digitizing approximately 130 years of data from the fish predation literature. Data from the primary and grey (e.g. theses, dissertations ...


Hypergaming For Cyber: Strategy For Gaming A Wicked Problem, Joshua A. Sipper 2022 Air University

Hypergaming For Cyber: Strategy For Gaming A Wicked Problem, Joshua A. Sipper

Military Cyber Affairs

Cyber as a domain and battlespace coincides with the defined attributes of a “wicked problem” with complexity and inter-domain interactions to spare. Since its elevation to domain status, cyber has continued to defy many attempts to explain its reach, importance, and fundamental definition. Corresponding to these intricacies, cyber also presents many interlaced attributes with other information related capabilities (IRCs), namely electromagnetic warfare (EW), information operations (IO), and intelligence, surveillance, and reconnaissance (ISR), within an information warfare (IW) construct that serves to add to its multifaceted nature. In this cyber analysis, the concept of hypergaming will be defined and discussed in ...


Intraday Algorithmic Trading Using Momentum And Long Short-Term Memory Network Strategies, Andrew R. Whitinger II 2022 East Tennessee State University

Intraday Algorithmic Trading Using Momentum And Long Short-Term Memory Network Strategies, Andrew R. Whitinger Ii

Undergraduate Honors Theses

Intraday stock trading is an infamously difficult and risky strategy. Momentum and reversal strategies and long short-term memory (LSTM) neural networks have been shown to be effective for selecting stocks to buy and sell over time periods of multiple days. To explore whether these strategies can be effective for intraday trading, their implementations were simulated using intraday price data for stocks in the S&P 500 index, collected at 1-second intervals between February 11, 2021 and March 9, 2021 inclusive. The study tested 160 variations of momentum and reversal strategies for profitability in long, short, and market-neutral portfolios, totaling 480 ...


Identifying Text File Similarities In Forensic Disk Images Using Fuzzy Logic, Mindy M. Wongsa 2022 University of South Alabama

Identifying Text File Similarities In Forensic Disk Images Using Fuzzy Logic, Mindy M. Wongsa

Theses and Dissertations

Digital storage is evolving with the growth of technology. Individuals and corporations can access large amounts of digital storage, leaving digital forensics investigators with large amounts of data to collect and analyze in their forensic investigation cases. In addition, analyzing forensic disk images that contain hundreds of thousands of files can cause a problem with time since the investigators’ workloads can vary based on how many cases they are assigned. Fuzzy logic provides a pattern recognition system that could assist in identifying patterns in data. The purpose of this study was to determine if fuzzy logic could reliably aid in ...


How Blockchain Solutions Enable Better Decision Making Through Blockchain Analytics, Sammy Ter Haar 2022 University of Arkansas, Fayetteville

How Blockchain Solutions Enable Better Decision Making Through Blockchain Analytics, Sammy Ter Haar

Information Systems Undergraduate Honors Theses

Since the founding of computers, data scientists have been able to engineer devices that increase individuals’ opportunities to communicate with each other. In the 1990s, the internet took over with many people not understanding its utility. Flash forward 30 years, and we cannot live without our connection to the internet. The internet of information is what we called early adopters with individuals posting blogs for others to read, this was known as Web 1.0. As we progress, platforms became social allowing individuals in different areas to communicate and engage with each other, this was known as Web 2.0 ...


Attempting To Predict The Unpredictable: March Madness, Coleton Kanzmeier 2022 University of Nebraska at Omaha

Attempting To Predict The Unpredictable: March Madness, Coleton Kanzmeier

Theses/Capstones/Creative Projects

Each year, millions upon millions of individuals fill out at least one if not hundreds of March Madness brackets. People test their luck every year, whether for fun, with friends or family, or to even win some money. Some people rely on their basketball knowledge whereas others know it is called March Madness for a reason and take a shot in the dark. Others have even tried using statistics to give them an edge. I intend to follow a similar approach, using statistics to my advantage. The end goal is to predict this year’s, 2022, March Madness bracket. To ...


Data And Algorithmic Modeling Approaches To Count Data, Andraya Hack 2022 Murray State University

Data And Algorithmic Modeling Approaches To Count Data, Andraya Hack

Honors College Theses

Various techniques are used to create predictions based on count data. This type of data takes the form of a non-negative integers such as the number of claims an insurance policy holder may make. These predictions can allow people to prepare for likely outcomes. Thus, it is important to know how accurate the predictions are. Traditional statistical approaches for predicting count data include Poisson regression as well as negative binomial regression. Both methods also have a zero-inflated version that can be used when the data has an overabundance of zeros. Another procedure is to use computer algorithms, also known as ...


College Of Education Filemaker Extraction And End-User Database Development, Andrew Tran 2022 California State University, San Bernardino

College Of Education Filemaker Extraction And End-User Database Development, Andrew Tran

Electronic Theses, Projects, and Dissertations

The College of Education (CoE) at the California State University San Bernardino (CSUSB) developed a system to keep track of both state and national accreditation requirements using FileMaker 5, a database system. This accreditation data is crucial for reporting and record-keeping for the CSU Chancellor’s Office as well as the State of California. However, the database system was developed several decades ago, and software support has long since been dropped, causing the CoE’s legacy accreditation data to be at risk of being lost should the software or hardware suffer permanent failure. The purpose of this project was to ...


Developing Critical Thinking Military Officers, Thor Martinsen 2022 Naval Postgraduate School

Developing Critical Thinking Military Officers, Thor Martinsen

Mathematica Militaris

Critical thinking is frequently identified as an important trait for military officers. This paper examines critical thinking from a historical, pedagogical, and warfighting perspective. The author uses his experience teaching mathematical reasoning at the Naval Postgraduate School to provide helpful advice for educators charged with teaching deductive and inductive reasoning. The paper argues that critical thinking should be taught early in an officer's career. It emphasizes a systematic and Socratic instructional approach along with the importance of equipping students with the necessary tools to evaluate problem-solving techniques and critique their associated solutions. Finally, the paper discusses Augmented Intelligence and ...


Beyond Hcahps: Analysis Of Patients’ Comments Provides An Expanded View Of Their Hospital Experiences, Andrew S. Gallan, Rakesh Niraj, Awanindra Singh 2022 Florida Atlantic University

Beyond Hcahps: Analysis Of Patients’ Comments Provides An Expanded View Of Their Hospital Experiences, Andrew S. Gallan, Rakesh Niraj, Awanindra Singh

Patient Experience Journal

An important concern for health care professionals is that standardized patient surveys may not fully capture all the topics that are important to patients. As a result, health care professionals may not have a complete picture of what their patients experience. The purpose of this research is to utilize a state-of-the-art Natural Language Processing technique to make sense of patients’ solicited, unstructured comments to gain a deeper and broader understanding of their experiences in the hospital. We analyzed a large dataset of inpatient survey responses (48,592 patients generating 65,998 comments) by a patient experience survey vendor for an ...


An Exploratory Data Analysis On Covid-19 And Its Effects On Crime In New York City, Lanlie Nguyen 2022 Bowling Green State University

An Exploratory Data Analysis On Covid-19 And Its Effects On Crime In New York City, Lanlie Nguyen

Honors Projects

The purpose of this study was to analyze the effects of the COVID-19 pandemic and how it has affected the crime rates present in New York City over the years of 2019 and 2020. There is limited criminal research that investigate the connection to pandemics, and how it can be used to reduce crime rates in similar situations. The goal of this study is to reduce crime rates and provide possible policy implications.

This project analyzes the crime rate trends present before and during the COVID-19 pandemic, and compares it to the number of COVID-19 cases. Analysis of the statewide ...


Topological Data Analysis With Mapper, Gretchen Langenbahn 2022 Bowling Green State University

Topological Data Analysis With Mapper, Gretchen Langenbahn

Honors Projects

This project is an introduction and overview of Mapper. Mapper is a method of high dimensional data visualization. Data visualization is a very important part of data analysis as it allows for further interpretation and exploration of data. Visualization of high dimensional data sets can be challenging as each variable is a new dimension that must be represented on a 2D, or at most 3D, graph. Mapper allows for high dimensional visualization by using Topological methods to study the relationships between points. This project goes over two different data set: the Iris data set, and a high dimensional data set ...


Exploring Music Genres: A Study Of Optimal Differentiation By Feature, Rebecca Stetler 2022 Bowling Green State University

Exploring Music Genres: A Study Of Optimal Differentiation By Feature, Rebecca Stetler

Honors Projects

This study explores the presence of optimal differentiation in music at the feature level by genre. Popularity prediction models are constructed and used to identify influential features in predicting popularity in each genre. These influential features are then assessed for optimal differentiation of the most popular songs from all songs in the genre.


Chattanooga Crime Over Time: An Analysis Of Police Incident Open Data, Logan Bateman 2022 Southern Adventist University

Chattanooga Crime Over Time: An Analysis Of Police Incident Open Data, Logan Bateman

Campus Research Day

The police and citizens of Chattanooga may want to know where the most crime occurs, what time of day is crime or police incidents most likely to occur over time. This information can help them understand the crime hotspots in the area. This research work presents a dashboard built upon open data in attempt to bring understanding and insights to the police and citizens about police incidents from the city of Chattanooga over the past five years.


*Interactive Earthquake Visualization With Open Data, Matous Hybl 2022 Southern Adventist University

*Interactive Earthquake Visualization With Open Data, Matous Hybl

Campus Research Day

Because earthquakes claim thousands of lives and billions of dollars yearly, there is a great need to recognize patterns in seismic data. While some tools for analysis exist, most geological software is expensive and open earthquake visualizations are limited. In this project, we provide accessible earthquake visualizations aimed to encourage geologists, and science enthusiasts in general, to explore open data using accessible, yet powerful, tools.


Detection Of 3d Genome Folding At Multiple Scales, Betul Akgol-Oksuz 2022 UMass Chan Medical School

Detection Of 3d Genome Folding At Multiple Scales, Betul Akgol-Oksuz

Morningside Graduate School of Biomedical Sciences Dissertations and Theses

Understanding 3D genome structure is crucial to learn how chromatin folds and how genes are regulated through the spatial organization of regulatory elements. Various technologies have been developed to investigate genome architecture. These technologies include ligation-based 3C Methodologies such as Hi-C and Micro-C, ligation-based pull-down methods like Proximity Ligation-Assisted ChIP-seq (PLAC Seq) and Paired-end tag sequencing (ChIA PET), and ligation-free methods like Split-Pool Recognition of Interactions by Tag Extension (SPRITE) and Genome Architecture Mapping (GAM). Although these technologies have provided great insight into chromatin organization, a systematic evaluation of these technologies is lacking. Among these technologies, Hi-C has been one ...


A Web User Interface Image Processing Tool For Classifying The Extent Of Dementia Across Alzheimer’S, sathvik prasad palyam, Robin Ghosh 2022 Arkansas Tech University

A Web User Interface Image Processing Tool For Classifying The Extent Of Dementia Across Alzheimer’S, Sathvik Prasad Palyam, Robin Ghosh

ATU Research Symposium

Alzheimer's disease (AD) is the most common form of dementia. This project used four image specifications to classify the dementia stages in each patient applying the CNN algorithm. Employing the CNN-based in silico model, the authors successfully classified and predicted the different AD stages and got around 97.19% accuracy. Later, a web interface tool was developed to educate doctors or researchers to check the patients' dementia level based on the MRI brain images and suggest symptoms that strengthen the predicted level of AI. A user uploads the brain scan, which is sent to the backend server, where the ...


How Students Use The Services Available From Lindenwood Universities Library, Jennifer Sailor 2022 Lindenwood University

How Students Use The Services Available From Lindenwood Universities Library, Jennifer Sailor

Student Academic Showcase

Student Assessment Scholars took on the Library Services stakeholder proposal. Their goal was to find how students use the service available from Lindenwood Universities Library. Through evidence, it was found that students have a positive sentiment towards the Lindenwood’s Library Services. That the Students want longer hours, better marketing, and find one of the best services being the building itself. Lindenwood Universities Library Services are similar to the other schools in the athletic conference. Lindenwood University houses a Maker Lab, Career Services, and technology rentals like the other universities, but Lindenwood students were not aware of them. Overall, students ...


Digital Commons powered by bepress