Open Access. Powered by Scholars. Published by Universities.®

Data Science Commons

Open Access. Powered by Scholars. Published by Universities.®

1,318 Full-Text Articles 2,623 Authors 273,342 Downloads 180 Institutions

All Articles in Data Science

Faceted Search

1,318 full-text articles. Page 6 of 65.

Data-Optimized Spatial Field Predictions For Robotic Adaptive Sampling: A Gaussian Process Approach, Zachary Nathan 2023 Dartmouth College

Data-Optimized Spatial Field Predictions For Robotic Adaptive Sampling: A Gaussian Process Approach, Zachary Nathan

Computer Science Senior Theses

We introduce a framework that combines Gaussian Process models, robotic sensor measurements, and sampling data to predict spatial fields. In this context, a spatial field refers to the distribution of a variable throughout a specific area, such as temperature or pH variations over the surface of a lake. Whereas existing methods tend to analyze only the particular field(s) of interest, our approach optimizes predictions through the effective use of all available data. We validated our framework on several datasets, showing that errors can decline by up to two-thirds through the inclusion of additional colocated measurements. In support of adaptive sampling, …


Trustworthy Machine Learning Through The Lens Of Privacy And Security, Thi Kim Phung Lai 2023 New Jersey Institute of Technology

Trustworthy Machine Learning Through The Lens Of Privacy And Security, Thi Kim Phung Lai

Dissertations

Nowadays, machine learning (ML) becomes ubiquitous and it is transforming society. However, there are still many incidents caused by ML-based systems when ML is deployed in real-world scenarios. Therefore, to allow wide adoption of ML in the real world, especially in critical applications such as healthcare, finance, etc., it is crucial to develop ML models that are not only accurate but also trustworthy (e.g., explainable, privacy-preserving, secure, and robust). Achieving trustworthy ML with different machine learning paradigms (e.g., deep learning, centralized learning, federated learning, etc.), and application domains (e.g., computer vision, natural language, human study, malware systems, etc.) is challenging, …


Ai Approaches To Understand Human Deceptions, Perceptions, And Perspectives In Social Media, Chih-Yuan Li 2023 New Jersey Institute of Technology

Ai Approaches To Understand Human Deceptions, Perceptions, And Perspectives In Social Media, Chih-Yuan Li

Dissertations

Social media platforms have created virtual space for sharing user generated information, connecting, and interacting among users. However, there are research and societal challenges: 1) The users are generating and sharing the disinformation 2) It is difficult to understand citizens' perceptions or opinions expressed on wide variety of topics; and 3) There are overloaded information and echo chamber problems without overall understanding of the different perspectives taken by different people or groups.

This dissertation addresses these three research challenges with advanced AI and Machine Learning approaches. To address the fake news, as deceptions on the facts, this dissertation presents Machine …


Deep Hybrid Modeling Of Neuronal Dynamics Using Generative Adversarial Networks, Soheil Saghafi 2023 New Jersey Institute of Technology

Deep Hybrid Modeling Of Neuronal Dynamics Using Generative Adversarial Networks, Soheil Saghafi

Dissertations

Mechanistic modeling and machine learning methods are powerful techniques for approximating biological systems and making accurate predictions from data. However, when used in isolation these approaches suffer from distinct shortcomings: model and parameter uncertainty limit mechanistic modeling, whereas machine learning methods disregard the underlying biophysical mechanisms. This dissertation constructs Deep Hybrid Models that address these shortcomings by combining deep learning with mechanistic modeling. In particular, this dissertation uses Generative Adversarial Networks (GANs) to provide an inverse mapping of data to mechanistic models and identifies the distributions of mechanistic model parameters coherent to the data.

Chapter 1 provides background information on …


A Survey On Online Matching And Ad Allocation, Ryan Lee 2023 New Jersey Institute of Technology

A Survey On Online Matching And Ad Allocation, Ryan Lee

Theses

One of the classical problems in graph theory is matching. Given an undirected graph, find a matching which is a set of edges without common vertices. In 1990s, Richard Karp, Umesh Vazirani, and Vijay Vazirani would be the first computer scientists to use matchings for online algorithms [8]. In our domain, an online algorithm operates in the online setting where a bipartite graph is given. On one side of the graph there is a set of advertisers and on the other side we have a set of impressions. During the online phase, multiple impressions will arrive and the objective of …


Towards Generalizable Machine Learning Models For Computer-Aided Diagnosis In Medicine, Yiyang Wang 2023 DePaul University

Towards Generalizable Machine Learning Models For Computer-Aided Diagnosis In Medicine, Yiyang Wang

College of Computing and Digital Media Dissertations

Hidden stratification represents a phenomenon in which a training dataset contains unlabeled (hidden) subsets of cases that may affect machine learning model performance. Machine learning models that ignore the hidden stratification phenomenon--despite promising overall performance measured as accuracy and sensitivity--often fail at predicting the low prevalence cases, but those cases remain important. In the medical domain, patients with diseases are often less common than healthy patients, and a misdiagnosis of a patient with a disease can have significant clinical impacts. Therefore, to build a robust and trustworthy CAD system and a reliable treatment effect prediction model, we cannot only pursue …


"Church On My Couch": Predicting The Future Impact Of Online Ministry Based On The Impact During Covid-19, Samukeliso Mabarani, Sikhumbuzo Dube 2023 Adventist University of Africa

"Church On My Couch": Predicting The Future Impact Of Online Ministry Based On The Impact During Covid-19, Samukeliso Mabarani, Sikhumbuzo Dube

Adventist Human-Subject Researchers Association

With “everything from home” as the new norm, “how does the use of digital platforms impact Adventist education, community engagement, and spiritual outreach?” Using a quantitative approach, we draw insights from online ministry during Covid-19 and use the insights to predict the future impact of online ministry statistically.


Covid-19 In Casinos: Analysis Of Covid-19 Contamination And Spread With Economic Impact Assessment, Anastasia (Stasi) D. Baran, Jason D. Fiege 2023 nQube Data Science Inc.

Covid-19 In Casinos: Analysis Of Covid-19 Contamination And Spread With Economic Impact Assessment, Anastasia (Stasi) D. Baran, Jason D. Fiege

International Conference on Gambling & Risk Taking

Abstract:

The COVID-19 pandemic caused tremendous disruption for casinos, with the virus causing various lengths of shutdowns, capacity restrictions, and social distancing strategies such as machine removals or section closures. Although most of the world has now eased off these measures, it is important to review lessons learned to understand, and better prepare for similar circumstances in the future. We present Monte Carlo slot floor simulation software customized to simulate players spreading COVID-19 on the slot floor. We simulate the amount of touch surface contamination; the number of potential surface contact exposure events per day, and a proximity exposures statistic …


Payments Data In Gambling Research, Kasra Ghaharian, Mana Azizsoltani 2023 University of Nevada, Las Vegas

Payments Data In Gambling Research, Kasra Ghaharian, Mana Azizsoltani

International Conference on Gambling & Risk Taking

A considerable body of gambling-related research has leveraged gamblers' behavioral tracking data to address a broad set of research questions. These data have typically comprised of gamblers' betting-related behaviors including, for example, the frequency and volume of betting. The analysis of gamblers' payment-related behavioral data is far less common, but provides a fruitful avenue gambling-related research.

In this presentation we discuss a selection of potential research opportunities that payments transaction data presents. We supplement this discussion with specific analyses that have been performed by our research group. We also discuss knowledge gaps and areas for future research.


Statistical Methods To Generate Artificial Slot Floor Data For The Advancement Of Casino Related Research, Courtney Bonner, Anastasia (Stasi) D. Baran, Jason D. Fiege, Saman Muthukumarana 2023 nQube Data Science Inc.

Statistical Methods To Generate Artificial Slot Floor Data For The Advancement Of Casino Related Research, Courtney Bonner, Anastasia (Stasi) D. Baran, Jason D. Fiege, Saman Muthukumarana

International Conference on Gambling & Risk Taking

Abstract:

A common difficulty when researching gambling topics is the availability of high-quality data sets for development and testing. Due to the high level of secrecy within the gambling industry, if data is obtained for research purposes it is often prohibitively obfuscated, incomplete, or aggregated. Although these data have allowed for advancement in academic work, it leaves both the researchers and readers left wondering about what would be possible if more detailed data sets were available. To mitigate the paucity of data available to researchers, we present a Markov chain-based statistical process for producing artificial event data for a simulated …


The Rocket: Analyzing Rtp (Return To Player), Payoff Distribution And Player Behavior In Crash Games, Mikhail M. Sher, Robert Haywood Scott III, Jonathan A. Daigle 2023 Monmouth University

The Rocket: Analyzing Rtp (Return To Player), Payoff Distribution And Player Behavior In Crash Games, Mikhail M. Sher, Robert Haywood Scott Iii, Jonathan A. Daigle

International Conference on Gambling & Risk Taking

Abstract

Rocket is a crash game developed by DraftKings, an American publicly traded online casino, sports betting and fantasy sports company. DraftKings Rocket is a game played with a rising rocket. Players must exit the rocket at any point before the rocket crashes. In that case they receive the payoff in accordance to the multiplier of their exit point. If the rocket crashes before the player bails, player’s payoff is 0 (and they lose their bet).

The game boasts an unprecedented 97% RTP (Return to Player). For comparison, Atlantic City casino slots typically have a 91-92% RTP, while Vegas casino …


The Locals Casino As A Social Network – Can An Interconnected Community Of Players Detect Differences In Hold?, Jason D. Fiege, Anastasia (Stasi) D. Baran 2023 nQube Data Science Inc.

The Locals Casino As A Social Network – Can An Interconnected Community Of Players Detect Differences In Hold?, Jason D. Fiege, Anastasia (Stasi) D. Baran

International Conference on Gambling & Risk Taking

Abstract

It is difficult for individual players to detect differences in theoretical hold between slot machines without playing an unrealistically large number of games. This difficulty occurs because the fractional loss incurred by a player converges only slowly to the theoretical hold in the presence of volatility designed into slot pay tables. Nevertheless, many operators believe that players can detect changes in hold or differences compared to competition, especially in a locals casino market, and therefore resist increasing holds. Instead of investigating whether individual players can detect differences in hold, we ask whether a population of casino regulars who share …


Algorithmic Bias: Causes And Effects On Marginalized Communities, Katrina M. Baha 2023 University of San Diego

Algorithmic Bias: Causes And Effects On Marginalized Communities, Katrina M. Baha

Undergraduate Honors Theses

Individuals from marginalized backgrounds face different healthcare outcomes due to algorithmic bias in the technological healthcare industry. Algorithmic biases, which are the biases that arise from the set of steps used to solve or analyze a problem, are evident when people from marginalized communities use healthcare technology. For example, many pulse oximeters, which are the medical devices used to measure oxygen saturation in the blood, are not able to accurately read people who have darker skin tones. Thus, people with darker skin tones are not able to receive proper health care due to their pulse oximetry data being inaccurate. This …


Special Education: Inclusion And Exclusion In The K-12 U.S. Educational System, Erik Brault 2023 University of San Diego

Special Education: Inclusion And Exclusion In The K-12 U.S. Educational System, Erik Brault

Dissertations

The U.S. Department of Education defines students with disabilities as those having a physical or mental impairment that substantially limits one or more life activities. Previous research has found that students with disabilities placed in inclusive environments perform better academically and socially compared to students with disabilities who are placed in segregated environments. Yet, we know that inclusion in K-12 general education classrooms across the country is not consistently implemented.

The purpose of this study was to better understand the effects, if any, of general education high school teachers’ personal and professional experiences and knowledge on their attitudes toward educating …


Utilizing New Technologies To Measure Therapy Effectiveness For Mental And Physical Health, Jonathan Ossie 2023 University of San Diego

Utilizing New Technologies To Measure Therapy Effectiveness For Mental And Physical Health, Jonathan Ossie

Dissertations

Mental health is quickly becoming a major policy concern, with recent data reporting increasing and disproportionately worse mental health outcomes, including anxiety, depression, increased substance abuse, and elevated suicidal ideation. One specific population that is especially high risk for these issues is the military community because military conflict, deployment stressors, and combat exposure contribute to the risk of mental health problems.

Although several pharmacological approaches have been employed to combat this epidemic, their efficacy is mixed at best, which has led to novel nonpharmacological approaches. One such approach is Operation Surf, a nonprofit that provides nature-based programs advocating the restorative …


Linear Regression With Regularization On The Genetic Architecture Of Maize Flowering Time, Roland Fiagbe 2023 University of Central Florida

Linear Regression With Regularization On The Genetic Architecture Of Maize Flowering Time, Roland Fiagbe

Data Science and Data Mining

Over a century, the maize crop has been one of the most important crop species that is targeted for genetic investigations and experiments. One of the major experiments that have been a topic of interest is crossing inbred lines to produce better offspring through a process called heterosis. Crossing the inbred lines create numerous SNP markers that determine the time to male flowering. This project seeks to explore the SNP markers to select the most relevant ones for predicting time to male flowering using linear regression with regularization methods due to the fact that p > n in our dataset. Various …


Tempers Rising: The Effect Of Heat On Spite, Jake C. Cosgrove 2023 University of San Francisco

Tempers Rising: The Effect Of Heat On Spite, Jake C. Cosgrove

Master's Theses

The relationship between heat and harmful outcomes is well documented, with research connecting various adverse economic outcomes to the climate. In the presence of increasing global warming and climate change, understanding why the climate leads to negative economic outcomes is essential for forming peaceful institutions of the future. We study how behavioral economic outcomes change in the presence of heat through a lab experiment involving 1,110 observations conducted in five different countries. This paper specifically focuses on the social preference outcome of spite. We find that increased time exposure to the treatment effect of heat is required to elicit an …


Analytical Approach For Monitoring The Behavior Of Patients With Pancreatic Adenocarcinoma At Different Stages As A Function Of Time, Aditya Chakaborty Dr, Chris P. Tsokos Dr 2023 Eastern Virginia Medical School

Analytical Approach For Monitoring The Behavior Of Patients With Pancreatic Adenocarcinoma At Different Stages As A Function Of Time, Aditya Chakaborty Dr, Chris P. Tsokos Dr

Biology and Medicine Through Mathematics Conference

No abstract provided.


Automated Delineation Of Visual Area Boundaries And Eccentricities By A Cnn Using Functional, Anatomical, And Diffusion-Weighted Mri Data, Noah C. Benson, Bogeng Song, Toshikazu Miyata, Hiromasa Takemura, Jonathan Winawer 2023 University of Washington

Automated Delineation Of Visual Area Boundaries And Eccentricities By A Cnn Using Functional, Anatomical, And Diffusion-Weighted Mri Data, Noah C. Benson, Bogeng Song, Toshikazu Miyata, Hiromasa Takemura, Jonathan Winawer

MODVIS Workshop

Delineating visual field maps and iso-eccentricities from fMRI data is an important but time-consuming task for many neuroimaging studies on the human visual cortex because the traditional methods of doing so using retinotopic mapping experiments require substantial expertise as well as scanner, computer, and human time. Automated methods based on gray-matter anatomy or a combination of anatomy and functional mapping can reduce these requirements but are less accurate than experts. Convolutional Neural Networks (CNNs) are powerful tools for automated medical image segmentation. We hypothesize that CNNs can define visual area boundaries with high accuracy. We trained U-Net CNNs with ResNet18 …


Toward A Manifold Encoding Neural Responses, Luciano Dyballa, Andra M. Rudzite, Mahmood S. Hoseini, Mishek Thapa, Michael P. Stryker, Greg D. Field, Steven W. Zucker 2023 Yale University

Toward A Manifold Encoding Neural Responses, Luciano Dyballa, Andra M. Rudzite, Mahmood S. Hoseini, Mishek Thapa, Michael P. Stryker, Greg D. Field, Steven W. Zucker

MODVIS Workshop

Understanding circuit properties from physiological data presents two challenges: (i) recordings do not reveal connectivity, and (ii) stimuli only exercise circuits to a limited extent. We address these challenges for the mouse visual system with a novel neural manifold obtained using unsupervised algorithms. Each point in our manifold is a neuron; nearby neurons respond similarly in time to similar parts of a stimulus ensemble. This ensemble includes drifting gratings and flows, i.e., patterns resembling what a mouse would “see” running through fields.

Regarding (i), our manifold differs from the standard practice in computational neuroscience: embedding trials in neural coordinates. Topology …


Digital Commons powered by bepress