Open Access. Powered by Scholars. Published by Universities.®

Data Science Commons

Open Access. Powered by Scholars. Published by Universities.®

1,514 Full-Text Articles 3,028 Authors 435,013 Downloads 190 Institutions

All Articles in Data Science

Faceted Search

1,514 full-text articles. Page 7 of 75.

Les Expositions Turnus, Une Page D’Histoire Transnationale Des Beaux-Arts En Suisse À La Fin Du Xixe Siècle. Et Comment Découvrir Les Humanités Numériques, Béatrice Joyeux-Prunel 2023 Université de Genève, Switzerland

Les Expositions Turnus, Une Page D’Histoire Transnationale Des Beaux-Arts En Suisse À La Fin Du Xixe Siècle. Et Comment Découvrir Les Humanités Numériques, Béatrice Joyeux-Prunel

Artl@s Bulletin

Cet article présente le travail de la classe d’introduction aux humanités numériques de l’Université de Genève sur les expositions Turnus en Suisse à partir des années 1840. Près de 50 catalogues ont été retranscrits, décrits et structurés à l’aide de scripts Python, puis géolocalisés. Les données ont été ajoutées à BasArt, le répertoire mondial de catalogues d’expositions d’Artl@s (https://artlas.huma-num.fr/map). Elles permettent de mieux comprendre les premières années de ces expositions et leurs dynamiques locales, fédérales et internationales. Le Turnus fut une plaque tournante pour les artistes suisses, voire un tremplin vers le marché européen de l’art.


High-Performance Computing In Covariant Loop Quantum Gravity, Pietropaolo Frisoni 2023 The University of Western Ontario

High-Performance Computing In Covariant Loop Quantum Gravity, Pietropaolo Frisoni

Electronic Thesis and Dissertation Repository

This Ph.D. thesis presents a compilation of the scientific papers I published over the last three years during my Ph.D. in loop quantum gravity (LQG). First, we comprehensively introduce spinfoam calculations with a practical pedagogical paper. We highlight LQG's unique features and mathematical formalism and emphasize the computational complexities associated with its calculations. The subsequent articles delve into specific aspects of employing high-performance computing (HPC) in LQG research. We discuss the results obtained by applying numerical methods to studying spinfoams' infrared divergences, or ``bubbles''. This research direction is crucial to define the continuum limit of LQG properly. We investigate the …


Development Of An App For The Kalamazoo Nature Center, Ernest Au 2023 Western Michigan University

Development Of An App For The Kalamazoo Nature Center, Ernest Au

Honors Theses

Kalamazoo Nature Center (KNC), which has been recognized by its peers as one of the top nature centers in the country, is home to over 14 miles of hiking trails winding through woods, wetlands, and prairies. There are numerous places/plots in KNC that have an interesting and impressive history besides being home to a variety of animals and hundreds of wildflowers and other plant life. To improve the visitor’s experience at KNC, we will design a software app via the senior capstone project at the department of Computer Science at WMU. As the first step towards establishing a reference model …


Teaching Reproducibility To First Year College Students: Reflections From An Introductory Data Science Course, Brennan L. Bean 2023 Utah State University

Teaching Reproducibility To First Year College Students: Reflections From An Introductory Data Science Course, Brennan L. Bean

Journal on Empowering Teaching Excellence

Access the online Pressbooks version of this article here.

Modern technology threatens traditional modes of classroom assessment by providing students with automated ways to write essays and take exams. At the same time, modern technology continues to expand the accessibility of computational tools that promise to increase the potential scope and quality of class projects. This paper presents a case study where students are asked to complete a “reproducible” final project in an introductory data science course using the R programming language. A reproducible project is one where an instructor can easily regenerate the results and conclusions from the submitted …


Convolution And Autoencoders Applied To Nonlinear Differential Equations, Noah Borquaye 2023 East Tennessee State University

Convolution And Autoencoders Applied To Nonlinear Differential Equations, Noah Borquaye

Electronic Theses and Dissertations

Autoencoders, a type of artificial neural network, have gained recognition by researchers in various fields, especially machine learning due to their vast applications in data representations from inputs. Recently researchers have explored the possibility to extend the application of autoencoders to solve nonlinear differential equations. Algorithms and methods employed in an autoencoder framework include sparse identification of nonlinear dynamics (SINDy), dynamic mode decomposition (DMD), Koopman operator theory and singular value decomposition (SVD). These approaches use matrix multiplication to represent linear transformation. However, machine learning algorithms often use convolution to represent linear transformations. In our work, we modify these approaches to …


Random Variable Spaces: Mathematical Properties And An Extension To Programming Computable Functions, Mohammed Kurd-Misto 2023 Chapman University

Random Variable Spaces: Mathematical Properties And An Extension To Programming Computable Functions, Mohammed Kurd-Misto

Computational and Data Sciences (PhD) Dissertations

This dissertation aims to extend the boundaries of Programming Computable Functions (PCF) by introducing a novel collection of categories referred to as Random Variable Spaces. Originating as a generalization of Quasi-Borel Spaces, Random Variable Spaces are rigorously defined as categories where objects are sets paired with a collection of random variables from an underlying measurable space. These spaces offer a theoretical foundation for extending PCF to natively handle stochastic elements.

The dissertation is structured into seven chapters that provide a multi-disciplinary background, from PCF and Measure Theory to Category Theory with special attention to Monads and the Giry Monad. The …


Generalized Differentiable Neural Architecture Search With Performance And Stability Improvements, Emily J. Herron 2023 University of Tennessee, Knoxville

Generalized Differentiable Neural Architecture Search With Performance And Stability Improvements, Emily J. Herron

Doctoral Dissertations

This work introduces improvements to the stability and generalizability of Cyclic DARTS (CDARTS). CDARTS is a Differentiable Architecture Search (DARTS)-based approach to neural architecture search (NAS) that uses a cyclic feedback mechanism to train search and evaluation networks concurrently, thereby optimizing the search process by enforcing that the networks produce similar outputs. However, the dissimilarity between the loss functions used by the evaluation networks during the search and retraining phases results in a search-phase evaluation network, a sub-optimal proxy for the final evaluation network utilized during retraining. ICDARTS, a revised algorithm that reformulates the search phase loss functions to ensure …


Wavelet Compression As An Observational Operator In Data Assimilation Systems For Sea Surface Temperature, Bradley J. Sciacca 2023 University of New Orleans, New Orleans

Wavelet Compression As An Observational Operator In Data Assimilation Systems For Sea Surface Temperature, Bradley J. Sciacca

University of New Orleans Theses and Dissertations

The ocean remains severely under-observed, in part due to its sheer size. Containing nearly billion of water with most of the subsurface being invisible because water is extremely difficult to penetrate using electromagnetic radiation, as is typically used by satellite measuring instruments. For this reason, most observations of the ocean have very low spatial-temporal coverage to get a broad capture of the ocean’s features. However, recent “dense but patchy” data have increased the availability of high-resolution – low spatial coverage observations. These novel data sets have motivated research into multi-scale data assimilation methods. Here, we demonstrate a new assimilation approach …


Exact Models, Heuristics, And Supervised Learning Approaches For Vehicle Routing Problems, Zefeng Lyu 2023 University of Tennessee, Knoxville

Exact Models, Heuristics, And Supervised Learning Approaches For Vehicle Routing Problems, Zefeng Lyu

Doctoral Dissertations

This dissertation presents contributions to the field of vehicle routing problems by utilizing exact methods, heuristic approaches, and the integration of machine learning with traditional algorithms. The research is organized into three main chapters, each dedicated to a specific routing problem and a unique methodology. The first chapter addresses the Pickup and Delivery Problem with Transshipments and Time Windows, a variant that permits product transfers between vehicles to enhance logistics flexibility and reduce costs. To solve this problem, we propose an efficient mixed-integer linear programming model that has been shown to outperform existing ones. The second chapter discusses a practical …


Making Data Meaningful: Stakeholder Perceptions On Data Visualization And Data Management Practices Within A Multi-Tiered System Of Supports (Mtss), Domenick Saia 2023 National Louis University

Making Data Meaningful: Stakeholder Perceptions On Data Visualization And Data Management Practices Within A Multi-Tiered System Of Supports (Mtss), Domenick Saia

Dissertations

Data-driven decision-making and collaboration are core pillars of a multi-tiered system of supports (MTSS); however, timely and accessible data use, as well as data literacy and visualization literacy skills, are challenges school leaders and educators face related to implementing such frameworks. I hypothesized efficient data management systems and data visualization tools enable school teams to predict student learning outcomes, readily communicate, and better understand student data. The purpose of this study design was to highlight a need for more efficient data structures that allow school stakeholders to balance their roles within an MTSS framework more effectively. The context of this …


Exploration And Statistical Modeling Of Profit, Caleb Gibson 2023 East Tennessee State University

Exploration And Statistical Modeling Of Profit, Caleb Gibson

Undergraduate Honors Theses

For any company involved in sales, maximization of profit is the driving force that guides all decision-making. Many factors can influence how profitable a company can be, including external factors like changes in inflation or consumer demand or internal factors like pricing and product cost. Understanding specific trends in one's own internal data, a company can readily identify problem areas or potential growth opportunities to help increase profitability.

In this discussion, we use an extensive data set to examine how a company might analyze their own data to identify potential changes the company might investigate to drive better performance. Based …


Predictive Model For Cfpb Consumer Complaints, Vyshnavi Nalluri 2023 California State University, San Bernardino

Predictive Model For Cfpb Consumer Complaints, Vyshnavi Nalluri

Electronic Theses, Projects, and Dissertations

Within the dynamic and highly competitive financial industry, the timely and efficient resolution of customer complaints stands as a central challenge, particularly in the intricate domain of mortgage services. The traditional processes for handling these complaints have long been recognized as laborious and resource-intensive, a situation that financial institutions, including the esteemed Wells Fargo, are keen to improve.

Currently, the industry largely relies on basic data analytics for identifying trends in customer complaints. However, this approach has its limitations, especially when dealing with complaints within the mortgage services domain. In response to this challenge, this research advocates the adoption of …


The Psychological Science Accelerator's Covid-19 Rapid-Response Dataset, Erin M. BUCHANAN, Andree HARTANTO 2023 Harrisburg University of Science and Technology

The Psychological Science Accelerator's Covid-19 Rapid-Response Dataset, Erin M. Buchanan, Andree Hartanto

Research Collection School of Social Sciences

In response to the COVID-19 pandemic, the Psychological Science Accelerator coordinated three large-scale psychological studies to examine the effects of loss-gain framing, cognitive reappraisals, and autonomy framing manipulations on behavioral intentions and affective measures. The data collected (April to October 2020) included specific measures for each experimental study, a general questionnaire examining health prevention behaviors and COVID-19 experience, geographical and cultural context characterization, and demographic information for each participant. Each participant started the study with the same general questions and then was randomized to complete either one longer experiment or two shorter experiments. Data were provided by 73,223 participants with …


A Bridge Between Graph Neural Networks And Transformers: Positional Encodings As Node Embeddings, Bright Kwaku Manu 2023 East Tennessee State University

A Bridge Between Graph Neural Networks And Transformers: Positional Encodings As Node Embeddings, Bright Kwaku Manu

Electronic Theses and Dissertations

Graph Neural Networks and Transformers are very powerful frameworks for learning machine learning tasks. While they were evolved separately in diverse fields, current research has revealed some similarities and links between them. This work focuses on bridging the gap between GNNs and Transformers by offering a uniform framework that highlights their similarities and distinctions. We perform positional encodings and identify key properties that make the positional encodings node embeddings. We found that the properties of expressiveness, efficiency and interpretability were achieved in the process. We saw that it is possible to use positional encodings as node embeddings, which can be …


Generative Adversarial Game With Tailored Quantum Feature Maps For Enhanced Classification, Anais Sandra Nguemto Guiawa 2023 University of Tennessee, Knoxville

Generative Adversarial Game With Tailored Quantum Feature Maps For Enhanced Classification, Anais Sandra Nguemto Guiawa

Doctoral Dissertations

In the burgeoning field of quantum machine learning, the fusion of quantum computing and machine learning methodologies has sparked immense interest, particularly with the emergence of noisy intermediate-scale quantum (NISQ) devices. These devices hold the promise of achieving quantum advantage, but they grapple with limitations like constrained qubit counts, limited connectivity, operational noise, and a restricted set of operations. These challenges necessitate a strategic and deliberate approach to crafting effective quantum machine learning algorithms.

This dissertation revolves around an exploration of these challenges, presenting innovative strategies that tailor quantum algorithms and processes to seamlessly integrate with commercial quantum platforms. A …


General Population Projection Model With Census Population Data, Takenori Tsuruga 2023 California State University, San Bernardino

General Population Projection Model With Census Population Data, Takenori Tsuruga

Electronic Theses, Projects, and Dissertations

The US Census Bureau offers a wide range of data, and within this array, the American Community Survey 5-Year Estimate (ACS5) serves as a valuable resource for understanding the US population. This project embarks on an exploration of Machine Learning and the Software Development process with the goal of generating effective population projections from ACS5 data. The project aims to provide methods to make predictions for every city and town in the US, encompassing their total population and population divided into 5-year age groups. It's worth noting that while the generation of these projections is grounded in the generalized statistical …


Review Classification Using Natural Language Processing And Deep Learning, Brian Nazareth 2023 California State University – San Bernardino

Review Classification Using Natural Language Processing And Deep Learning, Brian Nazareth

Electronic Theses, Projects, and Dissertations

Sentiment Analysis is an ongoing research in the field of Natural Language Processing (NLP). In this project, I will evaluate my testing against an Amazon Reviews Dataset, which contains more than 100 thousand reviews from customers. This project classifies the reviews using three methods – using a sentiment score by comparing the words of the reviews based on every positive and negative word that appears in the text with the Opinion Lexicon dataset, by considering the text’s variating sentiment polarity scores with a Python library called TextBlob, and with the help of neural network training. I have created a neural …


Parameter Estimation For Patient Enrollment In Clinical Trials, Junyan Liu 2023 William & Mary

Parameter Estimation For Patient Enrollment In Clinical Trials, Junyan Liu

Undergraduate Honors Theses

In this paper, we study the Poisson-gamma model for recruitment time in clinical trials. We proved several properties of this model that match our intuitions from a reliability perspective, did simulations on this model, and used different optimization methods to estimate the parameters. Although the behaviors of the optimization methods were unfavorable and unstable, we identified certain conditions and provided potential explanations for this phenomenon and further insights into the Poisson-gamma model.


Ai Assisted Workflows For Computational Electromagnetics And Antenna Design, Oameed Noakoasteen 2023 University of New Mexico - Main Campus

Ai Assisted Workflows For Computational Electromagnetics And Antenna Design, Oameed Noakoasteen

Electrical and Computer Engineering ETDs

These days large volumes of data can be recorded and manipulated with relative ease. If valuable information can be extracted from them, these vast amounts of data can be a rich resource not just for the digital economy but also for scientific discovery and development of technology. When it comes to deriving valuable information from data, Machine Learning (ML) emerges as the key solution. To unlock the potential benefits of ML to science and technology, extensive research is needed to explore what algorithms are suitable and how they can be applied.

To shine light on various ways that ML can …


Understanding Collective Performance: Human Factors And Team Science, Joseph Keebler 2023 Embry-Riddle Aeronautical University

Understanding Collective Performance: Human Factors And Team Science, Joseph Keebler

Math Department Colloquium Series

This talk will focus on modern issues with team science. Joe will discuss a variety of projects he's been involved with aimed at improving teamwork in complex sociotechnical systems including military, aviation, and healthcare. He will discuss major theoretical facets of teamwork and provide evidence-based best practices that were utilized to improve teams in applied settings.


Digital Commons powered by bepress