Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 28 of 28

Full-Text Articles in Physical Sciences and Mathematics

Toward The Integration Of Behavioral Sensing And Artificial Intelligence, Subigya K. Nepal May 2024

Toward The Integration Of Behavioral Sensing And Artificial Intelligence, Subigya K. Nepal

Dartmouth College Ph.D Dissertations

The integration of behavioral sensing and Artificial Intelligence (AI) has increasingly proven invaluable across various domains, offering profound insights into human behavior, enhancing mental health monitoring, and optimizing workplace productivity. This thesis presents five pivotal studies that employ smartphone, wearable, and laptop-based sensing to explore and push the boundaries of what these technologies can achieve in real-world settings. This body of work explores the innovative and practical applications of AI and behavioral sensing to capture and analyze data for diverse purposes. The first part of the thesis comprises longitudinal studies on behavioral sensing, providing a detailed, long-term view of how …


Towards Scalable Autonomous Underwater Construction With Free-Floating Robots, Samuel Eric Lensgraf May 2024

Towards Scalable Autonomous Underwater Construction With Free-Floating Robots, Samuel Eric Lensgraf

Dartmouth College Ph.D Dissertations

This thesis presents the first free-floating autonomous underwater construction system. Our system built structures weighing up to 100Kg (75Kg in water). Our robot builds structures made of standard cinder blocks and custom designed interlocking cement blocks. It is the first construction robot that uses active buoyancy compensation to efficiently transport building materials. It is also the first construction robot that can reconfigure visual fiducial markers on a foundation during the construction process to expand its working area.

Underwater construction is a challenging problem for free-floating robots. Currents can buffet the robot, and visibility conditions can change. We focus on achieving …


Poster, Performed: Understanding Public Opinions Of Authorship In Generative Artificial Intelligence Models Via Analogy, Wylie Z. Kasai Jan 2024

Poster, Performed: Understanding Public Opinions Of Authorship In Generative Artificial Intelligence Models Via Analogy, Wylie Z. Kasai

Dartmouth College Master’s Theses

Over the last decade, generative artificial intelligence models have advanced significantly and provided the public with several tools to create new works of art. However, the true authorship of these works has been debated due to their training on web-scraped data. Serving as an analogy to these larger models, Poster, Performed is an interactive artificial intelligence exhibition project that uses image assets submitted by the public to create poster compositions with custom image processing algorithms. During the course of a four-day exhibition, visitors were asked to identify the exhibition’s primary artist from five options: (1) participants who submitted image assets, …


Disentangling Cyclic Causality: An Instance-Based Framework For Causal Discovery, Chase A. Yakaboski Jan 2024

Disentangling Cyclic Causality: An Instance-Based Framework For Causal Discovery, Chase A. Yakaboski

Dartmouth College Ph.D Dissertations

Correlation does not imply causation" is one of the fundamental principles taught in science, emphasizing that associations between variables do not necessarily indicate causality. Yet, over the past three decades, extensive research has begun to challenge this perspective by developing sophisticated methods to differentiate causal from correlative relationships. This research suggests that correlations often involve a blend of confounded and causal interactions, which, given certain assumptions, can be disentangled to uncover actionable insights and deepen our understanding of physical, biological, and societal systems.

Accurately discovering causal relationships from data amidst cyclic dynamics remains a challenging open problem in causality research. …


Probing And Enhancing The Reliance Of Transformer Models On Poetic Information, Almas Abdibayev Dec 2023

Probing And Enhancing The Reliance Of Transformer Models On Poetic Information, Almas Abdibayev

Dartmouth College Ph.D Dissertations

Transformer models have achieved remarkable success in the widest variety of domains, spanning not just a multitude of tasks within natural language processing, but also those in computer vision, speech, and reinforcement learning. The key to this success is largely attributed to the self-attention mechanism, particularly its ability to scale in performance as it grows in the number of parameters. Extensive effort has been underway to study the major linguistic properties learned by these models during the course of their pretraining. However, the role of certain finer linguistic phenomena present in language and their utilization by Transformers has not been …


Energy-Aware Path Planning For Fixed-Wing Seaplane Uavs, Benjamin Atkinson Wolsieffer Sep 2023

Energy-Aware Path Planning For Fixed-Wing Seaplane Uavs, Benjamin Atkinson Wolsieffer

Dartmouth College Master’s Theses

Fixed-wing unmanned aerial vehicles (UAVs) are commonly used for remote sensing applications over water bodies, such as monitoring water quality or tracking harmful algal blooms. However, there are some types of measurements that are difficult to accurately obtain from the air. In existing work, water samples have been collected in situ either by hand, with an unmanned surface vehicle (USV), or with a vertical takeoff and landing (VTOL) UAV such as a multirotor. We propose a path planner, landing control algorithm, and energy estimator that will allow a low-cost and energy efficient fixed-wing UAV to carry out a combined remote …


Self-Supervised Pretraining And Transfer Learning On Fmri Data With Transformers, Sean Paulsen Aug 2023

Self-Supervised Pretraining And Transfer Learning On Fmri Data With Transformers, Sean Paulsen

Dartmouth College Ph.D Dissertations

Transfer learning is a machine learning technique founded on the idea that knowledge acquired by a model during “pretraining” on a source task can be transferred to the learning of a target task. Successful transfer learning can result in improved performance, faster convergence, and reduced demand for data. This technique is particularly desirable for the task of brain decoding in the domain of functional magnetic resonance imaging (fMRI), wherein even the most modern machine learning methods can struggle to decode labelled features of brain images. This challenge is due to the highly complex underlying signal, physical and neurological differences between …


System-Characterized Artificial Intelligence Approaches For Cardiac Cellular Systems And Molecular Signature Analysis, Ziqian Wu Jun 2023

System-Characterized Artificial Intelligence Approaches For Cardiac Cellular Systems And Molecular Signature Analysis, Ziqian Wu

Dartmouth College Ph.D Dissertations

The dissertation presents a significant advancement in the field of cardiac cellular systems and molecular signature systems by employing machine learning and generative artificial intelligence techniques. These methodologies are systematically characterized and applied to address critical challenges in these domains. A novel computational model is developed, which combines machine learning tools and multi-physics models. The main objective of this model is to accurately predict complex cellular dynamics, taking into account the intricate interactions within the cardiac cellular system. Furthermore, a comprehensive framework based on generative adversarial networks (GANs) is proposed. This framework is designed to generate synthetic data that faithfully …


Stereotypes And Language Models: Understanding How Language Models Encode Stereotypes, Debiasing Language Models, And Examining How Stereotypes Affect Conversations, Brian C. Wang Jun 2023

Stereotypes And Language Models: Understanding How Language Models Encode Stereotypes, Debiasing Language Models, And Examining How Stereotypes Affect Conversations, Brian C. Wang

Computer Science Senior Theses

This thesis describes a variety of approaches in examining how language models encode stereotypes (understanding stereotypes from a model point-of-view), debiasing language models, and using language models to understand how stereotypes affect conversations (understanding stereotypes from a conversational point-of-view). We present a novel approach for textual clues analysis that makes language models more interpretable, combining the understanding of what stereotypes the internal structures of language models have encoded during their initial training (via attention-based analysis) and understanding what textual clues are most relevant to identifying stereotypes for models trained to detect stereotypes (via SHAP-based analysis). We find that different pre-trained …


Sarcasm Detection In English And Arabic Tweets Using Transformer Models, Rishik Lad Jun 2023

Sarcasm Detection In English And Arabic Tweets Using Transformer Models, Rishik Lad

Computer Science Senior Theses

This thesis describes our approach toward the detection of sarcasm and its various types in English and Arabic Tweets through methods in deep learning. There are five problems we attempted: (1) detection of sarcasm in English Tweets, (2) detection of sarcasm in Arabic Tweets, (3) determining the type of sarcastic speech subcategory for English Tweets, (4) determining which of two semantically equivalent English Tweets is sarcastic, and (5) determining which of two semantically equivalent Arabic Tweets is sarcastic. All tasks were framed as classification problems, and our contributions are threefold: (a) we developed an English binary classifier system with RoBERTa, …


Data-Optimized Spatial Field Predictions For Robotic Adaptive Sampling: A Gaussian Process Approach, Zachary Nathan May 2023

Data-Optimized Spatial Field Predictions For Robotic Adaptive Sampling: A Gaussian Process Approach, Zachary Nathan

Computer Science Senior Theses

We introduce a framework that combines Gaussian Process models, robotic sensor measurements, and sampling data to predict spatial fields. In this context, a spatial field refers to the distribution of a variable throughout a specific area, such as temperature or pH variations over the surface of a lake. Whereas existing methods tend to analyze only the particular field(s) of interest, our approach optimizes predictions through the effective use of all available data. We validated our framework on several datasets, showing that errors can decline by up to two-thirds through the inclusion of additional colocated measurements. In support of adaptive sampling, …


Product Review Classification Using Machine Learning And Statistical Data Analysis, Kajal Singh May 2023

Product Review Classification Using Machine Learning And Statistical Data Analysis, Kajal Singh

Independent Student Projects and Publications

The aim of the paper is to implement and analyze the machine learning models for product review dataset. The project focuses on binary classification, multi-class classification, and clustering approaches to analyze and categorize product reviews. The performance of the models over each of the five classification tasks is measured by the 5-fold cross-validation scores over the training data.


Combating Fake News: A Gravity Well Simulation To Model Echo Chamber Formation In Social Media, Jeremy E. Thompson Jan 2023

Combating Fake News: A Gravity Well Simulation To Model Echo Chamber Formation In Social Media, Jeremy E. Thompson

Dartmouth College Ph.D Dissertations

Fake news has become a serious concern as distributing misinformation has become easier and more impactful. A solution is critically required. One solution is to ban fake news, but that approach could create more problems than it solves, and would also be problematic from the beginning, as it must first be identified to be banned. We initially propose a method to automatically recognize suspected fake news, and to provide news consumers with more information as to its veracity. We suggest that fake news is comprised of two components: premises and misleading content. Fake news can be condensed down to a …


Leveraging Context Patterns For Medical Entity Classification, Garrett Johnston Jun 2022

Leveraging Context Patterns For Medical Entity Classification, Garrett Johnston

Computer Science Senior Theses

The ability of patients to understand health-related text is important for optimal health outcomes. A system that can automatically annotate medical entities could help patients better understand health-related text. Such a system would also accelerate manual data annotation for this low-resource domain as well as assist in down- stream medical NLP tasks such as finding textual similarity, identifying conflicting medical advice, and aspect-based sentiment analysis. In this work, we investigate a state-of-the-art entity set expansion model, BootstrapNet, for the task of medical entity classification on a new dataset of medical advice text. We also propose EP SBERT, a simple model …


Symplectically Integrated Symbolic Regression Of Hamiltonian Dynamical Systems, Daniel Dipietro Jun 2022

Symplectically Integrated Symbolic Regression Of Hamiltonian Dynamical Systems, Daniel Dipietro

Computer Science Senior Theses

Here we present Symplectically Integrated Symbolic Regression (SISR), a novel technique for learning physical governing equations from data. SISR employs a deep symbolic regression approach, using a multi-layer LSTMRNN with mutation to probabilistically sample Hamiltonian symbolic expressions. Using symplectic neural networks, we develop a model-agnostic approach for extracting meaningful physical priors from the data that can be imposed on-the-fly into the RNN output, limiting its search space. Hamiltonians generated by the RNN are optimized and assessed using a fourth-order symplectic integration scheme; prediction performance is used to train the LSTM-RNN to generate increasingly better functions via a risk-seeking policy gradients …


Entity Based Sentiment Analysis For Textual Health Advice, Dae Lim Chung Apr 2022

Entity Based Sentiment Analysis For Textual Health Advice, Dae Lim Chung

Computer Science Senior Theses

This work explores entity based sentiment analysis for textual health advice through deep learning. We fine tuned a pretrained BERT model to analyze sentiments across five different predetermined categories which consist of food, medicine, disease, exercise, and vitality for three different sentiments: positive, negative, and neutral. Original set of annotated medical dataset from Dartmouth College’s Persist Lab was used to conduct the experiments. For the aim of tailoring the data for the purpose of entity based sentiment analysis, we explored data transformation techniques to generate optimum training examples. During the experiments, we were able to discover that the wide variety …


Exploiting Group Structures To Infer Social Interactions From Videos, Maksim Bolonkin Sep 2021

Exploiting Group Structures To Infer Social Interactions From Videos, Maksim Bolonkin

Dartmouth College Ph.D Dissertations

In this thesis, we consider the task of inferring the social interactions between humans by analyzing multi-modal data. Specifically, we attempt to solve some of the problems in interaction analysis, such as long-term deception detection, political deception detection, and impression prediction. In this work, we emphasize the importance of using knowledge about the group structure of the analyzed interactions. Previous works on the matter mostly neglected this aspect and analyzed a single subject at a time. Using the new Resistance dataset, collected by our collaborators, we approach the problem of long-term deception detection by designing a class of histogram-based features …


Fine-Grained Detection Of Hate Speech Using Bertoxic, Yakoob Khan Jun 2021

Fine-Grained Detection Of Hate Speech Using Bertoxic, Yakoob Khan

Dartmouth College Undergraduate Theses

This thesis describes our approach towards the fine-grained detection of hate speech using deep learning. We leverage the transformer encoder architecture to propose BERToxic, a system that fine-tunes a pre-trained BERT model to locate toxic text spans in a given text and utilizes additional post-processing steps to refine the prediction boundaries. The post-processing steps involve (1) labeling character offsets between consecutive toxic tokens as toxic and (2) assigning a toxic label to words that have at least one token labeled as toxic. Through experiments, we show that these two post-processing steps improve the performance of our model by 4.16% on …


Lexical Complexity Prediction With Assembly Models, Aadil Islam Jun 2021

Lexical Complexity Prediction With Assembly Models, Aadil Islam

Dartmouth College Undergraduate Theses

Tuning the complexity of one's writing is essential to presenting ideas in a logical, intuitive manner to audiences. This paper describes a system submitted by team BigGreen to LCP 2021 for predicting the lexical complexity of English words in a given context. We assemble a feature engineering-based model and a deep neural network model with an underlying Transformer architecture based on BERT. While BERT itself performs competitively, our feature engineering-based model helps in extreme cases, eg. separating instances of easy and neutral difficulty. Our handcrafted features comprise a breadth of lexical, semantic, syntactic, and novel phonetic measures. Visualizations of BERT …


Exploring The Long Tail, Joseph H. Hajjar Jun 2021

Exploring The Long Tail, Joseph H. Hajjar

Dartmouth College Undergraduate Theses

The migration of datasets online has created a near-infinite inventory for big name retailers such as Amazon and Netflix, giving rise to recommendation systems to assist users in navigating the massive catalog. This has also allowed for the possibility of retailers storing much less popular, uncommon items which would not appear in a more traditional brick-and-mortar setting due to the cost of storage. Nevertheless, previous work has highlighted the profit potential which lies in the so-called "long tail'' of niche, unpopular items. Unfortunately, due to the limited amount of data in this subset of the inventory, recommendation systems often struggle …


Object Manipulation With Modular Planar Tensegrity Robots, Maxine Perroni-Scharf Jun 2021

Object Manipulation With Modular Planar Tensegrity Robots, Maxine Perroni-Scharf

Dartmouth College Undergraduate Theses

This thesis explores the creation of a novel two-dimensional tensegrity-based mod- ular system. When individual planar modules are linked together, they form a larger tensegrity robot that can be used to achieve non-prehensile manipulation. The first half of this dissertation focuses on the study of preexisting types of tensegrity mod- ules and proposes different possible structures and arrangements of modules. The second half describes the construction and actuation of a modular 2D robot com- posed of planar three-bar tensegrity structures. We conclude that tensegrity modules are suitably adapted to object manipulation and propose a future extension of the modular 2D …


Exploring The Use Of Social Media To Infer Relationships Between Demographics, Psychographics And Vaccine Hesitancy, Abhimanyu Kapur Jun 2021

Exploring The Use Of Social Media To Infer Relationships Between Demographics, Psychographics And Vaccine Hesitancy, Abhimanyu Kapur

Computer Science Senior Theses

The growing popularity of social media as a platform to obtain information and share one's opinions on various topics makes it a rich source of information for research. In this study, we aimed to develop a framework to infer relationships between demographic and psychographic characteristics of a user and their opinion on a specific narrative - in this case, their stance on taking the COVID-19 vaccine. Twitter was the chosen platform due to the large USA user base and easily available data. Demographic traits included Race, Age, Gender, and Human-vs-Organization Status. Psychographic traits included the Big Five personality traits (Conscientiousness, …


Learning And Simulation Algorithms For Constraint Physical Systems, Shuqi Yang Apr 2021

Learning And Simulation Algorithms For Constraint Physical Systems, Shuqi Yang

Dartmouth College Master’s Theses

This thesis explores two computational approaches to learn and simulate complex physical systems exhibiting constraint characteristics. The target applications encompass both solids and fluids. On the solid side, we proposed a new family of data-driven simulators to predict the behaviors of an unknown physical system by learning its underpinning constraints. We devised a neural projection operator facilitated by an embedded recursive neural network to interactively enforce the learned underpinning constraints and to predict its various physical behaviors. Our method can automatically uncover a broad range of constraints from observation point data, such as length, angle, bending, collision, boundary effects, and …


Cup-Net: Compressed Ultrafast Photography Using Convolutional Neural Networks, Matthew Parker Jun 2020

Cup-Net: Compressed Ultrafast Photography Using Convolutional Neural Networks, Matthew Parker

ENGS 88 Honors Thesis (AB Students)

Compressed ultrafast photography (CUP) is a cutting-edge imaging technique that uses a variation of the traditional streak camera to obtain video at 100 billion frames per second with a single exposure. In order to achieve this level of temporal detail, CUP leverages compressed sensing (CS). Compressed sensing theory states that a compressed representation of an image can be directly acquired using a non-adaptive measurement matrix so long as the encoding matrix follows certain properties such as restrictive isometry and incoherence. This compressed representation of the original scene can later be reconstructed back into the original form. CUP applies CS by …


Abso2luteu-Net: Tissue Oxygenation Calculation Using Photoacoustic Imaging And Convolutional Neural Networks, Kevin Hoffer-Hawlik, Geoffrey P. Luke Jan 2019

Abso2luteu-Net: Tissue Oxygenation Calculation Using Photoacoustic Imaging And Convolutional Neural Networks, Kevin Hoffer-Hawlik, Geoffrey P. Luke

ENGS 88 Honors Thesis (AB Students)

Photoacoustic (PA) imaging uses incident light to generate ultrasound signals within tissues. Using PA imaging to accurately measure hemoglobin concentration and calculate oxygenation (sO2) requires prior tissue knowledge and costly computational methods. However, this thesis shows that machine learning algorithms can accurately and quickly estimate sO2. absO2luteU-Net, a convolutional neural network, was trained on Monte Carlo simulated multispectral PA data and predicted sO2 with higher accuracy compared to simple linear unmixing, suggesting machine learning can solve the fluence estimation problem. This project was funded by the Kaminsky Family Fund and the Neukom Institute.


Evaluating Prose Style Transfer With The Bible, Keith Carlson, Allen Riddell, Daniel Rockmore Sep 2018

Evaluating Prose Style Transfer With The Bible, Keith Carlson, Allen Riddell, Daniel Rockmore

Dartmouth Scholarship

In the prose style transfer task a system, provided with text input and a target prose style, produces output which preserves the meaning of the input text but alters the style. These systems require parallel data for evaluation of results and usually make use of parallel data for training. Currently, there are few publicly available corpora for this task. In this work, we identify a high-quality source of aligned, stylistically distinct text in different versions of the Bible. We provide a standardized split, into training, development and testing data, of the public domain versions in our corpus. This corpus is …


Artificial Intelligence And Amikacin Exposures Predictive Of Outcomes In Multidrug-Resistant Tuberculosis Patients, Chawangwa Modongo, Jotam G. Pasipanodya, Shashikant Srivastava, Nicola Zetola, Scott Williams, Giorgio Sirugo, Tawanda Gumbo Jul 2016

Artificial Intelligence And Amikacin Exposures Predictive Of Outcomes In Multidrug-Resistant Tuberculosis Patients, Chawangwa Modongo, Jotam G. Pasipanodya, Shashikant Srivastava, Nicola Zetola, Scott Williams, Giorgio Sirugo, Tawanda Gumbo

Dartmouth Scholarship

Aminoglycosides such as amikacin continue to be part of the backbone of treatment of multidrug-resistant tuberculosis (MDR- TB). We measured amikacin concentrations in 28 MDR-TB patients in Botswana receiving amikacin therapy together with oral levofloxacin, ethionamide, cycloserine, and pyrazinamide and calculated areas under the concentration-time curves from 0 to 24 h (AUC0 –24). The patients were followed monthly for sputum culture conversion based on liquid cultures. The median duration of amikacin therapy was 184 (range, 28 to 866) days, at a median dose of 17.30 (range 11.11 to 19.23) mg/kg. Only 11 (39%) pa- tients had sputum culture conversion during …


Improving Structure Mcmc For Bayesian Networks Through Markov Blanket Resampling, Chengwei Su, Mark E. Borsuk Apr 2016

Improving Structure Mcmc For Bayesian Networks Through Markov Blanket Resampling, Chengwei Su, Mark E. Borsuk

Dartmouth Scholarship

Algorithms for inferring the structure of Bayesian networks from data have become an increasingly popular method for uncovering the direct and indirect influences among variables in complex systems. A Bayesian approach to structure learning uses posterior probabilities to quantify the strength with which the data and prior knowledge jointly support each possible graph feature. Existing Markov Chain Monte Carlo (MCMC) algorithms for estimating these posterior probabilities are slow in mixing and convergence, especially for large networks. We present a novel Markov blanket resampling (MBR) scheme that intermittently reconstructs the Markov blanket of nodes, thus allowing the sampler to more effectively …