Open Access. Powered by Scholars. Published by Universities.®
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
- Keyword
-
- Machine learning (7)
- Machine Learning (5)
- Deep learning (3)
- Robotics (3)
- Artificial Intelligence (2)
-
- Data Science (2)
- NLP (2)
- Natural Language Processing (2)
- Transformers (2)
- Twitter (2)
- Adaptive sampling (1)
- Ai (1)
- Ai art (1)
- Aircraft (1)
- Algorithm (1)
- Argumentation (1)
- Art (1)
- Artificial intelligence (1)
- Authorship (1)
- Autonomous (1)
- BERT (1)
- Brain decoding (1)
- CLS (1)
- Cardiac cellular system (1)
- Character-level Tokenization (1)
- ChatGPT (1)
- Classification (1)
- Clustering (1)
- Collaborative filtering (1)
- Computational Physics (1)
Articles 1 - 19 of 19
Full-Text Articles in Physical Sciences and Mathematics
Poster, Performed: Understanding Public Opinions Of Authorship In Generative Artificial Intelligence Models Via Analogy, Wylie Z. Kasai
Poster, Performed: Understanding Public Opinions Of Authorship In Generative Artificial Intelligence Models Via Analogy, Wylie Z. Kasai
Dartmouth College Master’s Theses
Over the last decade, generative artificial intelligence models have advanced significantly and provided the public with several tools to create new works of art. However, the true authorship of these works has been debated due to their training on web-scraped data. Serving as an analogy to these larger models, Poster, Performed is an interactive artificial intelligence exhibition project that uses image assets submitted by the public to create poster compositions with custom image processing algorithms. During the course of a four-day exhibition, visitors were asked to identify the exhibition’s primary artist from five options: (1) participants who submitted image assets, …
Probing And Enhancing The Reliance Of Transformer Models On Poetic Information, Almas Abdibayev
Probing And Enhancing The Reliance Of Transformer Models On Poetic Information, Almas Abdibayev
Dartmouth College Ph.D Dissertations
Transformer models have achieved remarkable success in the widest variety of domains, spanning not just a multitude of tasks within natural language processing, but also those in computer vision, speech, and reinforcement learning. The key to this success is largely attributed to the self-attention mechanism, particularly its ability to scale in performance as it grows in the number of parameters. Extensive effort has been underway to study the major linguistic properties learned by these models during the course of their pretraining. However, the role of certain finer linguistic phenomena present in language and their utilization by Transformers has not been …
Energy-Aware Path Planning For Fixed-Wing Seaplane Uavs, Benjamin Atkinson Wolsieffer
Energy-Aware Path Planning For Fixed-Wing Seaplane Uavs, Benjamin Atkinson Wolsieffer
Dartmouth College Master’s Theses
Fixed-wing unmanned aerial vehicles (UAVs) are commonly used for remote sensing applications over water bodies, such as monitoring water quality or tracking harmful algal blooms. However, there are some types of measurements that are difficult to accurately obtain from the air. In existing work, water samples have been collected in situ either by hand, with an unmanned surface vehicle (USV), or with a vertical takeoff and landing (VTOL) UAV such as a multirotor. We propose a path planner, landing control algorithm, and energy estimator that will allow a low-cost and energy efficient fixed-wing UAV to carry out a combined remote …
Self-Supervised Pretraining And Transfer Learning On Fmri Data With Transformers, Sean Paulsen
Self-Supervised Pretraining And Transfer Learning On Fmri Data With Transformers, Sean Paulsen
Dartmouth College Ph.D Dissertations
Transfer learning is a machine learning technique founded on the idea that knowledge acquired by a model during “pretraining” on a source task can be transferred to the learning of a target task. Successful transfer learning can result in improved performance, faster convergence, and reduced demand for data. This technique is particularly desirable for the task of brain decoding in the domain of functional magnetic resonance imaging (fMRI), wherein even the most modern machine learning methods can struggle to decode labelled features of brain images. This challenge is due to the highly complex underlying signal, physical and neurological differences between …
System-Characterized Artificial Intelligence Approaches For Cardiac Cellular Systems And Molecular Signature Analysis, Ziqian Wu
Dartmouth College Ph.D Dissertations
The dissertation presents a significant advancement in the field of cardiac cellular systems and molecular signature systems by employing machine learning and generative artificial intelligence techniques. These methodologies are systematically characterized and applied to address critical challenges in these domains. A novel computational model is developed, which combines machine learning tools and multi-physics models. The main objective of this model is to accurately predict complex cellular dynamics, taking into account the intricate interactions within the cardiac cellular system. Furthermore, a comprehensive framework based on generative adversarial networks (GANs) is proposed. This framework is designed to generate synthetic data that faithfully …
Stereotypes And Language Models: Understanding How Language Models Encode Stereotypes, Debiasing Language Models, And Examining How Stereotypes Affect Conversations, Brian C. Wang
Computer Science Senior Theses
This thesis describes a variety of approaches in examining how language models encode stereotypes (understanding stereotypes from a model point-of-view), debiasing language models, and using language models to understand how stereotypes affect conversations (understanding stereotypes from a conversational point-of-view). We present a novel approach for textual clues analysis that makes language models more interpretable, combining the understanding of what stereotypes the internal structures of language models have encoded during their initial training (via attention-based analysis) and understanding what textual clues are most relevant to identifying stereotypes for models trained to detect stereotypes (via SHAP-based analysis). We find that different pre-trained …
Sarcasm Detection In English And Arabic Tweets Using Transformer Models, Rishik Lad
Sarcasm Detection In English And Arabic Tweets Using Transformer Models, Rishik Lad
Computer Science Senior Theses
This thesis describes our approach toward the detection of sarcasm and its various types in English and Arabic Tweets through methods in deep learning. There are five problems we attempted: (1) detection of sarcasm in English Tweets, (2) detection of sarcasm in Arabic Tweets, (3) determining the type of sarcastic speech subcategory for English Tweets, (4) determining which of two semantically equivalent English Tweets is sarcastic, and (5) determining which of two semantically equivalent Arabic Tweets is sarcastic. All tasks were framed as classification problems, and our contributions are threefold: (a) we developed an English binary classifier system with RoBERTa, …
Data-Optimized Spatial Field Predictions For Robotic Adaptive Sampling: A Gaussian Process Approach, Zachary Nathan
Data-Optimized Spatial Field Predictions For Robotic Adaptive Sampling: A Gaussian Process Approach, Zachary Nathan
Computer Science Senior Theses
We introduce a framework that combines Gaussian Process models, robotic sensor measurements, and sampling data to predict spatial fields. In this context, a spatial field refers to the distribution of a variable throughout a specific area, such as temperature or pH variations over the surface of a lake. Whereas existing methods tend to analyze only the particular field(s) of interest, our approach optimizes predictions through the effective use of all available data. We validated our framework on several datasets, showing that errors can decline by up to two-thirds through the inclusion of additional colocated measurements. In support of adaptive sampling, …
Combating Fake News: A Gravity Well Simulation To Model Echo Chamber Formation In Social Media, Jeremy E. Thompson
Combating Fake News: A Gravity Well Simulation To Model Echo Chamber Formation In Social Media, Jeremy E. Thompson
Dartmouth College Ph.D Dissertations
Fake news has become a serious concern as distributing misinformation has become easier and more impactful. A solution is critically required. One solution is to ban fake news, but that approach could create more problems than it solves, and would also be problematic from the beginning, as it must first be identified to be banned. We initially propose a method to automatically recognize suspected fake news, and to provide news consumers with more information as to its veracity. We suggest that fake news is comprised of two components: premises and misleading content. Fake news can be condensed down to a …
Leveraging Context Patterns For Medical Entity Classification, Garrett Johnston
Leveraging Context Patterns For Medical Entity Classification, Garrett Johnston
Computer Science Senior Theses
The ability of patients to understand health-related text is important for optimal health outcomes. A system that can automatically annotate medical entities could help patients better understand health-related text. Such a system would also accelerate manual data annotation for this low-resource domain as well as assist in down- stream medical NLP tasks such as finding textual similarity, identifying conflicting medical advice, and aspect-based sentiment analysis. In this work, we investigate a state-of-the-art entity set expansion model, BootstrapNet, for the task of medical entity classification on a new dataset of medical advice text. We also propose EP SBERT, a simple model …
Symplectically Integrated Symbolic Regression Of Hamiltonian Dynamical Systems, Daniel Dipietro
Symplectically Integrated Symbolic Regression Of Hamiltonian Dynamical Systems, Daniel Dipietro
Computer Science Senior Theses
Here we present Symplectically Integrated Symbolic Regression (SISR), a novel technique for learning physical governing equations from data. SISR employs a deep symbolic regression approach, using a multi-layer LSTMRNN with mutation to probabilistically sample Hamiltonian symbolic expressions. Using symplectic neural networks, we develop a model-agnostic approach for extracting meaningful physical priors from the data that can be imposed on-the-fly into the RNN output, limiting its search space. Hamiltonians generated by the RNN are optimized and assessed using a fourth-order symplectic integration scheme; prediction performance is used to train the LSTM-RNN to generate increasingly better functions via a risk-seeking policy gradients …
Entity Based Sentiment Analysis For Textual Health Advice, Dae Lim Chung
Entity Based Sentiment Analysis For Textual Health Advice, Dae Lim Chung
Computer Science Senior Theses
This work explores entity based sentiment analysis for textual health advice through deep learning. We fine tuned a pretrained BERT model to analyze sentiments across five different predetermined categories which consist of food, medicine, disease, exercise, and vitality for three different sentiments: positive, negative, and neutral. Original set of annotated medical dataset from Dartmouth College’s Persist Lab was used to conduct the experiments. For the aim of tailoring the data for the purpose of entity based sentiment analysis, we explored data transformation techniques to generate optimum training examples. During the experiments, we were able to discover that the wide variety …
Exploiting Group Structures To Infer Social Interactions From Videos, Maksim Bolonkin
Exploiting Group Structures To Infer Social Interactions From Videos, Maksim Bolonkin
Dartmouth College Ph.D Dissertations
In this thesis, we consider the task of inferring the social interactions between humans by analyzing multi-modal data. Specifically, we attempt to solve some of the problems in interaction analysis, such as long-term deception detection, political deception detection, and impression prediction. In this work, we emphasize the importance of using knowledge about the group structure of the analyzed interactions. Previous works on the matter mostly neglected this aspect and analyzed a single subject at a time. Using the new Resistance dataset, collected by our collaborators, we approach the problem of long-term deception detection by designing a class of histogram-based features …
Fine-Grained Detection Of Hate Speech Using Bertoxic, Yakoob Khan
Fine-Grained Detection Of Hate Speech Using Bertoxic, Yakoob Khan
Dartmouth College Undergraduate Theses
This thesis describes our approach towards the fine-grained detection of hate speech using deep learning. We leverage the transformer encoder architecture to propose BERToxic, a system that fine-tunes a pre-trained BERT model to locate toxic text spans in a given text and utilizes additional post-processing steps to refine the prediction boundaries. The post-processing steps involve (1) labeling character offsets between consecutive toxic tokens as toxic and (2) assigning a toxic label to words that have at least one token labeled as toxic. Through experiments, we show that these two post-processing steps improve the performance of our model by 4.16% on …
Lexical Complexity Prediction With Assembly Models, Aadil Islam
Lexical Complexity Prediction With Assembly Models, Aadil Islam
Dartmouth College Undergraduate Theses
Tuning the complexity of one's writing is essential to presenting ideas in a logical, intuitive manner to audiences. This paper describes a system submitted by team BigGreen to LCP 2021 for predicting the lexical complexity of English words in a given context. We assemble a feature engineering-based model and a deep neural network model with an underlying Transformer architecture based on BERT. While BERT itself performs competitively, our feature engineering-based model helps in extreme cases, eg. separating instances of easy and neutral difficulty. Our handcrafted features comprise a breadth of lexical, semantic, syntactic, and novel phonetic measures. Visualizations of BERT …
Exploring The Long Tail, Joseph H. Hajjar
Exploring The Long Tail, Joseph H. Hajjar
Dartmouth College Undergraduate Theses
The migration of datasets online has created a near-infinite inventory for big name retailers such as Amazon and Netflix, giving rise to recommendation systems to assist users in navigating the massive catalog. This has also allowed for the possibility of retailers storing much less popular, uncommon items which would not appear in a more traditional brick-and-mortar setting due to the cost of storage. Nevertheless, previous work has highlighted the profit potential which lies in the so-called "long tail'' of niche, unpopular items. Unfortunately, due to the limited amount of data in this subset of the inventory, recommendation systems often struggle …
Object Manipulation With Modular Planar Tensegrity Robots, Maxine Perroni-Scharf
Object Manipulation With Modular Planar Tensegrity Robots, Maxine Perroni-Scharf
Dartmouth College Undergraduate Theses
This thesis explores the creation of a novel two-dimensional tensegrity-based mod- ular system. When individual planar modules are linked together, they form a larger tensegrity robot that can be used to achieve non-prehensile manipulation. The first half of this dissertation focuses on the study of preexisting types of tensegrity mod- ules and proposes different possible structures and arrangements of modules. The second half describes the construction and actuation of a modular 2D robot com- posed of planar three-bar tensegrity structures. We conclude that tensegrity modules are suitably adapted to object manipulation and propose a future extension of the modular 2D …
Exploring The Use Of Social Media To Infer Relationships Between Demographics, Psychographics And Vaccine Hesitancy, Abhimanyu Kapur
Exploring The Use Of Social Media To Infer Relationships Between Demographics, Psychographics And Vaccine Hesitancy, Abhimanyu Kapur
Computer Science Senior Theses
The growing popularity of social media as a platform to obtain information and share one's opinions on various topics makes it a rich source of information for research. In this study, we aimed to develop a framework to infer relationships between demographic and psychographic characteristics of a user and their opinion on a specific narrative - in this case, their stance on taking the COVID-19 vaccine. Twitter was the chosen platform due to the large USA user base and easily available data. Demographic traits included Race, Age, Gender, and Human-vs-Organization Status. Psychographic traits included the Big Five personality traits (Conscientiousness, …
Learning And Simulation Algorithms For Constraint Physical Systems, Shuqi Yang
Learning And Simulation Algorithms For Constraint Physical Systems, Shuqi Yang
Dartmouth College Master’s Theses
This thesis explores two computational approaches to learn and simulate complex physical systems exhibiting constraint characteristics. The target applications encompass both solids and fluids. On the solid side, we proposed a new family of data-driven simulators to predict the behaviors of an unknown physical system by learning its underpinning constraints. We devised a neural projection operator facilitated by an embedded recursive neural network to interactively enforce the learned underpinning constraints and to predict its various physical behaviors. Our method can automatically uncover a broad range of constraints from observation point data, such as length, angle, bending, collision, boundary effects, and …