Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 21 of 21

Full-Text Articles in Physical Sciences and Mathematics

Social Media Sentiment Analysis With A Deep Neural Network: An Enhanced Approach Using User Behavioral Information, Ahmed Sulaiman M. Alharbi Dec 2019

Social Media Sentiment Analysis With A Deep Neural Network: An Enhanced Approach Using User Behavioral Information, Ahmed Sulaiman M. Alharbi

Dissertations

Sentiment analysis on social media such as Twitter has become a very important and challenging task. Due to the characteristics of such data (including tweet length, spelling errors, abbreviations, and special characters), the sentiment analysis task in such an environment requires a non-traditional approach. Moreover, social media sentiment analysis constitutes a fundamental problem with many interesting applications, such as for Business Intelligence, Medical Monitoring, and National Security. Most current social media sentiment classification methods judge the sentiment polarity primarily according to textual content and neglect other information on these platforms. In this research, we propose deep learning based frameworks that …


Extracting And Representing Entities, Types, And Relations, Patrick Verga Oct 2019

Extracting And Representing Entities, Types, And Relations, Patrick Verga

Doctoral Dissertations

Making complex decisions in areas like science, government policy, finance, and clinical treatments all require integrating and reasoning over disparate data sources. While some decisions can be made from a single source of information, others require considering multiple pieces of evidence and how they relate to one another. Knowledge graphs (KGs) provide a natural approach for addressing this type of problem: they can serve as long-term stores of abstracted knowledge organized around concepts and their relationships, and can be populated from heterogeneous sources including databases and text. KGs can facilitate higher level reasoning, influence the interpretation of new data, and …


Knowledge Base Question Answering With A Matching-Aggregation Model And Question-Specific Contextual Relations, Yunshi Lan, Shuohang Wang, Jing Jiang Oct 2019

Knowledge Base Question Answering With A Matching-Aggregation Model And Question-Specific Contextual Relations, Yunshi Lan, Shuohang Wang, Jing Jiang

Research Collection School Of Computing and Information Systems

Making use of knowledge bases to answer questions (KBQA) is a key direction in question answering systems. Researchers have developed a diverse range of methods to address this problem, but there are still some limitations with the existing methods. Specifically, the existing neural network-based methods for KBQA have not taken advantage of the recent “matching-aggregation” framework for the sequence matching, and when representing a candidate answer entity, they may not choose the most useful context of the candidate for matching. In this paper, we explore the use of a “matching-aggregation” framework to match candidate answers with questions. We further make …


Machine Learning-Based Models For Assessing Impacts Before, During And After Hurricane Events, Julie L. Harvey Sep 2019

Machine Learning-Based Models For Assessing Impacts Before, During And After Hurricane Events, Julie L. Harvey

Electronic Theses and Dissertations

Social media provides an abundant amount of real-time information that can be used before, during, and after extreme weather events. Government officials, emergency managers, and other decision makers can use social media data for decision-making, preparation, and assistance. Machine learning-based models can be used to analyze data collected from social media. Social media data and cloud cover temperature as physical sensor data was analyzed in this study using machine learning techniques. Data was collected from Twitter regarding Hurricane Florence from September 11, 2018 through September 20, 2018 and Hurricane Michael from October 1, 2018 through October 18, 2018. Natural language …


Poetry: Identification, Entity Recognition, And Retrieval, John J. Foley Iv Jul 2019

Poetry: Identification, Entity Recognition, And Retrieval, John J. Foley Iv

Doctoral Dissertations

Modern advances in natural language processing (NLP) and information retrieval (IR) provide for the ability to automatically analyze, categorize, process and search textual resources. However, generalizing these approaches remains an open problem: models that appear to understand certain types of data must be re-trained on other domains. Often, models make assumptions about the length, structure, discourse model and vocabulary used by a particular corpus. Trained models can often become biased toward an original dataset, learning that – for example – all capitalized words are names of people or that short documents are more relevant than longer documents. As a result, …


Exploring The Dynamics Of Scientific Research, Shilpa Lakhanpal Jun 2019

Exploring The Dynamics Of Scientific Research, Shilpa Lakhanpal

Dissertations

Scientific research papers present the research endeavors of numerous scientists around the world, and are documented across multitudes of technical conference proceedings, and other such publications. Given the plethora of such research data, if we could automate the extraction of key interesting areas of research, and provide access to this new information, it would make literature searches incredibly easier for researchers. This in turn could be very useful for them in furthering their research agenda. With this goal in mind, we have endeavored to provide such solutions through our research. Specifically, the focus of our research is to design, analyze …


A Data Driven Approach To Identify Journalistic 5ws From Text Documents, Venkata Krishna Mohan Sunkara Jun 2019

A Data Driven Approach To Identify Journalistic 5ws From Text Documents, Venkata Krishna Mohan Sunkara

Department of Computer Science and Engineering: Dissertations, Theses, and Student Research

Textual understanding is the process of automatically extracting accurate high-quality information from text. The amount of textual data available from different sources such as news, blogs and social media is growing exponentially. These data encode significant latent information which if extracted accurately can be valuable in a variety of applications such as medical report analyses, news understanding and societal studies. Natural language processing techniques are often employed to develop customized algorithms to extract such latent information from text.

Journalistic 5Ws refer to the basic information in news articles that describes an event and include where, when, who, what and why …


Music Mood Classification Using Convolutional Neural Networks, Revanth Akella May 2019

Music Mood Classification Using Convolutional Neural Networks, Revanth Akella

Master's Projects

Grouping music into moods is useful as music is migrating from to online streaming services as it can help in recommendations. To establish the connection between music and mood we develop an end-to-end, open source approach for mood classification using lyrics. We develop a pipeline for tag extraction, lyric extraction, and establishing classification models for classifying music into moods. We investigate techniques to classify music into moods using lyrics and audio features. Using various natural language processing methods with machine learning and deep learning we perform a comparative study across different classification and mood models. The results infer that features …


Chatbots With Personality Using Deep Learning, Susmit Gaikwad May 2019

Chatbots With Personality Using Deep Learning, Susmit Gaikwad

Master's Projects

Natural Language Processing (NLP) requires the computational modelling of the complex relationships of the syntax and semantics of a language. While traditional machine learning methods are used to solve NLP problems, they cannot imitate the human ability for language comprehension. With the growth in deep learning, these complexities within NLP are easier to model, and be used to build many computer applications. A particular example of this is a chatbot, where a human user has a conversation with a computer program, that generates responses based on the user’s input. In this project, we study the methods used in building chatbots, …


Smart Home Simulation In The Virtual World, Thomas Jones-Moore, David Son May 2019

Smart Home Simulation In The Virtual World, Thomas Jones-Moore, David Son

Scholars Week

The goal of this project is to produce a 'smart home' by using IoT and RFID like things in the virtual world to help solve problems. Some of these problems can be CPR training, etc. Used as an evaluation platform of suggested hardware to get a desired (or best fit) set of smart objects, or combinations with computer vision. Cost model to determine best fit based on: accuracy, lowest cost, easiest deployment, etc.


Question Answering With Textual Sequence Matching, Shuohang Wang Apr 2019

Question Answering With Textual Sequence Matching, Shuohang Wang

Dissertations and Theses Collection (Open Access)

Question answering (QA) is one of the most important applications in natural language processing. With the explosive text data from the Internet, intelligently getting answers of questions will help humans more efficiently collect useful information. My research in this thesis mainly focuses on solving question answering problem with textual sequence matching model which is to build vectorized representations for pairs of text sequences to enable better reasoning. And our thesis consists of three major parts.

In Part I, we propose two general models for building vectorized representations over a pair of sentences, which can be directly used to solve the …


Test: A Terminology Extraction System For Technology Related Terms, Murhaf Hossari, Soumyabrata Dev, John Kelleher Jan 2019

Test: A Terminology Extraction System For Technology Related Terms, Murhaf Hossari, Soumyabrata Dev, John Kelleher

Conference papers

Tracking developments in the highly dynamic data-technology landscape are vital to keeping up with novel technologies and tools, in the various areas of Artificial Intelligence (AI). However, It is difficult to keep track of all the relevant technology keywords. In this paper, we propose a novel system that addresses this problem. This tool is used to automatically detect the existence of new technologies and tools in text, and extract terms used to describe these new technologies. The extracted new terms can be logged as new AI technologies as they are found on-the-fly in the web. It can be subsequently classified …


Untapped Potential Of Clinical Text For Opioid Surveillance, Amy L. Olex, Tamas Gal, Majid Afshar, Dmitriy Dligach, Niranjan Karnik, Travis Oakes, Brihat Sharma, Meng Xie, Bridget T. Mcinnes, Julian Solway, Abel Kho, William Cramer, F. Gerard Moeller Jan 2019

Untapped Potential Of Clinical Text For Opioid Surveillance, Amy L. Olex, Tamas Gal, Majid Afshar, Dmitriy Dligach, Niranjan Karnik, Travis Oakes, Brihat Sharma, Meng Xie, Bridget T. Mcinnes, Julian Solway, Abel Kho, William Cramer, F. Gerard Moeller

Wright Center for Clinical and Translational Research Works

Accurate surveillance is needed to combat the growing opioid epidemic. To investigate the potential volume of missed opioid overdoses, we compare overdose encounters identified by ICD-10-CM codes and an NLP pipeline from two different medical systems. Our results show that the NLP pipeline identified a larger percentage of OOD encounters than ICD-10-CM codes. Thus, incorporating sophisticated NLP techniques into current diagnostic methods has the potential to improve surveillance on the incidence of opioid overdoses.


Automatic Concept Identification Of Software Requirements In Turkish, Fatma Bozyi̇ği̇t, Özlem Aktaş, Deni̇z Kilinç Jan 2019

Automatic Concept Identification Of Software Requirements In Turkish, Fatma Bozyi̇ği̇t, Özlem Aktaş, Deni̇z Kilinç

Turkish Journal of Electrical Engineering and Computer Sciences

Software requirements include description of the features for the target system and express the expectations of users. In the analysis phase, requirements are transformed into easy-to-understand conceptual models that facilitate communication between stakeholders. Although creating conceptual models using requirements is mostly implemented manually by analysts, the number of models that automate this process has increased recently. Most of the models and tools are developed to analyze requirements in English, and there is no study for agglutinative languages such as Turkish or Finnish. In this study, we propose an automatic concept identification model that transforms Turkish requirements into Unified Modeling Language …


A Hybrid Sentiment Analysis Method For Turkish, Buket Erşahi̇n, Özlem Aktaş, Deni̇z Kilinç, Mustafa Erşahi̇n Jan 2019

A Hybrid Sentiment Analysis Method For Turkish, Buket Erşahi̇n, Özlem Aktaş, Deni̇z Kilinç, Mustafa Erşahi̇n

Turkish Journal of Electrical Engineering and Computer Sciences

This paper presents a hybrid methodology for Turkish sentiment analysis, which combines the lexicon-based and machine learning (ML)-based approaches. On the lexicon-based side, we use a sentiment dictionary that is extended with a synonyms lexicon. Besides this, we tackle the classification problem with three supervised classifiers, naive Bayes, support vector machines, and J48, on the ML side. Our hybrid methodology combines these two approaches by generating a new lexicon-based value according to our feature generation algorithm and feeds it as one of the features to machine learning classifiers. Despite the linguistic challenges caused by the morphological structure of Turkish, the …


Automated Essay Evaluation Using Natural Language Processing And Machine Learning, Harshanthi Ghanta Jan 2019

Automated Essay Evaluation Using Natural Language Processing And Machine Learning, Harshanthi Ghanta

Theses and Dissertations

The goal of automated essay evaluation is to assign grades to essays and provide feedback using computers. Automated evaluation is increasingly being used in classrooms and online exams. The aim of this project is to develop machine learning models for performing automated essay scoring and evaluate their performance. In this research, a publicly available essay data set was used to train and test the efficacy of the adopted techniques. Natural language processing techniques were used to extract features from essays in the dataset. Three different existing machine learning algorithms were used on the chosen dataset. The data was divided into …


The D&D Sorting Hat: Predicting Dungeons And Dragons Characters From Textual Backstories, Joseph C. Macinnes Jan 2019

The D&D Sorting Hat: Predicting Dungeons And Dragons Characters From Textual Backstories, Joseph C. Macinnes

Senior Independent Study Theses

Dungeons and Dragons is a tabletop roleplaying game which focuses heavily on character interaction and creating narratives. The current state of the game's character creation process often bogs down new players in decisions related to game mechanics, not a character's identity and personality. This independent study investigates the use of machine learning and natural language processing to make these decisions for a player based on their character's backstory - the textual biography or description of a character. The study presents a collection of existing characters and uses these examples to create a family of models capable of predicting a character's …


Application Of Boolean Logic To Natural Language Complexity In Political Discourse, Austin Taing Jan 2019

Application Of Boolean Logic To Natural Language Complexity In Political Discourse, Austin Taing

Theses and Dissertations--Computer Science

Press releases serve as a major influence on public opinion of a politician, since they are a primary means of communicating with the public and directing discussion. Thus, the public’s ability to digest them is an important factor for politicians to consider. This study employs several well-studied measures of linguistic complexity and proposes a new one to examine whether politicians change their language to become more or less difficult to parse in different situations. This study uses 27,500 press releases from the US Senate between 2004–2008 and examines election cycles and natural disasters, namely hurricanes, as situations where politicians’ language …


Reorganize Your Blogs: Supporting Blog Re-Visitation With Natural Language Processing And Visualization, Shuo Niu, D. Scott Mccrickard, Timothy L. Stelter, Alan Dix, G. Don Taylor Jan 2019

Reorganize Your Blogs: Supporting Blog Re-Visitation With Natural Language Processing And Visualization, Shuo Niu, D. Scott Mccrickard, Timothy L. Stelter, Alan Dix, G. Don Taylor

Computer Science

Temporally-connected personal blogs contain voluminous textual content, presenting challenges in re-visiting and reflecting on experiences. Other data repositories have benefited from natural language processing (NLP) and interactive visualizations (VIS) to support exploration, but little is known about how these techniques could be used with blogs to present experiences and support multimodal interaction with blogs, particularly for authors. This paper presents the effect of reorganization—reorganizing the large blog set with NLP and presenting abstract topics with VIS—to support novel re-visitation experiences to blogs. The BlogCloud tool, a blog re-visitation tool that reorganizes blog paragraphs around user-searched keywords, implements reorganization and similarity-based …


Automatic Slide Generation For Scientific Papers, Athar Sefid, Jian Wu, Prasenjit Mitra, C. Lee Giles Jan 2019

Automatic Slide Generation For Scientific Papers, Athar Sefid, Jian Wu, Prasenjit Mitra, C. Lee Giles

Computer Science Faculty Publications

We describe our approach for automatically generating presentation slides for scientific papers using deep neural networks. Such slides can help authors have a starting point for their slide generation process. Extractive summarization techniques are applied to rank and select important sentences from the original document. Previous work identified important sentences based only on a limited number of features that were extracted from the position and structure of sentences in the paper. Our method extends previous work by (1) extracting a more comprehensive list of surface features, (2) considering semantic or meaning of the sentence, and (3) using context around the …


Global Inference For Aspect And Opinion Terms Co-Extraction Based On Multi-Task Neural Networks, Jianfei Yu, Jing Jiang, Rui Xia Jan 2019

Global Inference For Aspect And Opinion Terms Co-Extraction Based On Multi-Task Neural Networks, Jianfei Yu, Jing Jiang, Rui Xia

Research Collection School Of Computing and Information Systems

Extracting aspect terms and opinion terms are two fundamental tasks in opinion mining. The recent success of deep learning has inspired various neural network architectures, which have been shown to achieve highly competitive performance in these two tasks. However, most existing methods fail to explicitly consider the syntactic relations among aspect terms and opinion terms, which may lead to the inconsistencies between the model predictions and the syntactic constraints. To this end, we first apply a multi-task learning framework to implicitly capture the relations between the two tasks, and then propose a global inference method by explicitly modelling several syntactic …