Open Access. Powered by Scholars. Published by Universities.®
- Discipline
-
- Computer Engineering (58)
- Physical Sciences and Mathematics (50)
- Computer Sciences (48)
- Electrical and Computer Engineering (28)
- Artificial Intelligence and Robotics (13)
-
- Other Computer Engineering (6)
- Social and Behavioral Sciences (6)
- Operations Research, Systems Engineering and Industrial Engineering (5)
- Civil and Environmental Engineering (4)
- Digital Communications and Networking (4)
- Business (3)
- Civil Engineering (3)
- Computational Engineering (3)
- Data Science (3)
- Engineering Science and Materials (3)
- Theory and Algorithms (3)
- Chemistry (2)
- Communication (2)
- Data Storage Systems (2)
- Databases and Information Systems (2)
- Information Security (2)
- Library and Information Science (2)
- Medicine and Health Sciences (2)
- Sociology (2)
- Systems and Communications (2)
- Transportation Engineering (2)
- Anthropology (1)
- Applied Mathematics (1)
- Arts and Humanities (1)
- Institution
-
- TÜBİTAK (15)
- Old Dominion University (8)
- Wright State University (8)
- Technological University Dublin (7)
- University of South Carolina (6)
-
- Air Force Institute of Technology (4)
- Louisiana State University (4)
- University of Nebraska - Lincoln (4)
- University of Central Florida (3)
- Virginia Commonwealth University (3)
- California Polytechnic State University, San Luis Obispo (2)
- New Jersey Institute of Technology (2)
- Purdue University (2)
- University of Massachusetts Amherst (2)
- Binghamton University (1)
- Brigham Young University (1)
- City University of New York (CUNY) (1)
- Claremont Colleges (1)
- Clemson University (1)
- Embry-Riddle Aeronautical University (1)
- Institute of Business Administration (1)
- Missouri University of Science and Technology (1)
- Sheridan College (1)
- Singapore Management University (1)
- Tennessee State University (1)
- University of Alabama at Birmingham (1)
- University of Kentucky (1)
- University of Louisville (1)
- University of Nevada, Las Vegas (1)
- University of Wisconsin Milwaukee (1)
- Publication Year
- Publication
-
- Turkish Journal of Electrical Engineering and Computer Sciences (15)
- Theses and Dissertations (7)
- Browse all Theses and Dissertations (6)
- Dissertations (5)
- Publications (5)
-
- Electronic Theses and Dissertations (4)
- Articles (3)
- Computer Science Faculty Publications (3)
- Chemical and Life Science Engineering Publications (2)
- Computer Science and Engineering Faculty Publications (2)
- Department of Computer Science and Engineering: Dissertations, Theses, and Student Research (2)
- Doctoral Dissertations (2)
- LSU Doctoral Dissertations (2)
- LSU Master's Theses (2)
- Modeling, Simulation and Visualization Student Capstone Conference (2)
- All Dissertations (1)
- All ETDs from UAB (1)
- CGU Theses & Dissertations (1)
- Chemistry Faculty Research & Creative Works (1)
- Computer Engineering (1)
- Computer Science Faculty Research (1)
- Department of Electrical and Computer Engineering Technical Reports (1)
- Department of Electrical and Computer Engineering: Faculty Publications (1)
- Dissertations, Theses, and Capstone Projects (1)
- Electrical & Computer Engineering Theses & Dissertations (1)
- Engineering Management & Systems Engineering Faculty Publications (1)
- Faculty & Staff Scholarship (1)
- International Conference on Information and Communication Technologies (1)
- Journal of Digital Forensics, Security and Law (1)
- Library Philosophy and Practice (e-journal) (1)
- Publication Type
Articles 1 - 30 of 87
Full-Text Articles in Engineering
Text-To-Sql: A Methodical Review Of Challenges And Models, Ali Buğra Kanburoğlu, Faik Boray Tek
Text-To-Sql: A Methodical Review Of Challenges And Models, Ali Buğra Kanburoğlu, Faik Boray Tek
Turkish Journal of Electrical Engineering and Computer Sciences
This survey focuses on Text-to-SQL, automated translation of natural language queries into SQL queries. Initially, we describe the problem and its main challenges. Then, by following the PRISMA systematic review methodology, we survey the existing Text-to-SQL review papers in the literature. We apply the same method to extract proposed Text-to-SQL models and classify them with respect to used evaluation metrics and benchmarks. We highlight the accuracies achieved by various models on Text-to-SQL datasets and discuss execution-guided evaluation strategies. We present insights into model training times and implementations of different models. We also explore the availability of Text-to-SQL datasets in non-English …
A Survey On Few-Shot Class-Incremental Learning, Songsong Tian, Lusi Li, Weijun Li, Hang Ran, Xin Ning, Prayag Tiwari
A Survey On Few-Shot Class-Incremental Learning, Songsong Tian, Lusi Li, Weijun Li, Hang Ran, Xin Ning, Prayag Tiwari
Computer Science Faculty Publications
Large deep learning models are impressive, but they struggle when real-time data is not available. Few-shot class-incremental learning (FSCIL) poses a significant challenge for deep neural networks to learn new tasks from just a few labeled samples without forgetting the previously learned ones. This setup can easily leads to catastrophic forgetting and overfitting problems, severely affecting model performance. Studying FSCIL helps overcome deep learning model limitations on data volume and acquisition time, while improving practicality and adaptability of machine learning models. This paper provides a comprehensive survey on FSCIL. Unlike previous surveys, we aim to synthesize few-shot learning and incremental …
A Chinese Power Text Classification Algorithm Based On Deep Active Learning, Song Deng, Qianliang Li, Renjie Dai, Siming Wei, Di Wu, Yi He, Xindong Wu
A Chinese Power Text Classification Algorithm Based On Deep Active Learning, Song Deng, Qianliang Li, Renjie Dai, Siming Wei, Di Wu, Yi He, Xindong Wu
Computer Science Faculty Publications
The construction of knowledge graph is beneficial for grid production, electrical safety protection, fault diagnosis and traceability in an observable and controllable way. Highly-precision text classification algorithm is crucial to build a professional knowledge graph in power system. Unfortunately, there are a large number of poorly described and specialized texts in the power business system, and the amount of data containing valid labels in these texts is low. This will bring great challenges to improve the precision of text classification models. To offset the gap, we propose a classification algorithm for Chinese text in the power system based on deep …
Exploring The Impact Of Training Datasets On Turkish Stance Detection, Muhammed Sai̇d Zengi̇n, Berk Utku Yeni̇sey, Mücahi̇d Kutlu
Exploring The Impact Of Training Datasets On Turkish Stance Detection, Muhammed Sai̇d Zengi̇n, Berk Utku Yeni̇sey, Mücahi̇d Kutlu
Turkish Journal of Electrical Engineering and Computer Sciences
Stance detection has garnered considerable attention from researchers due to its broad range of applications, including fact-checking and social computing. While state-of-the-art stance detection models are usually based on supervised machine learning methods, their effectiveness is heavily reliant on the quality of training data. This problem is more prevalent in stance detection task because the stance of a text is intimately tied to the target under consideration. While numerous datasets exist for stance detection, determining their suitability for a specific target can be challenging. In this work, we focus on Turkish stance detection and explore the impact of training data …
Inéire: An Interpretable Nlp Pipeline Summarizing Inclusive Policy Making Concerning Migrants In Ireland, Arefeh Kazem, Arjumand Younus, Mingyeong Jeon, Muhammad Atif Qureshi, Simon Caton
Inéire: An Interpretable Nlp Pipeline Summarizing Inclusive Policy Making Concerning Migrants In Ireland, Arefeh Kazem, Arjumand Younus, Mingyeong Jeon, Muhammad Atif Qureshi, Simon Caton
Articles
Reaching marginal and other migrant communities to elicit their political views and opinions is a well-known challenge. Social media has enabled a certain amount of online activism and participation, especially in societies with abundant multicultural identities. However, it can be quite challenging to isolate the voice of the migrant in English-speaking countries, especially with an abundance of content in English on social media. In this paper, we pursue a case study of Ireland’s Twitter landscape, specifically migrant and native activists. We present a methodology that can accurately ( >80% ) isolate the Irish migrant voice with as little as 25 …
Behind Derogatory Migrants' Terms For Venezuelan Migrants: Xenophobia And Sexism Identification With Twitter Data And Nlp, Joseph Martínez, Melissa Miller-Felton, Jose Padilla, Erika Frydenlund
Behind Derogatory Migrants' Terms For Venezuelan Migrants: Xenophobia And Sexism Identification With Twitter Data And Nlp, Joseph Martínez, Melissa Miller-Felton, Jose Padilla, Erika Frydenlund
Modeling, Simulation and Visualization Student Capstone Conference
The sudden arrival of many migrants can present new challenges for host communities and create negative attitudes that reflect that tension. In the case of Colombia, with the influx of over 2.5 million Venezuelan migrants, such tensions arose. Our research objective is to investigate how those sentiments arise in social media. We focused on monitoring derogatory terms for Venezuelans, specifically veneco and veneca. Using a dataset of 5.7 million tweets from Colombian users between 2015 and 2021, we determined the proportion of tweets containing those terms. We observed a high prevalence of xenophobic and defamatory language correlated with the …
Towards Nlp-Based Conceptual Modeling Frameworks, David Shuttleworth, Jose Padilla
Towards Nlp-Based Conceptual Modeling Frameworks, David Shuttleworth, Jose Padilla
Modeling, Simulation and Visualization Student Capstone Conference
This paper presents preliminary research using Natural Language Processing (NLP) to support the development of conceptual modeling frameworks. NLP-based frameworks are intended to lower the barrier of entry for non-modelers to develop models and to facilitate communication across disciplines considering simulations in research efforts. NLP drives conceptual modeling in two ways. Firstly, it attempts to automate the generation of conceptual models and simulation specifications, derived from non-modelers’ narratives, while standardizing the conceptual modeling process and outcome. Secondly, as the process is automated, it is simpler to replicate and be followed by modelers and non-modelers. This allows for using a common …
Solving Turkish Math Word Problems By Sequence-To-Sequence Encoder-Decoder Models, Esi̇n Gedi̇k, Tunga Güngör
Solving Turkish Math Word Problems By Sequence-To-Sequence Encoder-Decoder Models, Esi̇n Gedi̇k, Tunga Güngör
Turkish Journal of Electrical Engineering and Computer Sciences
Solving math word problems (MWP) is a challenging task due to the semantic gap between natural language texts and mathematical equations. The main purpose of the task is to take a written math problem as input and produce a proper equation as output for solving that problem. This paper describes a sequence-to-sequence (seq2seq) neural model for automatically solving Turkish MWPs based on their semantic meanings in the text. It comprises a bidirectional encoder to comprehend the semantics of the problem by encoding the input sequence and a decoder with attention to extract the equation by tracking the semantic meanings of …
Know An Emotion By The Company It Keeps: Word Embeddings From Reddit/Coronavirus, Alejandro García-Rudolph, David Sanchez-Pinsach, Dietmar Frey, Eloy Opisso, Katryna Cisek, John Kelleher
Know An Emotion By The Company It Keeps: Word Embeddings From Reddit/Coronavirus, Alejandro García-Rudolph, David Sanchez-Pinsach, Dietmar Frey, Eloy Opisso, Katryna Cisek, John Kelleher
Articles
Social media is a crucial communication tool (e.g., with 430 million monthly active users in online forums such as Reddit), being an objective of Natural Language Processing (NLP) techniques. One of them (word embeddings) is based on the quotation, “You shall know a word by the company it keeps,” highlighting the importance of context in NLP. Meanwhile, “Context is everything in Emotion Research.” Therefore, we aimed to train a model (W2V) for generating word associations (also known as embeddings) using a popular Coronavirus Reddit forum, validate them using public evidence and apply them to the discovery of context for specific …
Exploring Gender Bias In Semantic Representations For Occupational Classification In Nlp: Techniques And Mitigation Strategies, Joseph Michael O'Carroll
Exploring Gender Bias In Semantic Representations For Occupational Classification In Nlp: Techniques And Mitigation Strategies, Joseph Michael O'Carroll
Dissertations
Gender bias in Natural Language Processing (NLP) models is a non-trivial problem that can perpetuate and amplify existing societal biases. This thesis investigates gender bias in occupation classification and explores the effectiveness of different debiasing methods for language models to reduce the impact of bias in the model’s representations. The study employs a data-driven empirical methodology focusing heavily on experimentation and result investigation. The study uses five distinct semantic representations and models with varying levels of complexity to classify the occupation of individuals based on their biographies.
A Structure-Aware Generative Adversarial Network For Bilingual Lexicon Induction, Bocheng Han, Qian Tao, Lusi Li, Zhihao Xiong
A Structure-Aware Generative Adversarial Network For Bilingual Lexicon Induction, Bocheng Han, Qian Tao, Lusi Li, Zhihao Xiong
Computer Science Faculty Publications
Bilingual lexicon induction (BLI) is the task of inducing word translations with a learned mapping function that aligns monolingual word embedding spaces in two different languages. However, most previous methods treat word embeddings as isolated entities and fail to jointly consider both the intra-space and inter-space topological relations between words. This limitation makes it challenging to align words from embedding spaces with distinct topological structures, especially when the assumption of isomorphism may not hold. To this end, we propose a novel approach called the Structure-Aware Generative Adversarial Network (SA-GAN) model to explicitly capture multiple topological structure information to achieve accurate …
A Structured Narrative Prompt For Prompting Narratives From Large Language Models: Sentiment Assessment Of Chatgpt-Generated Narratives And Real Tweets, Christopher J. Lynch, Erik J. Jensen, Virginia Zamponi, Kevin O'Brien, Erika Frydenlund, Ross Gore
A Structured Narrative Prompt For Prompting Narratives From Large Language Models: Sentiment Assessment Of Chatgpt-Generated Narratives And Real Tweets, Christopher J. Lynch, Erik J. Jensen, Virginia Zamponi, Kevin O'Brien, Erika Frydenlund, Ross Gore
VMASC Publications
Large language models (LLMs) excel in providing natural language responses that sound authoritative, reflect knowledge of the context area, and can present from a range of varied perspectives. Agent-based models and simulations consist of simulated agents that interact within a simulated environment to explore societal, social, and ethical, among other, problems. Simulated agents generate large volumes of data and discerning useful and relevant content is an onerous task. LLMs can help in communicating agents' perspectives on key life events by providing natural language narratives. However, these narratives should be factual, transparent, and reproducible. Therefore, we present a structured narrative prompt …
Advancing Internet Viewpoint Diversity: A Novel Algorithm And A Corpus Creation Tool, Jeffrey Harwell
Advancing Internet Viewpoint Diversity: A Novel Algorithm And A Corpus Creation Tool, Jeffrey Harwell
CGU Theses & Dissertations
A fundamental requirement for Western democracy is an informed and engaged electorate with access to a wide range of viewpoints. However, concerns have arisen regarding how information technology affects the diversity of viewpoints available. In response to an increasingly polarized society and worries surrounding filter bubbles and algorithmic bias, this research presents a novel tool for constructing internet-based topical corpora and an algorithm tailored for viewpoint detection and the curation of diverse search results.Following a comprehensive exploration of viewpoint diversity through the lenses of mass media, social psychology, and information retrieval, this dissertation presents an approach to operationalize viewpoint diversity …
L3 Ensembles: Lifelong Learning Approach For Ensemble Of Foundational Language Models*, Aidin Shiri, Kaushik Roy, Amit Sheth, Manas Gaur
L3 Ensembles: Lifelong Learning Approach For Ensemble Of Foundational Language Models*, Aidin Shiri, Kaushik Roy, Amit Sheth, Manas Gaur
Publications
Fine-tuning pre-trained foundational language models (FLM) for specific tasks is often impractical, especially for resource-constrained devices. This necessitates the development of a Lifelong Learning (L3) framework that continuously adapts to a stream of Natural Language Processing (NLP) tasks efficiently. We propose an approach that focuses on extracting meaningful representations from unseen data, constructing a structured knowledge base, and improving task performance incrementally. We conducted experiments on various NLP tasks to validate its effectiveness, including benchmarks like GLUE and SuperGLUE. We measured good performance across the accuracy, training efficiency, and knowledge transfer metrics. Initial experimental results show that the proposed L3 …
Data-Driven Strategies For Disease Management In Patients Admitted For Heart Failure, Ankita Agarwal
Data-Driven Strategies For Disease Management In Patients Admitted For Heart Failure, Ankita Agarwal
Browse all Theses and Dissertations
Heart failure is a syndrome which effects a patient’s quality of life adversely. It can be caused by different underlying conditions or abnormalities and involves both cardiovascular and non-cardiovascular comorbidities. Heart failure cannot be cured but a patient’s quality of life can be improved by effective treatment through medicines and surgery, and lifestyle management. As effective treatment of heart failure incurs cost for the patients and resource allocation for the hospitals, predicting length of stay of these patients during each hospitalization becomes important. Heart failure can be classified into two types: left sided heart failure and right sided heart failure. …
Comparative Adjudication Of Noisy And Subjective Data Annotation Disagreements For Deep Learning, Scott David Williams
Comparative Adjudication Of Noisy And Subjective Data Annotation Disagreements For Deep Learning, Scott David Williams
Browse all Theses and Dissertations
Obtaining accurate inferences from deep neural networks is difficult when models are trained on instances with conflicting labels. Algorithmic recognition of online hate speech illustrates this. No human annotator is perfectly reliable, so multiple annotators evaluate and label online posts in a corpus. Labeling scheme limitations, differences in annotators' beliefs, and limits to annotators' honesty and carefulness cause some labels to disagree. Consequently, decisive and accurate inferences become less likely. Some practical applications such as social research can tolerate some indecisiveness. However, an online platform using an indecisive classifier for automated content moderation could create more problems than it solves. …
Enhanced Neurologic Concept Recognition Using A Named Entity Recognition Model Based On Transformers, Sima Azizi, Daniel B. Hier, Donald C. Wunsch
Enhanced Neurologic Concept Recognition Using A Named Entity Recognition Model Based On Transformers, Sima Azizi, Daniel B. Hier, Donald C. Wunsch
Chemistry Faculty Research & Creative Works
Although Deep Learning Has Been Applied to the Recognition of Diseases and Drugs in Electronic Health Records and the Biomedical Literature, Relatively Little Study Has Been Devoted to the Utility of Deep Learning for the Recognition of Signs and Symptoms. the Recognition of Signs and Symptoms is Critical to the Success of Deep Phenotyping and Precision Medicine. We Have Developed a Named Entity Recognition Model that Uses Deep Learning to Identify Text Spans Containing Neurological Signs and Symptoms and Then Maps These Text Spans to the Clinical Concepts of a Neuro-Ontology. We Compared a Model based on Convolutional Neural Networks …
Use Of Machine Learning And Natural Language Processing To Enhance Traffic Safety Analysis, Md Abu Sayed
Use Of Machine Learning And Natural Language Processing To Enhance Traffic Safety Analysis, Md Abu Sayed
Theses and Dissertations
Despite significant advances in vehicle technologies, safety data collection and analysis, and engineering advancements, tens of thousands of Americans die every year in motor vehicle crashes. Alarmingly, the trend of fatal and serious injury crashes appears to be heading in the wrong direction. In 2021, the actual rate of fatalities exceeded the predicted rate. This worrisome trend prompts and necessitates the development of advanced and holistic approaches to determining the causes of a crash (particularly fatal and major injuries). These approaches range from analyzing problems from multiple perspectives, utilizing available data sources, and employing the most suitable tools and technologies …
Identification Of Factors Contributing To Traffic Crashes By Analysis Of Text Narratives, Cristian D. Arteaga-Sanchez
Identification Of Factors Contributing To Traffic Crashes By Analysis Of Text Narratives, Cristian D. Arteaga-Sanchez
UNLV Theses, Dissertations, Professional Papers, and Capstones
The fatalities, injuries, and property damage that result from traffic crashes impose a significant burden on society. Current research and practice in traffic safety rely on analysis of quantitative data from crash reports to understand crash severity contributors and develop countermeasures. Despite advances from this effort, quantitative crash data suffers from drawbacks, such as the limited ability to capture all the information relevant to the crashes and the potential errors introduced during data collection. Crash narratives can help address these limitations, as they contain detailed descriptions of the context and sequence of events of the crash. However, the unstructured nature …
Natural Language Processing For Novel Writing, Leqing Qu, Okan Ersoy
Natural Language Processing For Novel Writing, Leqing Qu, Okan Ersoy
Department of Electrical and Computer Engineering Technical Reports
No abstract provided.
Diacritics Correction In Turkish With Context-Aware Sequence To Sequence Modeling, Asi̇ye Tuba Özge, Özge Bozal, Umut Özge
Diacritics Correction In Turkish With Context-Aware Sequence To Sequence Modeling, Asi̇ye Tuba Özge, Özge Bozal, Umut Özge
Turkish Journal of Electrical Engineering and Computer Sciences
Digital texts in many languages have examples of missing or misused diacritics which makes it hard for natural language processing applications to disambiguate the meaning of words. Therefore, diacritics restoration is a crucial step in natural language processing applications for many languages. In this study we approach this problem as bidirectional transformation of diacritical letters and their ASCII counterparts, rather than unidirectional diacritic restoration. We propose a context-aware character-level sequence to sequence model for this transformation. The model is language independent in the sense that no language-specific feature extraction is necessary other than the utilization of word embeddings and is …
Deception Detection Across Domains, Languages And Modalities, Subhadarshi Panda
Deception Detection Across Domains, Languages And Modalities, Subhadarshi Panda
Dissertations, Theses, and Capstone Projects
With the increase of deception and misinformation especially in social media, it has become crucial to develop machine learning methods to automatically identify deception. In this dissertation, we identify key challenges underlying text-based deception detection in a cross-domain setting, where we do not have training data in the target domain. We analyze the differences between domains and as a result develop methods to improve cross-domain deception detection. We additionally develop approaches that take advantage of cross-lingual properties to support deception detection across languages. This involves the usage of either multilingual NLP models or translation models. Finally, to better understand multi-modal …
Applied Deep Learning: Case Studies In Computer Vision And Natural Language Processing, Md Reshad Ul Hoque
Applied Deep Learning: Case Studies In Computer Vision And Natural Language Processing, Md Reshad Ul Hoque
Electrical & Computer Engineering Theses & Dissertations
Deep learning has proved to be successful for many computer vision and natural language processing applications. In this dissertation, three studies have been conducted to show the efficacy of deep learning models for computer vision and natural language processing. In the first study, an efficient deep learning model was proposed for seagrass scar detection in multispectral images which produced robust, accurate scars mappings. In the second study, an arithmetic deep learning model was developed to fuse multi-spectral images collected at different times with different resolutions to generate high-resolution images for downstream tasks including change detection, object detection, and land cover …
Digitalization Of Construction Project Requirements Using Natural Language Processing (Nlp) Techniques, Fahad Ul Hassan
Digitalization Of Construction Project Requirements Using Natural Language Processing (Nlp) Techniques, Fahad Ul Hassan
All Dissertations
Contract documents are a critical legal component of a construction project that specify all wishes and expectations of the owner toward the design, construction, and handover of a project. A single contract package, especially of a design-build (DB) project, comprises hundreds of documents including thousands of requirements. Precise comprehension and management of the requirements are critical to ensure that all important explicit and implicit requirements of the project scope are captured, managed, and completed. Since requirements are mainly written in a natural human language, the current manual methods impose a significant burden on practitioners to process and restructure them into …
An Empirical Study Of Memorization In Nlp, Xiaosen Zheng, Jing Jiang
An Empirical Study Of Memorization In Nlp, Xiaosen Zheng, Jing Jiang
Research Collection School Of Computing and Information Systems
A recent study by Feldman (2020) proposed a long-tail theory to explain the memorization behavior of deep learning models. However, memorization has not been empirically verified in the context of NLP, a gap addressed by this work. In this paper, we use three different NLP tasks to check if the long-tail theory holds. Our experiments demonstrate that top-ranked memorized training instances are likely atypical, and removing the top-memorized training instances leads to a more serious drop in test accuracy compared with removing training instances randomly. Furthermore, we develop an attribution method to better understand why a training instance is memorized. …
Toward Suicidal Ideation Detection With Lexical Network Features And Machine Learning, Ulya Bayram, William Lee, Daniel Santel, Ali Minai, Peggy Clark, Tracy Glauser, John Pestian
Toward Suicidal Ideation Detection With Lexical Network Features And Machine Learning, Ulya Bayram, William Lee, Daniel Santel, Ali Minai, Peggy Clark, Tracy Glauser, John Pestian
Northeast Journal of Complex Systems (NEJCS)
In this study, we introduce a new network feature for detecting suicidal ideation from clinical texts and conduct various additional experiments to enrich the state of knowledge. We evaluate statistical features with and without stopwords, use lexical networks for feature extraction and classification, and compare the results with standard machine learning methods using a logistic classifier, a neural network, and a deep learning method. We utilize three text collections. The first two contain transcriptions of interviews conducted by experts with suicidal (n=161 patients that experienced severe ideation) and control subjects (n=153). The third collection consists of interviews conducted by experts …
Event-Related Microblog Retrieval In Turkish, Çağri Toraman
Event-Related Microblog Retrieval In Turkish, Çağri Toraman
Turkish Journal of Electrical Engineering and Computer Sciences
Microblogs, such as tweets, are short messages in which users are able to share any opinion and information. Microblogs are mostly related to real-life events reported in news articles. Finding event-related microblogs is important to analyze online social networks and understand public opinion on events. However, finding such microblogs is a challenging task due to the dynamic nature of microblogs and their limited length. In this study, assuming that news articles are given as queries and microblogs as documents, we find event-related microblogs in Turkish. In order to represent news articles and microblogs, we examine encoding methods, namely traditional bag-of-words …
Narrative Analysis Of Open-Source Social Media Activity In The Indopacom Aor, Aaron K. Glenn
Narrative Analysis Of Open-Source Social Media Activity In The Indopacom Aor, Aaron K. Glenn
Theses and Dissertations
Emotion classification can be a powerful tool to derive narratives from social media data. Recurrent Neural Networks (RNN) can meet or exceed the performance of state-of-the-art traditional machine learning techniques using exclusively open-source data and models. Specifically, these results show that RNN variants can produce more than an 8% gain in accuracy in comparison to Logistic Regression and SVM techniques and a 15% gain over Random Forest when using FastText embeddings. This research found a statistical significance in the performance of a single layer Bi-directional Long Short-Term Memory (Bi-LSTM) model over a 2-layer stacked Bi-LSTM model. This research also found …
Analysis Of Twitter Networks To Aid Open Source Intelligence Capabilities: A Multilayer Network Approach, Austin P. Logan
Analysis Of Twitter Networks To Aid Open Source Intelligence Capabilities: A Multilayer Network Approach, Austin P. Logan
Theses and Dissertations
Open Source Intelligence using social media is a practice which gives military intelligence analysts a window into the thoughts and minds of an online population. Using Social Network Analysis, user interactions on Twitter will be modeled as a weighted and directed network. Topic modeling through Latent Dirichlet Allocation uncovers the topics of discussion in Tweets and is then integrated into a multi-layer network which allows users to be connected to the conversations with which they have participated. Influential users in this network as well as highly connected groups of individuals are then discovered to paint a picture for intelligence analysts …
Nlp@Vcu: Crop Characteristic Extraction Framework, Cora Lewis, Bridget Mcinnes, Getiria Onsongo
Nlp@Vcu: Crop Characteristic Extraction Framework, Cora Lewis, Bridget Mcinnes, Getiria Onsongo
Summer REU Program
We developed a crop characteristic extraction framework. Starting from a custom SpaCy named entity recognition model, we added pre-trained word embeddings and a part-of-speech based entity expansion post-processing step. Then, we implemented an evaluation framework that functioned as a 5-fold cross validation wrapper for SpaCy custom training. Preliminary results showed improvement in the extraction framework after these additions.