Open Access. Powered by Scholars. Published by Universities.®
- Discipline
-
- Physical Sciences and Mathematics (83)
- Computer Sciences (76)
- Artificial Intelligence and Robotics (43)
- Arts and Humanities (43)
- Discourse and Text Linguistics (23)
-
- Semantics and Pragmatics (23)
- Phonetics and Phonology (22)
- Applied Linguistics (20)
- Communication (19)
- Psycholinguistics and Neurolinguistics (18)
- Psychology (18)
- Engineering (17)
- First and Second Language Acquisition (16)
- Language Description and Documentation (14)
- Computer Engineering (13)
- Other Computer Sciences (13)
- Library and Information Science (12)
- Syntax (12)
- Other Linguistics (11)
- Anthropological Linguistics and Sociolinguistics (10)
- Cognition and Perception (9)
- Communication Technology and New Media (9)
- Comparative and Historical Linguistics (9)
- Databases and Information Systems (9)
- Medicine and Health Sciences (9)
- Business (8)
- Data Science (8)
- Institution
-
- City University of New York (CUNY) (54)
- Technological University Dublin (29)
- University of Kentucky (15)
- Selected Works (12)
- University of Massachusetts Amherst (12)
-
- Brigham Young University (9)
- SelectedWorks (7)
- University of Nebraska - Lincoln (7)
- Chapman University (6)
- Western University (5)
- East Tennessee State University (4)
- Air Force Institute of Technology (3)
- Bard College (3)
- California Polytechnic State University, San Luis Obispo (3)
- Minnesota State University, Mankato (3)
- Montclair State University (3)
- University of Central Florida (3)
- Binghamton University (2)
- Boise State University (2)
- Claremont Colleges (2)
- Florida International University (2)
- Iowa State University (2)
- Portland State University (2)
- University of Tennessee, Knoxville (2)
- Ursinus College (2)
- Bowling Green State University (1)
- COBRA (1)
- Cedarville University (1)
- Clemson University (1)
- Dartmouth College (1)
- Keyword
-
- Computational linguistics (20)
- Natural Language Processing (15)
- Natural language processing (12)
- Machine learning (10)
- Computational Linguistics (7)
-
- Machine Learning (7)
- Dialogue (6)
- Linguistics (6)
- Phonology (6)
- Prosody (6)
- WordNet (6)
- Data mining (5)
- Deep learning (5)
- Prepositions (5)
- Sentiment analysis (5)
- Social media (5)
- Artificial intelligence (4)
- BERT (4)
- Computational linguistics: LSA, Second language assesment (4)
- Corpus (4)
- Corpus linguistics (4)
- Semantics (4)
- Situated Dialog (4)
- Sociolinguistics (4)
- Spatial Language (4)
- Spatial Templates (4)
- Stylometry (4)
- Text classification (4)
- Text mining (4)
- Authorship attribution (3)
- Publication Year
- Publication
-
- Dissertations, Theses, and Capstone Projects (49)
- Conference papers (18)
- Faculty Publications (10)
- Theses and Dissertations--Linguistics (9)
- Doctoral Dissertations (8)
-
- Masters Theses (6)
- Publications and Research (5)
- Articles (4)
- Communication Sciences and Disorders Faculty Articles and Research (4)
- Conference Papers (4)
- Electronic Thesis and Dissertation Repository (4)
- Yves Bestgen (4)
- Electronic Theses and Dissertations (3)
- Other Resources (3)
- Theses and Dissertations (3)
- CGU Faculty Publications and Research (2)
- CSE Conference and Workshop Papers (2)
- Commonwealth Computational Summit (2)
- Computer Science Summer Fellows (2)
- Department of Linguistics Faculty Scholarship and Creative Works (2)
- Dissertations (2)
- Electronic Literature Organization Conference 2020 (2)
- Elena Cotos (2)
- FIU Electronic Theses and Dissertations (2)
- Honors Theses (2)
- Jonathan Howell (2)
- Library Philosophy and Practice (e-journal) (2)
- Master's Theses (2)
- Northeast Journal of Complex Systems (NEJCS) (2)
- Robert Frank (2)
Articles 31 - 60 of 233
Full-Text Articles in Computational Linguistics
Automatic Transcription Of Northern Prinmi Oral Art: Approaches And Challenges To Automatic Speech Recognition For Language Documentation, Connor Bechler
Automatic Transcription Of Northern Prinmi Oral Art: Approaches And Challenges To Automatic Speech Recognition For Language Documentation, Connor Bechler
Theses and Dissertations--Linguistics
One significant issue facing language documentation efforts is the transcription bottleneck: each documented recording must be transcribed and annotated, and these tasks are extremely labor intensive (Ćavar et al., 2016). Researchers have sought to accelerate these tasks with partial automation via forced alignment, natural language processing, and automatic speech recognition (ASR) (Neubig et al., 2020). Neural network—especially transformer-based—approaches have enabled large advances in ASR over the last decade. Models like XLSR-53 promise improved performance on under-resourced languages by leveraging massive data sets from many different languages (Conneau et al., 2020). This project extends these efforts to a novel context, applying …
‘A Category Of Their Own’: Quantitative Methods In The Use Of Pile-Sort Data In Perceptual Dialectology, Zachary Ty Gill
‘A Category Of Their Own’: Quantitative Methods In The Use Of Pile-Sort Data In Perceptual Dialectology, Zachary Ty Gill
Theses and Dissertations--Linguistics
The purpose of this study is to investigate how Mississippi Gulf Coast Creoles perceive language differences in their home area. A pile-sort task was carried out in which respondents were given stacks of cards with local communities written on them and instructed to stack together the regions where people “talk the same.” Once the piles were made, the fieldworker discussed their sortings with the respondents. The stacks were analyzed by means of a hierarchal agglomerative cluster analysis and non-parametric multidimensional scaling with k-means cluster analysis overlays to extract the perceived dialect areas. The groupings reveal that respondent strategies are based …
Brazilian Portuguese-Russian (Braporus) Corpus: Automatic Transcription And Acoustic Quality Of Elderly Speech During Covid-19 Pandemic, Irina A. Sekerina, Anna Smirnova Henriques, Aleksandra Skorobogatova, Natalia Tyulina, Tatiana V. Kachkovskaia, Svetlana Ruseishvili, Sandra Madureira
Brazilian Portuguese-Russian (Braporus) Corpus: Automatic Transcription And Acoustic Quality Of Elderly Speech During Covid-19 Pandemic, Irina A. Sekerina, Anna Smirnova Henriques, Aleksandra Skorobogatova, Natalia Tyulina, Tatiana V. Kachkovskaia, Svetlana Ruseishvili, Sandra Madureira
Publications and Research
This article presents the Brazilian Portuguese-Russian (BraPoRus) corpus, whose goal is to collect, analyze, and preserve for posterity the spoken heritage Russian still used today in Brazil by approximately 1,500 elderly bilingual heritage Russian–Brazilian Portuguese speakers. Their unique 100-year-old variety of moribund Russian is disappearing because it has not been passed to their descendants born in Brazil. During the COVID-19 pandemic, we remotely collected 170 h of speech samples in heritage Russian from 26 participants (Mage = 75.7 years) in naturalistic settings using Zoom or a phone call. To estimate the quality of collected data, we focus on two methodological …
Technology In The Classroom: The Features Language Teachers Should Consider, Sophie Cuocci, Padideh Fattahi Marnani
Technology In The Classroom: The Features Language Teachers Should Consider, Sophie Cuocci, Padideh Fattahi Marnani
Journal of English Learner Education
The fast development of technology and the new generation of highly computer literate students led to consider the integration of technology in school as essential. Throughout the last two decades, research has identified multiple factors leading to the successful and unsuccessful integration of technology in the classroom. Educators must consider these factors when deciding on which technology tools to use and how to integrate them to their lessons. Simultaneously, the increasing number of English learners in the United States calls for the identification of teaching strategies that will best support their needs. Many language teachers now rely on teaching techniques …
Creating Data From Unstructured Text With Context Rule Assisted Machine Learning (Craml), Stephen Meisenbacher, Peter Norlander
Creating Data From Unstructured Text With Context Rule Assisted Machine Learning (Craml), Stephen Meisenbacher, Peter Norlander
School of Business: Faculty Publications and Other Works
Popular approaches to building data from unstructured text come with limitations, such as scalability, interpretability, replicability, and real-world applicability. These can be overcome with Context Rule Assisted Machine Learning (CRAML), a method and no-code suite of software tools that builds structured, labeled datasets which are accurate and reproducible. CRAML enables domain experts to access uncommon constructs within a document corpus in a low-resource, transparent, and flexible manner. CRAML produces document-level datasets for quantitative research and makes qualitative classification schemes scalable over large volumes of text. We demonstrate that the method is useful for bibliographic analysis, transparent analysis of proprietary data, …
Data-Driven Neuroanatomical Subtypes In Various Stages Of Schizophrenia: Linking Cortical Thickness, Glutamate, And Language Functioning, Liangbing Liang
Data-Driven Neuroanatomical Subtypes In Various Stages Of Schizophrenia: Linking Cortical Thickness, Glutamate, And Language Functioning, Liangbing Liang
Electronic Thesis and Dissertation Repository
The considerable variation in the spatial distribution of cortical thickness changes has been used to parse heterogeneity in schizophrenia. We aimed to recover a ‘cortical impoverishment’ subgroup with widespread cortical thinning. We applied hierarchical cluster analysis to cortical thickness data of three datasets in different stages of psychosis and studied the cognitive, functional, neurochemical, language and symptom profiles of the observed subgroups. Our consensus-based clustering procedure consistently produced a subgroup characterized by significantly lower cortical thickness. This ‘cortical impoverishment’ subgroup was associated with a higher symptom burden in a clinically stable sample and higher glutamate levels with language impairments in …
Phonotactic Learning With Distributional Representations, Max A. Nelson
Phonotactic Learning With Distributional Representations, Max A. Nelson
Doctoral Dissertations
This dissertation explores the possibility that the phonological grammar manipulates phone representations based on learned distributional class memberships rather than those based on substantive linguistic features. In doing so, this work makes three primary contributions. First, I propose three novel algorithms for learning a phonological class system from the distributional statistics of a language, all of which are based on partitioning graph representations of phone distributions. Second, I propose a new method for fitting Maximum Entropy phonotactic grammars, MaxEntGrams, which offers theoretical complexity improvements over the widely-adopted approach taken by Hayes and Wilson [2008]. Third, I present a series of …
Restrictive Tier Induction, Seoyoung Kim
Restrictive Tier Induction, Seoyoung Kim
Doctoral Dissertations
This dissertation proposes the Restrictive Tier Learner, which automatically induces only the tiers that are absolutely necessary in capturing phonological long-distance dependencies. The core of my learner is the addition of an extra evaluation step to the existing Inductive Projection Learner (Gouskova and Gallagher 2020), where the necessity and accuracy of the candidate tiers are determined. An important building block of my learner is a typological observation, namely the dichotomy between trigram-bound and unbounded patterns. The fact that this dichotomy is attested in both consonant interactions and vowel interactions allows for a unified approach to be used. Another important piece …
From Sesame Street To Beyond: Multi-Domain Discourse Relation Classification With Pretrained Bert, Isaac R. Raff
From Sesame Street To Beyond: Multi-Domain Discourse Relation Classification With Pretrained Bert, Isaac R. Raff
Dissertations, Theses, and Capstone Projects
Research efforts in transfer learning have gained massive popularity in recent years. Pretrained language models have demonstrated the most successful results in producing high quality neural networks capable of quality inference after training across domains via transfer learning. This study expands on the domain transfer introduced in \cite{ferracane-etal-2019-news} exploring neural methods for transfer learning of discourse parsing between a news source domain and a medical target domain. \cite{ferracane-etal-2019-news} specifically discuss transfer learning from news articles to PubMed medical journal articles. Experiments in transfer learning in the current work expand to include three domains: Wall Street Journal articles previously annotated with …
Linguistic Abstractions In Children’S Very Early Utterances, Qihui Xu
Linguistic Abstractions In Children’S Very Early Utterances, Qihui Xu
Dissertations, Theses, and Capstone Projects
How early do children produce multiword utterances? Do children's early utterances reflect abstract syntactic knowledge or are they the result of data-driven learning? We examine this issue through corpus analysis, computational modeling, and adult simulation experiments. Chapter 1 investigates when children start producing multiword utterances; we use corpora to establish the development of multiword utterances and a probabilistic computational model to account for the quantitative change of early multiword utterances. We find that multiword utterances of different lengths appear early in acquisition and increase together, and the length growth pattern can be viewed as a probabilistic and dynamic process.
Chapter …
Predicting Stress In Russian Using Modern Machine-Learning Tools, John Schriner
Predicting Stress In Russian Using Modern Machine-Learning Tools, John Schriner
Dissertations, Theses, and Capstone Projects
In the Russian language, stress on a word is determined via often complex patterns and rules. In this paper, after examining nearly a century of research in stress rules and methods in Russian, we turn to see if modern machine learning tools can aid in predicting stress. Using A.A. Zaliznyak’s dictionary grammar and over 300,000 word forms, we derived stress codes to aid in predicting which syllable primary stress falls on. We trained an LSTM neural network on the data and conducted eight experiments with added features such as lemma, part of speech, and morphology. While the model performed better …
Towards Explaining Variation In Entrainment, Andreas Weise
Towards Explaining Variation In Entrainment, Andreas Weise
Dissertations, Theses, and Capstone Projects
Entrainment refers to the tendency of human speakers to adapt to their interlocutors to become more similar to them. This affects various dimensions and occurs in many contexts, allowing for rich applications in human-computer interaction. However, it is not exhibited by every speaker in every conversation but varies widely across features, speakers, and contexts, hindering broad application. This variation, whose guiding principles are poorly understood even after decades of entrainment research, is the subject of this thesis. We begin with a comprehensive literature review that serves as the foundation of our own work and provides a reference to guide future …
Spectral Analysis Of Multiscale Cultural Traits On Twitter, Chandler Squires, Nikhil Kunapuli, Yaneer Bar-Yam, Alfredo Morales
Spectral Analysis Of Multiscale Cultural Traits On Twitter, Chandler Squires, Nikhil Kunapuli, Yaneer Bar-Yam, Alfredo Morales
Northeast Journal of Complex Systems (NEJCS)
Understanding and mapping the emergence and boundaries of cultural areas is a challenge for social sciences. In this paper, we present a method for analyzing the cultural composition of regions via Twitter hashtags. Cultures can be described as distinct combination of traits which we capture via principal component analysis (PCA). We investigate the top 8 PCA components of an area including France, Spain, and Portugal, in terms of the geographic distribution of their hashtag composition. We also discuss relationships between components and the insights those relationships can provide into the structure of a cultural space. Finally, we compare the spatial …
Generic Ab Initio, James A. Heilpern, Earl Kjar Brown, William G. Eggington, Zachary D. Smith
Generic Ab Initio, James A. Heilpern, Earl Kjar Brown, William G. Eggington, Zachary D. Smith
Buffalo Law Review
From comic conventions to disbanded dioceses, courts continue to struggle with a unique but puzzling question of trademark law. Federal law protects certain terms that refer to a product or service from a specific producer instead of to a product generally. Terms that refer to products are considered generic and cannot receive protection. Courts have also held that a term that was generic at the time the party adopted the mark cannot receive protection, even if the public later views it as being specific to a particular producer. But, many marks were adopted decades or centuries ago. As a result, …
Corrective Feedback Timing In Kanji Writing Instruction Apps, Phoenix Mulgrew
Corrective Feedback Timing In Kanji Writing Instruction Apps, Phoenix Mulgrew
Honors Theses
The focus of this research paper is to determine the correct time to provide corrective feedback to people who are learning how to write Japanese kanji. To do this, we developed a system that is able to recognize Japanese kanji that is handwritten onto an iPad screen and check for errors such as wrong stroke order. Previous research has achieved success in developing similar systems, but this project is unique because the research question involves the timing of corrective feedback. In particular, we are looking at whether immediate or delayed corrective feedback results in better learning.
A Machine Learning Approach To Text-Based Sarcasm Detection, Lara I. Novic
A Machine Learning Approach To Text-Based Sarcasm Detection, Lara I. Novic
Dissertations, Theses, and Capstone Projects
Sarcasm and indirect language are commonplace for humans to produce and recognize but difficult for machines to detect. While artificial intelligence can accurately analyze sentiment and emotion in speech and text, it may struggle with insincere and sardonic content, although it is possible to train a machine to identify uttered and written sarcasm. This paper aims to detect sarcasm using logistic regression and a support vector machine (SVM) and compare their results to a baseline.
The models are trained on headlines from a Kaggle dataset containing headlines from the satirical news website The Onion and serious news website Huffpost (formerly …
Covert Determiners In Appalachian English Narrative Declarative Sentences, William Oliver
Covert Determiners In Appalachian English Narrative Declarative Sentences, William Oliver
Dissertations, Theses, and Capstone Projects
In this thesis, I explore the syntax and semantics of covert determiners (Ds) in matrix subject determiner phrases (DPs) with definite specific interpretations. To conduct my investigation, I used the Audio-Aligned and Parsed Corpus of Appalachian English (AAPCAppE), a million-word Penn Treebank corpus, and the software CorpusSearch, a Java program that searches Penn Treebank corpora. My research shows that Appalachian English contains a linguistic phenomenon where speakers drop the D, replacing overt Ds with covert Ds, in definite specific DPs. For example, where Standard English speakers say The doctor came by horseback, Appalachian speakers may use a covert D …
Integrating Cultural Knowledge Into Artificially Intelligent Systems: Human Experiments And Computational Implementations, Anurag Acharya
Integrating Cultural Knowledge Into Artificially Intelligent Systems: Human Experiments And Computational Implementations, Anurag Acharya
FIU Electronic Theses and Dissertations
With the advancement of Artificial Intelligence, it seems as if every aspect of our lives is impacted by AI in one way or the other. As AI is used for everything from driving vehicles to criminal justice, it becomes crucial that it overcome any biases that might hinder its fair application. We are constantly trying to make AI be more like humans. But most AI systems so far fail to address one of the main aspects of humanity: our culture and the differences between cultures. We cannot truly consider AI to have understood human reasoning without understanding culture. So it …
Metaphor Detection In Poems In Misurata Arabic Sub-Dialect : An Lstm Model, Azza Abugharsa
Metaphor Detection In Poems In Misurata Arabic Sub-Dialect : An Lstm Model, Azza Abugharsa
Theses, Dissertations and Culminating Projects
Natural Language Processing (NLP) in Arabic is witnessing an increasing interest in investigating different topics in the field. One of the topics that have drawn attention is the automatic processing of Arabic figurative language. The focus in previous projects is on detecting and interpreting metaphors in comments from social media as well as phrases and/or headlines from news articles. The current project focuses on metaphor detection in poems written in the Misurata Arabic sub-dialect spoken in Misurata, located in the North African region. The dataset is initially annotated by a group of linguists, and their annotation is treated as the …
“I Can See The Forest For The Trees”: Examining Personality Traits With Trasformers, Alexander Moore
“I Can See The Forest For The Trees”: Examining Personality Traits With Trasformers, Alexander Moore
All Dissertations
Our understanding of Personality and its structure is rooted in linguistic studies operating under the assumptions made by the Lexical Hypothesis: personality characteristics that are important to a group of people will at some point be codified in their language, with the number of encoded representations of a personality characteristic indicating their importance. Qualitative and quantitative efforts in the dimension reduction of our lexicon throughout the mid-20th century have played a vital role in the field’s eventual arrival at the widely accepted Five Factor Model (FFM). However, there are a number of presently unresolved conflicts regarding the breadth and …
Toward Suicidal Ideation Detection With Lexical Network Features And Machine Learning, Ulya Bayram, William Lee, Daniel Santel, Ali Minai, Peggy Clark, Tracy Glauser, John Pestian
Toward Suicidal Ideation Detection With Lexical Network Features And Machine Learning, Ulya Bayram, William Lee, Daniel Santel, Ali Minai, Peggy Clark, Tracy Glauser, John Pestian
Northeast Journal of Complex Systems (NEJCS)
In this study, we introduce a new network feature for detecting suicidal ideation from clinical texts and conduct various additional experiments to enrich the state of knowledge. We evaluate statistical features with and without stopwords, use lexical networks for feature extraction and classification, and compare the results with standard machine learning methods using a logistic classifier, a neural network, and a deep learning method. We utilize three text collections. The first two contain transcriptions of interviews conducted by experts with suicidal (n=161 patients that experienced severe ideation) and control subjects (n=153). The third collection consists of interviews conducted by experts …
Prácticas Comunicativas Digitales Y Construcción De Subjetividades: El Uso Del Podcast En La Escuela, María Isabel Guevara Rodríguez
Prácticas Comunicativas Digitales Y Construcción De Subjetividades: El Uso Del Podcast En La Escuela, María Isabel Guevara Rodríguez
Doctorado en Educación y Sociedad
No abstract provided.
From An Art To A Science: Features And Methodology In Computational Authorship Identification, Jonathan I. Manczur
From An Art To A Science: Features And Methodology In Computational Authorship Identification, Jonathan I. Manczur
Dissertations, Theses, and Capstone Projects
Nearly thirty years ago, the United States Supreme Court revaluated the criteria for accepting forensic science and expert testimony, challenging Forensic Linguistics to assert itself as a reputable science. Much work has been produced in the interim to that end, but much still needs to be accomplished to satisfy the judicial standards. Computational linguistics has the potential to provide that necessary analytical framework. This paper’s intent is two-fold. First, there are two competing theories on the proper features necessary to identify an unknown author. Four features were drawn from the syntactic computational linguistics tradition and four from computational stylometry to …
Exploring The Personality Of Virtual Tutors In Conversational Foreign Language Practice, Johanna Dobbriner, Cathy Ennis, Robert J. Ross
Exploring The Personality Of Virtual Tutors In Conversational Foreign Language Practice, Johanna Dobbriner, Cathy Ennis, Robert J. Ross
Conference papers
Fluid interaction between virtual agents and humans requires the understanding of many issues of conversational pragmatics. One such issue is the interaction between communication strategy and personality. As a step towards developing models of personality driven pragmatics policies, in this paper, we present our initial experiment to explore differences in user interaction with two contrasting avatar personalities. Each user saw a single personality in a video-call setting and gave feedback on the interaction. Our expectations, that a more extroverted outgoing positive personality would be a more successful tutor, were only partially confirmed. While this personality did induce longer conversations in …
Label Imputation For Homograph Disambiguation: Theoretical And Practical Approaches, Jennifer M. Seale
Label Imputation For Homograph Disambiguation: Theoretical And Practical Approaches, Jennifer M. Seale
Dissertations, Theses, and Capstone Projects
This dissertation presents the first implementation of label imputation for the task of homograph disambiguation using 1) transcribed audio, and 2) parallel, or translated, corpora. For label imputation from parallel corpora, a hypothesis of interlingual alignment between homograph pronunciations and text word forms is developed and formalized. Both audio and parallel corpora label imputation techniques are tested empirically in experiments that compare homograph disambiguation model performance using: 1) hand-labeled training data, and 2) hand-labeled training data augmented with label-imputed data. Regularized, multinomial logistic regression and pre-trained ALBERT, BERT, and XLNet language models fine-tuned as token classifiers are developed for homograph …
Detection And Morphological Analysis Of Novel Russian Loanwords, Yulia Spektor
Detection And Morphological Analysis Of Novel Russian Loanwords, Yulia Spektor
Dissertations, Theses, and Capstone Projects
This paper investigates recent English loanwords in Russian and explores ways in which computational methods can help further theoretical research. The goal of the study is two-fold: to find new, previously unattested loanwords borrowed over the last decade and to examine the rate of adaptation of the new borrowings, attested by the degree to which they conform to the constraints of the Russian language. First, we train a finite-state pipeline that combines character n-gram language models, which encode phonotactic and lexical properties of loanwords, with a binary classifier to detect loanwords. The model achieves state-of-the-art performance results during evaluation, surpassing …
Learning Phonology With Sequence-To-Sequence Neural Networks, Brandon Prickett
Learning Phonology With Sequence-To-Sequence Neural Networks, Brandon Prickett
Doctoral Dissertations
This dissertation tests sequence-to-sequence neural networks to see whether they can simulate human phonological learning and generalization in a number of artificial language experiments. These experiments and simulations are organized into three chapters: one on opaque interactions, one on computational complexity in phonology, and one on reduplication. The first chapter focuses on two biases involving interactions that have been proposed in the past: a bias for transparent patterns and a bias for patterns that maximally utilize all of the processes in a language. The second chapter looks at harmony patterns of varying complexity to see whether both Formal Language Theory …
The Public Innovations Explorer: A Geo-Spatial & Linked-Data Visualization Platform For Publicly Funded Innovation Research In The United States, Seth Schimmel
Dissertations, Theses, and Capstone Projects
The Public Innovations Explorer (https://sethsch.github.io/innovations-explorer/app/index.html) is a web-based tool created using Node.js, D3.js and Leaflet.js that can be used for investigating awards made by Federal agencies and departments participating in the Small Business Innovation Research (SBIR) and Small Business Technology Transfer (STTR) grant-making programs between 2008 and 2018. By geocoding the publicly available grants data from SBIR.gov, the Public Innovations Explorer allows users to identify companies performing publicly-funded innovative research in each congressional district and obtain dynamic district-level summaries of funding activity by agency and year. Applying spatial clustering techniques on districts' employment levels across major economic sectors provides users …
Predicting Stock Price Movements Using Sentiment And Subjectivity Analyses, Andrew Kirby
Predicting Stock Price Movements Using Sentiment And Subjectivity Analyses, Andrew Kirby
Dissertations, Theses, and Capstone Projects
In a quick search online, one can find many tools which use information from news headlines to make predictions concerning the trajectory of a given stock. But what if we went further, looking instead into the text of the article, to extract this and other information? Here, the goal is to extract the sentence in which a stock ticker symbol is mentioned from a news article, then determine sentiment and subjectivity values from that sentence, and finally make a prediction on whether or not the value of that stock will go up or not in a 24-hour timespan. Bloomberg News …
Plprepare: A Grammar Checker For Challenging Cases, Jacob Hoyos
Plprepare: A Grammar Checker For Challenging Cases, Jacob Hoyos
Electronic Theses and Dissertations
This study investigates one of the Polish language’s most arbitrary cases: the genitive masculine inanimate singular. It collects and ranks several guidelines to help language learners discern its proper usage and also introduces a framework to provide detailed feedback regarding arbitrary cases. The study tests this framework by implementing and evaluating a hybrid grammar checker called PLPrepare. PLPrepare performs similarly to other grammar checkers and is able to detect genitive case usages and provide feedback based on a number of error classifications.