Open Access. Powered by Scholars. Published by Universities.®
- Discipline
-
- Artificial Intelligence and Robotics (2)
- Arts and Humanities (2)
- Computer Sciences (2)
- Phonetics and Phonology (2)
- Physical Sciences and Mathematics (2)
-
- Psychology (2)
- Semantics and Pragmatics (2)
- Applied Linguistics (1)
- Art and Design (1)
- Communication (1)
- Computer Engineering (1)
- Digital Communications and Networking (1)
- Engineering (1)
- English Language and Literature (1)
- Film and Media Studies (1)
- Game Design (1)
- Interactive Arts (1)
- Journalism Studies (1)
- Library and Information Science (1)
- Mass Communication (1)
- OS and Networks (1)
- Other Film and Media Studies (1)
- Psycholinguistics and Neurolinguistics (1)
- Social Psychology (1)
- Theory and Philosophy (1)
- Typological Linguistics and Linguistic Diversity (1)
- Keyword
-
- Prosody (2)
- Acoustic prominence (1)
- American Sign Language (1)
- Applied linguistics (1)
- Brows (1)
-
- Cognate detection (1)
- Computational linguistics (1)
- Computer animation (1)
- Corpus (1)
- Corpus linguistics. (1)
- Corpus lingusitics (1)
- Corpus-based research (1)
- Data mining (1)
- Deception Detection (NLP) (1)
- Deception detection (1)
- Digg (1)
- Digital humanities (1)
- Edit distance (1)
- Language corpus (1)
- Language learning and teaching (1)
- Low-resource (1)
- Machine learning (1)
- Natural language processing (1)
- Network analysis (1)
- Neural networks (1)
- News (1)
- News verification (1)
- Nonmanual signals (1)
- Parsing (1)
- Ph.D. Dissertation (1)
Articles 1 - 12 of 12
Full-Text Articles in Computational Linguistics
Phonologically Informed Edit Distance Algorithms For Word Alignment With Low-Resource Languages, Richard T. Mccoy, Robert Frank
Phonologically Informed Edit Distance Algorithms For Word Alignment With Low-Resource Languages, Richard T. Mccoy, Robert Frank
Robert Frank
We present three methods for weighting edit distance algorithms based on linguistic information. These methods base their penalties on (i) phonological features, (ii) distributional character embeddings, or (iii) differences between cognate words. We also introduce a novel method for evaluating edit distance through the task of low-resource word alignment by using edit-distance neighbors in a high-resource pivot language to inform alignments from the low-resource language. At this task, the cognate-based scheme outperforms our other methods and the Levenshtein edit distance baseline, showing that NLP applications can benefit from information about cross-linguistic phonological patterns.
Jabberwocky Parsing: Dependency Parsing With Lexical Noise, Jungo Kasai, Robert Frank
Jabberwocky Parsing: Dependency Parsing With Lexical Noise, Jungo Kasai, Robert Frank
Robert Frank
Parsing models have long benefited from the use of lexical information, and indeed current state-of-the art neural network models for dependency parsing achieve substantial improvements by benefiting from distributed representations of lexical information. At the same time, humans can easily parse sentences with unknown or even novel words, as in Lewis Carroll’s poem Jabberwocky. In this paper, we carry out jabberwocky parsing experiments, exploring how robust a state-of-the-art neural network parser is to the absence of lexical information. We find that current parsing models, at least under usual training regimens, are in fact overly dependent on lexical information, and perform …
Acoustic Classification Of Focus: On The Web And In The Lab, Jonathan Howell, Mats Rooth, Michael Wagner
Acoustic Classification Of Focus: On The Web And In The Lab, Jonathan Howell, Mats Rooth, Michael Wagner
Jonathan Howell
General Analysis Of An Online Language Corpus, Kerwin A. Livingstone
General Analysis Of An Online Language Corpus, Kerwin A. Livingstone
Kerwin A. Livingstone
Corpus-based research is rapidly gaining ground in the field of Applied Linguistics. More interesting is the evidence of many online language corpora which can be easily accessed, with just the click of the mouse. A quick navigation of the Web will produce different kinds of corpora in a vast number of language areas. Given the need to find new and exciting ways to improve the language learning and teaching process, corpus linguistics does have potential for generating significant learner experiences. Taking into consideration the above-mentioned, this paper deals with the general analysis of an online language corpus. The specific corpus …
Linguistics As Structure In Computer Animation: Toward A More Effective Synthesis Of Brow Motion In American Sign Language, Rosalee Wolfe, Peter Cook, John C. Mcdonald, Jerry Schnepp
Linguistics As Structure In Computer Animation: Toward A More Effective Synthesis Of Brow Motion In American Sign Language, Rosalee Wolfe, Peter Cook, John C. Mcdonald, Jerry Schnepp
Jerry C Schnepp
Computer-generated three-dimensional animation holds great promise for synthesizing utterances in American Sign Language (ASL) that are not only grammatical, but well tolerated by members of the Deaf community. Unfortunately, animation poses several challenges stemming from the necessity of grappling with massive amounts of data. However, the linguistics of ASL can aid in surmounting the challenge by providing structure and rules for organizing animation data. An exploration of the linguistic and extra linguistic behavior of the brows from an animator’s viewpoint yields a new approach for synthesizing nonmanuals that differs from the conventional animation of anatomy and instead offers a different …
Towards News Verification: Deception Detection Methods For News Discourse, Victoria Rubin, Niall Conroy, Yimin Chen
Towards News Verification: Deception Detection Methods For News Discourse, Victoria Rubin, Niall Conroy, Yimin Chen
Victoria Rubin
News verification is a process of determining whether a particular news report is truthful or deceptive. Deliberately deceptive (fabricated) news creates false conclusions in the readers’ minds. Truthful (authentic) news matches the writer’s knowledge. How do you tell the difference between the two in an automated way? To investigate this question, we analyzed rhetorical structures, discourse constituent parts and their coherence relations in deceptive and truthful news sample from NPR’s “Bluff the Listener”. Subsequently, we applied a vector space model to cluster the news by discourse feature similarity, achieving 63% accuracy. Our predictive model is not significantly better than chance …
Predicting Survey Responses: How And Why Semantics Shape Survey Statistics On Organizational Behaviour, Ketil Arnulf, Kai R. Larsen, Øyvind Martinsen, Chih How Bong
Predicting Survey Responses: How And Why Semantics Shape Survey Statistics On Organizational Behaviour, Ketil Arnulf, Kai R. Larsen, Øyvind Martinsen, Chih How Bong
Kai R.T. Larsen
Some disciplines in the social sciences rely heavily on collecting survey responses to detect empirical relationships among variables. We explored whether these relationships were a priori predictable from the semantic properties of the survey items, using language processing algorithms which are now available as new research methods. Language processing algorithms were used to calculate the semantic similarity among all items in state-of-the-art surveys from Organisational Behaviour research. These surveys covered areas such as transformational leadership, work motivation and work outcomes. This information was used to explain and predict the response patterns from real subjects. Semantic algorithms explained 60–86% of the …
What's In A Letter?, Aaron J. Schein
What's In A Letter?, Aaron J. Schein
Aaron J Schein
Sentiment analysis is a burgeoning field in natural language processing used to extract and categorize opinion in evaluative documents. We look at recommendation letters, which pose unique challenges to standard sentiment analysis systems. Our dataset is eighteen letters from applications to UMass Worcester Memorial Medical Center’s residency program in Obstetrics and Gynecology. Given a small dataset, we develop a method intended for use by domain experts to systematically explore their intuitions about the topical make-up of documents on which they make critical decisions. By leveraging WordNet and the WordNet Propagation algorithm, the method allows a user to develop topic seed …
Using Textual Features To Predict Popular Content On Digg, Paul H. Miller
Using Textual Features To Predict Popular Content On Digg, Paul H. Miller
Paul H Miller
Over the past few years, collaborative rating sites, such as Netflix, Digg and Stumble, have become increasingly prevalent sites for users to find trending content. I used various data mining techniques to study Digg, a social news site, to examine the influence of content on popularity. What influence does content have on popularity, and what influence does content have on users’ decisions? Overwhelmingly, prior studies have consistently shown that predicting popularity based on content is difficult and maybe even inherently impossible. The same submission can have multiple outcomes and content neither determines popularity, nor individual user decisions. My results show …
Computational Style Processing, Foaad Khosmood
Computational Style Processing, Foaad Khosmood
Foaad Khosmood
Our main thesis is that computational processing of natural language styles can be accomplished using corpus analysis methods and language transformation rules. We demonstrate this first by statistically modeling natural language styles, and second by developing tools that carry out style processing, and finally by running experiments using the tools and evaluating the results. Specifically, we present a model for style in natural languages, and demonstrate style processing in three ways: Our system analyzes styles in quantifiable terms according to our model (analysis), associates documents based on stylistic similarity to known corpora (classification) and manipulates texts to match a desired …
Prosodylab-Aligner: A Tool For Forced Alignment Of Laboratory Speech, Kyle Gorman, Jonathan Howell, Michael Wagner
Prosodylab-Aligner: A Tool For Forced Alignment Of Laboratory Speech, Kyle Gorman, Jonathan Howell, Michael Wagner
Jonathan Howell
The Variable Elision Of Unstressed Vowels In European Portuguese: A Case Study, David James Silva
The Variable Elision Of Unstressed Vowels In European Portuguese: A Case Study, David James Silva
David Silva