Open Access. Powered by Scholars. Published by Universities.®

Social and Behavioral Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 4 of 4

Full-Text Articles in Social and Behavioral Sciences

Identification Of Informativeness In Text Using Natural Language Stylometry, Rushdi Shams Aug 2014

Identification Of Informativeness In Text Using Natural Language Stylometry, Rushdi Shams

Electronic Thesis and Dissertation Repository

In this age of information overload, one experiences a rapidly growing over-abundance of written text. To assist with handling this bounty, this plethora of texts is now widely used to develop and optimize statistical natural language processing (NLP) systems. Surprisingly, the use of more fragments of text to train these statistical NLP systems may not necessarily lead to improved performance. We hypothesize that those fragments that help the most with training are those that contain the desired information. Therefore, determining informativeness in text has become a central issue in our view of NLP. Recent developments in this field have spawned …


The Relationship Between Implicit And Explicit Processing In Statistical Language Learning, Nicolette B. Noonan Jun 2014

The Relationship Between Implicit And Explicit Processing In Statistical Language Learning, Nicolette B. Noonan

Electronic Thesis and Dissertation Repository

Statistical language learning is an implicit process wherein language learners track sequential statistics in fluent speech, and may it facilitate the learning of word boundaries. This process is well studied, however, the cognitive mechanisms supporting it remain poorly understood. The present thesis investigated whether domain-specific or cross-domain explicit working memory engagement would impair implicit statistical learning of word boundaries in fluent speech. Participants (n = 110) were exposed to an implicit statistical word segmentation paradigm while concurrently engaged in no other task (control), or an explicit domain- specific (verbal) or cross-domain (visuospatial) working memory task of either low- or high- …


Cosine Similarity For Article Section Classification: Using Structured Abstracts As A Proxy For An Annotated Corpus, Arthur T. Bugorski Jun 2014

Cosine Similarity For Article Section Classification: Using Structured Abstracts As A Proxy For An Annotated Corpus, Arthur T. Bugorski

Electronic Thesis and Dissertation Repository

During the last decade, the amount of research published in biomedical journals has grown significantly and at an accelerating rate. To fully explore all of this literature, new tools and techniques are needed for both information retrieval and processing. One such tool is the identification and extraction of key claims. In an e ort to work toward claim-extraction, we aim to identify the key areas in the body of the article referred to by text in the abstract. In this project, our work is preliminary to that goal in that we attempt to match specific clauses in the abstract with …


A Bayesian Model Of Stress Assignment In Reading, Olessia Jouravlev Jan 2014

A Bayesian Model Of Stress Assignment In Reading, Olessia Jouravlev

Electronic Thesis and Dissertation Repository

The goal of the present thesis was to introduce a Bayesian model of stress assignment in reading. According to this model, readers compute probabilities of stress patterns by assessing prior beliefs about the likelihoods of stress patterns in a language and combining that information with non-lexical evidence for stress patterns provided by the word. The choice of a response is thought of as a random walk-type process which takes the system from a starting point to a response boundary. The calculated Bayesian probabilities determine the drift rate towards each boundary such that the probability of an error and the response …