Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences

Theses and Dissertations

Annotation

Articles 1 - 8 of 8

Full-Text Articles in Physical Sciences and Mathematics

Pseudo-Data Generation For Improving Clinical Named Entity Recognition, Jeffrey T. Smith Jan 2020

Pseudo-Data Generation For Improving Clinical Named Entity Recognition, Jeffrey T. Smith

Theses and Dissertations

One of the primary challenges for clinical Named Entity Recognition (NER) is the availability of annotated training data. Technical and legal hurdles prevent the creation and release of corpora related to electronic health records (EHRs). In this work, we look at the imapct of pseudo-data generation on clinical NER using gazetteering and thresholding utilizing a neural network model. We report that gazetteers can result in the inclusion of proper terms with the exclusion of determiners and pronouns in preceding and middle positions. Gazetteers that had higher numbers of terms inclusive to the original dataset had a higher impact. We also …


The Annotation Cost Of Context Switching: How Topic Models And Active Learning [May Not] Work Together, Nozomu Okuda Aug 2017

The Annotation Cost Of Context Switching: How Topic Models And Active Learning [May Not] Work Together, Nozomu Okuda

Theses and Dissertations

The labeling of language resources is a time consuming task, whether aided by machine learning or not. Much of the prior work in this area has focused on accelerating human annotation in the context of machine learning, yielding a variety of active learning approaches. Most of these attempt to lead an annotator to label the items which are most likely to improve the quality of an automated, machine learning-based model. These active learning approaches seek to understand the effect of item selection on the machine learning model, but give significantly less emphasis to the effect of item selection on the …


Technique And Cue Selection For Graphical Presentation Of Generic Hyperdimensional Data, Lee Mont Howard Jun 2012

Technique And Cue Selection For Graphical Presentation Of Generic Hyperdimensional Data, Lee Mont Howard

Theses and Dissertations

The process of visualizing n-D data presents the user with four problems: finding a hyperdimensional graphics package capable of rendering n-D data, finding a suitable presentation technique supported by the package that allows insight to be gained, using the provided user interface to interact with the presentation technique to explore the information in the data, and finding a way to share the information gained with others. Many graphics packages have been written to solve the first problem. However, existing packages do not sufficiently solve the other three problems. A hyperdimensional graphics package that sufficiently solves all these problems simplifies the …


Pixel Based Note Taking Through Perceptual Structure Inference, Mitchell Kent Harris Oct 2010

Pixel Based Note Taking Through Perceptual Structure Inference, Mitchell Kent Harris

Theses and Dissertations

Knowledge workers need effective annotation tools to assimilate information. Unfortunately many digital annotators are limited in the range of document that they accept. Those that do accept many different documents do so by converting documents to images, thus losing any awareness about the original content of the document. We introduce a digital note taker that is both universal and content aware. By constructing a hierarchical context tree of document images, the structure of a document is inferred from the image. This hierarchical context tree is shown to be useful by demonstrating how it facilitates selection of document elements, reflowing documents …


Interactive Football Summarization, Brandon B. Moon Dec 2009

Interactive Football Summarization, Brandon B. Moon

Theses and Dissertations

Football fans do not have the time to watch every game in its entirety and need an effective solution that summarizes them the story of the game. Human-generated summaries are often too short, requiring time and resources to create. We utilize the advantages of Interactive TV to create an automatic football summarization service that is cohesive, provides context, covers the necessary plays, and is concise. First, we construct a degree of interest function that ranks each play based on detailed, play-by-play game events as well as viewing statistics collected from an interactive viewing environment. This allows us to select the …


Obstacle Annotation By Demonstration, Michael David Clement Mar 2007

Obstacle Annotation By Demonstration, Michael David Clement

Theses and Dissertations

By observing human driving with a “digital head" (combined video camera and accelerometers) and taking a few hand annotations, we can automatically annotate regions in a robot's field of view that should be interpreted as obstacles to be avoided. This is accomplished by detecting the movement for a given frame in a video. Some hand annotations of video frames are necessary and they are used to create Probability Grids. Using the movement data and the Probability Grids, it is possible to annotate large amounts of video data quickly in an automated system.


Screencrayons: Using Screen Captures For Annotation And Research, Trent Alan Taufer Dec 2006

Screencrayons: Using Screen Captures For Annotation And Research, Trent Alan Taufer

Theses and Dissertations

In a world full of digital information we should be able to easily collect, organize, annotate, and leverage information from many different sources. This should be easy to do and not interrupt our normal workflow. A system to support information collection and organization should be user-friendly and as unobtrusive as possible, while still allowing for flexible and intelligent annotation. It should also be able to leverage the inherent information content of a collection of annotated information. We present a system that will demonstrate how these ideas can come together to make information collection easier and more productive. The system facilitates …


On-Line Electronic Document Collaboration And Annotation, Trev R. Harmon Nov 2006

On-Line Electronic Document Collaboration And Annotation, Trev R. Harmon

Theses and Dissertations

The Internet provides a powerful medium for communication and collaboration. The ability one has to connect and interact with web-based tools from anywhere in the world makes the Internet ideal for such tasks. However, the lack of native tools can be a hindrance when deploying collaborative initiatives, as many current projects require specialized software in order to operate. This thesis demonstrates, with the comparably recent advances in browser technology and Document Object Model (DOM) implementation, a web-based collaborative annotation system can be developed that can be accessed by a user through a standards-compliant web browser. Such a system, demonstrated to …