Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 30 of 44

Full-Text Articles in Physical Sciences and Mathematics

Decision Tree Ensemble: Small Heterogeneous Is Better Than Large Homogeneous, Mike Gashler, Christophe G. Giraud-Carrier, Tony R. Martinez Dec 2008

Decision Tree Ensemble: Small Heterogeneous Is Better Than Large Homogeneous, Mike Gashler, Christophe G. Giraud-Carrier, Tony R. Martinez

Faculty Publications

Using decision trees that split on randomly selected attributes is one way to increase the diversity within an ensemble of decision trees. Another approach increases diversity by combining multiple tree algorithms. The random forest approach has become popular because it is simple and yields good results with common datasets. We present a technique that combines heterogeneous tree algorithms and contrast it with homogeneous forest algorithms. Our results indicate that random forests do poorly when faced with irrelevant attributes, while our heterogeneous technique handles them robustly. Further, we show that large ensembles of random trees are more susceptible to diminishing returns …


Learning-Based Fusion For Data Deduplication, Sabra Dinerstein, Parris K. Egbert, Stephen W. Clyde, Jared Dinerstein Dec 2008

Learning-Based Fusion For Data Deduplication, Sabra Dinerstein, Parris K. Egbert, Stephen W. Clyde, Jared Dinerstein

Faculty Publications

Rule-based deduplication utilizes expert domain knowledge to identify and remove duplicate data records. Achieving high accuracy in a rule-based system requires the creation of rules containing a good combination of discriminatory clues. Unfortunately, accurate rule-based deduplication often requires significant manual tuning of both the rules and the corresponding thresholds. This need for manual tuning reduces the efficacy of rule-based deduplication and its applicability to real-world data sets. No adequate solution exists for this problem. We propose a novel technique for rule-based deduplication. We apply individual deduplication rules, and combine the resultant match scores via learning-based information fusion. We show empirically …


Nowhere To Hide: Finding Plagiarized Documents Based On Sentence Similarity, Nathaniel Gustafson, Yiu-Kai D. Ng, Maria Soledad Pera Dec 2008

Nowhere To Hide: Finding Plagiarized Documents Based On Sentence Similarity, Nathaniel Gustafson, Yiu-Kai D. Ng, Maria Soledad Pera

Faculty Publications

Plagiarism is a serious problem that infringes copyrighted documents/materials, which is an unethical practice and decreases the economic incentive received by authors (owners) of the original copies. Unfortunately, plagiarism is getting worse due to the increasing number of online publications on the Web, which facilitates locating and paraphrasing information. In solving this problem, we propose a novel plagiarism-detection method, called SimPaD, which (i) establishes the degree of resemblance between any two documents D1 and D2 based on their sentence-to-sentence similarity computed by using pre-defined word-correlation factors, and (ii) generates a graphical view of sentences that are similar (or the same) …


Sequence Alignment With Traceback On Reconfigurable Hardware, Scott Lloyd, Quinn O. Snell Dec 2008

Sequence Alignment With Traceback On Reconfigurable Hardware, Scott Lloyd, Quinn O. Snell

Faculty Publications

Biological sequence alignment is an essential tool used in molecular biology and biomedical applications. The growing volume of genetic data and the complexity of sequence alignment present a challenge in obtaining alignment results in a timely manner. Known methods to accelerate alignment on reconfigurable hardware only address sequence comparison, limit the sequence length, or exhibit memory and I/O bottlenecks. A space-efficient, global sequence alignment algorithm and architecture is presented that accelerates the forward scan and traceback in hardware without memory and I/O limitations. With 256 processing elements in FPGA technology, a performance gain over 300 times that of a desktop …


Antiphase Ordering And Surface Phases In Lithium Aluminate, Richard R. Vanfleet, J. A. Simmons, D. W. Hill, M. M. C. Chou, B. H. Chai Nov 2008

Antiphase Ordering And Surface Phases In Lithium Aluminate, Richard R. Vanfleet, J. A. Simmons, D. W. Hill, M. M. C. Chou, B. H. Chai

Faculty Publications

Antiphase domains are seen in single crystal gamma lithium aluminate (gamma-LiAlO2) with 16.7 nm periodicity in the <110> direction. Alternate domains have a (1/2) [001] shift. Beta phase lithium aluminate (beta-LiAlO2) is seen to form on the surface of the as-received wafers with an epitaxial strain limited relationship with the bulk gamma phase. The orthorhombic beta phase aligns with the a and b axes (0.528 and 0.630 nm) matching with the tetragonal gamma phase's a and c axes (0.5168 and 0.6268 nm). The gamma and beta phases are seen to have different etch rates. The beta phase converts back to the …


Extreme-Ultraviolet Polarimeter Utilizing Laser-Generated High-Order Harmonics, Nicole Brimhall, Matthew Turner, Nicholas Herrick, David D. Allred, R. Steven Turley, Michael Ware, Justin Peatross Oct 2008

Extreme-Ultraviolet Polarimeter Utilizing Laser-Generated High-Order Harmonics, Nicole Brimhall, Matthew Turner, Nicholas Herrick, David D. Allred, R. Steven Turley, Michael Ware, Justin Peatross

Faculty Publications

We describe an extreme-ultraviolet (EUV) polarimeter that employs laser-generated high-order harmonics as the light source. The polarimeter is designed to characterize materials and thin films for use with EUV light. Laser high harmonics are highly directional with easily rotatable linear polarization, not typically available with other EUV sources. The harmonics have good wavelength coverage, potentially spanning the entire EUV from a few to a hundred nanometers. Our instrument is configured to measure reflectances from 14 to 30 nm and has ~180 spectral resolution (lambda/delta lambda). The reflection from a sample surface can be measured over a continuous range of incident …


Using Vagueness Measures To Re-Rank Documents Retrieved By A Fuzzy Set Information Retrieval Model, Stephen Lynn, Yiu-Kai D. Ng Oct 2008

Using Vagueness Measures To Re-Rank Documents Retrieved By A Fuzzy Set Information Retrieval Model, Stephen Lynn, Yiu-Kai D. Ng

Faculty Publications

Traditional information retrieval (IR) systems evaluate user queries and retrieve/rank documents based on matching keywords in user queries with words in documents. These exact word-matching and ranking approaches ignore too many relevant documents that do not contain the exact keywords as specified in a user query. Instead of considering these traditional approaches, we propose to retrieve documents using a fuzzy set IR model and rank retrieved documents for any vague query using the “vagueness score” of the documents based on the word senses as defined in WordNet. Using the vagueness scores, we rank the most highest “relevant” documents of a …


Enhancement Of Unusual Color In Aerial Video Sequences For Assisting Wilderness Search And Rescue, Bryan S. Morse, Nathan D. Rasmussen, Daniel Thornton Oct 2008

Enhancement Of Unusual Color In Aerial Video Sequences For Assisting Wilderness Search And Rescue, Bryan S. Morse, Nathan D. Rasmussen, Daniel Thornton

Faculty Publications

The use of aerial video for search and surveillance has been popularized by the increased use of camera-equipped unmanned aerial vehicles. For many search applications, objects may also be missed by observers due to their small size, brief visibility, or the inherent monotony of the scene. This paper presents a novel method for automatically emphasizing unusually colored objects to improve their detectability. We use a hue histogram and a local saliency measure to find unusually colored objects, then boost the saturation of these objects while desaturating more common colors, thus drawing the observer’s attention and facilitating video search.


Scalable Multicast Routing For Ad Hoc Networks, Manoj Pandey, Daniel Zappala Oct 2008

Scalable Multicast Routing For Ad Hoc Networks, Manoj Pandey, Daniel Zappala

Faculty Publications

Routing in a mobile ad hoc network is challenging because nodes can move at any time, invalidating a previously-discovered route. Multicast routing is even more challenging, because a source needs to maintain a route to potentially many group members simultaneously. Providing scalable solutions to this problem typically requires building a hierarchy or an overlay network to reduce the cost of route discovery and maintenance. In this paper, we show that a much simpler alternative is possible, by using source specific semantics and relying on the unicast routing protocol to find all routes. This separation of concerns enables the multicast routing …


Hop-By-Hop Multicast Transport For Mobile Ad Hoc Wireless Networks, Manoj Pandey, Daniel Zappala Oct 2008

Hop-By-Hop Multicast Transport For Mobile Ad Hoc Wireless Networks, Manoj Pandey, Daniel Zappala

Faculty Publications

Multicast transport is a challenging problem because the source must provide congestion control and reliability for a tree, rather than a single path. This problem is made even more difficult in mobile ad hoc networks due to problems caused by contention, spatial reuse, and mobility. In this paper, we design a hop-by-hop multicast transport protocol, which pushes transport functionality into the core of the network. Although this requires per-flow state, a hop-by-hop approach simplifies congestion control, enables local recovery of lost packets, and provides low delay and efficient use of wireless capacity. We use a simulation study to demonstrate the …


Autonomous And Intelligent Radio Switching For Heterogeneous Wireless Networks, Qiuyi Duan, Charles D. Knutson, Lei Wang, Daniel Zappala Sep 2008

Autonomous And Intelligent Radio Switching For Heterogeneous Wireless Networks, Qiuyi Duan, Charles D. Knutson, Lei Wang, Daniel Zappala

Faculty Publications

As wireless devices continue to become more prevalent, heterogeneous wireless networks - in which communicating devices have at their disposal multiple types of radios - will become the norm. Communication between nodes in these networks ought to be as simple as possible; they should be able to seamlessly switch between different radios and network stacks on the fly in order to better serve the user. To make this a possibility, we consider the challenging problems of when two communicating devices should decide to switch to a different radio, and which radio they should choose. We design an Autonomous and Intelligent …


Improving Live Sequence Chart To Automata Transformation For Verification, Rahul Kumar, Eric G. Mercer Aug 2008

Improving Live Sequence Chart To Automata Transformation For Verification, Rahul Kumar, Eric G. Mercer

Faculty Publications

This paper presents a Live Sequence Chart (LSC) to automata transformation algorithm that enables the verification of communication protocol implementations. Using this LSC to automata transformation a communication protocol implementation can be verified using a single verification run as opposed to previous techniques that rely on a three stage verification approach. The novelty and simplicity of the transformation algorithm lies in its placement of accept states in the automata generated from the LSC. We present in detail an example of the transformation as well as the transformation algorithm. Further, we present a detailed analysis and an empirical study comparing the …


Recombination Fluorescence In Ultracold Neutral Plasmas, Scott D. Bergeson, F. Robicheaux Aug 2008

Recombination Fluorescence In Ultracold Neutral Plasmas, Scott D. Bergeson, F. Robicheaux

Faculty Publications

We present the first measurements and simulations of recombination fluorescence from ultracold neutral calcium plasmas. This method probes three-body recombination at times less than 1 µs, shorter than previously published time scales. For the lowest initial electron temperatures, the recombination rate scales with the density as n22, significantly slower than the predicted n3. Recombination fluorescence opens a new diagnostic window in ultracold plasmas. In most cases it probes deeply bound level populations that depend critically on electron energetics. However, a perturbation in the calcium 4snd Rydberg series allows our fluorescence measurements to probe the population in weakly bound levels that …


Watertight Trimmed Nurbs, Thomas W. Sederberg, Xin Li, Hongwei Lin, Heather Ipson Aug 2008

Watertight Trimmed Nurbs, Thomas W. Sederberg, Xin Li, Hongwei Lin, Heather Ipson

Faculty Publications

This paper addresses the long-standing problem of the unavoidable gaps that arise when expressing the intersection of two NURBS surfaces using conventional trimmed-NURBS representation. The solution converts each trimmed NURBS into an untrimmed T-Spline, and then merges the untrimmed T-Splines into a single, watertight model. The solution enables watertight fillets of NURBS models, as well as arbitrary feature curves that do not have to follow isoparameter curves. The resulting T-Spline representation can be exported without error as a collection of NURBS surfaces.


An Ultrahigh Stability, Low-Noise Laser Current Driver With Digital Control, Christopher J. Erickson, Marshall Van Zijll, Greg Doermann, Dallin S. Durfee Jul 2008

An Ultrahigh Stability, Low-Noise Laser Current Driver With Digital Control, Christopher J. Erickson, Marshall Van Zijll, Greg Doermann, Dallin S. Durfee

Faculty Publications

We present a low-noise, high modulation-bandwidth design for a laser current driver with excellent long-term stability. The driver improves upon the commonly used Hall–Libbrecht design. The current driver can be operated remotely by way of a microprocessing unit, which controls the current set point digitally. This allows precise repeatability and improved accuracy and stability. It also allows the driver to be placed near the laser for reduced noise and for lower phase lag when using the modulation input. We present the theory of operation for our driver in detail, and give a thorough characterization of its stability, noise, set-point accuracy …


A Review Of Fibroblast Populated Collagen Lattices, J. C. Dallon, Paul H. Ehrlich Jul 2008

A Review Of Fibroblast Populated Collagen Lattices, J. C. Dallon, Paul H. Ehrlich

Faculty Publications

Bellaes introduction of the fibroblast populated collagen lattice (FPCL) (1) has facilitated the study of collagen-cell interactions. As a result of the numerous modifications of the casting of FPCL's, the in vivo applications of these in vitro findings has been confusing. Here experimental FPCL contraction findings are viewed in regard to three proposed mechanisms responsible for lattice contraction. The cellular mechanisms responsible for generating FPCL contraction are: cell contraction, cell tractional forces related to cell locomotion, and initial cell elongation and spreading.


Data-Driven Programming And Behavior For Autonomous Virtual Characters, Jonathan Dinerstein, Parris K. Egbert, Michael A. Goodrich, Dan A. Ventura Jul 2008

Data-Driven Programming And Behavior For Autonomous Virtual Characters, Jonathan Dinerstein, Parris K. Egbert, Michael A. Goodrich, Dan A. Ventura

Faculty Publications

In the creation of autonomous virtual characters, two levels of autonomy are common. They are often called motion synthesis (low-level autonomy) and behavior synthesis (high-level autonomy), where an action (i.e. motion) achieves a short-term goal and a behavior is a sequence of actions that achieves a long-term goal. There exists a rich literature addressing many aspects of this general problem (and it is discussed in the full paper). In this paper we present a novel technique for behavior (high-level) autonomy and utilize existing motion synthesis techniques. Creating an autonomous virtual character with behavior synthesis abilities frequently includes three stages: forming …


The Role Of Upstream Sequences In Selecting The Reading Frame On Tmrna, Allen R. Buskirk, Mickey R. Miller, David W. Healey, Jonathan D. Dewey, Stephen G. Robison Jun 2008

The Role Of Upstream Sequences In Selecting The Reading Frame On Tmrna, Allen R. Buskirk, Mickey R. Miller, David W. Healey, Jonathan D. Dewey, Stephen G. Robison

Faculty Publications

tmRNA acts first as a tRNA and then as an mRNA to rescue stalled ribosomes in eubacteria. Two unanswered questions about tmRNA function remain: how does tmRNA, lacking an anticodon, bypass the decoding machinery and enter the ribosome? Secondly, how does the ribosome choose the proper codon to resume translation on tmRNA? According to the -1 triplet hypothesis, the answer to both questions lies in the unique properties of the three nucleotides upstream of the first tmRNA codon. These nucleotides assume an A-form conformation that mimics the codon-anticodon interaction, leading to recognition by the decoding center and choice of the …


Algorithm For Generating Derivative Structures, Gus L. W. Hart, Rodney W. Forcade Jun 2008

Algorithm For Generating Derivative Structures, Gus L. W. Hart, Rodney W. Forcade

Faculty Publications

We present an algorithm for generating all derivative superstructures--for arbitrary parent structures and for any number of atom types. This algorithm enumerates superlattices and atomic configurations in a geometry-independent way. The key concept is to use the quotient group associated with each superlattice to determine all unique atomic configurations. The run time of the algorithm scales linearly with the number of unique structures found.


Link Quality Prediction For Wireless Devices With Multiple Radios, Qiuyi Duan, Charles D. Knutson, Lei Wang, Daniel Zappala Jun 2008

Link Quality Prediction For Wireless Devices With Multiple Radios, Qiuyi Duan, Charles D. Knutson, Lei Wang, Daniel Zappala

Faculty Publications

Communication between wireless devices ought to be as simple as possible; they should be able to seamlessly switch between different radios and network stacks on the fly in order to better serve the user. To make this a possibility, we consider the challenging problem of predicting link quality in a changing mobile environment. In this paper we present an algorithm that uses Weighted Least Squares Regression to predict whether a given link can meet application requirements in terms of throughput, delay, and jitter. We use a simulation study to demonstrate that our algorithm is able to predict link quality accurately …


Assessing The Costs Of Sampling Methods In Active Learning For Annotation, James Carroll, Robbie Haertel, Peter Mcclanahan, Eric K. Ringger, Kevin Seppi Jun 2008

Assessing The Costs Of Sampling Methods In Active Learning For Annotation, James Carroll, Robbie Haertel, Peter Mcclanahan, Eric K. Ringger, Kevin Seppi

Faculty Publications

Traditional Active Learning (AL) techniques assume that the annotation of each datum costs the same. This is not the case when annotating sequences; some sequences will take longer than others. We show that the AL technique which performs best depends on how cost is measured. Applying an hourly cost model based on the results of an annotation user study, we approximate the amount of time necessary to annotate a given sentence. This model allows us to evaluate the effectiveness of AL sampling methods in terms of time spent in annotation. We acheive a 77% reduction in hours from a random …


Or Best Offer: A Privacy Policy Negotiation Protocol, Eric G. Mercer, Kent E. Seamons, Daniel D. Walker Jun 2008

Or Best Offer: A Privacy Policy Negotiation Protocol, Eric G. Mercer, Kent E. Seamons, Daniel D. Walker

Faculty Publications

Privacy policy languages, such as P3P, allow websites to publish their privacy practices and policies in machine readable form. Currently, software agents designed to protect users’ privacy follow a “take it or leave it” approach that is inflexible and gives the server ultimate control. Privacy policy negotiation is one approach to leveling the playing field by allowing a client to negotiate with a server to determine how that server collects and uses the client’s data. We present a privacy policy negotiation protocol, “Or Best Offer”, that includes a formal model for specifying privacy preferences and reasoning about privacy policies. The …


Application And Evaluation Of Spatiotemporal Enhancement Of Live Aerial Video Using Temporally Local Mosaics, Dennis Eggett, Cameron Engh, Damon Gerhardt, Michael A. Goodrich, Bryan S. Morse, Nathan Rasmussen, Daniel Thornton Jun 2008

Application And Evaluation Of Spatiotemporal Enhancement Of Live Aerial Video Using Temporally Local Mosaics, Dennis Eggett, Cameron Engh, Damon Gerhardt, Michael A. Goodrich, Bryan S. Morse, Nathan Rasmussen, Daniel Thornton

Faculty Publications

Camera-equipped mini-UAVs are popular for many applications, including search and surveillance, but video from them is commonly plagued with distracting jittery motions and disorienting rotations that make it difficult for human viewers to detect objects of interest and infer spatial relationships. For time-critical search situations there are also inherent tradeoffs between detection and search speed. These problems make the use of dynamic mosaics to expand the spatiotemporal properties of the video appealing. However, for many applications it may not be necessary to maintain full mosaics of all of the video but to mosaic and retain only a number of recent …


The Enigmatic Young Object: Walker 90/V590 Monocerotis, M. D. Joner, M. R. Perez, B. Mccollum, M. E. Van Dend Ancker May 2008

The Enigmatic Young Object: Walker 90/V590 Monocerotis, M. D. Joner, M. R. Perez, B. Mccollum, M. E. Van Dend Ancker

Faculty Publications

Aims. We assess the evolutionary status of the intriguing object Walker 90/V590 Mon, which is located about 20 arcmin northwest of the Cone Nebula near the center of the open cluster NGC 2264. This object, according to its most recent optical spectral type determination (B7), which we confirmed, is at least 3 mag too faint in V for the cluster distance, but it shows the classical signs of a young pre-main sequence object, such as highly variable H emission, Mg II emission, IR excess, UV continuum, and optical variability. Methods. We analyzed a collection of archival and original data on …


On The Steering Of Sound Energy Through A Supercritical Plate By A Near-Field Transducer Array, Brian E. Anderson, Stephen A. Hambric, Jack W. Hughes May 2008

On The Steering Of Sound Energy Through A Supercritical Plate By A Near-Field Transducer Array, Brian E. Anderson, Stephen A. Hambric, Jack W. Hughes

Faculty Publications

The ability to direct sound energy through the flexural vibrations of a submerged plate at various angles of incidence using a near-field transducer array is investigated. An alumina bar is placed in front of a one-dimensional, eight-element transducer array, between the array and the water. Operating in a receive mode, data were taken as a function of angle of incidence and compared to data taken without the presence of the alumina bar. The array was also operated in transmit mode and results were compared to corresponding receive mode data, showing that reciprocity holds. Results show that in fact sound energy …


Metallicity And Effective Temperature Of The Secondary Or Rs Ophicuhi, R. L. Pearson Iii, Ya. V. Pavlenko, A. Evans, T. Kerr, L. Yakovina, C. E. Woodward, D. Lynch, R. Rudy, R. W. Russell Apr 2008

Metallicity And Effective Temperature Of The Secondary Or Rs Ophicuhi, R. L. Pearson Iii, Ya. V. Pavlenko, A. Evans, T. Kerr, L. Yakovina, C. E. Woodward, D. Lynch, R. Rudy, R. W. Russell

Faculty Publications

Context. The recurrent nova RS Ophiuchi undergoes nova eruptions every 10-20 years as a result of thermonuclear runaway on the surface of a white dwarf close to the Chandrasekhar limit. Both the progress of the eruption and its aftermath depend on the (poorly known) composition of the red giant in the RS Oph system. Aims. Our aim is to understand better the effect of the giant secondary on the recurrent nova eruption. Methods. Synthetic spectra were computed for a grid of M-giant model atmospheres having a range of effective temperatures 3200 < Teff < 4400 K, gravities 0 < log g < 1 and abundances -4 < [Fe/H] < 0.5, and fit to infrared spectra of RS Oph as it returned to quiescence after its 2006 eruption. We have modelled the infrared spectrum in the range 1.4-2.5µm to determine metallicity and effective temperature of the red giant. Results. We find Teff= 4100 ±100 K, log g = 0.0 ±0.5, [Fe/H] = 0.0 ±0.5, [C/H] = -0.8 ±0.2, [N/H] = +0.6 ±0.3 in the atmosphere of the secondary, and demonstrate that inclusion of some dust "veiling" in the spectra cannot improve our fits.


Skill Evaluation In Women's Volleyball, Lindsay W. Florence, Gilbert W. Fellingham, Pat R. Vehrs, Nina P. Mortensen Apr 2008

Skill Evaluation In Women's Volleyball, Lindsay W. Florence, Gilbert W. Fellingham, Pat R. Vehrs, Nina P. Mortensen

Faculty Publications

The Brigham Young University Women's Volleyball Team recorded and rated all skills (pass, set, attack, etc.) and recorded rally outcomes (point for BYU, rally continues, point for opponent) for the entire 2006 home volleyball season. Only sequences of events occurring on BYU's side of the net were considered. Events followed one of these general patterns: serve-outcome, pass-set-attack-outcome, or block-dig-set-attack-outcome. These sequences of events were assumed to be first-order Markov chains where the quality of each contact depended only on the quality of the previous contact but not explicitly on contacts further removed in the sequence. We represented these sequences in …


Comment On “Contact Conditions For The Charge In The Theory Of The Electrical Double Layer”, Douglas Henderson, L. B. Bhuiyan Mar 2008

Comment On “Contact Conditions For The Charge In The Theory Of The Electrical Double Layer”, Douglas Henderson, L. B. Bhuiyan

Faculty Publications

Exact results in any field, including statistical mechanics, are both aesthetically pleasing and very valuable in assessing theoretical approximations.


Compiling And Annotating A Syriac Corpus, George Busby, James Carroll, Marc Carmen, Carl Griffin, Robbie Haertel, Kristian Heal, Joshua Heaton, Deryle W. Lonsdale, Peter Mcclanahan, Eric K. Ringger, Kevin Seppi, David Taylor Mar 2008

Compiling And Annotating A Syriac Corpus, George Busby, James Carroll, Marc Carmen, Carl Griffin, Robbie Haertel, Kristian Heal, Joshua Heaton, Deryle W. Lonsdale, Peter Mcclanahan, Eric K. Ringger, Kevin Seppi, David Taylor

Faculty Publications

PDF of Powerpoint Presentation on compiling and annotating a Syriac corpus. This presentation was given at the Conference of the American Association for Corpus Linguistics in 2008.


Accelerating Corpus Annotation Through Active Learning, George Busby, Marc Carmen, James Carroll, Robbie Haertel, Deryle W. Lonsdale, Peter Mcclanahan, Eric K. Ringger, Kevin Seppi Mar 2008

Accelerating Corpus Annotation Through Active Learning, George Busby, Marc Carmen, James Carroll, Robbie Haertel, Deryle W. Lonsdale, Peter Mcclanahan, Eric K. Ringger, Kevin Seppi

Faculty Publications

PDF of Powerpoint Presentation on accelerating corpus annotation through active learning. This presentation was given at the Conference of the American Association for Corpus Linguistics in 2008.