Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 30 of 41

Full-Text Articles in Physical Sciences and Mathematics

Bounding The Probability Of Error For High Precision Optical Character Recognition, Gary B. Huang, Andrew Kae, Carl Doersch, Erik G. Learned-Miller Feb 2012

Bounding The Probability Of Error For High Precision Optical Character Recognition, Gary B. Huang, Andrew Kae, Carl Doersch, Erik G. Learned-Miller

Erik G Learned-Miller

We consider a model for which it is important, early in proces sing, to estimate some variables with high precision, but perhaps at relatively low recall. I f some variables can be identified with near certainty, they can be conditioned upon, allowing furt her inference to be done efficiently. Specifically, we consider optical character recognition (O CR) systems that can be bootstrapped by identifying a subset of correctly translated document wo rds with very high precision. This “clean set” is subsequently used as document-specific train ing data. While OCR systems produce confidence measures for the identity of each letter or …


Background Modeling Using Adaptive Pixelwise Kernel Variances In A Hybrid Feature Space, Erik G. Learned-Miller, Manjunath Narayana, Allen Hanson Dec 2011

Background Modeling Using Adaptive Pixelwise Kernel Variances In A Hybrid Feature Space, Erik G. Learned-Miller, Manjunath Narayana, Allen Hanson

Erik G Learned-Miller

Recent work on background subtraction has shown de- velopments on two major fronts. In one, there has been increasing sophistication of probabilistic models, from mix- tures of Gaussians at each pixel [ 7 ], to kernel density esti- mates at each pixel [ 1 ], and more recently to joint domain- range density estimates that incorporate spatial informa- tion [ 6 ]. Another line of work has shown the benefits of increasingly complex feature representations, including the use of texture information, local binary patterns, and re- cently scale-invariant local ternary patterns [ 4 ]. In this work, we use joint …


Scene Text Recognition With Bilateral Regression, Jacqueline Feild, Erik G. Learned-Miller Dec 2011

Scene Text Recognition With Bilateral Regression, Jacqueline Feild, Erik G. Learned-Miller

Erik G Learned-Miller

This paper focuses on improving the recognition of text in images of natural scenes, such as storefront signs or street signs. This is a difficult problem due to lighting conditions, variation in font shape and color, and complex backgrounds. We present a word recognition system that addresses these difficulties using an innovative technique to extract and recognize foreground text in an image. First, we develop a new method, called bilateral regression, for extracting and modeling one coherent (although not necessarily contiguous) region from an image. The method models smooth color changes across an image region without being corrupted by neighboring …


Unsupervised Joint Alignment And Clustering Using Bayesian Nonparametrics, Marwan Mattar, Allen Hanson, Erik G. Learned-Miller Dec 2011

Unsupervised Joint Alignment And Clustering Using Bayesian Nonparametrics, Marwan Mattar, Allen Hanson, Erik G. Learned-Miller

Erik G Learned-Miller

Joint alignment of a collection of functions is the process of independently transforming the func- tions so that they appear more similar to each other. Typically, such unsupervised alignment al- gorithms fail when presented with complex data sets arising from multiple modalities or make re- strictive assumptions about the form of the func- tions or transformations, limiting their general- ity. We present a transformed Bayesian infinite mixture model that can simultaneously align and cluster a data set. Our model and associated learning scheme offer two key advantages: the optimal number of clusters is determined in a data-driven fashion through the …


Learning Hierarchical Representations For Face Verification, Gary B. Huang, Honglak Lee, Erik G. Learned-Miller Dec 2011

Learning Hierarchical Representations For Face Verification, Gary B. Huang, Honglak Lee, Erik G. Learned-Miller

Erik G Learned-Miller

Most modern face recognition systems rely on a feature representation given by a hand-crafted image descriptor, such as Local Binary Patterns (LBP), and achieve improved performance by combining several such representations. In this paper, we propose deep learning as a natural source for obtaining additional, complementary representations. To learn features in high-resolution images, we make use of convolutional deep belief networks. Moreover, to take advantage of global structure in an object class, we develop local convolutional restricted Boltzmann machines, a novel convolutional learning model that exploits the global structure by not assuming stationarity of features across the image, while maintaining …


Half-Wits: Software Techniques For Low-Voltage Probabilistic Storage On Microcontrollers With Nor Flash Memory., Mastooreh Salajegheh, Yue Wang, Anxiao Jiang, Erik G. Learned-Miller, Kevin Fu Dec 2011

Half-Wits: Software Techniques For Low-Voltage Probabilistic Storage On Microcontrollers With Nor Flash Memory., Mastooreh Salajegheh, Yue Wang, Anxiao Jiang, Erik G. Learned-Miller, Kevin Fu

Erik G Learned-Miller

No abstract provided.


Improvements In Joint Domain-Range Modeling For Background Subtraction, Manjunath Narayana, Allen Hanson, Erik G. Learned-Miller Dec 2011

Improvements In Joint Domain-Range Modeling For Background Subtraction, Manjunath Narayana, Allen Hanson, Erik G. Learned-Miller

Erik G Learned-Miller

In many algorithms for background modeling, a distribution over feature values is modeled at each pixel. These models, however, do not account for the dependencies that may exist among nearby pixels. The joint domain-range kernel density estimate (KDE) model by Sheikh and Shah [ 7 ], which is not a pixel-wise model, represents the background and foreground processes by combining the three color dimensions and two spatial dimensions into a five-dimensional joint space. The Sheikh and Shah model, as we will show, has a peculiar dependence on the size of the image. In contrast, we build three-dimensional color distributions at …


Enforcing Similarity Constraints With Integer Programming For Better Scene Text Recognition, David L. Smith, Jacqueline Feild, Erik G. Learned-Miller Dec 2010

Enforcing Similarity Constraints With Integer Programming For Better Scene Text Recognition, David L. Smith, Jacqueline Feild, Erik G. Learned-Miller

Erik G Learned-Miller

The recognition of text in everyday scenes is made dif- ficult by viewing conditions, unusual fonts, and lack of lin- guistic context. Most methods integrate a priori appear- ance information and some sort of hard or soft constraint on the allowable strings. Weinman and Learned-Miller [ 14 ] showed that the similarity among characters, as a supple- ment to the appearance of the characters with respect to a model, could be used to improve scene text recognition. In this work, we make further improvements to scene text recognition by taking a novel approach to the incorpora- tion of similarity. In …


Forensic Triage For Mobile Phones With Dec0de, Robert Walls, Brian N. Levine, Erik G. Learned-Miller Dec 2010

Forensic Triage For Mobile Phones With Dec0de, Robert Walls, Brian N. Levine, Erik G. Learned-Miller

Erik G Learned-Miller

We present DEC0DE, a system for recovering information from phones with unknown storage formats, a critical problem for forensic triage. Because phones have myr- iad custom hardware and software, we examine only the stored data. Via flexible descriptions of typical data struc- tures, and using a classic dynamic programming algo- rithm, we are able to identify call logs and address book entries in phones across varied models and manufactur- ers. We designed DEC0DE by examining the formats of one set of phone models, and we evaluate its performance on other models. Overall, we are able to obtain high performance for …


Distribution Fields, Laura Sevilla Lara, Erik G. Learned-Miller Dec 2010

Distribution Fields, Laura Sevilla Lara, Erik G. Learned-Miller

Erik G Learned-Miller

No abstract provided.


Online Domain-Adaptation Of A Pre-Trained Cascade Of Classifiers, Vidit Jain, Erik G. Learned-Miller Dec 2010

Online Domain-Adaptation Of A Pre-Trained Cascade Of Classifiers, Vidit Jain, Erik G. Learned-Miller

Erik G Learned-Miller

Many classifiers are trained with massive training sets only to be applied at test time on data from a different dis- tribution. How can we rapidly and simply adapt a classifier to a new test distribution, even when we do not have ac- cess to the original training data? We present an on-line approach for rapidly adapting a “black box” classifier to a new test data set without retraining the classifier or ex- amining the original optimization criterion. Assuming the original classifier outputs a continuous number for which a threshold gives the class, we reclassify points near the ori g- …


Exploiting Half-Wits: Smarter Storage For Low-Power Devices, Mastooreh Salajegheh, Yue Wang, Kevin Fu, Anxiao Jiang, Erik G. Learned-Miller Dec 2010

Exploiting Half-Wits: Smarter Storage For Low-Power Devices, Mastooreh Salajegheh, Yue Wang, Kevin Fu, Anxiao Jiang, Erik G. Learned-Miller

Erik G Learned-Miller

This work analyzes the stochastic behavior of writing to embedded flash memory at voltages lower than recom- mended by a microcontroller’s specifications to reduce energy consumption. Flash memory integrated within a microcontroller typically requires the entire chip to op- erate on common supply voltage almost double what the CPU portion requires. Our approach tolerates a lower supply voltage so that the CPU may operate in a more en- ergy efficient manner. Energy efficient coding algorithms then cope with flash memory that behaves unpredictably. Our software-only coding algorithms ( in-place writes , multiple-place writes , RS-Berger codes ) enable reliable storage …


Improving State-Of-The-Art Ocr Through High-Precision Document-Specific Modeling, Andrew Kae, Gary B. Huang, Carl Doersch, Erik G. Learned-Miller Dec 2009

Improving State-Of-The-Art Ocr Through High-Precision Document-Specific Modeling, Andrew Kae, Gary B. Huang, Carl Doersch, Erik G. Learned-Miller

Erik G Learned-Miller

Optical character recognition (OCR) remains a difficult problem for noisy documents or documents not scanned at high resolution. Many current approaches rely on stored font models that are vulnerable to cases in which the docu- ment is noisy or is written in a font dissimilar to the stored fonts. We address these problems by learning character models directly from the document itself, rather than using pre-stored font models. This method has had some success in the past, but we are able to achieve substantial improve- ment in error reduction through a novel method for creating nearly error-free document-specifictraining data and …


Reverse Engineering For Mobile Systems Forensics With Ares, John Tuttle, Robert J. Walls, Erik G. Learned-Miller, Brian Neil Levine Dec 2009

Reverse Engineering For Mobile Systems Forensics With Ares, John Tuttle, Robert J. Walls, Erik G. Learned-Miller, Brian Neil Levine

Erik G Learned-Miller

We present Ares ,areverseengineeringtechniqueforassist- ing in the analysis of data recovered for the investigation of mobile and embedded systems. The focus of investigations into insider activity is most often on the data stored on the insider’s computers and digital devices — call logs, email messaging, calendar entries, text messages, and browser his- tory — rather than on the status of the system’s security. Ares is novel in that it uses a data-driven approach that in- corporates natural language processing techniques to infer the layout of input data that has been created according to some unknown specification. While some other reverse …


Learning Class-Specific Image Transformations With Higher-Order Boltzmann Machines, Erik G. Learned-Miller, Gary Huang Dec 2009

Learning Class-Specific Image Transformations With Higher-Order Boltzmann Machines, Erik G. Learned-Miller, Gary Huang

Erik G Learned-Miller

In this paper, we examine the problem of learning a rep- resentation of image transformations specific to a complex object class, such as faces. Learning such a representation for a specific object class would allow us to perform im- proved, pose-invariant visual verification, such as uncon- strained face verification. We build off of the method of using factored higher-order Boltzmann machines to model such image transformations. Using this approach will po- tentially enable us to use the model as one component of a larger deep architecture. This will allow us to use the fea- ture information in an ordinary deep …


Bounding The Probability Of Error For High Precision Recognition, Andrew Kae, Gary B. Huang, Erik G. Learned-Miller Dec 2008

Bounding The Probability Of Error For High Precision Recognition, Andrew Kae, Gary B. Huang, Erik G. Learned-Miller

Erik G Learned-Miller

We consider models for which it is important, early in proces sing, to estimate some variables with high precision, but perhaps at relative ly low rates of recall. If some variables can be identified with near certainty, then th ey can be conditioned upon, allowing further inference to be done efficiently. Spe cifically, we consider optical character recognition (OCR) systems that can be boo tstrapped by identify- ing a subset of correctly translated document words with ver y high precision. This “clean set” is subsequently used as document-specific train ing data. While many current OCR systems produce measures of confidence …


Non-Parametric Curve Alignment, Marwan Mattar, Michael G. Ross, Erik G. Learned-Miller Dec 2008

Non-Parametric Curve Alignment, Marwan Mattar, Michael G. Ross, Erik G. Learned-Miller

Erik G Learned-Miller

Congealing is a flexible nonparametric data-driven framework for the joint alignment of data. It has been successfully applied to the joint alignment of binary images of digits, binary images of object silhouettes, grayscale MRI images, color images of cars and faces, and 3D brain volumes. This research enhances congealing to practically and effectively apply it to curve data. We develop a parameterized set of nonlinear transformations that allow us to apply congealing to this type of data. We present positive results on aligning synthetic and real curve data sets and conclude with a discussion on extending this work to simultaneous …


Cryptogram Decoding For Optical Character Recognition, Gary Huang, Erik G. Learned-Miller, Andrew Mccallum Jul 2006

Cryptogram Decoding For Optical Character Recognition, Gary Huang, Erik G. Learned-Miller, Andrew Mccallum

Erik G Learned-Miller

OCR systems for printed documents typically require large numbers of font styles and character models to work well. When given an unseen font, performance degrades even in the absence of noise. In this paper, we perform OCR in an unsupervised fashion without using any character models by using a cryptogram decoding algorithm. We present results on real and artificial OCR data.


The Umass Mobile Manipulator Uman: An Experimental Platform For Autonomous Mobile Manipulation, Dov Katz, Emily Horrell, Yuandong Yang, Brendan Burns, Thomas Buckley, Anna Grishkan, Volodymyr Zhylkovskyy, Oliver Brock, Erik G. Learned-Miller Jul 2006

The Umass Mobile Manipulator Uman: An Experimental Platform For Autonomous Mobile Manipulation, Dov Katz, Emily Horrell, Yuandong Yang, Brendan Burns, Thomas Buckley, Anna Grishkan, Volodymyr Zhylkovskyy, Oliver Brock, Erik G. Learned-Miller

Erik G Learned-Miller

Object identification is the task of identifying specific objects belonging to the same class such as cars. We often need to recognize an object that we have only seen a few times. In fact, we often observe only one example of a particular object before we need to recognize it again. Thus we are interested in building a system which can learn to extract distinctive markers from a single example and which can then be used to identify the object in another image as “same ” or “different”. Previous work by Ferencz et al. introduced the notion of hyper-features, which …


Detecting Acromegaly: Screening For Disease With A Morphable Model, Erik G. Learned-Miller, Qifeng Lung, Angela Paisley, Peter Trainer, Volker Blanz, Katrin Dedden, Ralph Miller Dec 2005

Detecting Acromegaly: Screening For Disease With A Morphable Model, Erik G. Learned-Miller, Qifeng Lung, Angela Paisley, Peter Trainer, Volker Blanz, Katrin Dedden, Ralph Miller

Erik G Learned-Miller

Acromegaly is a rare disorder which affects about 50 of every million people. The disease typically causes swelling of the hands, feet, and face, and eventually permanent changes to areas such as the jaw, brow ridge, and cheek bones. The disease is often missed by physicians and progresses beyond where it might if it were identified and treated earlier. We consider a semi-automated approach to detecting acromegaly, using a novel combination of support vector machines (SVMs) and a morphable model. Our training set consists of 24 frontal photographs of acromegalic patients and 25 of disease-free subjects. We modelled each subject's …


Data Driven Image Models Through Continuous Joint Alignment, Erik G. Learned-Miller Dec 2005

Data Driven Image Models Through Continuous Joint Alignment, Erik G. Learned-Miller

Erik G Learned-Miller

This paper presents a family of techniques that we call congealing for modeling image classes from data. The idea is to start with a set of images and make them appear as similar as possible by removing variability along the known axes of variation. This technique can be used to eliminate "nuisance” variables such as affine deformations from handwritten digits or unwanted bias fields from magnetic resonance images. In addition to separating and modeling the latent images—i.e., the images without the nuisance variables—we can model the nuisance variables themselves, leading to factorized generative image models. When nuisance variable distributions are …


Joint Feature Selection For Object Detection And Recognition, Jerod J. Weinman, Allen Hanson, Erik G. Learned-Miller Dec 2005

Joint Feature Selection For Object Detection And Recognition, Jerod J. Weinman, Allen Hanson, Erik G. Learned-Miller

Erik G Learned-Miller

Object detection and recognition systems, such as face detectors and face recognizers, are often trained separately and operated in a feed-forward fashion. Selecting a small number of features for these tasks is important to prevent over-fitting and reduce computation. However, when a system has such related or sequential tasks, selecting features for these tasks independently may not be optimal. We propose a framework for choosing features to be shared between object detection and recognition tasks. The result is a system that achieves better performance by joint training and is faster because some features for identification have already been computed for …


Discriminative Training Of Hyper-Feature Models For Object Identification, Vidit Jain, Erik G. Learned-Miller Dec 2005

Discriminative Training Of Hyper-Feature Models For Object Identification, Vidit Jain, Erik G. Learned-Miller

Erik G Learned-Miller

Object identification is the task of identifying specific objects belonging to the same class such as cars. We often need to recognize an object that we have only seen a few times. In fact, we often observe only one example of a particular object before we need to recognize it again. Thus we are interested in building a system which can learn to extract distinctive markers from a single example and which can then be used to identify the object in another image as “same ” or “different”. Previous work by Ferencz et al. introduced the notion of hyper-features, which …


Improving Recognition Of Novel Input With Similarity, Jerod J. Weinman, Erik G. Learned-Miller Dec 2005

Improving Recognition Of Novel Input With Similarity, Jerod J. Weinman, Erik G. Learned-Miller

Erik G Learned-Miller

Many sources of information relevant to computer vision and machine learning tasks are often underused. One example is the similarity between the elements from a novel source, such as a speaker, writer, or printed font. By comparing instances emitted by a source, we help ensure that similar instances are given the same label. Previous approaches have clustered instances prior to recognition. We propose a probabilistic framework that unifies similarity with prior identity and contextual information. By fusing information sources in a single model, we eliminate unrecoverable errors that result from processing the information in separate stages and improve overall accuracy. …


Building A Classification Cascade For Visual Identification From One Example, Andras Ferencz, Erik G. Learned-Miller, Jitendra Malik Sep 2005

Building A Classification Cascade For Visual Identification From One Example, Andras Ferencz, Erik G. Learned-Miller, Jitendra Malik

Erik G Learned-Miller

Object identification (OID) is specialized recognition where the category is known (e.g. cars) and the algorithm recognizes an object's exact identity (e.g. Bob's BMW). Two special challenges characterize OID. (1) Interclass variation is often small (many cars look alike) and may be dwarfed by illumination or pose changes. (2) There may be many classes but few or just one positive "training" examples per class. Due to (1), a solution must locate possibly subtle object-specific salient features (a door handle) while avoiding distracting ones (a specular highlight). However, (2) rules out direct techniques of feature selection. We describe an online algorithm …


Joint Mri Bias Removal Using Entropy Minimization Across Images, Erik G. Learned-Miller, Parvez Ahammad Dec 2004

Joint Mri Bias Removal Using Entropy Minimization Across Images, Erik G. Learned-Miller, Parvez Ahammad

Erik G Learned-Miller

The correction of bias in magnetic resonance images is an important problem in medical image processing. Most previous approaches have used a maximum likelihood method to increase the likelihood of the pixels in a single image by adaptively estimating a correction to the unknown image bias field. The pixel likelihoods are defined either in terms of a pre-existing tissue model, or non-parametrically in terms of the image's own pixel values. In both cases, the specific location of a pixel in the image is not used to calculate the likelihoods. We suggest a new approach in which we simultaneously eliminate the …


A Probabilistic Upper Bound On Differential Entropy, Joseph Destefano, Erik G. Learned-Miller Dec 2004

A Probabilistic Upper Bound On Differential Entropy, Joseph Destefano, Erik G. Learned-Miller

Erik G Learned-Miller

A novel probabilistic upper bound on the entropy of an unknown one-dimensional distribution, given the support of the distribution and a sample from that distribution, is presented. No knowledge beyond the support of the unknown distribution is required. Previous distribution-free bounds on the cumulative distribution function of a random variable given a sample of that variable are used to construct the bound. A simple, fast, and intuitive algorithm for computing the entropy bound from a sample is provided.


Detecting Acromegaly: Screening For Disease With A Morphable Model, Qifeng Lu, Erik G. Learned-Miller, Angela Paisley, Peter Trainer, Volker Blanz, Katrin Dedden, Ralph Miller Dec 2004

Detecting Acromegaly: Screening For Disease With A Morphable Model, Qifeng Lu, Erik G. Learned-Miller, Angela Paisley, Peter Trainer, Volker Blanz, Katrin Dedden, Ralph Miller

Erik G Learned-Miller

Acromegaly is a rare disorder which affects about 50 of every million people. The disease typically causes swelling of the hands, feet, and face, and eventually permanent changes to areas such as the jaw, brow ridge, and cheek bones. The disease is often missed by physicians and progresses beyond where it might if it were identified and treated earlier. We consider a semi-automated approach to detecting acromegaly, using a novel combination of support vector machines (SVMs) and a morphable model. Our training set consists of 24 frontal photographs of acromegalic patients and 25 of disease-free subjects. We modelled each subject's …


Sign Classification For The Visually Impaired, Marwan A. Mattar, Allen R. Hanson, Erik G. Learned-Miller Dec 2004

Sign Classification For The Visually Impaired, Marwan A. Mattar, Allen R. Hanson, Erik G. Learned-Miller

Erik G Learned-Miller

Our world is populated with visual information that a sighted person makes use of daily. Unfortunately, the visually impaired are deprived from such information, which limits their mobility in unconstrained environments. To help alleviate this we are developing a wearable system that is capable of detecting and recognizing signs in natural scenes. The system is composed of two main components, sign detection and recognition. The sign detector, uses a conditional maximum entropy model to find regions in an image that correspond to a sign. The sign recognizer matches the hypothesized sign regions with sign images in a database. The system …


Efficient Population Registration Of 3d Data, Lilla Zöllei, Erik G. Learned-Miller, Eric Grimson, William Wells Dec 2004

Efficient Population Registration Of 3d Data, Lilla Zöllei, Erik G. Learned-Miller, Eric Grimson, William Wells

Erik G Learned-Miller

We present a population registration framework that acts on large collections or populations of data volumes. The data alignment procedure runs in a simultaneous fashion, with every member of the population approaching the central tendency of the collection at the same time. Such a mechanism eliminates the need for selecting a particular reference frame a priori, resulting in a non-biased estimate of a digital atlas. Our algorithm adopts an affine congealing framework with an information theoretic objective function and is optimized via a gradientbased stochastic approximation process embedded in a multi-resolution setting. We present experimental results on both synthetic and …