Open Access. Powered by Scholars. Published by Universities.®
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
Articles 1 - 30 of 41
Full-Text Articles in Physical Sciences and Mathematics
Bounding The Probability Of Error For High Precision Optical Character Recognition, Gary B. Huang, Andrew Kae, Carl Doersch, Erik G. Learned-Miller
Bounding The Probability Of Error For High Precision Optical Character Recognition, Gary B. Huang, Andrew Kae, Carl Doersch, Erik G. Learned-Miller
Erik G Learned-Miller
We consider a model for which it is important, early in proces sing, to estimate some variables with high precision, but perhaps at relatively low recall. I f some variables can be identified with near certainty, they can be conditioned upon, allowing furt her inference to be done efficiently. Specifically, we consider optical character recognition (O CR) systems that can be bootstrapped by identifying a subset of correctly translated document wo rds with very high precision. This “clean set” is subsequently used as document-specific train ing data. While OCR systems produce confidence measures for the identity of each letter or …
Background Modeling Using Adaptive Pixelwise Kernel Variances In A Hybrid Feature Space, Erik G. Learned-Miller, Manjunath Narayana, Allen Hanson
Background Modeling Using Adaptive Pixelwise Kernel Variances In A Hybrid Feature Space, Erik G. Learned-Miller, Manjunath Narayana, Allen Hanson
Erik G Learned-Miller
Recent work on background subtraction has shown de- velopments on two major fronts. In one, there has been increasing sophistication of probabilistic models, from mix- tures of Gaussians at each pixel [ 7 ], to kernel density esti- mates at each pixel [ 1 ], and more recently to joint domain- range density estimates that incorporate spatial informa- tion [ 6 ]. Another line of work has shown the benefits of increasingly complex feature representations, including the use of texture information, local binary patterns, and re- cently scale-invariant local ternary patterns [ 4 ]. In this work, we use joint …
Scene Text Recognition With Bilateral Regression, Jacqueline Feild, Erik G. Learned-Miller
Scene Text Recognition With Bilateral Regression, Jacqueline Feild, Erik G. Learned-Miller
Erik G Learned-Miller
This paper focuses on improving the recognition of text in images of natural scenes, such as storefront signs or street signs. This is a difficult problem due to lighting conditions, variation in font shape and color, and complex backgrounds. We present a word recognition system that addresses these difficulties using an innovative technique to extract and recognize foreground text in an image. First, we develop a new method, called bilateral regression, for extracting and modeling one coherent (although not necessarily contiguous) region from an image. The method models smooth color changes across an image region without being corrupted by neighboring …
Unsupervised Joint Alignment And Clustering Using Bayesian Nonparametrics, Marwan Mattar, Allen Hanson, Erik G. Learned-Miller
Unsupervised Joint Alignment And Clustering Using Bayesian Nonparametrics, Marwan Mattar, Allen Hanson, Erik G. Learned-Miller
Erik G Learned-Miller
Joint alignment of a collection of functions is the process of independently transforming the func- tions so that they appear more similar to each other. Typically, such unsupervised alignment al- gorithms fail when presented with complex data sets arising from multiple modalities or make re- strictive assumptions about the form of the func- tions or transformations, limiting their general- ity. We present a transformed Bayesian infinite mixture model that can simultaneously align and cluster a data set. Our model and associated learning scheme offer two key advantages: the optimal number of clusters is determined in a data-driven fashion through the …
Learning Hierarchical Representations For Face Verification, Gary B. Huang, Honglak Lee, Erik G. Learned-Miller
Learning Hierarchical Representations For Face Verification, Gary B. Huang, Honglak Lee, Erik G. Learned-Miller
Erik G Learned-Miller
Most modern face recognition systems rely on a feature representation given by a hand-crafted image descriptor, such as Local Binary Patterns (LBP), and achieve improved performance by combining several such representations. In this paper, we propose deep learning as a natural source for obtaining additional, complementary representations. To learn features in high-resolution images, we make use of convolutional deep belief networks. Moreover, to take advantage of global structure in an object class, we develop local convolutional restricted Boltzmann machines, a novel convolutional learning model that exploits the global structure by not assuming stationarity of features across the image, while maintaining …
Half-Wits: Software Techniques For Low-Voltage Probabilistic Storage On Microcontrollers With Nor Flash Memory., Mastooreh Salajegheh, Yue Wang, Anxiao Jiang, Erik G. Learned-Miller, Kevin Fu
Half-Wits: Software Techniques For Low-Voltage Probabilistic Storage On Microcontrollers With Nor Flash Memory., Mastooreh Salajegheh, Yue Wang, Anxiao Jiang, Erik G. Learned-Miller, Kevin Fu
Erik G Learned-Miller
No abstract provided.
Improvements In Joint Domain-Range Modeling For Background Subtraction, Manjunath Narayana, Allen Hanson, Erik G. Learned-Miller
Improvements In Joint Domain-Range Modeling For Background Subtraction, Manjunath Narayana, Allen Hanson, Erik G. Learned-Miller
Erik G Learned-Miller
In many algorithms for background modeling, a distribution over feature values is modeled at each pixel. These models, however, do not account for the dependencies that may exist among nearby pixels. The joint domain-range kernel density estimate (KDE) model by Sheikh and Shah [ 7 ], which is not a pixel-wise model, represents the background and foreground processes by combining the three color dimensions and two spatial dimensions into a five-dimensional joint space. The Sheikh and Shah model, as we will show, has a peculiar dependence on the size of the image. In contrast, we build three-dimensional color distributions at …
Enforcing Similarity Constraints With Integer Programming For Better Scene Text Recognition, David L. Smith, Jacqueline Feild, Erik G. Learned-Miller
Enforcing Similarity Constraints With Integer Programming For Better Scene Text Recognition, David L. Smith, Jacqueline Feild, Erik G. Learned-Miller
Erik G Learned-Miller
The recognition of text in everyday scenes is made dif- ficult by viewing conditions, unusual fonts, and lack of lin- guistic context. Most methods integrate a priori appear- ance information and some sort of hard or soft constraint on the allowable strings. Weinman and Learned-Miller [ 14 ] showed that the similarity among characters, as a supple- ment to the appearance of the characters with respect to a model, could be used to improve scene text recognition. In this work, we make further improvements to scene text recognition by taking a novel approach to the incorpora- tion of similarity. In …
Forensic Triage For Mobile Phones With Dec0de, Robert Walls, Brian N. Levine, Erik G. Learned-Miller
Forensic Triage For Mobile Phones With Dec0de, Robert Walls, Brian N. Levine, Erik G. Learned-Miller
Erik G Learned-Miller
We present DEC0DE, a system for recovering information from phones with unknown storage formats, a critical problem for forensic triage. Because phones have myr- iad custom hardware and software, we examine only the stored data. Via flexible descriptions of typical data struc- tures, and using a classic dynamic programming algo- rithm, we are able to identify call logs and address book entries in phones across varied models and manufactur- ers. We designed DEC0DE by examining the formats of one set of phone models, and we evaluate its performance on other models. Overall, we are able to obtain high performance for …
Distribution Fields, Laura Sevilla Lara, Erik G. Learned-Miller
Distribution Fields, Laura Sevilla Lara, Erik G. Learned-Miller
Erik G Learned-Miller
No abstract provided.
Online Domain-Adaptation Of A Pre-Trained Cascade Of Classifiers, Vidit Jain, Erik G. Learned-Miller
Online Domain-Adaptation Of A Pre-Trained Cascade Of Classifiers, Vidit Jain, Erik G. Learned-Miller
Erik G Learned-Miller
Many classifiers are trained with massive training sets only to be applied at test time on data from a different dis- tribution. How can we rapidly and simply adapt a classifier to a new test distribution, even when we do not have ac- cess to the original training data? We present an on-line approach for rapidly adapting a “black box” classifier to a new test data set without retraining the classifier or ex- amining the original optimization criterion. Assuming the original classifier outputs a continuous number for which a threshold gives the class, we reclassify points near the ori g- …
Exploiting Half-Wits: Smarter Storage For Low-Power Devices, Mastooreh Salajegheh, Yue Wang, Kevin Fu, Anxiao Jiang, Erik G. Learned-Miller
Exploiting Half-Wits: Smarter Storage For Low-Power Devices, Mastooreh Salajegheh, Yue Wang, Kevin Fu, Anxiao Jiang, Erik G. Learned-Miller
Erik G Learned-Miller
This work analyzes the stochastic behavior of writing to embedded flash memory at voltages lower than recom- mended by a microcontroller’s specifications to reduce energy consumption. Flash memory integrated within a microcontroller typically requires the entire chip to op- erate on common supply voltage almost double what the CPU portion requires. Our approach tolerates a lower supply voltage so that the CPU may operate in a more en- ergy efficient manner. Energy efficient coding algorithms then cope with flash memory that behaves unpredictably. Our software-only coding algorithms ( in-place writes , multiple-place writes , RS-Berger codes ) enable reliable storage …
Improving State-Of-The-Art Ocr Through High-Precision Document-Specific Modeling, Andrew Kae, Gary B. Huang, Carl Doersch, Erik G. Learned-Miller
Improving State-Of-The-Art Ocr Through High-Precision Document-Specific Modeling, Andrew Kae, Gary B. Huang, Carl Doersch, Erik G. Learned-Miller
Erik G Learned-Miller
Optical character recognition (OCR) remains a difficult problem for noisy documents or documents not scanned at high resolution. Many current approaches rely on stored font models that are vulnerable to cases in which the docu- ment is noisy or is written in a font dissimilar to the stored fonts. We address these problems by learning character models directly from the document itself, rather than using pre-stored font models. This method has had some success in the past, but we are able to achieve substantial improve- ment in error reduction through a novel method for creating nearly error-free document-specifictraining data and …
Reverse Engineering For Mobile Systems Forensics With Ares, John Tuttle, Robert J. Walls, Erik G. Learned-Miller, Brian Neil Levine
Reverse Engineering For Mobile Systems Forensics With Ares, John Tuttle, Robert J. Walls, Erik G. Learned-Miller, Brian Neil Levine
Erik G Learned-Miller
We present Ares ,areverseengineeringtechniqueforassist- ing in the analysis of data recovered for the investigation of mobile and embedded systems. The focus of investigations into insider activity is most often on the data stored on the insider’s computers and digital devices — call logs, email messaging, calendar entries, text messages, and browser his- tory — rather than on the status of the system’s security. Ares is novel in that it uses a data-driven approach that in- corporates natural language processing techniques to infer the layout of input data that has been created according to some unknown specification. While some other reverse …
Learning Class-Specific Image Transformations With Higher-Order Boltzmann Machines, Erik G. Learned-Miller, Gary Huang
Learning Class-Specific Image Transformations With Higher-Order Boltzmann Machines, Erik G. Learned-Miller, Gary Huang
Erik G Learned-Miller
In this paper, we examine the problem of learning a rep- resentation of image transformations specific to a complex object class, such as faces. Learning such a representation for a specific object class would allow us to perform im- proved, pose-invariant visual verification, such as uncon- strained face verification. We build off of the method of using factored higher-order Boltzmann machines to model such image transformations. Using this approach will po- tentially enable us to use the model as one component of a larger deep architecture. This will allow us to use the fea- ture information in an ordinary deep …
Bounding The Probability Of Error For High Precision Recognition, Andrew Kae, Gary B. Huang, Erik G. Learned-Miller
Bounding The Probability Of Error For High Precision Recognition, Andrew Kae, Gary B. Huang, Erik G. Learned-Miller
Erik G Learned-Miller
We consider models for which it is important, early in proces sing, to estimate some variables with high precision, but perhaps at relative ly low rates of recall. If some variables can be identified with near certainty, then th ey can be conditioned upon, allowing further inference to be done efficiently. Spe cifically, we consider optical character recognition (OCR) systems that can be boo tstrapped by identify- ing a subset of correctly translated document words with ver y high precision. This “clean set” is subsequently used as document-specific train ing data. While many current OCR systems produce measures of confidence …
Non-Parametric Curve Alignment, Marwan Mattar, Michael G. Ross, Erik G. Learned-Miller
Non-Parametric Curve Alignment, Marwan Mattar, Michael G. Ross, Erik G. Learned-Miller
Erik G Learned-Miller
Congealing is a flexible nonparametric data-driven framework for the joint alignment of data. It has been successfully applied to the joint alignment of binary images of digits, binary images of object silhouettes, grayscale MRI images, color images of cars and faces, and 3D brain volumes. This research enhances congealing to practically and effectively apply it to curve data. We develop a parameterized set of nonlinear transformations that allow us to apply congealing to this type of data. We present positive results on aligning synthetic and real curve data sets and conclude with a discussion on extending this work to simultaneous …
Cryptogram Decoding For Optical Character Recognition, Gary Huang, Erik G. Learned-Miller, Andrew Mccallum
Cryptogram Decoding For Optical Character Recognition, Gary Huang, Erik G. Learned-Miller, Andrew Mccallum
Erik G Learned-Miller
OCR systems for printed documents typically require large numbers of font styles and character models to work well. When given an unseen font, performance degrades even in the absence of noise. In this paper, we perform OCR in an unsupervised fashion without using any character models by using a cryptogram decoding algorithm. We present results on real and artificial OCR data.
The Umass Mobile Manipulator Uman: An Experimental Platform For Autonomous Mobile Manipulation, Dov Katz, Emily Horrell, Yuandong Yang, Brendan Burns, Thomas Buckley, Anna Grishkan, Volodymyr Zhylkovskyy, Oliver Brock, Erik G. Learned-Miller
The Umass Mobile Manipulator Uman: An Experimental Platform For Autonomous Mobile Manipulation, Dov Katz, Emily Horrell, Yuandong Yang, Brendan Burns, Thomas Buckley, Anna Grishkan, Volodymyr Zhylkovskyy, Oliver Brock, Erik G. Learned-Miller
Erik G Learned-Miller
Object identification is the task of identifying specific objects belonging to the same class such as cars. We often need to recognize an object that we have only seen a few times. In fact, we often observe only one example of a particular object before we need to recognize it again. Thus we are interested in building a system which can learn to extract distinctive markers from a single example and which can then be used to identify the object in another image as “same ” or “different”. Previous work by Ferencz et al. introduced the notion of hyper-features, which …
Detecting Acromegaly: Screening For Disease With A Morphable Model, Erik G. Learned-Miller, Qifeng Lung, Angela Paisley, Peter Trainer, Volker Blanz, Katrin Dedden, Ralph Miller
Detecting Acromegaly: Screening For Disease With A Morphable Model, Erik G. Learned-Miller, Qifeng Lung, Angela Paisley, Peter Trainer, Volker Blanz, Katrin Dedden, Ralph Miller
Erik G Learned-Miller
Acromegaly is a rare disorder which affects about 50 of every million people. The disease typically causes swelling of the hands, feet, and face, and eventually permanent changes to areas such as the jaw, brow ridge, and cheek bones. The disease is often missed by physicians and progresses beyond where it might if it were identified and treated earlier. We consider a semi-automated approach to detecting acromegaly, using a novel combination of support vector machines (SVMs) and a morphable model. Our training set consists of 24 frontal photographs of acromegalic patients and 25 of disease-free subjects. We modelled each subject's …
Data Driven Image Models Through Continuous Joint Alignment, Erik G. Learned-Miller
Data Driven Image Models Through Continuous Joint Alignment, Erik G. Learned-Miller
Erik G Learned-Miller
This paper presents a family of techniques that we call congealing for modeling image classes from data. The idea is to start with a set of images and make them appear as similar as possible by removing variability along the known axes of variation. This technique can be used to eliminate "nuisance” variables such as affine deformations from handwritten digits or unwanted bias fields from magnetic resonance images. In addition to separating and modeling the latent images—i.e., the images without the nuisance variables—we can model the nuisance variables themselves, leading to factorized generative image models. When nuisance variable distributions are …
Joint Feature Selection For Object Detection And Recognition, Jerod J. Weinman, Allen Hanson, Erik G. Learned-Miller
Joint Feature Selection For Object Detection And Recognition, Jerod J. Weinman, Allen Hanson, Erik G. Learned-Miller
Erik G Learned-Miller
Object detection and recognition systems, such as face detectors and face recognizers, are often trained separately and operated in a feed-forward fashion. Selecting a small number of features for these tasks is important to prevent over-fitting and reduce computation. However, when a system has such related or sequential tasks, selecting features for these tasks independently may not be optimal. We propose a framework for choosing features to be shared between object detection and recognition tasks. The result is a system that achieves better performance by joint training and is faster because some features for identification have already been computed for …
Discriminative Training Of Hyper-Feature Models For Object Identification, Vidit Jain, Erik G. Learned-Miller
Discriminative Training Of Hyper-Feature Models For Object Identification, Vidit Jain, Erik G. Learned-Miller
Erik G Learned-Miller
Object identification is the task of identifying specific objects belonging to the same class such as cars. We often need to recognize an object that we have only seen a few times. In fact, we often observe only one example of a particular object before we need to recognize it again. Thus we are interested in building a system which can learn to extract distinctive markers from a single example and which can then be used to identify the object in another image as “same ” or “different”. Previous work by Ferencz et al. introduced the notion of hyper-features, which …
Improving Recognition Of Novel Input With Similarity, Jerod J. Weinman, Erik G. Learned-Miller
Improving Recognition Of Novel Input With Similarity, Jerod J. Weinman, Erik G. Learned-Miller
Erik G Learned-Miller
Many sources of information relevant to computer vision and machine learning tasks are often underused. One example is the similarity between the elements from a novel source, such as a speaker, writer, or printed font. By comparing instances emitted by a source, we help ensure that similar instances are given the same label. Previous approaches have clustered instances prior to recognition. We propose a probabilistic framework that unifies similarity with prior identity and contextual information. By fusing information sources in a single model, we eliminate unrecoverable errors that result from processing the information in separate stages and improve overall accuracy. …
Building A Classification Cascade For Visual Identification From One Example, Andras Ferencz, Erik G. Learned-Miller, Jitendra Malik
Building A Classification Cascade For Visual Identification From One Example, Andras Ferencz, Erik G. Learned-Miller, Jitendra Malik
Erik G Learned-Miller
Object identification (OID) is specialized recognition where the category is known (e.g. cars) and the algorithm recognizes an object's exact identity (e.g. Bob's BMW). Two special challenges characterize OID. (1) Interclass variation is often small (many cars look alike) and may be dwarfed by illumination or pose changes. (2) There may be many classes but few or just one positive "training" examples per class. Due to (1), a solution must locate possibly subtle object-specific salient features (a door handle) while avoiding distracting ones (a specular highlight). However, (2) rules out direct techniques of feature selection. We describe an online algorithm …
Joint Mri Bias Removal Using Entropy Minimization Across Images, Erik G. Learned-Miller, Parvez Ahammad
Joint Mri Bias Removal Using Entropy Minimization Across Images, Erik G. Learned-Miller, Parvez Ahammad
Erik G Learned-Miller
The correction of bias in magnetic resonance images is an important problem in medical image processing. Most previous approaches have used a maximum likelihood method to increase the likelihood of the pixels in a single image by adaptively estimating a correction to the unknown image bias field. The pixel likelihoods are defined either in terms of a pre-existing tissue model, or non-parametrically in terms of the image's own pixel values. In both cases, the specific location of a pixel in the image is not used to calculate the likelihoods. We suggest a new approach in which we simultaneously eliminate the …
A Probabilistic Upper Bound On Differential Entropy, Joseph Destefano, Erik G. Learned-Miller
A Probabilistic Upper Bound On Differential Entropy, Joseph Destefano, Erik G. Learned-Miller
Erik G Learned-Miller
A novel probabilistic upper bound on the entropy of an unknown one-dimensional distribution, given the support of the distribution and a sample from that distribution, is presented. No knowledge beyond the support of the unknown distribution is required. Previous distribution-free bounds on the cumulative distribution function of a random variable given a sample of that variable are used to construct the bound. A simple, fast, and intuitive algorithm for computing the entropy bound from a sample is provided.
Detecting Acromegaly: Screening For Disease With A Morphable Model, Qifeng Lu, Erik G. Learned-Miller, Angela Paisley, Peter Trainer, Volker Blanz, Katrin Dedden, Ralph Miller
Detecting Acromegaly: Screening For Disease With A Morphable Model, Qifeng Lu, Erik G. Learned-Miller, Angela Paisley, Peter Trainer, Volker Blanz, Katrin Dedden, Ralph Miller
Erik G Learned-Miller
Acromegaly is a rare disorder which affects about 50 of every million people. The disease typically causes swelling of the hands, feet, and face, and eventually permanent changes to areas such as the jaw, brow ridge, and cheek bones. The disease is often missed by physicians and progresses beyond where it might if it were identified and treated earlier. We consider a semi-automated approach to detecting acromegaly, using a novel combination of support vector machines (SVMs) and a morphable model. Our training set consists of 24 frontal photographs of acromegalic patients and 25 of disease-free subjects. We modelled each subject's …
Sign Classification For The Visually Impaired, Marwan A. Mattar, Allen R. Hanson, Erik G. Learned-Miller
Sign Classification For The Visually Impaired, Marwan A. Mattar, Allen R. Hanson, Erik G. Learned-Miller
Erik G Learned-Miller
Our world is populated with visual information that a sighted person makes use of daily. Unfortunately, the visually impaired are deprived from such information, which limits their mobility in unconstrained environments. To help alleviate this we are developing a wearable system that is capable of detecting and recognizing signs in natural scenes. The system is composed of two main components, sign detection and recognition. The sign detector, uses a conditional maximum entropy model to find regions in an image that correspond to a sign. The sign recognizer matches the hypothesized sign regions with sign images in a database. The system …
Efficient Population Registration Of 3d Data, Lilla Zöllei, Erik G. Learned-Miller, Eric Grimson, William Wells
Efficient Population Registration Of 3d Data, Lilla Zöllei, Erik G. Learned-Miller, Eric Grimson, William Wells
Erik G Learned-Miller
We present a population registration framework that acts on large collections or populations of data volumes. The data alignment procedure runs in a simultaneous fashion, with every member of the population approaching the central tendency of the collection at the same time. Such a mechanism eliminates the need for selecting a particular reference frame a priori, resulting in a non-biased estimate of a digital atlas. Our algorithm adopts an affine congealing framework with an information theoretic objective function and is optimized via a gradientbased stochastic approximation process embedded in a multi-resolution setting. We present experimental results on both synthetic and …