Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 30 of 53

Full-Text Articles in Physical Sciences and Mathematics

Terrain Constrained Stereo Correspondence, Gabrielle Inglis, Chris Roman Dec 2012

Terrain Constrained Stereo Correspondence, Gabrielle Inglis, Chris Roman

Christopher N. Roman

There is a persistent need in the oceanographic community for accurate three dimensional reconstructions of seafloor structures. To meet this need underwater mapping techniques have expanded to include the use of stereo vision and high frequency multibeam sonar for mapping scenes 10's to 100's of square meters in size. Both techniques have relative advantages and disadvantages that depend on the task at hand and the desired accuracy. In this paper, we develop a method to constrain the often problematic stereo correspondence search to small sections of the image that correspond to estimated ranges along the epipolar lines calculated from coregistered …


Application Of Structured Light Imaging For High Resolution Mapping Of Underwater Archaeological Sites, Chris Roman, Gabrielle Inglis, James Rutter Dec 2012

Application Of Structured Light Imaging For High Resolution Mapping Of Underwater Archaeological Sites, Chris Roman, Gabrielle Inglis, James Rutter

Christopher N. Roman

This paper presents results from recent work using structured light laser profile imaging to create high resolution bathymetric maps of underwater archaeological sites. Documenting the texture and structure of submerged sites is a difficult task and many applicable acoustic and photographic mapping techniques have recently emerged. This effort was completed to evaluate laser profile imaging in comparison to stereo imaging and high frequency multibeam mapping. A ROV mounted camera and inclined 532 nm sheet laser were used to create profiles of the bottom that were then merged into maps using platform navigation data. These initial results show very promising resolution …


P2cp: A New Cloud Storage Model To Enhance Performance Of Cloud Services, Zhe Sun, Jun Shen, Ghassan Beydoun Dec 2012

P2cp: A New Cloud Storage Model To Enhance Performance Of Cloud Services, Zhe Sun, Jun Shen, Ghassan Beydoun

Associate Professor Ghassan Beydoun

This paper presents a storage model named Peer to Cloud and Peer (P2CP). Assuming that the P2CP model follows the Poisson process or Little’s law, we prove that the speed and availability of P2CP is generally better than that of the pure Peer to Peer (P2P) model, the Peer to Server, Peer (P2SP) model or the cloud model. A key feature of our P2CP is that it has three data transmission tunnels: the cloud-user data transmission tunnel, the clients’ data transmission tunnel, and the common data transmission tunnel. P2CP uses the cloud storage system as a common storage system. When …


Evaluating Usage Of Wsmo And Owl-S In Semantic Web Services, Lina Azleny Kamaruddin, Jun Shen, Ghassan Beydoun Dec 2012

Evaluating Usage Of Wsmo And Owl-S In Semantic Web Services, Lina Azleny Kamaruddin, Jun Shen, Ghassan Beydoun

Associate Professor Ghassan Beydoun

Applying ontologies is the most promising approach to semantically enrich Web services. To facilitate this, two efforts contributed the most in enabling the creation of ontologies: OWL-S from the US and WSMO in Europe. These two compete and promote their ontologies from the design perspective, reflecting their inventors’ bias but not offering much help to Web service developers using them. To bypass existing biases and enable evaluation of ontologies expressed in these two languages, this paper provides a study of the two important facilitators, OWL-S and WSMO, surveying their usage in several SWS Projects and identifying their respective and outstanding …


A Pipeline For Structured Light Bathymetric Mapping, Gabrielle Inglis, Clara Smart, J. Vaughn, Chris Roman Oct 2012

A Pipeline For Structured Light Bathymetric Mapping, Gabrielle Inglis, Clara Smart, J. Vaughn, Chris Roman

Christopher N. Roman

This paper details a methodology for using structured light laser imaging to create high resolution bathymetric maps of the sea floor. The system includes a pair of stereo cameras and an inclined 532nm sheet laser mounted to a remotely operated vehicle (ROV). While a structured light system generally requires a single camera, a stereo vision set up is used here for in-situ calibration of the laser system geometry by triangulating points on the laser line. This allows for quick calibration at the survey site and does not require precise jigs or a controlled environment. A batch procedure to extract the …


Greedy Approximation Of Kernel Pca By Minimizing The Mapping Error, Peng Cheng, Wanqing Li, Philip Ogunbona Sep 2012

Greedy Approximation Of Kernel Pca By Minimizing The Mapping Error, Peng Cheng, Wanqing Li, Philip Ogunbona

Professor Philip Ogunbona

In this paper we propose a new kernel PCA (KPCA) speed-up algorithm that aims to find a reduced KPCA to approximate the kernel mapping. The algorithm works by greedily choosing a subset of the training samples that minimizes the mean square error of the kernel mapping between the original KPCA and the reduced KPCA. Experimental results have shown that the proposed algorithm is more efficient in computation and effective with lower mapping errors than previous algorithms.


Perceived Similarity And Visual Descriptions In Content-Based Image Retrieval, Yuan Zhong, Lei Ye, Wanqing Li, Philip Ogunbona Sep 2012

Perceived Similarity And Visual Descriptions In Content-Based Image Retrieval, Yuan Zhong, Lei Ye, Wanqing Li, Philip Ogunbona

Professor Philip Ogunbona

The use of low-level feature descriptors is pervasive in content-based image retrieval tasks and the answer to the question of how well these features describe users’ intention is inconclusive. In this paper we devise experiments to gauge the degree of alignment between the description of target images by humans and that implicitly provided by low-level image feature descriptors. Data was collected on how humans perceive similarity in images. Using images judged by humans to be similar, as ground truth, the performance of some MPEG-7 visual feature descriptors were evaluated. It is found that various descriptors play different roles in different …


Emotional States Control For On-Line Game Avatars, Ce Zhan, Wanqing Li, Farzad Safaei, Philip Ogunbona Sep 2012

Emotional States Control For On-Line Game Avatars, Ce Zhan, Wanqing Li, Farzad Safaei, Philip Ogunbona

Professor Philip Ogunbona

Although detailed animation has already been achieved in a number of Multi-player On-line Games (MOGs), players have to use text commands to control emotional states of avatars. Some systems have been proposed to implement a real-time automatic system facial expression recognition of players. Such systems can then be used to control avatars emotional states by driving the MOG's "animation engine" instead of text commands. Some of the challenges of such systems is the ability to detect and recognize facial components from low spatial resolution face images. In this paper a system based on an improved face detection method of Viola …


Modelling Of Color Cross-Talk In Cmos Image Sensors, Wanqing Li, Philip Ogunbona, Yan Shi, Igor Kharitonenko Sep 2012

Modelling Of Color Cross-Talk In Cmos Image Sensors, Wanqing Li, Philip Ogunbona, Yan Shi, Igor Kharitonenko

Professor Philip Ogunbona

This paper presents a way to model the cross-talk effect in CMOS image sensors. Two algorithms are derived from the model; both of them work on the Bayer raw data and have low computational complexity. Experiments on Macbeth color chart and real images have shown the effectiveness of the modeling to eliminate the cross-talk effect and produce better quality images with traditional color interpolation and correction algorithms designed for CCD image sensors.


Application Of Visual Modelling In Image Restoration And Colour Image Processing, Aziz Qureshi, Philip Ogunbona Sep 2012

Application Of Visual Modelling In Image Restoration And Colour Image Processing, Aziz Qureshi, Philip Ogunbona

Professor Philip Ogunbona

This paper describes the application of human visual models in (i) defining a visually uniform colour representation space and (ii) the formulation of visually weighted Kalman filtering for image restoration. The former being useful in colour image quantisation and compression. For (i), the uniformity of chromaticity differences at the ouptut of Frei ’s colour vision model [3] is tested and compensated for by using MacAdam’s uniform chromaticity space. For (ii), the dynamical image model of the Kalman filter is visually weighted using the frequency response of Stockham’s model [l] of human vision.


Channel-Optimized Vector Trellis Source Coding For The Awgn Channel, Philip Secker, Philip Ogunbona Sep 2012

Channel-Optimized Vector Trellis Source Coding For The Awgn Channel, Philip Secker, Philip Ogunbona

Professor Philip Ogunbona

A channel-optimised (joint source and channel) trellis source coder is designed for the AWGN channel. The optimum decoder is a non-linear function of the real channel information. The extension to 2D vector alphabets coupled with modifications to the signal space are found to improve performance. Favourable comparisons are made against a trellis source coder/TCM system.


Visual Perceptual Process Model And Object Segmentation, Wanqing Li, P. Ogunbona, Lei Ye, Igor Kharitonenko Sep 2012

Visual Perceptual Process Model And Object Segmentation, Wanqing Li, P. Ogunbona, Lei Ye, Igor Kharitonenko

Professor Philip Ogunbona

Modeling human visual process is crucial for automatic object segmentation that is able to produce consistent results to human perception. Based on the latest understanding of how human performs the task of extracting objects from images, we proposed a graph-based computational framework to model the visual process. The model supports the hierarchical nature of human visual perception and consists of the key steps of human visual perception including pre-attentive (pre-constancy) grouping, figure-and-ground organization, and attentive (post-constancy) grouping. A divide-and-conquer implementation of the model based on the concept of shortest spanning tree (SST) has demonstrated the potential of the model for …


Industrial Computer Vision Using Undefined Feature Extraction, Phil Evans, John A. Fulcher, Philip Ogunbona Sep 2012

Industrial Computer Vision Using Undefined Feature Extraction, Phil Evans, John A. Fulcher, Philip Ogunbona

Professor Philip Ogunbona

This paper presents an application of computer The implementation and operation of the system is vision in a real-world uncontrolled environment found at BHP Steel Port Kembla. The task is visual identification of torpedo ladles at a Blast Furnace wlahdilceh. is achieved by reading numbers attached to each 3. IMPLEMENTATION Number recognition is achieved through use of feature extraction using a Multi-Layer Perceptron (MLP) Artificial Neural Network (ANN). The novelty in the method used in this application is that the features the MLP is being trained to extract are undefined before the MLP is initialised. The results of the MLP …


Kernel Pca Of Hog Features For Posture Detection, Peng Cheng, Wanqing Li, Philip Ogunbona Sep 2012

Kernel Pca Of Hog Features For Posture Detection, Peng Cheng, Wanqing Li, Philip Ogunbona

Professor Philip Ogunbona

Motivated by the non-linear manifold learning ability of the Kernel Principal Component Analysis (KPCA), we propose in this paper a method for detecting human postures from single images by employing KPCA to learn the manifold span of a set of HOG features that can effectively represent the postures. The main contribution of this paper is to apply the KPCA as a non-linear learning and open-set classification tool, which implicitly learns a smooth manifold from noisy data that scatter over the feature space. For a new instance of HOG feature, its distance to the manifold that is measured by its reconstruction …


Index Factorised Image Adaptive Vector Quantisation, Jamshid Shanbehzadeh, Philip Ogunbona Sep 2012

Index Factorised Image Adaptive Vector Quantisation, Jamshid Shanbehzadeh, Philip Ogunbona

Professor Philip Ogunbona

No abstract provided.


On Multiple Watermarking, Nicholas Paul Sheppard, Reihaneh Safavi-Naini, Philip Ogunbona Sep 2012

On Multiple Watermarking, Nicholas Paul Sheppard, Reihaneh Safavi-Naini, Philip Ogunbona

Professor Philip Ogunbona

Mintzer and Braudaway once asked: If one watermark is good, are more better? In this paper, we discuss some techniques for embedding multiple watermarks into a single multimedia object and report some observations on implementations of these techniques.


Methods Of Channel-Optimised Trellis Source Coding For The Awgn Channel, Philip Secker, Philip Ogunbona Sep 2012

Methods Of Channel-Optimised Trellis Source Coding For The Awgn Channel, Philip Secker, Philip Ogunbona

Professor Philip Ogunbona

Improvements to channel-optimised trellis source coding for the AWGN channel are obtained by using, in various forms, real or ‘soft’ channel information. The proposed 1 bit/sample systems use a channel-optimised encoder matched to 1) a simple decision feedback detector, 2) an expanded codebook with 2-bit quantized information and 3) an optimum non-linear estimator decoder. The third system is further improved by considering vector alphabets and both constant and average energy constrained 2D signal constellations.


Face To Face Communications In Multiplayer Online Games: A Real-Time System, Ce Zhan, Wanqing Li, Farzad Safaei, Philip Ogunbona Sep 2012

Face To Face Communications In Multiplayer Online Games: A Real-Time System, Ce Zhan, Wanqing Li, Farzad Safaei, Philip Ogunbona

Professor Philip Ogunbona

Multiplayer online games (MOG) bring HCI into a new era of human-human interactions in computer world. Although current MOG provide more interactivity and social interaction in the virtual world, natural facial expression as a key factor in emulating face to face communications has been neglected by game designers. In this work, we propose a real-time automatic system to recognize players’ facial expressions, so that the recognition results can be used to drive the MOG’s “facial expression engine” instead of “text commands”. Our major contributions are the evaluation, improvement and efficient implementation of existing algorithms to build a real-time system that …


Human Detection Based On Weighted Template Matching, Duc Thanh Nguyen, Philip Ogunbona, Wanqing Li Sep 2012

Human Detection Based On Weighted Template Matching, Duc Thanh Nguyen, Philip Ogunbona, Wanqing Li

Professor Philip Ogunbona

This paper proposes a new two-stage human detection method involving matching and verification. A Bayesian framework is developed to verify the matching score obtained from a weighted distance measure. Performance evaluation indicates that the proposed method is able to utilize the flexible matching scheme and produce superior true positive, true negative and low misclassification rates.


Compression Performance Of Jpeg Encryption Scheme, C. Kailasanathan, R. Safavi-Naini, P. Ogunbona Sep 2012

Compression Performance Of Jpeg Encryption Scheme, C. Kailasanathan, R. Safavi-Naini, P. Ogunbona

Professor Philip Ogunbona

Recent development in the Internet and Web based technologies require faster communication of multimedia data in a secure form. A number of encryption schemes for MPEG have been proposed. In this paper, we evaluate the compression performance of JPEG which has been encrypted with the zig-zag permutation algorithm, suggest a security enhancement to the scheme, and propose an alternative to entropy coding recommended by JPEG to compensate for the compression drop occurring due to permutation.


Human Detection With Contour-Based Local Motion Binary Patterns, Duc Thanh Nguyen, Philip Ogunbona, Wanqing Li Sep 2012

Human Detection With Contour-Based Local Motion Binary Patterns, Duc Thanh Nguyen, Philip Ogunbona, Wanqing Li

Professor Philip Ogunbona

This paper presents a human detection method using contour- based local motion features. The local motion is encoded using a variant of the popular Local Binary Pattern (LBP) called Non-Redundant Local Binary Pattern (NRLBP) descriptor computed on the difference image of two consecutive frames. In addition, the local motion features are extracted along the human's boundary contour. Localising features on the contours has the advantage of utilizing a precise human shape description. A motivation of the proposed method is that most of informative movements are performed on boundary contours of the body parts, e.g. legs of pedestrians. Evaluation of the …


Image Compression Based On Genealogical Relation Of The Tsvq Indices, Jamshid Shanbehzadeh, Philip Ogunbona, Abdoihosein Sarafzadeh Sep 2012

Image Compression Based On Genealogical Relation Of The Tsvq Indices, Jamshid Shanbehzadeh, Philip Ogunbona, Abdoihosein Sarafzadeh

Professor Philip Ogunbona

The indices obtained by tree-structured vector quantisation (TSVQ) have an interesting property that enables them to give information about the correlation between two image blocks. Iftwo image blocks are highly correlated, they may have an identical index, or the same ancestors. The existence of high inter-block correlation in natural images results in having neighboring blocks with the same genealogy. This characteristic can be used to compress the indices. This paper introduces a novel method to exploit the genealogical relation between the image block indices obtained from a TSVQ. The performance of this scheme in terms of PSNR versus average rate …


An Audio Representation For Content Based Retrieval, Kathy Melih, Ruben Gonzalez, Philip Ogunbona Sep 2012

An Audio Representation For Content Based Retrieval, Kathy Melih, Ruben Gonzalez, Philip Ogunbona

Professor Philip Ogunbona

Despite: the increasing interest in multimedia data retrieval audio data has received little attention. This is due, not to a lack of interest but rather to unique difficulties posed by the medium. In particular existing unstructured audio representations do not easily lend themselves to content based retrieval and especially browsing. This paper aims to address hs oversight by developing an audio representation that provides direct support for browsing and content based retrieval. This support is the result of a structured representation based on psychoacoustic ptincip1.e~in which salient attributes of audio are directly accessible. In addition, the representation is compact thus …


An Mpeg Tolerant Authentication System For Video Data, Takeyuki Uehara, R. Safavi-Naini, P. Ogunbona Sep 2012

An Mpeg Tolerant Authentication System For Video Data, Takeyuki Uehara, R. Safavi-Naini, P. Ogunbona

Professor Philip Ogunbona

We propose a secure video authentication algorithm that is tolerant to visual degradation due to MPEG lossy compression to a designed level. The authentication process generates a tag that is sent with video data and the level of protection can be adjusted so that longer tags are used for higher security, and that the protection is distributed such that higher security is provided for regions of interest in the image. The computation required for authentication and verification can be largely performed as part of MPEG compression and so generation and verification of the tag can be integrated into the compression …


Human Detection Using Local Shape And Non-Redundant Binary Patterns, Duc Thanh Nguyen, Wanqing Li, Philip Ogunbona Sep 2012

Human Detection Using Local Shape And Non-Redundant Binary Patterns, Duc Thanh Nguyen, Wanqing Li, Philip Ogunbona

Professor Philip Ogunbona

Motivated by the advantages of using shape matching technique in detecting objects in various postures and viewpoints and the discriminative power of local patterns in object recognition, this paper proposes a human detection method combining both shape and appearance cues. In particular, local shapes of the body parts are detected using template matching. Based on body parts' shapes, local appearance features are extracted. We introduce a novel local binary pattern (LBP) descriptor, called Non-Redundant LBP (NRLBP), to encode local appearance of human. The proposed method was evaluated and compared with other state-of-the-art human detection methods on two commonly used datasets: …


Texture Analysis Using Gabor Wavelets, Golshah Naghdy, Jianli Wang, Philip Ogunbona Sep 2012

Texture Analysis Using Gabor Wavelets, Golshah Naghdy, Jianli Wang, Philip Ogunbona

Professor Philip Ogunbona

Receptive field profiles of simple cells in the visual cortex have been shown to resemble even- symmetric or odd-symmetric Gabor filters. Computational models employed in the analysis of textures have been motivated by two-dimensional Gabor functions arranged in a multi-channel architecture. More recently wavelets have emerged as a powerful tool for non-stationary signal analysis capable of encoding scale-space information efficiently. A multi-resolution implementation in the form of a dyadic decomposition of the signal of interest has been popularized by many researchers. In this paper, Gabor wavelet configured in a 'rosette' fashion is used as a multi-channel filter-bank feature extractor for …


Image Content Annotation Based On Visual Features, Lei Ye, Philip Ogunbona, J. Wang Sep 2012

Image Content Annotation Based On Visual Features, Lei Ye, Philip Ogunbona, J. Wang

Professor Philip Ogunbona

Automatic image content annotation techniques attempt to explore structural visual features of images that describe image content and associate them with image semantics. In this paper, two types of concept spaces, atomic concept and collective concept spaces, are defined and the annotation problems in those spaces are formulated as feature classification and Bayesian inference, respectively. A scheme of image content annotation in this framework is presented and evaluated as an application of photo categorization using MPEG-7 VCE2 dataset and its ground truth. The experimental results show a promising performance.


Edge Image Description Using Fractal Interpolation, P Motallebi, P O. Ogunbona Sep 2012

Edge Image Description Using Fractal Interpolation, P Motallebi, P O. Ogunbona

Professor Philip Ogunbona

Edge images derived from compressed image databases are described using fractal techniques. The proposed method is able to give affine transformation-invariant description suitable for use in a query-by-example database application. Comparison among the proposed method, polynomial interpolation and spline interpolation is given. It is concluded that fractal interpolation can give a compact description of image contours and is able to cope with random perturbation of the coordinates of the contour points by as much as 25 percent.


Detecting Humans Under Occlusion Using Variational Mean Field Method, Duc Thanh Nguyen, Philip Ogunbona, Wanqing Li Sep 2012

Detecting Humans Under Occlusion Using Variational Mean Field Method, Duc Thanh Nguyen, Philip Ogunbona, Wanqing Li

Professor Philip Ogunbona

This paper proposes a human detection method using variational mean field approximation for occlusion reasoning. In the method, parts of human objects are detected individually using template matching. Initial detection hypotheses with spatial layout information are represented in a graphical model and refined through a Bayesian estimation. In this paper, mean field method is employed for such an estimation. The proposed method was evaluated on the popular CAVIAR-INRIA dataset. Experimental results show that the proposed algorithm is able to detect humans in severe occlusion within reasonable processing time.


A Novel Template Matching Method For Human Detection, Duc Thanh Nguyen, Wanqing Li, Philip Ogunbona Sep 2012

A Novel Template Matching Method For Human Detection, Duc Thanh Nguyen, Wanqing Li, Philip Ogunbona

Professor Philip Ogunbona

This paper proposes a novel weighted template matching method. It employs a generalized distance transform (GDT) and an orientation map (OM). The GDT allows us to weight the distance transform more on the strong edge points and the OM provides supplementary local orientation information for matching. Based on the matching method, a two-stage human detection method consisting of template matching and Bayesian verification is developed. Experimental results have shown that the proposed method can effectively reduce the false positive and false negative detection rates and perform superiorly in comparison to the conventional Chamfer matching method.