Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 17 of 17

Full-Text Articles in Physical Sciences and Mathematics

Predicting Software Fault Proneness Using Machine Learning, Sanjay Ghanathey Dec 2018

Predicting Software Fault Proneness Using Machine Learning, Sanjay Ghanathey

Electronic Thesis and Dissertation Repository

Context: Continuous Integration (CI) is a DevOps technique which is widely used in practice. Studies show that its adoption rates will increase even further. At the same time, it is argued that maintaining product quality requires extensive and time consuming, testing and code reviews. In this context, if not done properly, shorter sprint cycles and agile practices entail higher risk for the quality of the product. It has been reported in literature [68], that lack of proper test strategies, poor test quality and team dependencies are some of the major challenges encountered in continuous integration and deployment.

Objective: The objective …


Partitioning And Offloading For Iot And Video Streaming Applications That Utilize Computing Resources At The Network Edge, Navid Bayat Dec 2018

Partitioning And Offloading For Iot And Video Streaming Applications That Utilize Computing Resources At The Network Edge, Navid Bayat

Electronic Thesis and Dissertation Repository

The Internet of Things (IoT) is a concept in which physical objects embedded with sensors, actuators, and network connectivity can communicate and react to their surroundings. IoT applications connect physical objects for the purpose of decision making by sensing and analysing generated data from the embedded sensors in physical objects. IoT applications are growing rapidly as sensors become less expensive. Sensors generate large amounts of data that may meaningless unless the data is used to derive knowledge with in a certain period of time. Stream processing paradigm is used by IoT to provide requirements of IoT applications. In a stream …


Secured Data Masking Framework And Technique For Preserving Privacy In A Business Intelligence Analytics Platform, Osama Ali Dec 2018

Secured Data Masking Framework And Technique For Preserving Privacy In A Business Intelligence Analytics Platform, Osama Ali

Electronic Thesis and Dissertation Repository

The main concept behind business intelligence (BI) is how to use integrated data across different business systems within an enterprise to make strategic decisions. It is difficult to map internal and external BI’s users to subsets of the enterprise’s data warehouse (DW), resulting that protecting the privacy of this data while maintaining its utility is a challenging task. Today, such DW systems constitute one of the most serious privacy breach threats that an enterprise might face when many internal users of different security levels have access to BI components. This thesis proposes a data masking framework (iMaskU: Identify, Map, Apply, …


Complexity Results For Fourier-Motzkin Elimination, Delaram Talaashrafi Dec 2018

Complexity Results For Fourier-Motzkin Elimination, Delaram Talaashrafi

Electronic Thesis and Dissertation Repository

In this thesis, we propose a new method for removing all the redundant inequalities generated by Fourier-Motzkin elimination. This method is based on Kohler’s work and an improved version of Balas’ work. Moreover, this method only uses arithmetic operations on matrices. Algebraic complexity estimates and experimental results show that our method outperforms alternative approaches based on linear programming.


High Performance Sparse Multivariate Polynomials: Fundamental Data Structures And Algorithms, Alex Brandt Aug 2018

High Performance Sparse Multivariate Polynomials: Fundamental Data Structures And Algorithms, Alex Brandt

Electronic Thesis and Dissertation Repository

Polynomials may be represented sparsely in an effort to conserve memory usage and provide a succinct and natural representation. Moreover, polynomials which are themselves sparse – have very few non-zero terms – will have wasted memory and computation time if represented, and operated on, densely. This waste is exacerbated as the number of variables increases. We provide practical implementations of sparse multivariate data structures focused on data locality and cache complexity. We look to develop high-performance algorithms and implementations of fundamental polynomial operations, using these sparse data structures, such as arithmetic (addition, subtraction, multiplication, and division) and interpolation. We revisit …


From Large-Scale Molecular Clouds To Filaments And Cores : Unveiling The Role Of The Magnetic Fields In Star Formation, Sayantan Auddy Jul 2018

From Large-Scale Molecular Clouds To Filaments And Cores : Unveiling The Role Of The Magnetic Fields In Star Formation, Sayantan Auddy

Electronic Thesis and Dissertation Repository

I present a comprehensive study of the role of strong magnetic fields in characterizing the structure of molecular clouds. We run three-dimensional turbulent non-ideal magnetohydrodynamic simulations (with ambipolar diffusion) to see the effect of magnetic fields on the evolution of the column density probability distribution function (PDF). Our results indicate a systematic dependence of the column density PDF of molecular clouds on magnetic field strength and turbulence, with observationally distinguishable outcomes between supercritical (gravity dominated) and subcritical (magnetic field dominated) initial conditions. We find that most cases develop a direct power-law PDF, and only the subcritical clouds with turbulence are …


Finding Nonlinear Relationships In Functional Magnetic Resonance Imaging Data With Genetic Programming, James Hughes Jul 2018

Finding Nonlinear Relationships In Functional Magnetic Resonance Imaging Data With Genetic Programming, James Hughes

Electronic Thesis and Dissertation Repository

The human brain is a complex, nonlinear dynamic chaotic system that is poorly understood. When faced with these difficult to understand systems, it is common to observe the system and develop models such that the underlying system might be deciphered. When observing neurological activity within the brain with functional magnetic resonance imaging (fMRI), it is common to develop linear models of functional connectivity; however, these models are incapable of describing the nonlinearities we know to exist within the system.

A genetic programming (GP) system was developed to perform symbolic regression on recorded fMRI data. Symbolic regression makes fewer assumptions than …


Baseline Assisted Classification Of Heart Rate Variability, Elham Harirpoush Jun 2018

Baseline Assisted Classification Of Heart Rate Variability, Elham Harirpoush

Electronic Thesis and Dissertation Repository

Recently, among various analysis methods of physiological signals, automatic analysis of Electrocardiogram (ECG) signals, especially heart rate variability (HRV) has received significant attention in the field of machine learning. Heart rate variability is an important indicator of health prediction and it is applicable to various fields of scientific research. Heart rate variability is based on measuring the differences in time between consecutive heartbeats (also known as RR interval), and the most common measuring techniques are divided into the time domain and frequency domain. In this research study, a classifier based on analysis of HRV signal is developed to classify different …


Word Blending And Other Formal Models Of Bio-Operations, Zihao Wang May 2018

Word Blending And Other Formal Models Of Bio-Operations, Zihao Wang

Electronic Thesis and Dissertation Repository

As part of ongoing efforts to view biological processes as computations, several formal models of DNA-based processes have been proposed and studied in the formal language literature. In this thesis, we survey some classical formal language word and language operations, as well as several bio-operations, and we propose a new operation inspired by a DNA recombination lab protocol known as Cross-pairing Polymerase Chain Reaction, or XPCR. More precisely, we define and study a word operation called word blending which models a special case of XPCR, where two words x w p and q w y sharing a non-empty overlap part …


Analysis Challenges For High Dimensional Data, Bangxin Zhao Apr 2018

Analysis Challenges For High Dimensional Data, Bangxin Zhao

Electronic Thesis and Dissertation Repository

In this thesis, we propose new methodologies targeting the areas of high-dimensional variable screening, influence measure and post-selection inference. We propose a new estimator for the correlation between the response and high-dimensional predictor variables, and based on the estimator we develop a new screening technique termed Dynamic Tilted Current Correlation Screening (DTCCS) for high dimensional variables screening. DTCCS is capable of picking up the relevant predictor variables within a finite number of steps. The DTCCS method takes the popular used sure independent screening (SIS) method and the high-dimensional ordinary least squares projection (HOLP) approach as its special cases.

Two methods …


Putting Fürer's Algorithm Into Practice With The Bpas Library, Linxiao Wang Apr 2018

Putting Fürer's Algorithm Into Practice With The Bpas Library, Linxiao Wang

Electronic Thesis and Dissertation Repository

Fast algorithms for integer and polynomial multiplication play an important role in scientific computing as well as other disciplines. In 1971, Schönhage and Strassen designed an algorithm that improved the multiplication time for two integers of at most n bits to O(log n log log n). In 2007, Martin Fürer presented a new algorithm that runs in O (n log n · 2 ^O(log* n)) , where log*n is the iterated logarithm of n. We explain how we can put Fürer’s ideas into practice for multiplying polynomials over a prime field Z/pZ, which characteristic is a Generalized Fermat prime of …


A Framework For Modelling User Activity Preferences, Roberto Barboza Junior Apr 2018

A Framework For Modelling User Activity Preferences, Roberto Barboza Junior

Electronic Thesis and Dissertation Repository

The availability of location data increases every day and brings the opportunity to mine these data and extract valuable knowledge about human behaviour. More specifically, these data may contain information about users’ activities, which can enable, for example, services to improve advertising campaigns or enhance the user experience of a mobile application. However, several techniques ignore the fact that users’ context other than location and time, such as weather conditions, influences their behaviour. Moreover, several studies focus only on a single data source, addressing either data collected without any type of user interaction, such as GPS data, or data spontaneously …


Computational Modelling Of Human Transcriptional Regulation By An Information Theory-Based Approach, Ruipeng Lu Apr 2018

Computational Modelling Of Human Transcriptional Regulation By An Information Theory-Based Approach, Ruipeng Lu

Electronic Thesis and Dissertation Repository

ChIP-seq experiments can identify the genome-wide binding site motifs of a transcription factor (TF) and determine its sequence specificity. Multiple algorithms were developed to derive TF binding site (TFBS) motifs from ChIP-seq data, including the entropy minimization-based Bipad that can derive both contiguous and bipartite motifs. Prior studies applying these algorithms to ChIP-seq data only analyzed a small number of top peaks with the highest signal strengths, biasing their resultant position weight matrices (PWMs) towards consensus-like, strong binding sites; nor did they derive bipartite motifs, disabling the accurate modelling of binding behavior of dimeric TFs.

This thesis presents a novel …


Pelee: A Real-Time Object Detection System On Mobile Devices, Jun Wang Apr 2018

Pelee: A Real-Time Object Detection System On Mobile Devices, Jun Wang

Electronic Thesis and Dissertation Repository

There has been a rising interest in running high-quality Convolutional Neural Network (CNN) models under strict constraints on memory and computational budget. A number of efficient architectures have been proposed in recent years, for example, MobileNet, ShuffleNet, and NASNet-A. However, all these architectures are heavily dependent on depthwise separable convolution which lacks efficient implementation in most deep learning frameworks. Meanwhile, there are few studies that combine efficient models with fast object detection algorithms. This research tries to explore the design of an efficient CNN architecture for both image classification tasks and object detection tasks. We propose an efficient architecture named …


Some Applications Of Higher-Order Hidden Markov Models In The Exotic Commodity Markets, Heng Xiong Feb 2018

Some Applications Of Higher-Order Hidden Markov Models In The Exotic Commodity Markets, Heng Xiong

Electronic Thesis and Dissertation Repository

The liberalisation of regional and global commodity markets over the last several decades resulted in certain commodity price behaviours that require new modelling and estimation approaches. Such new approaches have important implications to the valuation and utilisation of commodity derivatives. Derivatives are becoming increasingly crucial for market participants in hedging their exposure to volatile price swings and in managing risks associated with derivative trading. The modelling of commodity-based variables is an integral part of risk management and optimal-investment strategies for commodity-linked portfolios. The characteristics of commodity price evolution cannot be captured sufficiently by one-state driven models even with the inclusion …


Sol: Segmentation With Overlapping Labels, Karin Ng Jan 2018

Sol: Segmentation With Overlapping Labels, Karin Ng

Electronic Thesis and Dissertation Repository

Image segmentation is a fundamental problem in Computer Vision which involves segmenting an image into two or more segments. These segments usually correspond to objects of interest in the image, i.e. liver, kidney’s etc. The classic approach to this problem segments the image into mutually exclusive segments. However, this approach is not well-suited when segmenting overlapping objects, e.g. cells, or when segmenting a single object into multiple parts that are not necessarily mutually exclusive. Moreover, we show that optimization methods for multi-part object segmentation with different priors/constraints may better avoid local minima in case of a relaxation allowing parts to …


Feature Based Calibration Of A Network Of Kinect Sensors, Xiaoyang Li Jan 2018

Feature Based Calibration Of A Network Of Kinect Sensors, Xiaoyang Li

Electronic Thesis and Dissertation Repository

The availability of affordable depth sensors in conjunction with common RGB cameras, such as the Microsoft Kinect, can provide robots with a complete and instantaneous representation of the current surrounding environment. However, in the problem of calibrating multiple camera systems, traditional methods bear some drawbacks, such as requiring human intervention. In this thesis, we propose an automatic and reliable calibration framework that can easily estimate the extrinsic parameters of a Kinect sensor network. Our framework includes feature extraction, Random Sample Consensus and camera pose estimation from high accuracy correspondences. We also implement a robustness analysis of position estimation algorithms. The …