Open Access. Powered by Scholars. Published by Universities.®

Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 14 of 14

Full-Text Articles in Engineering

Exploring Computational Chemistry On Emerging Architectures, David Dewayne Jenkins Dec 2012

Exploring Computational Chemistry On Emerging Architectures, David Dewayne Jenkins

Doctoral Dissertations

Emerging architectures, such as next generation microprocessors, graphics processing units, and Intel MIC cards, are being used with increased popularity in high performance computing. Each of these architectures has advantages over previous generations of architectures including performance, programmability, and power efficiency. With the ever-increasing performance of these architectures, scientific computing applications are able to attack larger, more complicated problems. However, since applications perform differently on each of the architectures, it is difficult to determine the best tool for the job. This dissertation makes the following contributions to computer engineering and computational science. First, this work implements the computational chemistry variational …


Parallel For Loops On Heterogeneous Resources, Frederick Edward Weber Dec 2012

Parallel For Loops On Heterogeneous Resources, Frederick Edward Weber

Doctoral Dissertations

In recent years, Graphics Processing Units (GPUs) have piqued the interest of researchers in scientific computing. Their immense floating point throughput and massive parallelism make them ideal for not just graphical applications, but many general algorithms as well. Load balancing applications and taking advantage of all computational resources in a machine is a difficult challenge, especially when the resources are heterogeneous. This dissertation presents the clUtil library, which vastly simplifies developing OpenCL applications for heterogeneous systems. The core focus of this dissertation lies in clUtil's ParallelFor construct and our novel PINA scheduler which can efficiently load balance work onto multiple …


Kernel-Assisted And Topology-Aware Mpi Collective Communication Among Multicore Or Many-Core Clusters, Teng Ma Dec 2012

Kernel-Assisted And Topology-Aware Mpi Collective Communication Among Multicore Or Many-Core Clusters, Teng Ma

Doctoral Dissertations

Multicore or many-core clusters have become the most prominent form of High Performance Computing (HPC) systems. Hardware complexity and hierarchies not only exist in the inter-node layer, i.e., hierarchical networks, but also exist in internals of multicore compute nodes, e.g., Non Uniform Memory Accesses (NUMA), network-style interconnect, and memory and shared cache hierarchies.

Message Passing Interface (MPI), the most widely adopted in the HPC communities, suffers from decreased performance and portability due to increased hardware complexity of multiple levels. We identified three critical issues specific to collective communication: The first problem arises from the gap between logical collective topologies and …


Extending Structural Learning Paradigms For High-Dimensional Machine Learning And Analysis, Christopher Todd Symons Dec 2012

Extending Structural Learning Paradigms For High-Dimensional Machine Learning And Analysis, Christopher Todd Symons

Doctoral Dissertations

Structure-based machine-learning techniques are frequently used in extensions of supervised learning, such as active, semi-supervised, multi-modal, and multi-task learning. A common step in many successful methods is a structure-discovery process that is made possible through the addition of new information, which can be user feedback, unlabeled data, data from similar tasks, alternate views of the problem, etc. Learning paradigms developed in the above-mentioned fields have led to some extremely flexible, scalable, and successful multivariate analysis approaches. This success and flexibility offer opportunities to expand the use of machine learning paradigms to more complex analyses. In particular, while information is often …


Dynamic Task Execution On Shared And Distributed Memory Architectures, Asim Yarkhan Dec 2012

Dynamic Task Execution On Shared And Distributed Memory Architectures, Asim Yarkhan

Doctoral Dissertations

Multicore architectures with high core counts have come to dominate the world of high performance computing, from shared memory machines to the largest distributed memory clusters. The multicore route to increased performance has a simpler design and better power efficiency than the traditional approach of increasing processor frequencies. But, standard programming techniques are not well adapted to this change in computer architecture design.

In this work, we study the use of dynamic runtime environments executing data driven applications as a solution to programming multicore architectures. The goals of our runtime environments are productivity, scalability and performance. We demonstrate productivity by …


Hard And Soft Error Resilience For One-Sided Dense Linear Algebra Algorithms, Peng Du Aug 2012

Hard And Soft Error Resilience For One-Sided Dense Linear Algebra Algorithms, Peng Du

Doctoral Dissertations

Dense matrix factorizations, such as LU, Cholesky and QR, are widely used by scientific applications that require solving systems of linear equations, eigenvalues and linear least squares problems. Such computations are normally carried out on supercomputers, whose ever-growing scale induces a fast decline of the Mean Time To Failure (MTTF). This dissertation develops fault tolerance algorithms for one-sided dense matrix factorizations, which handles Both hard and soft errors.

For hard errors, we propose methods based on diskless checkpointing and Algorithm Based Fault Tolerance (ABFT) to provide full matrix protection, including the left and right factor that are normally seen in …


Vision-Based Robot Control In The Context Of Human-Machine Interactions, Andrzej Nycz Aug 2012

Vision-Based Robot Control In The Context Of Human-Machine Interactions, Andrzej Nycz

Doctoral Dissertations

This research has explored motion control based on visual servoing – in the context of complex human-machine interactions and operations in realistic environments. Two classes of intelligent robotic systems were studied in this context: operator assistance with a high dexterity telerobotic manipulator performing remote tooling-centric tasks, and a bio-robot for X-ray imaging of lower extremity human skeletal joints during natural walking. The combination of human-machine interactions and practical application scenarios has led to the following fundamental contributions: 1) exploration and evaluation of a new concept of acquiring fluoroscope images of musculoskeletal features of interest during natural human motion, 2) creation …


Air: Adaptive Dynamic Precision Iterative Refinement, Jun Kyu Lee Aug 2012

Air: Adaptive Dynamic Precision Iterative Refinement, Jun Kyu Lee

Doctoral Dissertations

In high performance computing, applications often require very accurate solutions while minimizing runtimes and power consumption. Improving the ratio of the number of logic gates implementing floating point arithmetic operations to the total number of logic gates enables greater efficiency, potentially with higher performance and lower power consumption. Software executing on the fixed hardware in Von-Neuman architectures faces limitations on improving this ratio, since processors require extensive supporting logic to fetch and decode instructions while employing arithmetic units with statically defined precision. This dissertation explores novel approaches to improve computing architectures for linear system applications not only by designing application-specific …


Coalition Formation And Execution In Multi-Robot Tasks, Yu Zhang Aug 2012

Coalition Formation And Execution In Multi-Robot Tasks, Yu Zhang

Doctoral Dissertations

In this research, I explore several related problems in distributed robot systems that must be addressed in order to achieve multi-robot tasks, in which individual robots may not possess all the required capabilities. While most previous research work on multi-robot cooperation mainly concentrates on loosely-coupled multi-robot tasks, a more challenging problem is to also address tightly-coupled multi- robot tasks involving close robot interactions, which often require capability sharing. Three related topics towards addressing these tasks are discussed, as follows:

Forming coalitions, which determines how robots should form into subgroups (i.e., coalitions) to address individual tasks. To achieve system autonomy, the …


Reliability Models For Hpc Applications And A Cloud Economic Model, Thanadech Thanakornworakij Jul 2012

Reliability Models For Hpc Applications And A Cloud Economic Model, Thanadech Thanakornworakij

Doctoral Dissertations

With the enormous number of computing resources in HPC and Cloud systems, failures become a major concern. Therefore, failure behaviors such as reliability, failure rate, and mean time to failure need to be understood to manage such a large system efficiently.

This dissertation makes three major contributions in HPC and Cloud studies. First, a reliability model with correlated failures in a k-node system for HPC applications is studied. This model is extended to improve accuracy by accounting for failure correlation. Marshall-Olkin Multivariate Weibull distribution is improved by excess life, conditional Weibull, to better estimate system reliability. Also, the univariate …


Deep Machine Learning With Spatio-Temporal Inference, Thomas Paul Karnowski May 2012

Deep Machine Learning With Spatio-Temporal Inference, Thomas Paul Karnowski

Doctoral Dissertations

Deep Machine Learning (DML) refers to methods which utilize hierarchies of more than one or two layers of computational elements to achieve learning. DML may draw upon biomemetic models, or may be simply biologically-inspired. Regardless, these architectures seek to employ hierarchical processing as means of mimicking the ability of the human brain to process a myriad of sensory data and make meaningful decisions based on this data. In this dissertation we present a novel DML architecture which is biologically-inspired in that (1) all processing is performed hierarchically; (2) all processing units are identical; and (3) processing captures both spatial and …


Advanced Neural Networks And Their Applications In Smart Grids, Bipul Luitel Jan 2012

Advanced Neural Networks And Their Applications In Smart Grids, Bipul Luitel

Doctoral Dissertations

"Today, neural networks (NN) are used in several system identification and nonlinear control system applications. However, their implementation on large systems has been limited either because of their degraded performance or the amount of time and computation required for implementing them being impractical for real applications. The primary emphasis of this research is on the development of advanced NN - simultaneous recurrent neural networks (SRN) and cellular neural networks (CNN); and methods for learning the complexity of large systems.

US electricity infrastructure is one of the largest and the most critical infrastructure consisting of thousands of generators connected by transmission …


Real-Time Localization Using Received Signal Strength, Mohammed Rana Basheer Jan 2012

Real-Time Localization Using Received Signal Strength, Mohammed Rana Basheer

Doctoral Dissertations

"Locating and tracking assets in an indoor environment is a fundamental requirement for several applications which include for instance network enabled manufacturing. However, translating time of flight-based GPS technique for indoor solutions has proven very costly and inaccurate primarily due to the need for high resolution clocks and the non-availability of reliable line of sight condition between the transmitter and receiver. In this dissertation, localization and tracking of wireless devices using radio signal strength (RSS) measurements in an indoor environment is undertaken. This dissertation is presented in the form of five papers.

The first two papers deal with localization and …


Null Convention Logic Applications Of Asynchronous Design In Nanotechnology And Cryptographic Security, Jun Wu Jan 2012

Null Convention Logic Applications Of Asynchronous Design In Nanotechnology And Cryptographic Security, Jun Wu

Doctoral Dissertations

"This dissertation presents two Null Convention Logic (NCL) applications of asynchronous logic circuit design in nanotechnology and cryptographic security. The first application is the Asynchronous Nanowire Reconfigurable Crossbar Architecture (ANRCA); the second one is an asynchronous S-Box design for cryptographic system against Side-Channel Attacks (SCA). The following are the contributions of the first application: 1) Proposed a diode- and resistor-based ANRCA (DR-ANRCA). Three configurable logic block (CLB) structures were designed to efficiently reconfigure a given DR-PGMB as one of the 27 arbitrary NCL threshold gates. A hierarchical architecture was also proposed to implement the higher level logic that requires a …