Open Access. Powered by Scholars. Published by Universities.®

Computer Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 14 of 14

Full-Text Articles in Computer Engineering

Design And Implementation Of A Hybrid Solver On Cpu And Gpu Multi-Target Machines, Lin Ma, Xuesong Zhang, Xinlin Lei, Tie Bao Apr 2022

Design And Implementation Of A Hybrid Solver On Cpu And Gpu Multi-Target Machines, Lin Ma, Xuesong Zhang, Xinlin Lei, Tie Bao

Journal of System Simulation

Abstract: The traditional parallel solving methods for the ordinary differential equations mainly include the task-oriented parallelism and the method-oriented parallelism. However, these two solving algorithms have serious shortcomings, which can only use CPU resource or just design for the homogeneous form of ODE(ordinary differential equations) clusters. By using RIDC(revisionist integral deferred correction) algorithm, a hybrid solver based on CPU and GPU multi-target machine is designed, which solves the differential equation system based on the pipeline form. Meanwhile, the parallel calculation within a single equation group and between the different equation groups is realized, which can give full play to the …


Gpu-Accelerated Simulation For Class Of Multi-Agent Based Models, Zhao Yuan, Jiachang Cheng, Wang Lu, Yueming Hu Sep 2020

Gpu-Accelerated Simulation For Class Of Multi-Agent Based Models, Zhao Yuan, Jiachang Cheng, Wang Lu, Yueming Hu

Journal of System Simulation

Abstract: A parallel agent-based model of Von Thünen Model was proposed driven by graphics processing units (GPUs). The Von Thünen Model often involved the simulation of large numbers of geographically located individual decision-makers and a massive number of individual-level interactions. This simulation required substantial computational power. GPU-enabled computing resources provided a massively parallel processing platform based on a fine-grained shared memory paradigm. This massively parallel processing platform held considerable promise for meeting the computing requirement of agent-based models of spatial problems. A dynamic relationship table rebuilding method was proposed to enable the use of GPUs for parallel agent-based modeling of …


Towards Practical Homomorphic Encryption And Efficient Implementation, Gyana R. Sahu Aug 2020

Towards Practical Homomorphic Encryption And Efficient Implementation, Gyana R. Sahu

Dissertations

Cloud computing has gained significant traction over the past few years and its application continues to soar as evident from its rapid adoption in various industries. One of the major challenges involved in cloud computing services is the security of sensitive information as cloud servers have been often found to be vulnerable to snooping by malicious adversaries. Such data privacy concerns can be addressed to a greater extent by enforcing cryptographic measures. Fully homomorphic encryption (FHE), a special form of public key encryption has emerged as a primary tool in deploying such cryptographic security assurances without sacrificing many of the …


Real-Time Rendering Of Large-Scale River In Inland River Ship Simulator, Xiaoming Zhai, Yin Yong, Helong Shen Aug 2020

Real-Time Rendering Of Large-Scale River In Inland River Ship Simulator, Xiaoming Zhai, Yin Yong, Helong Shen

Journal of System Simulation

Abstract: To improve the reality of river visual in inland river ship simulator, a method of rendering large scale river which could be applied for inland river ship simulator was proposed. The river flowing field was built though the conception of stream function, and the algorithm of Poisson disk sampling was used to control the distribution of river flowing field. This could balance the relationship between efficiency and effect of river surface. The parallelism and programmability of GPU was adopt to render river surface. The results show the river flowing field generated by algorithm of stream function can reflect …


Flow Visualization Based On Enhanced Streamline Line Integral Convolution, Han Min, Haichao Zhang, Maosong Bian, Danchen Zheng Aug 2020

Flow Visualization Based On Enhanced Streamline Line Integral Convolution, Han Min, Haichao Zhang, Maosong Bian, Danchen Zheng

Journal of System Simulation

Abstract: In order to show the flow information clearly and reduce the calculation of linear integral convolution, an enhanced streamline linear integral convolution algorithm was proposed. The generation of streamlines were improved. Critical points were detected and the points’ area with a gradient fill was generated. Combined two integration methods in different areas, the integration step was updated adaptively. Utilizing the parallelism of GPU and GLSL (OpenGL shading language), the algorithm further improved the sharpness of the output image. Experiments show that the improved linear integral convolution algorithm posses both the speedability and the precision. And the flow visualization of …


Survey On Parallel Collision Detection Algorithms, Fuchang Liu, Shuangjian Wang, Zhigeng Pan, Jinrong Wang Jun 2020

Survey On Parallel Collision Detection Algorithms, Fuchang Liu, Shuangjian Wang, Zhigeng Pan, Jinrong Wang

Journal of System Simulation

Abstract: The demand for real-time collision detection is increasing in different applications. Exploiting the parallel computing capability of multi-core CPUs and GPUs to accelerate the speed of collision detection algorithms has attracted abroad attention. This paper reviews the development history of collision detection algorithms and classified the existing algorithms from multiple perspectives. Moreover, we analyze the strengths and weaknesses of more than ten representative parallel collision detection algorithms based on multi-core CPUs and GPUs from the aspects of the scalability, memory consumption and workload balancing. Finally, the problem of present parallel collision detection research and potential direction of following research …


Real-Time Simulation Of Realistic Melting In Fluid-Solid Interaction, Xuqiang Shao, Song Yu Jun 2020

Real-Time Simulation Of Realistic Melting In Fluid-Solid Interaction, Xuqiang Shao, Song Yu

Journal of System Simulation

Abstract: In order to realize real-time simulation of realistic melting in fluid-solid coupling, a particle-based GPU parallel simulation method was proposed. This method employed an implicit heat diffusion model which took the heat conduction properties of materials to update the temperatures of each particle. Based on the lattice shape matching method, a stable topology change method was proposed to simulate the conversion of solid particles to the fluid particles. In order to improve the time performance of each time step, a parallel simulation method entirely implemented on GPU was put forward, and an iterative filling algorithm was designed to dynamically …


Algorithms And Framework For Computing 2-Body Statistics On Graphics Processing Units, Napath Pitaksirianan Feb 2020

Algorithms And Framework For Computing 2-Body Statistics On Graphics Processing Units, Napath Pitaksirianan

USF Tampa Graduate Theses and Dissertations

Various types of two-body statistics (2-BS) are regarded as essential components of low-level data analysis in scientific database systems. In relational algebraic terms, a 2-BS is essentially a Cartesian product between two datasets (or two instances of the same dataset) followed by a user-defined aggregate. The quadratic complexity of these computations hinders the timely processing of data. Thus using modern parallel hardware has become an obvious solution to meet such challenges. This dissertation presents our recent work in designing and optimizing parallel algorithms for 2-BS computation on Graphics Processing Units (GPUs). The unique architecture, however, provides abundant opportunities for optimizing …


Analysis Of 3d Cone-Beam Ct Image Reconstruction Performance On A Fpga, Devin Held Dec 2016

Analysis Of 3d Cone-Beam Ct Image Reconstruction Performance On A Fpga, Devin Held

Electronic Thesis and Dissertation Repository

Efficient and accurate tomographic image reconstruction has been an intensive topic of research due to the increasing everyday usage in areas such as radiology, biology, and materials science. Computed tomography (CT) scans are used to analyze internal structures through capture of x-ray images. Cone-beam CT scans project a cone-shaped x-ray to capture 2D image data from a single focal point, rotating around the object. CT scans are prone to multiple artifacts, including motion blur, streaks, and pixel irregularities, therefore must be run through image reconstruction software to reduce visual artifacts. The most common algorithm used is the Feldkamp, Davis, and …


Novel Software Defined Radio Architecture With Graphics Processor Acceleration, Lalith Narasimhan Dec 2015

Novel Software Defined Radio Architecture With Graphics Processor Acceleration, Lalith Narasimhan

Dissertations

Wireless has become one of the most pervasive core technologies in the modern world. Demand for faster data rates, improved spectrum efficiency, higher system access capacity, seamless protocol integration, improved security and robustness under varying channel environments has led to the resurgence of programmable software defined radio (SDR) as an alternative to traditional ASIC based radios. Future SDR implementations will need support for multiple standards on platforms with multi-Gb/s connectivity, parallel processing and spectrum sensing capabilities. This dissertation implemented key technologies of importance in addressing these issues namely development of cost effective multi-mode reconfigurable SDR and providing a framework to …


Gpu/Cpu Performance Of Image Processing Tasks For Use In The Cam 2 System, Jonathan Cottom, Yung-Hsiang Lu, Young-Sol Koh Aug 2015

Gpu/Cpu Performance Of Image Processing Tasks For Use In The Cam 2 System, Jonathan Cottom, Yung-Hsiang Lu, Young-Sol Koh

The Summer Undergraduate Research Fellowship (SURF) Symposium

Over the past several years, graphics processing units (GPU) have increasingly been viewed as the future of image processing engines. Currently, the Continuous Analysis of Many CAMeras (CAM2) project performs its processing on CPUs, which will potentially be more costly as the system scales to service more users. This study seeks to analyze the performance gains of GPU processing and evaluate the advantage of supporting GPU-accelerated analysis for CAM2 users. The platform for comparing the CPU and GPU performance has been the NVIDIA Jetson TK1. The target hardware implementation is an Amazon cloud instance, where final cost …


Efficient, Scalable, Parallel, Matrix-Matrix Multiplication, Enrique Portillo Jan 2013

Efficient, Scalable, Parallel, Matrix-Matrix Multiplication, Enrique Portillo

Open Access Theses & Dissertations

For the past decade, power/energy consumption has become a limiting factor for large-scale and embedded High Performance Computing (HPC) systems. This is especially true for systems that include accelerators, e.g., high-end computing devices, such as Graphics Processing Units (GPUs), with terascale computing capabilities and high power draws that greatly surpass that of multi-core CPUs. Accordingly, improving the node-level power/energy efficiency of an application can have a direct and positive impact on both classes of HPC systems.

The research reported in this thesis explores the use of software techniques to enhance the execution-time and power-consumption performance of applications executed on a …


Exploring Computational Chemistry On Emerging Architectures, David Dewayne Jenkins Dec 2012

Exploring Computational Chemistry On Emerging Architectures, David Dewayne Jenkins

Doctoral Dissertations

Emerging architectures, such as next generation microprocessors, graphics processing units, and Intel MIC cards, are being used with increased popularity in high performance computing. Each of these architectures has advantages over previous generations of architectures including performance, programmability, and power efficiency. With the ever-increasing performance of these architectures, scientific computing applications are able to attack larger, more complicated problems. However, since applications perform differently on each of the architectures, it is difficult to determine the best tool for the job. This dissertation makes the following contributions to computer engineering and computational science. First, this work implements the computational chemistry variational …


Parallel For Loops On Heterogeneous Resources, Frederick Edward Weber Dec 2012

Parallel For Loops On Heterogeneous Resources, Frederick Edward Weber

Doctoral Dissertations

In recent years, Graphics Processing Units (GPUs) have piqued the interest of researchers in scientific computing. Their immense floating point throughput and massive parallelism make them ideal for not just graphical applications, but many general algorithms as well. Load balancing applications and taking advantage of all computational resources in a machine is a difficult challenge, especially when the resources are heterogeneous. This dissertation presents the clUtil library, which vastly simplifies developing OpenCL applications for heterogeneous systems. The core focus of this dissertation lies in clUtil's ParallelFor construct and our novel PINA scheduler which can efficiently load balance work onto multiple …