Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 30 of 30

Full-Text Articles in Physical Sciences and Mathematics

On Modeling Contention For Shared Caches In Multi-Core Processors With Techniques From Ecology, Amy Apon, Wesley Emeneker Aug 2014

On Modeling Contention For Shared Caches In Multi-Core Processors With Techniques From Ecology, Amy Apon, Wesley Emeneker

Amy W. Apon

Multi-core x86_64 processors introduced an important change in architecture, a shared last level cache. Historically, each processor has had access to a large private cache that seamlessly and transparently (to end users) interfaced with main memory. Previously, processes or threads only had to compete for memory bandwidth, but now they are competing for actual space. Competition for space and environmental resources is a problem studied in other scientific domains. This paper introduces methods from ecology to model multi-core cache usage with the competitive Lotka–Volterra equations. A model is presented and validated for characterizing the interaction of cores through shared caching, …


A Component-Based End-To-End Simulation Of The Linux File System, Amy Apon, Hai Nguyen Aug 2014

A Component-Based End-To-End Simulation Of The Linux File System, Amy Apon, Hai Nguyen

Amy W. Apon

The Linux file system is designed with components utilizing a layered architecture. The upper components hide details of the lower components, and each layer presents unified and simple interfaces to the layers above and below. This design helps Linux to be flexible as well as to provide support for multiple types of storage devices. In this paper, this component architecture is used to develop a realistic simulation without having to model lower level details of the hardware layer or particular storage devices. A detailed simulation-based performance model of the Linux ext3 file system has been developed using Colored Petri Nets. …


Capacity Planning Of A Commodity Cluster In An Academic Environment: A Case Study, Linh B. Ngo, Amy W. Apon, Baochuan Lu, Hung Bui, Nathan Hamm, Larry Dowdy, Doug Hoffman, Denny Brewer Jul 2014

Capacity Planning Of A Commodity Cluster In An Academic Environment: A Case Study, Linh B. Ngo, Amy W. Apon, Baochuan Lu, Hung Bui, Nathan Hamm, Larry Dowdy, Doug Hoffman, Denny Brewer

Amy W. Apon

In this paper, the design of a simulation model for evaluating two alternative supercomputer configurations in an academic environment is presented. The workload is analyzed and modeled, and its #11;effect on the relative performance of both systems is studied. The Integrated Capacity Planning Environment (ICPE) toolkit, developed for commodity cluster capacity planning, is successfully applied to the target environment. The ICPE is a tool for workload modeling, simulation modeling, and what-if analysis. A new characterization strategy is applied to the workload to more accurately model commodity cluster work- loads. Through "what-if" analysis, the sensitivity of the baseline system performance to …


A Case Study On Grid Performance Modeling, Amy Apon, Baochuan Lu, Larry Dowdy, Frank Robinson, Doug Hoffman, Denny Brewer Jul 2014

A Case Study On Grid Performance Modeling, Amy Apon, Baochuan Lu, Larry Dowdy, Frank Robinson, Doug Hoffman, Denny Brewer

Amy W. Apon

The purpose of this case study is to develop a performance model for an enterprise grid for performance management and capacity planning1. The target environment includes grid applications such as health-care and financial services where the data is located primarily within the resources of a worldwide corporation. The approach is to build a discrete event simulation model for a representative work-flow grid. Five work-flow classes, found using a customized k-means clustering algorithm characterize the workload of the grid. Analyzing the gap between the simulation and measurement data validates the model. The case study demonstrates that the simulation model can be …


An Infrastructure To Support Data Integration And Curation For Higher Educational Research, Linh B. Ngo, Amy W. Apon, Pengfei Xuan, Kimberley Ferguson, Christin Marshall, John Mccann, Yueli Zheng Jul 2014

An Infrastructure To Support Data Integration And Curation For Higher Educational Research, Linh B. Ngo, Amy W. Apon, Pengfei Xuan, Kimberley Ferguson, Christin Marshall, John Mccann, Yueli Zheng

Amy W. Apon

The recent challenges for higher education call for research that can offer a comprehensive understanding about the performance and efficiency of higher education institutions in their three primary missions: research, education, and service. In other for this to happen, it is necessary for researchers to have access to a multitude of data sources.However, due to the nature of their academic training, many higher education practitioners do not have access to expertise in working with different data sources. In this work, we describe a design and implementation for an infrastructure that will bring together the tools and the data to provide …


A Performance And Productivity Study Using Mpi, Titanium, And Fortress, Amy Apon, Chris Bryan, Wesley Emeneker Jul 2014

A Performance And Productivity Study Using Mpi, Titanium, And Fortress, Amy Apon, Chris Bryan, Wesley Emeneker

Amy W. Apon

The popularity of cluster computing has increased focus on usability, especially in the area of programmability. Languages and libraries that require explicit message passing have been the standard. New languages, designed for cluster computing, are coming to the forefront as a way to simplify parallel programming. Titanium and Fortress are examples of this new class of programming paradigms. This papers presents results from a productivity study of these two newcomers with MPI, the de- facto standard for parallel programming.


Accelerating Image Feature Comparisons Using Cuda On Commodity Hardware, Amy Apon, Seth Warn, Wesley Emeneker, John Gauch, Jackson Cothren Jul 2014

Accelerating Image Feature Comparisons Using Cuda On Commodity Hardware, Amy Apon, Seth Warn, Wesley Emeneker, John Gauch, Jackson Cothren

Amy W. Apon

Given multiple images of the same scene, image registration is the process of determining the correct transformation to bring the images into a common coordinate system—i.e., how the images fit together. Feature based registration applies a transformation function to the input images before performing the correlation step. The result of that transformation, also called feature extraction, is a list of significant points in the images, and the registration process will attempt to correlate these points, rather than directly comparing the input images.


High Performance Computing Instrumentation And Research Productivity In U.S. Universities, Amy W. Apon, Linh B. Ngo, Stanley Ahalt, Vijay Dantuluri, Constantin Gurdgiev, Moez Limayem, Michael Stealey Jul 2014

High Performance Computing Instrumentation And Research Productivity In U.S. Universities, Amy W. Apon, Linh B. Ngo, Stanley Ahalt, Vijay Dantuluri, Constantin Gurdgiev, Moez Limayem, Michael Stealey

Amy W. Apon

This paper studies the relationship between investments in High-Performance Computing (HPC) instrumentation and research competitiveness. Measures of institutional HPC investment are computed from data that is readily available from the Top 500 list, a list that has been published twice a year since 1993 that lists the fastest 500 computers in the world at that time. Institutions that are studied include US doctoral-granting institutions that fall into the very high or high research rankings according to the Carnegie Foundation classifications and additional institutions that have had entries in the Top 500 list. Research competitiveness is derived from federal funding data, …


Accelerating Sift On Parallel Architectures, Amy Apon, Seth Warn, Wesley Emeneker, Jackson Cothren Jul 2014

Accelerating Sift On Parallel Architectures, Amy Apon, Seth Warn, Wesley Emeneker, Jackson Cothren

Amy W. Apon

SIFT is a widely-used algorithm that extracts features from images; using it to extract information from hundreds of terabytes of aerial and satellite photographs requires parallelization in order to be feasible. We explore accelerating an existing serial SIFT implementation with OpenMP parallelization and GPU execution.


Massive Data Processing On The Acxiom Cluster Testbed, Amy Apon, Pawel Wolinski, Dennis Reed, Greg Amerson, Prathima Gorjala Jul 2014

Massive Data Processing On The Acxiom Cluster Testbed, Amy Apon, Pawel Wolinski, Dennis Reed, Greg Amerson, Prathima Gorjala

Amy W. Apon

No abstract provided.


Cluster Computing In The Classroom: Topics, Guidelines, And Experiences, Amy Apon, Rajkumar Buyya, Hai Jin, Jens Mache Jul 2014

Cluster Computing In The Classroom: Topics, Guidelines, And Experiences, Amy Apon, Rajkumar Buyya, Hai Jin, Jens Mache

Amy W. Apon

With the progress of research on cluster computing, more and more universities have begun to offer various courses covering cluster computing. A wide variety of content can be taught in these courses. Because of this, a difficulty that arises is the selection of appropriate course material. The selection is complicated by the fact that some content in cluster computing is also covered by other courses such as operating systems, networking, or computer architecture. In addition, the background of students enrolled in cluster computing courses varies. These aspects of cluster computing make the development of good course material difficult. Combining our …


Inital Starting Point Analysis For K-Means Clustering: A Case Study, Amy Apon, Frank Robinson, Denny Brewer, Larry Dowdy, Doug Hoffman, Baochuan Lu Jul 2014

Inital Starting Point Analysis For K-Means Clustering: A Case Study, Amy Apon, Frank Robinson, Denny Brewer, Larry Dowdy, Doug Hoffman, Baochuan Lu

Amy W. Apon

Workload characterization is an important part of systems performance modeling. Clustering is a method used to find classes of jobs within workloads. K-Means is one of the most popular clustering algorithms. Initial starting point values are needed as input parameters when performing k-means clustering. This paper shows that the results of the running the k-means algorithm on the same workload will vary depending on the values chosen as initial starting points. Fourteen methods of composing initial starting point values are compared in a case study. The results indicate that a synthetic method, scrambled midpoints, is an effective starting point method …


The Great Plains Network (Gpn) Middleware Test Bed, Amy Apon, Gregory Monaco, Gordon Springer Jul 2014

The Great Plains Network (Gpn) Middleware Test Bed, Amy Apon, Gregory Monaco, Gordon Springer

Amy W. Apon

GPN (Great Plains Network) is a consortium of public universities in seven mid-western states. GPN goals include regional strategic planning and the development of a collaboration environment, middleware services and a regional grid for sharing computational, storage and data resources. A major challenge is to arrive at a common authentication and authorization service, based on the set of heterogeneous identity providers at each institution. GPN has built a prototype middleware test bed that includes Shibboleth and other NMI-EDIT middleware components. The test bed includes several prototype end-user applications, and is being used to further our research into fine-grained access control …


Assessing The Effect Of High Performance Computing Capabilities On Academic Research Output, Amy Apon, Linh B. Ngo, Michael E. Payne, Paul W. Wilson Jul 2014

Assessing The Effect Of High Performance Computing Capabilities On Academic Research Output, Amy Apon, Linh B. Ngo, Michael E. Payne, Paul W. Wilson

Amy W. Apon

This paper uses nonparametric methods and some new results on hypothesis testing with nonparametric efficiency estimators and applies these to analyze the effect of locally-available high performance computing (HPC) resources on universities efficiency in producing research and other outputs. We find that locally-available HPC resources enhance the technical efficiency of research output in Chemistry, Civil Engineering, Physics, and History, but not in Computer Science, Economics, nor English; we find mixed results for Biology. Out research results provide a critical first step in a quantitative economic model for investments in HPC.


Evaluation Of Real-Time Fiber Communications For Parallel Collective Operations, Amy Apon, Parvathi Rajagopal Jul 2014

Evaluation Of Real-Time Fiber Communications For Parallel Collective Operations, Amy Apon, Parvathi Rajagopal

Amy W. Apon

Real-Time Fiber Communications (RTFC) is a gigabit speed network that has been designed for damage tolerant local area networks. In addition to its damage tolerant characteristics, it has several features that make it attractive as a possible interconnection technology for parallel applications in a cluster of workstations. These characteristics include support for broadcast and multicast messaging, memory cache in the network interface card, and support for very fine grain writes to the network cache. Broadcast data is captured in network cache of all workstations in the network providing a distributed shared memory capability. In this paper, RTFC is introduced. The …


Measuring The Effects Of Thread Placement On The Kendall Square Ksr1, Amy Apon, T D. Wagner, E Smirni, M Madhukar, L W. Dowdy Jul 2014

Measuring The Effects Of Thread Placement On The Kendall Square Ksr1, Amy Apon, T D. Wagner, E Smirni, M Madhukar, L W. Dowdy

Amy W. Apon

This paper describes a measurement study of the effects of thread placement on memory access times on the Kendall Square multiprocessor, the KSRl. The KSRl uses a conventional shared memory programming model in a distributed memory architecture. The architecture is based on a ring of rings of 64-bit superscalar microprocessors. The KSRl has a Cache-Only Memory Architecture (COMA). Memory consists of the local cache memoria attached to each processor. Whenever an address is accessed, the data item is automatically copied to the local cache memory module, 80 that access times for subsequent references will be minimal. If a local cache …


The Ksr1: Experimentation And Modeling Of Poststore, Amy Apon, E Rosti, E Smirni, T D. Wagner, M Madhukar, L W. Dowdy Jul 2014

The Ksr1: Experimentation And Modeling Of Poststore, Amy Apon, E Rosti, E Smirni, T D. Wagner, M Madhukar, L W. Dowdy

Amy W. Apon

Kendall Square Research introduced the KSRl system in 1991. The architecture is based on a ring of rings of 64bit microprocessors. It is a distributed, shared memory system and is scalable. The memory structure is unique and is the key to understanding the system. Different levels of caching eliminates physical memory addressing and leads to the ALLCACHE TM scheme. Since requested data may be found in any of several caches, the initial access time is variable. Once pulled into the local (sub)cache, subsequent access times are fixed and minimal. Thus, the KSRl is a Cache-Only Memory Architecture (COMA) system. This …


Mpi Collective Operations Over Ip Multicast, Amy Apon, H A. Chen, Y O. Carrasco Jul 2014

Mpi Collective Operations Over Ip Multicast, Amy Apon, H A. Chen, Y O. Carrasco

Amy W. Apon

Many common implementations of Message Passing Inter- face (MPI) implement collective operations over point-to-point operations. This work examines IP multicast as a framework for collective operations. IP multicast is not reliable. If a receiver is not ready when a message is sent via IP multicast, the message is lost. Two techniques for ensuring that a message is not lost due to a slow receiving process are examined. The techniques are implemented and compared experimentally over both a shared and a switched Fast Ethernet. The average performance of collective operations is improved as a function of the number of participating processes …


Architectural Tradeoffs For Unifying Campus Grid Resources, Amy Apon, Bart Taylor Jul 2014

Architectural Tradeoffs For Unifying Campus Grid Resources, Amy Apon, Bart Taylor

Amy W. Apon

Most universities have a powerful collection of computing resources on campus for use in areas from high performance computing to general access student labs. However, these resources are rarely used to their full potential. Grid computing offers a way to unify these resources and to better utilize the capability they provide. The complexity of some grid tools makes learning to use them a daunting task for users not familiar with using the command line. Combining these tools together into a single web portal interface provides campus faculty and students with an easy way to access the campus resources. This paper …


Developing A Coherent Cyberinfrastructure From Local Campus To National Facilities: Challenges And Strategies, Amy Apon, Patrick Dreher, Vijay Agarwala, Stan Ahalt, Guy Almes, Sue Fratkin, Thomas Hauser, Jan Odegard, Jim Pepin, Craig Stewart Jul 2014

Developing A Coherent Cyberinfrastructure From Local Campus To National Facilities: Challenges And Strategies, Amy Apon, Patrick Dreher, Vijay Agarwala, Stan Ahalt, Guy Almes, Sue Fratkin, Thomas Hauser, Jan Odegard, Jim Pepin, Craig Stewart

Amy W. Apon

A fundamental goal of cyberinfrastructure (CI) is the integration of computing hardware, software, and network technology, along with data, information management, and human resources to advance scholarship and research. Such integration creates opportunities for researchers, educators, and learners to share ideas, expertise, tools, and facilities in new and powerful ways that cannot be realized if each of these components is applied independently. Bridging the gap between the reality of CI today and its potential in the immediate future is critical to building a balanced CI ecosystem that can support future scholarship and research. This report summarizes the observations and recommendations …


Writing A Supercomputer Proposal For The National Science Foundation's Major Research Instrumentation Solicitation, Amy Apon Jul 2014

Writing A Supercomputer Proposal For The National Science Foundation's Major Research Instrumentation Solicitation, Amy Apon

Amy W. Apon

No abstract provided.


The Circulating Processor Model Of Parallel Systems, Amy Apon, Lawrence Dowdy Jul 2014

The Circulating Processor Model Of Parallel Systems, Amy Apon, Lawrence Dowdy

Amy W. Apon

This paper introduces the circulating processor model for parallel computer systems. The circulating processor model is a product form queuing network model where the processors are allowed to circulate between the parallel applications instead of the more traditional circulating task model. Certain behaviors of parallel systems are better captured using this new approach. The circulating processor model may be load dependent or load dependent. The load dependent circulating processor model is exact for systems which contain a single parallel application. An exact error is calculated for the load independent circulating processor model for systems which contain a single parallel application. …


Community Funding Models For Computational Resources, Amy Apon, Jeff Pummill, Dana Brunson Jul 2014

Community Funding Models For Computational Resources, Amy Apon, Jeff Pummill, Dana Brunson

Amy W. Apon

As scientific research has extended far beyond the practicality and abilities of laboratory experiments, computational simulations have become the mainstay of enabling and furthering the research in a way never previously thought possible. It is becoming commonplace to model and simulate both the very large, such as black hole collisions in astrophysics, and the very small, such as subatomic particle behavior and interaction in high energy physics. In addition to the previous examples detailing extremes, practically every area of research currently utilizes and benefits from computational resources to simulate their work; financial modeling, weather forecasting, geological phenomena, geo-spatial data analysis, …


Shibboleth As A Tool For Authorized Access Control To The Subversion Repository System, Linh B. Ngo, Amy W. Apon Jul 2014

Shibboleth As A Tool For Authorized Access Control To The Subversion Repository System, Linh B. Ngo, Amy W. Apon

Amy W. Apon

Shibboleth is an architecture and protocol for allowing users to authenticate and be authorized to use a remote resource by logging into the identity management system that is maintained at their home institution. With Shibboleth, a federation of institutions can share resources among users and yet allow the administration of both the user access control to resources and the user identity and attribute information to be performed at the hosting or home institution. Subversion is a version control repository system that allows the creation of fine-grained permissions to files and directories. In this project an infrastructure, Shibbolized Subversion, has been …


Classroom Exercises For Grid Services, Amy Apon, Jens Mache, Yuriko Yara, Kurt Landrus Jul 2014

Classroom Exercises For Grid Services, Amy Apon, Jens Mache, Yuriko Yara, Kurt Landrus

Amy W. Apon

No abstract provided.


Load Dependent Single Chain Models Of Multichain Closed Queueing Networks, Amy Apon, Lawrence Dowdy Jul 2014

Load Dependent Single Chain Models Of Multichain Closed Queueing Networks, Amy Apon, Lawrence Dowdy

Amy W. Apon

No abstract provided.


Teaching Hdfs/Mapreduce Systems Concepts To Undergraduates, Linh B. Ngo, Amy W. Apon, Edward B. Duffy Jul 2014

Teaching Hdfs/Mapreduce Systems Concepts To Undergraduates, Linh B. Ngo, Amy W. Apon, Edward B. Duffy

Amy W. Apon

This paper presents the development of a Hadoop MapReduce module that has been taught in a course in distributed computing to upper undergraduate computer science students at Clemson University. The paper describes our teaching experiences and the feedback from the students over several semesters that have helped to shape the course. We provide suggested best practices for lecture materials, the computing platform, and the teaching methods. In addition, the computing platform and teaching methods can be extended to accommodate emerging technologies and modules for related courses.


Ampnet - A Highly Available Cluster Interconnection Network, Amy Apon, Larry Bilbur Jul 2014

Ampnet - A Highly Available Cluster Interconnection Network, Amy Apon, Larry Bilbur

Amy W. Apon

No abstract provided.


The Multigraph Modeling Tool, Amy Apon, C A. Childers, W H. Hooper, K D. Gordon, L W. Dowdy Jul 2014

The Multigraph Modeling Tool, Amy Apon, C A. Childers, W H. Hooper, K D. Gordon, L W. Dowdy

Amy W. Apon

No abstract provided.


Jummp: Job Uninterrupted Maneuverable Mapreduce Platform, William Moody, Linh Ngo, Edward Duffy, Amy Apon Feb 2014

Jummp: Job Uninterrupted Maneuverable Mapreduce Platform, William Moody, Linh Ngo, Edward Duffy, Amy Apon

Amy W. Apon

In this paper, we present JUMMP, the Job Uninterrupted Maneuverable MapReduce Platform, an automated scheduling platform that provides a customized Hadoop environment within a batch-scheduled cluster environment. JUMMP enables an interactive pseudo-persistent MapReduce platform within the existing administrative structure of an academic high performance computing center by “jumping” between nodes with minimal administrative effort. Jumping is implemented by the synchronization of stopping and starting daemon processes on different nodes in the cluster. Our experimental evaluation shows that JUMMP can be as efficient as a persistent Hadoop cluster on dedicated computing resources, depending on the jump time. Additionally, we show that …