Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 30 of 41

Full-Text Articles in Physical Sciences and Mathematics

The Transformation Of Science With Hpc, Big Data, And Ai, Jeffrey Kirk Oct 2017

The Transformation Of Science With Hpc, Big Data, And Ai, Jeffrey Kirk

Commonwealth Computational Summit

High performance computing has matured into an indispensable tool for not only academic research and government labs and agencies, but also for many industry sectors: energy, manufacturing, healthcare, financial services, even digital content creation. More recently, the advent of Big Data has enabled the use of HPC techniques for large scale data analysis, expanding the scope of HPC and the reach of it into more research and enterprise use cases. Since 2012, a new regime of data-driven analytics, deep learning, has erupted in popularity, fueled by both the massive performance increases in HPC technologies and in the explosive rate of …


Harnessing The Data Revolution, Chaitan Baru Oct 2017

Harnessing The Data Revolution, Chaitan Baru

Commonwealth Computational Summit

Harnessing Data for 21st Century Science and Engineering (aka Harnessing the Data Revolution, HDR) is one of NSF's six "Big Research Ideas," aimed at supporting fundamental research in data science and engineering; developing a cohesive, federated approach to the research data infrastructure needed to power this revolution; and developing of a 21st-century data-capable workforce. HDR will enable new modes of data-driven discovery allowing researchers to ask and answer new questions in frontier science and engineering, generate new knowledge and understanding by working with domain experts, and accelerate discovery and innovation. This initiative builds on NSF's history of data science investments. …


Tell Me Why? Tell Me More! Explaining Predictions, Iterated Learning Bias, And Counter-Polarization In Big Data Discovery Models, Olfa Nasraoui Oct 2017

Tell Me Why? Tell Me More! Explaining Predictions, Iterated Learning Bias, And Counter-Polarization In Big Data Discovery Models, Olfa Nasraoui

Commonwealth Computational Summit

Outline:

What can go Wrong in Machine Learning?

  • Unfair Machine Learning
  • Iterated Bias & Polarization
  • Black Box models

Tell me more: Counter-Polarization

Tell me why: Explanation Generation


Analysis Of Complex Vertebrate Genomes: Computational Challenges And Solutions, Jeramiah J. Smith Oct 2017

Analysis Of Complex Vertebrate Genomes: Computational Challenges And Solutions, Jeramiah J. Smith

Commonwealth Computational Summit

No abstract provided.


Accurate And Scalable Query Over Large Rna‐Seq Experiments, Jinze Liu Oct 2017

Accurate And Scalable Query Over Large Rna‐Seq Experiments, Jinze Liu

Commonwealth Computational Summit

No abstract provided.


Resource Efficient Design Of Quantum Circuits For Quantum Algorithms, Himanshu Thapliyal Oct 2017

Resource Efficient Design Of Quantum Circuits For Quantum Algorithms, Himanshu Thapliyal

Commonwealth Computational Summit

No abstract provided.


Additional Data Via Autonomous Systems To Supplement Traditional Sparse Sources For Weather Forecasting And Atmospheric Science, Suzanne Weaver Smith Oct 2017

Additional Data Via Autonomous Systems To Supplement Traditional Sparse Sources For Weather Forecasting And Atmospheric Science, Suzanne Weaver Smith

Commonwealth Computational Summit

No abstract provided.


Correct Model Selection In Multiple Regression Analyses Of Big Data, Katherine L. Thompson Oct 2017

Correct Model Selection In Multiple Regression Analyses Of Big Data, Katherine L. Thompson

Commonwealth Computational Summit

Goals:

  • Improve statistical modeling in a variety of application areas
  • Correctly identify the relationships present in data sets
  • Understand the difficulty in choosing the correct statistical model in big data


Integrative Computational Modeling For Developing Means To Manipulate Biological Cells And For Solving Complex Engineering Problems, Yu Zhao, Guigen Zhang Oct 2017

Integrative Computational Modeling For Developing Means To Manipulate Biological Cells And For Solving Complex Engineering Problems, Yu Zhao, Guigen Zhang

Commonwealth Computational Summit

A brief discussion on reductive vs integrative investigation

A case study: how integrative computational modeling helps advance the understanding and application of dielectrophoresis (DEP) in various situations

Other applications in advancing the design and development of nanopore, medical devices, novel materials, actuation devices, and coupled spectroscopic techniques, etc.


Cloud‐Based Text Analytics Harvesting, Cleaning And Analyzing Corporate Earnings Conference Calls, Michael Chuancai Zhang, Vikram Gazula, Dan Stone, Hong Xie Oct 2017

Cloud‐Based Text Analytics Harvesting, Cleaning And Analyzing Corporate Earnings Conference Calls, Michael Chuancai Zhang, Vikram Gazula, Dan Stone, Hong Xie

Commonwealth Computational Summit

No abstract provided.


Grading Complex Tasks Through Crowdsourcing, Lingyu Lyu Oct 2017

Grading Complex Tasks Through Crowdsourcing, Lingyu Lyu

Commonwealth Computational Summit

Crowdsourcing provides huge opportunities and scalability solutions for grading large scale tasks, such as MOOCs.

Reliability and quality of graders and crowdsourced data are challenging issues.

Workers might give random grades, which are spam; or provide biased grades, which need to be corrected.

The budget for hiring graders is limited, in many cases.


Parallelization Of A Three-Dimensional Full Multigrid Algorithm To Simulate Tumor Growth, Dylan Goodin, Chin F. Ng, Hermann B. Frieboes Oct 2017

Parallelization Of A Three-Dimensional Full Multigrid Algorithm To Simulate Tumor Growth, Dylan Goodin, Chin F. Ng, Hermann B. Frieboes

Commonwealth Computational Summit

We present the performance gains of an openMP implementation of a fully adaptive nonlinear full multigrid (FMG) algorithm to simulate three-dimensional multispecies desmoplastic tumor growth on computer systems of varying processing capabilities. The FMG algorithm is applied to solve a recently published thermodynamic mixture model that uses a diffuse interface approach with fourth-order reaction-advection-diffusion PDEs (Cahn-Hilliard-type equations) that are coupled, nonlinear, and numerically stiff. The model includes multiple cell species and extracellular matrix (ECM), with adhesive and elastic energy contributions in chemical potential terms, as well as including blood and lymphatic vessels represented as continuous vasculatures. Advection-reaction-diffusion PDEs are employed …


The Effect Of Inlet Pulsations On Primary Atomization Of Liquid Jets, Kyle Windland, Prashant Khare Oct 2017

The Effect Of Inlet Pulsations On Primary Atomization Of Liquid Jets, Kyle Windland, Prashant Khare

Commonwealth Computational Summit

Objectives

  • Elucidate the physics underlying the primary atomization of liquid jets.
  • Investigate the effect of inlet pulsations on the atomization process.
  • Identify the reliability of numerical predictions using uncertainty quantification techniques (UQ) and sensitivity analyses.


Computational Materials Characterization, Discovery, And Design With High Performance Computing, Qunfei Zhou, Xiaotao Liu, Tyler Maxwell, Thomas John Balk, Matthew J. Beck Oct 2017

Computational Materials Characterization, Discovery, And Design With High Performance Computing, Qunfei Zhou, Xiaotao Liu, Tyler Maxwell, Thomas John Balk, Matthew J. Beck

Commonwealth Computational Summit

No abstract provided.


New Explainable Active Learning Approach For Recommender Systems, Sami Khenissi, Behnoush Abdollahi, Wenlong Sun, Pegah Sagheb, Olfa Nasraoui Oct 2017

New Explainable Active Learning Approach For Recommender Systems, Sami Khenissi, Behnoush Abdollahi, Wenlong Sun, Pegah Sagheb, Olfa Nasraoui

Commonwealth Computational Summit

Introduction and Motivations

  • Recommender Systems are intelligent programs that analyze patterns between items and users to predict the user’s taste.

Objective

  • Design an efficient Active Learning Strategy to increase the explainability and the accuracy of an “Explainable Matrix Factorization” model.


A Network Tomography Approach For Traffic Monitoring In Smart Cities, Ruoxi Zhang, Sara Newman, Marco Ortolani, Simone Silvestri Oct 2017

A Network Tomography Approach For Traffic Monitoring In Smart Cities, Ruoxi Zhang, Sara Newman, Marco Ortolani, Simone Silvestri

Commonwealth Computational Summit

Various urban planning and managing activities required by a Smart City are feasible because of traffic monitoring. As such, this project proposes a network tomography-based approach that can be applied to road networks to achieve a cost-efficient, flexible, and scalable monitor deployment. Due to the algebraic approach of network tomography, the selection of monitoring intersections can be solved through the use of matrices, with its rows representing paths between two intersections, and its columns representing links in the road network. Because the goal of the algorithm is to provide an inexpensive monitor set, this problem can be translated into a …


Bio-Inspired Disaster Response Networks, Vijay K. Shah, Simone Silvestri, Sajal K. Das, Satyaki Roy Oct 2017

Bio-Inspired Disaster Response Networks, Vijay K. Shah, Simone Silvestri, Sajal K. Das, Satyaki Roy

Commonwealth Computational Summit

Large-scale natural disasters (e.g., Earthquake, Hurricane) –

  • Three times as many disasters between 1980 and 2016 compared to 1940-1980. (EM-DAT – The International Disaster Database)
  • Since 1990, 217 million people affected each year. (The New England Journal of Medicine)

Aftermath a disaster,

  • Loss of human lives and property
  • Lack of food, clean drinking water, shelter etc.
  • Disruption of infrastructure networks (e.g. cellular towers) and other public infrastructures (e.g. power sources) – Our focus !


Formation Of Supermassive Black Holes In The Early Universe: High-Resolution Numerical Simulations Of Radiation Transfer Inside Collapsing Gas, Yang Luo, Kazem Ardaneh, Isaac Shlosman, Kentaro Nagamine, John Wise, Mitchell C. Begelman Oct 2017

Formation Of Supermassive Black Holes In The Early Universe: High-Resolution Numerical Simulations Of Radiation Transfer Inside Collapsing Gas, Yang Luo, Kazem Ardaneh, Isaac Shlosman, Kentaro Nagamine, John Wise, Mitchell C. Begelman

Commonwealth Computational Summit

Observations of high-redshift quasars reveal that super massive black holes (SMBHs) with masses exceeding 109 M formed as early as redshift z ~ 7 [1,3,6]. This means that SMBHs have already formed ~700 million years after the Big Bang. How did such SMBHs could grow so quickly?

In this work, we use a modified and improved version of the blockstructured adaptive mesh refinement (AMR) code ENZO [2] to provide high spatial and temporal resolution for modeling the formation of SMBHs via direct collapse within dark matter (DM) halos at high redshifts. The radiation hydrodynamics equations are solved in …


Integrative Computational Modeling For Developing Means To Manipulate Biological Cells And For Solving Complex Engineering Problems, Yu Zhao, Guigen Zhang Oct 2017

Integrative Computational Modeling For Developing Means To Manipulate Biological Cells And For Solving Complex Engineering Problems, Yu Zhao, Guigen Zhang

Commonwealth Computational Summit

Computational modeling has become more widely used to guide the design of microfluidic devices for manipulating cells using Dielectrophoresis (DEP), and devise novel means for advancing the study of cellular science and engineering. Conventionally, cells are treated as volumeless points in the system, which allows study of the movement of groups of particles under the effect of field. However, this approach often neglects the distortion effect of particle on external field, as well as interactions among particles. Moreover, it ignores the complex inner structures of cell, which are the causes of distinctive cell behavior. To more accurately model the behavior …


Dynamic Load Balancing Based On Live Virtual Machine Migration, Manh Do, Michael Galloway Oct 2017

Dynamic Load Balancing Based On Live Virtual Machine Migration, Manh Do, Michael Galloway

Commonwealth Computational Summit

Recently, cloud computing is a new trend emerging in computer technology with a huge demand from the clients, which leads to the consumption of a tremendous amount of energy. Load balancing is taken into account as a vital part of managing income demand, improving the cloud system's performance and reducing the energy cost. Live virtual machine migration is a technique to perform the dynamic load balancing algorithm. To optimize the cloud cluster, there are three issues to consider: First, how does the cloud cluster distribute the virtual machine (VM) requests from clients to all physical machine (PM) when each machine …


Cloud-Based Text Analytics: Harvesting, Cleaning And Analyzing Corporate Earnings Conference Calls, Michael Chuancai Zhang, Vikram Gazula, Dan Stone, Hong Xie Oct 2017

Cloud-Based Text Analytics: Harvesting, Cleaning And Analyzing Corporate Earnings Conference Calls, Michael Chuancai Zhang, Vikram Gazula, Dan Stone, Hong Xie

Commonwealth Computational Summit

Does management language cohesion in earnings conference calls matter to the capital market? As a part of the research on the above question, and taking advantage of the modern IT technologies, this project:

  • harvested 115,882 earnings conference call transcripts from SeekingAlpha.com
  • parsed and structured 89,988 transcripts using regular expressions in Stata
  • analyzed 179,976 text files using Amazon Elastic Compute Cloud (Amazon EC2), which
  • saved almost 2 years (675 days) of the project time
As this project is related to big data, text analytics, and big computing, it may be a good case to show how we can benefit from modern …


High-Fidelity Simulations Of Water Jet In Air Crossflow, Austin Johnston, Prashant Khare Oct 2017

High-Fidelity Simulations Of Water Jet In Air Crossflow, Austin Johnston, Prashant Khare

Commonwealth Computational Summit

Objectives

  • Investigate detailed physics underlying liquid jets in crossflow configurations applicable to various applications such as, gas-turbine, scramjet, and afterburner fuel injection.
  • Develop models to predict the statistical behaviors of resulting droplets.


Adversarial Discriminative Domain Adaptation For Extracting Protein-Protein Interactions From Text, Anthony Rios, Ramakanth Kavuluru, Zhiyong Lu Oct 2017

Adversarial Discriminative Domain Adaptation For Extracting Protein-Protein Interactions From Text, Anthony Rios, Ramakanth Kavuluru, Zhiyong Lu

Commonwealth Computational Summit

Relation extraction is the process of extracting structured information from unstructured text. Recently, neural networks (NNs) have produced state-of-art results in extracting protein-protein interactions (PPIs) from text. While multiple corpora have been created to extract PPIs from text, most methods have shown poor cross-corpora generalization. In other words, models trained on one dataset perform poorly on other datasets for the same task. In the case of PPI, the F1 has been shown to vary by as much as 30% between different datasets. In this work, we utilize adversarial discriminative domain adaptation (ADDA) to improve the generalization between the source and …


Imputing Trust Network Information In Nmf‐Based Recommendation Systems, Fatemah Alghamedy, Jun Zhang Oct 2017

Imputing Trust Network Information In Nmf‐Based Recommendation Systems, Fatemah Alghamedy, Jun Zhang

Commonwealth Computational Summit

With the emergence of E‐commerce, recommendation system becomes a significant tool which can help both sellers and buyers. It helps sellers by increasing the profits and advertising items to customers. In addition, recommendation systems facilitate buyers to find items they are looking for easily.

In recommendation systems, the rating matrix R represents users' ratings for items. The rows in the rating matrix represent the users and the columns represent items. If particular user rates a particular item, then the value of the intersection of the user row and item column holds the rating value. The trust matrix T describes the …


Discovery Of Sex-Specific Regions In A Salamander Genome, Nataliya Y. Timoshevskaya, Melissa C. Keinath, Jeramiah J. Smith Oct 2017

Discovery Of Sex-Specific Regions In A Salamander Genome, Nataliya Y. Timoshevskaya, Melissa C. Keinath, Jeramiah J. Smith

Commonwealth Computational Summit

Biological Aspects:

Salamander (Ambystoma mexicanum) has a gigantic genome: ~32,000,000,000 bases (10X of size of human genome)

Sex is determined by a pair of morphologically identical chromosomes:

  • ZZ in male
  • ZW in female

Object:

  • Find (if there are any) genomic differences between chromosomes W and Z

Workflow:

  1. Sequencing and de novo assembly of the reference salamander genome
  2. Alignment of short sequences from male and female genomes to the reference
  3. Coverage analysis


Registration And Grouping Algorithms In Protein Nmr Derived Peak Lists And Their Application In Protein Nmr Reference Correction, Andrey Smelter, Xi Chen, Eric C. Rouchka, Hunter N. B. Moseley Oct 2017

Registration And Grouping Algorithms In Protein Nmr Derived Peak Lists And Their Application In Protein Nmr Reference Correction, Andrey Smelter, Xi Chen, Eric C. Rouchka, Hunter N. B. Moseley

Commonwealth Computational Summit

Nuclear magnetic resonance spectroscopy of proteins (protein NMR) is a powerful analytical technique for studying structure and dynamics of proteins. Almost all aspects of protein NMR have been accelerated by the development of software tools that enable the analysis of NMR spectral data and its utilization in studying protein structure and dynamics. This includes software for raw NMR processing, spectral visualization, protein resonance assignment, and structure determination. However, full automation of protein NMR data analysis is still a work in progress and data analysis still requires an expert NMR spectroscopist utilizing an array of software tools.

While manual resonance assignment …


Evolution Of Barred Galaxies In Spinning Dark Matter Halos: High Resolution N-Body Simulations At Dlx, Angela Collier, Isaac Shlosman, Clayton Heller Oct 2017

Evolution Of Barred Galaxies In Spinning Dark Matter Halos: High Resolution N-Body Simulations At Dlx, Angela Collier, Isaac Shlosman, Clayton Heller

Commonwealth Computational Summit

Observations show that galaxies are dominated by stellar disks immersed in much more massive, slowly tumbling dark matter (DM) halos. Large fraction of galactic disks, at least 75%, are barred (see Hubble Fork on the right). Stellar bars form either via spontaneous break of axial symmetry or via galaxy interactions.

The formation and evolution of stellar bars is not fully understood. Stellar bar evolution is highly nonlinear and cannot be treated analytically. The main approach to study these disk-halo systems is via numerical simulations, whose goal is to explain why galaxies have such a wide range of morphologies as shown …


Dark Matter Halo Mass Function From Hpc N-Body Simulations, Da Bi, Isaac Shlosman, Emilio Romano-Diaz Oct 2017

Dark Matter Halo Mass Function From Hpc N-Body Simulations, Da Bi, Isaac Shlosman, Emilio Romano-Diaz

Commonwealth Computational Summit

Dark matter (DM) dominates the matter in the Universe. Because of self-gravity, DM collapses and becomes clumpy, building the large-scale hierarchical structures. Baryons assemble within DM potential wells and form galaxies.

Because we can not directly observe DM halos, numerical simulations is the only way one can study their dynamics and other properties. Using N-body simulations, we can obtain the Halo Mass Function (HMF), which provides the abundance of DM halos as a function of their mass. The HMF depends weakly on cosmological redshift and is one of the basic tools in modern cosmology.

We use GIZMO --- a flexible, …


The Lamprey Genome: Illuminating Genomic Change Across Eons And Embryogenesis, Jeramiah J. Smith, Courtney K. M. Waterbury, Melissa C. Keinath, Cody B. Saraceno, Vladimir A. Timoshevskiy, Nataliya Y. Timoshevskaya Oct 2017

The Lamprey Genome: Illuminating Genomic Change Across Eons And Embryogenesis, Jeramiah J. Smith, Courtney K. M. Waterbury, Melissa C. Keinath, Cody B. Saraceno, Vladimir A. Timoshevskiy, Nataliya Y. Timoshevskaya

Commonwealth Computational Summit

The lamprey genome provides unique insights into both the deep evolutionary history of vertebrate genomes and the maintenance of genome structure/integrity over development. The lamprey lineage diverged from all other vertebrates approximately 500 million years ago. As such, comparisons between lamprey and other vertebrates permit reconstruction of ancient duplication and rearrangement events that defined the fundamental architecture and gene content of all extant vertebrate genomes. Lamprey also undergoes programmatic changes genome structure that result in the physical elimination of ~20% of its genomic DNA (~0.5Gb from a ~2 Gb genome) from all somatic cell lineages during early embryonic development. Here, …


Utilizing Nvidia Gpus For Waveform Analysis For The Nab Experiment, David G. Mathews, A. P. Sprow, C. B. Crawford Oct 2017

Utilizing Nvidia Gpus For Waveform Analysis For The Nab Experiment, David G. Mathews, A. P. Sprow, C. B. Crawford

Commonwealth Computational Summit

GPUs are composed of a large number of small computational cores compared to CPUs which are generally just a few larger cores. While the CPU excels at linear processes, the GPU excels at parallel tasks. For this project, the goal was to find a way to use the massive parallelism of a GPU to rapidly analyze waveform data from the Nab experiment.