Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 16 of 16

Full-Text Articles in Physical Sciences and Mathematics

Wrangle Your Data Like A Pro With The Data Processing Power Of Python, Geoffrey P. Timms, Jeremy M. Brown Oct 2017

Wrangle Your Data Like A Pro With The Data Processing Power Of Python, Geoffrey P. Timms, Jeremy M. Brown

Charleston Library Conference

Management, delivery, and marketing of library resources and collections necessitate interaction with a plethora of data from many sources and in many forms. Accessing and transforming data into meaningful information or different formats used in library automation can be time consuming, but a working knowledge of a programming language can improve efficiency in many facets of librarianship. From processing lists to creating extensible markup language (XML), from editing machine-readable cataloging (MARC) records before upload to automating statistical reports, the Python programming language and third-party application programming interfaces (APIs) can be used to accomplish both behind-the-scenes tasks and end-user facing projects. …


Improving The Accuracy For The Long-Term Hydrologic Impact Assessment (L-Thia) Model, Anqi Zhang, Lawrence Theller, Bernard A. Engel Aug 2017

Improving The Accuracy For The Long-Term Hydrologic Impact Assessment (L-Thia) Model, Anqi Zhang, Lawrence Theller, Bernard A. Engel

The Summer Undergraduate Research Fellowship (SURF) Symposium

Urbanization increases runoff by changing land use types from less impervious to impervious covers. Improving the accuracy of a runoff assessment model, the Long-Term Hydrologic Impact Assessment (L-THIA) Model, can help us to better evaluate the potential uses of Low Impact Development (LID) practices aimed at reducing runoff, as well as to identify appropriate runoff and water quality mitigation methods. Several versions of the model have been built over time, and inconsistencies have been introduced between the models. To improve the accuracy and consistency of the model, the equations and parameters (primarily curve numbers in the case of this model) …


Web-Based Interactive Social Media Visual Analytics, Diego Rodríguez-Baquero, Jiawei Zhang, David S. Ebert, Sorin A. Matei Aug 2017

Web-Based Interactive Social Media Visual Analytics, Diego Rodríguez-Baquero, Jiawei Zhang, David S. Ebert, Sorin A. Matei

The Summer Undergraduate Research Fellowship (SURF) Symposium

Real-time social media platforms enable quick information broadcasting and response during disasters and emergencies. Analyzing the massive amount of generated data to understand the human behavior requires data collection and acquisition, parsing, filtering, augmentation, processing, and representation. Visual analytics approaches allow decision makers to observe trends and abnormalities, correlate them with other variables and gain invaluable insight into these situations. In this paper, we propose a set of visual analytic tools for analyzing and understanding real-time social media data in times of crisis and emergency situations. First, we model the degree of risk of individuals’ movement based on evacuation zones …


Parallelization Of Molecular Docking Algorithms Using Cuda For Use In Drug Discovery, Brandon Stewart, Jonathan Fine, Gaurav Chopra Phd Aug 2017

Parallelization Of Molecular Docking Algorithms Using Cuda For Use In Drug Discovery, Brandon Stewart, Jonathan Fine, Gaurav Chopra Phd

The Summer Undergraduate Research Fellowship (SURF) Symposium

Traditional drug discovery methodology uses a multitude of software packages to design and evaluate new drug-like compounds. While software packages implement a wide variety of methods, the serial (i.e. single core) implementation for many of these algorithms, prohibit large scale docking, such as proteome-wide docking (i.e. thousands of compounds with thousands of proteins). Several docking algorithms can be parallelized, significantly reducing the runtime of the calculations, thus enabling large-scale docking. Implementing algorithms that take advantage of the distributed nature of graphical processing units (GPUs) via the Compute Unified Device Architecture (CUDA) enables us to efficiently implement massively parallel algorithms. Two …


Purdue Airsense: An Affordable Way To Measure And Study Air Pollution, Stephane Junior Nouafo Wanko, Shadi Tariq Azouz, Ruihang Du, Brandon Boor, Greg Michalski Aug 2017

Purdue Airsense: An Affordable Way To Measure And Study Air Pollution, Stephane Junior Nouafo Wanko, Shadi Tariq Azouz, Ruihang Du, Brandon Boor, Greg Michalski

The Summer Undergraduate Research Fellowship (SURF) Symposium

Air pollution is a major health hazard worldwide, accounting for one-eighth of all deaths in 2012 (World Health Organization). Globally, there is a severe lack of ground-based spatiotemporal monitoring of gaseous and particulate air pollutants, particularly in Africa, South and Central America, and the Middle East. This is in great part due to the high costs of air quality instrumentation that meet accuracy and reliability criteria set by monitoring agencies. The air quality data that is available is often not presented to the public in a user-friendly manner. Taking advantage of recent developments in low-cost sensing technologies, an integrated sensor …


Optimization And Control Of Production Of Graphene, Atharva Hans, Nimish M. Awalgaonkar, Majed Alrefae, Ilias Bilionis, Timothy S. Fisher Aug 2017

Optimization And Control Of Production Of Graphene, Atharva Hans, Nimish M. Awalgaonkar, Majed Alrefae, Ilias Bilionis, Timothy S. Fisher

The Summer Undergraduate Research Fellowship (SURF) Symposium

Graphene is a 2-dimensional element of high practical importance. Despite its exceptional properties, graphene’s real applications in industrial or commercial products have been limited. There are many methods to produce graphene, but none has been successful in commercializing its production. Roll-to-roll plasma chemical vapor deposition (CVD) is used to manufacture graphene at large scale. In this research, we present a Bayesian linear regression model to predict the roll-to-roll plasma system’s electrode voltage and current; given a particular set of inputs. The inputs of the plasma system are power, pressure and concentration of gases; hydrogen, methane, oxygen, nitrogen and argon. This …


Visually Analyzing The Impacts Of Essential Air Service Funding Decisions, Rohan Kashuka, Chittayong Surakitbanharn, Calvin Yau, David S. Ebert Aug 2017

Visually Analyzing The Impacts Of Essential Air Service Funding Decisions, Rohan Kashuka, Chittayong Surakitbanharn, Calvin Yau, David S. Ebert

The Summer Undergraduate Research Fellowship (SURF) Symposium

Essential Air Service (EAS) is a U.S. government subsidy program which ensures maintenance of commercial airline services in small deregulated communities. The program’s budget currently is around $250 million annually, which is used as subsidy for airlines to maintain a minimal level of scheduled air service in relatively smaller airports. It is evident that 2% of the FAA budget is being spent to maintain air service in smaller communities, but there is not enough evidence to prove that all the current decisions made by Congress about EAS are advantageous. To understand these decisions, 15 years of data produced by the …


Applying Machine Learning To Computational Chemistry: Can We Predict Molecular Properties Faster Without Compromising Accuracy?, Hanjing Xu, Pradeep Gurunathan, Lyudmila Slipchenko Aug 2017

Applying Machine Learning To Computational Chemistry: Can We Predict Molecular Properties Faster Without Compromising Accuracy?, Hanjing Xu, Pradeep Gurunathan, Lyudmila Slipchenko

The Summer Undergraduate Research Fellowship (SURF) Symposium

Non-covalent interactions are crucial in analyzing protein folding and structure, function of DNA and RNA, structures of molecular crystals and aggregates, and many other processes in the fields of biology and chemistry. However, it is time and resource consuming to calculate such interactions using quantum-mechanical formulations. Our group has proposed previously that the effective fragment potential (EFP) method could serve as an efficient alternative to solve this problem. However, one of the computational bottlenecks of the EFP method is obtaining parameters for each molecule/fragment in the system, before the actual EFP simulations can be carried out. Here we present a …


Development Of A Water Quality Status And Trend Detection Tool*, Ruchir Aggarwal, Valeria Mijares, Margaret W. Gitau Aug 2017

Development Of A Water Quality Status And Trend Detection Tool*, Ruchir Aggarwal, Valeria Mijares, Margaret W. Gitau

The Summer Undergraduate Research Fellowship (SURF) Symposium

Water Quality Index (WQI) models have been developed since the early 1970s. They present a means by which water quality status and trends can be compared across time and space on the basis of a composite value computed using existing water quality data. There is a need for a tool that can bring the different water quality parameters together and calculate the WQIs so as to facilitate data use in predictive modeling and water quality management. We are developing a software tool that can be used by water quality managers and others with different technical backgrounds to calculate WQI of …


Machine Learning In Xenon1t Analysis, Dillon A. Davis, Rafael F. Lang, Darryl P. Masson Aug 2017

Machine Learning In Xenon1t Analysis, Dillon A. Davis, Rafael F. Lang, Darryl P. Masson

The Summer Undergraduate Research Fellowship (SURF) Symposium

In process of analyzing large amounts of quantitative data, it can be quite time consuming and challenging to uncover populations of interest contained amongst the background data. Therefore, the ability to partially automate the process while gaining additional insight into the interdependencies of key parameters via machine learning seems quite appealing. As of now, the primary means of reviewing the data is by manually plotting data in different parameter spaces to recognize key features, which is slow and error prone. In this experiment, many well-known machine learning algorithms were applied to a dataset to attempt to semi-automatically identify known populations, …


Structure-Force Field Generator For Molecular Dynamics Simulations, Carlos M. Patiño, Lorena Alzate, Alejandro Strachan Aug 2017

Structure-Force Field Generator For Molecular Dynamics Simulations, Carlos M. Patiño, Lorena Alzate, Alejandro Strachan

The Summer Undergraduate Research Fellowship (SURF) Symposium

Atomistic and molecular simulations have become an important research field due to the progress made in computer performance and the necessity of new and improved materials. Despite this, first principle simulations of large molecules are still not possible because the high computational time and resources required. Other methods, such as molecular dynamics, allow the simplification of calculations by defining energy terms to describe multiple atom interactions without compromising accuracy significantly. A group of these energy terms is called a force field, and each force field has its own descriptions and parameters. The objective of this project was to develop a …


Predictive Power And Validity Of Connectome Predictive Modeling: A Replication And Extension, Michael Wang, Joaquin Goni, Enrico Amico Aug 2017

Predictive Power And Validity Of Connectome Predictive Modeling: A Replication And Extension, Michael Wang, Joaquin Goni, Enrico Amico

The Summer Undergraduate Research Fellowship (SURF) Symposium

Neuroimaging, particularly functional magnetic resonance imaging (fMRI), is a rapidly growing research area and has applications ranging from disease classification to understanding neural development. With new advancements in imaging technology, researchers must employ new techniques to accommodate the influx of high resolution data sets. Here, we replicate a new technique: connectome-based predictive modeling (CPM), which constructs a linear predictive model of brain connectivity and behavior. CPM’s advantages over classic machine learning techniques include its relative ease of implementation and transparency compared to “black box” opaqueness and complexity. Is this method efficient, powerful, and reliable in the prediction of behavioral measures …


Predicting Locations Of Pollution Sources Using Convolutional Neural Networks, Yiheng Chi, Nickolas D. Winovich, Guang Lin Aug 2017

Predicting Locations Of Pollution Sources Using Convolutional Neural Networks, Yiheng Chi, Nickolas D. Winovich, Guang Lin

The Summer Undergraduate Research Fellowship (SURF) Symposium

Pollution is a severe problem today, and the main challenge in water and air pollution controls and eliminations is detecting and locating pollution sources. This research project aims to predict the locations of pollution sources given diffusion information of pollution in the form of array or image data. These predictions are done using machine learning. The relations between time, location, and pollution concentration are first formulated as pollution diffusion equations, which are partial differential equations (PDEs), and then deep convolutional neural networks are built and trained to solve these PDEs. The convolutional neural networks consist of convolutional layers, reLU layers …


Learning To Identify Depth Edges In Real-World Images With 3d Ground Truth, Krista A. Ehinger, Kevin T. Joseph, Wendy J. Adams, Erich W. Graf, James H. Elder May 2017

Learning To Identify Depth Edges In Real-World Images With 3d Ground Truth, Krista A. Ehinger, Kevin T. Joseph, Wendy J. Adams, Erich W. Graf, James H. Elder

MODVIS Workshop

No abstract provided.


Scoring Scene Symmetry, Morteza Rezanejad, John D. Wilder, Sven Dickinson, Allan Jepson, Dirk B. Walther, Kaleem Siddiqi May 2017

Scoring Scene Symmetry, Morteza Rezanejad, John D. Wilder, Sven Dickinson, Allan Jepson, Dirk B. Walther, Kaleem Siddiqi

MODVIS Workshop

No abstract provided.


Large-Scale Discovery Of Visual Features For Object Recognition, Drew Linsley, Sven Eberhardt, Dan Shiebler, Thomas Serre May 2017

Large-Scale Discovery Of Visual Features For Object Recognition, Drew Linsley, Sven Eberhardt, Dan Shiebler, Thomas Serre

MODVIS Workshop

A central goal in vision science is to identify features that are important for object and scene recognition. Reverse correlation methods have been used to uncover features important for recognizing faces and other stimuli with low intra-class variability. However, these methods are less successful when applied to natural scenes with variability in their appearance.

To rectify this, we developed Clicktionary, a web-based game for identifying features for recognizing real-world objects. Pairs of participants play together in different roles to identify objects: A “teacher” reveals image regions diagnostic of the object’s category while a “student” tries to recognize the object. Aggregating …