Open Access. Powered by Scholars. Published by Universities.®

Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Theses/Dissertations

Data Science

Discipline
Institution
Publication Year
Publication

Articles 1 - 15 of 15

Full-Text Articles in Engineering

Machine Learning Modeling Of Polymer Coating Formulations: Benchmark Of Feature Representation Schemes, Nelson I. Evbarunegbe Nov 2023

Machine Learning Modeling Of Polymer Coating Formulations: Benchmark Of Feature Representation Schemes, Nelson I. Evbarunegbe

Masters Theses

Polymer coatings offer a wide range of benefits across various industries, playing a crucial role in product protection and extension of shelf life. However, formulating them can be a non-trivial task given the multitude of variables and factors involved in the production process, rendering it a complex, high-dimensional problem. To tackle this problem, machine learning (ML) has emerged as a promising tool, showing considerable potential in enhancing various polymer and chemistry-based applications, particularly those dealing with high dimensional complexities.

Our research aims to develop a physics-guided ML approach to facilitate the formulations of polymer coatings. As the first step, this …


Leveraging Artificial Intelligence And Geomechanical Data For Accurate Shear Stress Prediction In Co2 Sequestration Within Saline Aquifers (Smart Proxy Modeling), Munirah Alawadh Jan 2023

Leveraging Artificial Intelligence And Geomechanical Data For Accurate Shear Stress Prediction In Co2 Sequestration Within Saline Aquifers (Smart Proxy Modeling), Munirah Alawadh

Graduate Theses, Dissertations, and Problem Reports

This research builds upon the success of a previous project that used a Smart Proxy Model (SPM) to predict pressure and saturation in Carbon Capture and Storage (CCS) operations into saline aquifers. The Smart Proxy Model is a data-driven machine learning model that can replicate the output of a sophisticated numerical simulation model for each time step in a short amount of time, using Artificial Intelligence (AI) and large volumes of subsurface data. This study aims to develop the Smart Proxy Model further by incorporating geomechanical datadriven techniques to predict shear stress by using a neural network, specifically through supervised …


Comparative Analysis Of Artificial Intelligence And Numerical Reservoir Simulation In Marcellus Shale Wells, Arya Maher Sattari Jan 2023

Comparative Analysis Of Artificial Intelligence And Numerical Reservoir Simulation In Marcellus Shale Wells, Arya Maher Sattari

Graduate Theses, Dissertations, and Problem Reports

This dissertation addresses the limitations of conventional numerical reservoir simulation techniques in the context of unconventional shale plays and proposes the use of data-driven artificial intelligence (AI) models as a promising alternative. Traditional methods, while providing valuable insights, often rely on simplifying assumptions and are constrained by time, resources, and data quality. The research leverages AI models to handle the complexities of shale behavior more effectively, facilitating accurate predictions and optimizations with less resource expenditure.

Two specific methodologies are investigated for this purpose: traditional numerical reservoir simulations using Computer Modelling Group's GEM reservoir simulation software, and an AI-based Shale Analytics …


Enhancing Management Of Built And Natural Water And Sanitation Systems With Data Science, Nelson Da Luz Jun 2022

Enhancing Management Of Built And Natural Water And Sanitation Systems With Data Science, Nelson Da Luz

Doctoral Dissertations

In the age of the data revolution, the civil engineer can enhance the management of infrastructure systems using new techniques focused on data. This dissertation present three studies in which data science approaches are used to enhance management of water and sanitation systems in both the built and natural environments. Chapters 1 and 2 focus on improving methods for data collection relating to water quality monitoring. In Chapter 1, the efficacy of different water quality sampling program designs is evaluated as the programs relate to meeting monitoring goals. Considerations include how timing, location, and distribution system operations can affect monitoring …


Strainer: State Transcript Rating For Informed News Entity Retrieval, Thomas M. Gerrity Jun 2022

Strainer: State Transcript Rating For Informed News Entity Retrieval, Thomas M. Gerrity

Master's Theses

Over the past two decades there has been a rapid decline in public oversight of state and local governments. From 2003 to 2014, the number of journalists assigned to cover the proceedings in state houses has declined by more than 30\%. During the same time period, non-profit projects such as Digital Democracy sought to collect and store legislative bill and hearing information on behalf of the public. More recently, AI4Reporters, an offshoot of Digital Democracy, seeks to actively summarize interesting legislative data.

This thesis presents STRAINER, a parallel project with AI4Reporters, as an active data retrieval and filtering system for …


Quadratic Neural Network Architecture As Evaluated Relative To Conventional Neural Network Architecture, Reid Taylor Apr 2022

Quadratic Neural Network Architecture As Evaluated Relative To Conventional Neural Network Architecture, Reid Taylor

Senior Theses

Current work in the field of deep learning and neural networks revolves around several variations of the same mathematical model for associative learning. These variations, while significant and exceptionally applicable in the real world, fail to push the limits of modern computational prowess. This research does just that: by leveraging high order tensors in place of 2nd order tensors, quadratic neural networks can be developed and can allow for substantially more complex machine learning models which allow for self-interactions of collected and analyzed data. This research shows the theorization and development of mathematical model necessary for such an idea to …


Statistical Modeling, Learning And Computing For Stochastic Dynamics Of Complex Systems, Mohammadmahdi Hajiha Dec 2021

Statistical Modeling, Learning And Computing For Stochastic Dynamics Of Complex Systems, Mohammadmahdi Hajiha

Graduate Theses and Dissertations

With the recent advances in sensor technology, it is much easier to collect and store streams of system operational and environmental (SOE) data. These data can be used as input to model the underlying behavior of complex engineered systems and phenomenons if appropriate algorithms with well-defined assumptions are developed. This dissertation is comprised of the research work to show the applicability of SOE data when fed into proposed tailored algorithms. The first purposes of these algorithms are to estimate and analyze the reliability of a system as elaborated in Chapter 2. This chapter provides the derivation of closed-form expressions that …


Convolutional Neural Networks For Deflate Data Encoding Classification Of High Entropy File Fragments, Nehal Ameen May 2021

Convolutional Neural Networks For Deflate Data Encoding Classification Of High Entropy File Fragments, Nehal Ameen

University of New Orleans Theses and Dissertations

Data reconstruction is significantly improved in terms of speed and accuracy by reliable data encoding fragment classification. To date, work on this problem has been successful with file structures of low entropy that contain sparse data, such as large tables or logs. Classifying compressed, encrypted, and random data that exhibit high entropy is an inherently difficult problem that requires more advanced classification approaches. We explore the ability of convolutional neural networks and word embeddings to classify deflate data encoding of high entropy file fragments after establishing ground truth using controlled datasets. Our model is designed to either successfully classify file …


Implementation Of A Computer-Vision System As A Supportive Diagnostic Tool For Parkinson’S Disease, Diego Machado Reyes May 2020

Implementation Of A Computer-Vision System As A Supportive Diagnostic Tool For Parkinson’S Disease, Diego Machado Reyes

Honors Theses

Parkinson’s disease is the second most common neurodegenerative disorder, affecting nearly 1 million people in the US and it is predicted that the number will keep increasing. Parkinson’s disease is difficult to diagnose due to its similarity with other diseases that share the parkinsonian symptoms and the subjectivity of its assessment, thus increasing the probabilities of misdiagnosis. Therefore, it is relevant to develop diagnostic tools that are quantitatively based and monitoring tools to improve the patient’s quality of life. Computer-based assessment systems have shown to be successful in this field through diverse approaches that can be classified into two main …


Data Science Methods For Standardization, Safety, And Quality Assurance In Radiation Oncology, Khajamoinuddin Syed Jan 2020

Data Science Methods For Standardization, Safety, And Quality Assurance In Radiation Oncology, Khajamoinuddin Syed

Theses and Dissertations

Radiation oncology is the field of medicine that deals with treating cancer patients through ionizing radiation. The clinical modality or technique used to treat the cancer patients in the radiation oncology domain is referred to as radiation therapy. Radiation therapy aims to deliver precisely measured dose irradiation to a defined tumor volume (target) with as minimal damage as possible to surrounding healthy tissue (organs-at-risk), resulting in eradication of the tumor, high quality of life, and prolongation of survival. A typical radiotherapy process requires the use of different clinical systems at various stages of the workflow. The data generated in these …


Combining Human Factors And Data Science Methods To Evaluate The Use Of Free Text Communication Orders In Electronic Health Records, Swaminathan Kandaswamy Oct 2019

Combining Human Factors And Data Science Methods To Evaluate The Use Of Free Text Communication Orders In Electronic Health Records, Swaminathan Kandaswamy

Doctoral Dissertations

Medication errors are a leading cause of death in the United States. Electronic Health Records (EHR) along with Computerized Provider Order Entry (CPOE) are considered promising ways to reduce these errors. However, EHR systems have not eliminated medication errors. Moreover, in some cases they have facilitated errors due to issues such as poor usability and negative effects on clinical workflows. The use of unexpected free text within a CPOE system can serve as a marker that the system does not adequately support clinical workflow. Prior studies have looked at the use of free text within medication orders, but the inclusion …


Automated Development Of Semantic Data Models Using Scientific Publications, Martha O. Perez-Arriaga May 2018

Automated Development Of Semantic Data Models Using Scientific Publications, Martha O. Perez-Arriaga

Computer Science ETDs

The traditional methods for analyzing information in digital documents have evolved with the ever-increasing volume of data. Some challenges in analyzing scientific publications include the lack of a unified vocabulary and a defined context, different standards and formats in presenting information, various types of data, and diverse areas of knowledge. These challenges hinder detecting, understanding, comparing, sharing, and querying information rapidly.

I design a dynamic conceptual data model with common elements in publications from any domain, such as context, metadata, and tables. To enhance the models, I use related definitions contained in ontologies and the Internet. Therefore, this dissertation generates …


Demand Side Management In Smart Grid Using Big Data Analytics, Sidhant Chatterjee Dec 2017

Demand Side Management In Smart Grid Using Big Data Analytics, Sidhant Chatterjee

All Graduate Plan B and other Reports, Spring 1920 to Spring 2023

Smart Grids are the next generation electrical grid system that utilizes smart meter-ing devices and sensors to manage the grid operations. Grid management includes the prediction of load and and classification of the load patterns and consumer usage behav-iors. These predictions can be performed using machine learning methods which are often supervised. Supervised machine learning signifies that the algorithm trains the model to efficiently predict decisions based on the previously available data.

Smart grids are employed with numerous smart meters that send user statistics to a central server. The data can be accumulated and processed using data mining and machine …


Force Field Development With Gomc A Fast New Monte Carlo Molecular Simulation Code, Jason Richard Mick Jan 2016

Force Field Development With Gomc A Fast New Monte Carlo Molecular Simulation Code, Jason Richard Mick

Wayne State University Dissertations

In this work GOMC (GPU Optimized Monte Carlo) a new fast, flexible, and free molecular Monte Carlo code for the simulation atomistic chemical systems is presented. The results of a large Lennard-Jonesium simulation in the Gibbs ensemble is presented. Force fields developed using the code are also presented. To fit the models a quantitative fitting process is outlined using a scoring function and heat maps. The presented n-6 force fields include force fields for noble gases and branched alkanes. These force fields are shown to be the most accurate LJ or n-6 force fields to date for these compounds, capable …


Tspoons: Tracking Salience Profiles Of Online News Stories, Kimberly Laurel Paterson Jun 2014

Tspoons: Tracking Salience Profiles Of Online News Stories, Kimberly Laurel Paterson

Master's Theses

News space is a relatively nebulous term that describes the general discourse concerning events that affect the populace. Past research has focused on qualitatively analyzing news space in an attempt to answer big questions about how the populace relates to the news and how they respond to it. We want to ask when do stories begin? What stories stand out among the noise? In order to answer the big questions about news space, we need to track the course of individual stories in the news. By analyzing the specific articles that comprise stories, we can synthesize the information gained from …