Open Access. Powered by Scholars. Published by Universities.®

Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 13 of 13

Full-Text Articles in Engineering

Person Identification With Convolutional Neural Networks, Kang Zheng Oct 2019

Person Identification With Convolutional Neural Networks, Kang Zheng

Theses and Dissertations

Person identification aims at matching persons across images or videos captured by different cameras, without requiring the presence of persons’ faces. It is an important problem in computer vision community and has many important real-world applica- tions, such as person search, security surveillance, and no-checkout stores. However, this problem is very challenging due to various factors, such as illumination varia- tion, view changes, human pose deformation, and occlusion. Traditional approaches generally focus on hand-crafting features and/or learning distance metrics for match- ing to tackle these challenges. With Convolutional Neural Networks (CNNs), feature extraction and metric learning can be combined in …


Cybersecurity Issues In The Context Of Cryptographic Shuffling Algorithms And Concept Drift: Challenges And Solutions, Hatim Alsuwat Oct 2019

Cybersecurity Issues In The Context Of Cryptographic Shuffling Algorithms And Concept Drift: Challenges And Solutions, Hatim Alsuwat

Theses and Dissertations

In this dissertation, we investigate and address two kinds of data integrity threats. We first study the limitations of secure cryptographic shuffling algorithms regarding preservation of data dependencies. We then study the limitations of machine learning models regarding concept drift detection. We propose solutions to address these threats.

Shuffling Algorithms have been used to protect the confidentiality of sensitive data. However, these algorithms may not preserve data dependencies, such as functional de- pendencies and data-driven associations. We present two solutions for addressing these shortcomings: (1) Functional dependencies preserving shuffle, and (2) Data-driven asso- ciations preserving shuffle. For preserving functional dependencies, …


Properties, Learning Algorithms, And Applications Of Chain Graphs And Bayesian Hypergraphs, Mohammad Ali Javidian Oct 2019

Properties, Learning Algorithms, And Applications Of Chain Graphs And Bayesian Hypergraphs, Mohammad Ali Javidian

Theses and Dissertations

Probabilistic graphical models (PGMs) use graphs, either undirected, directed, or mixed, to represent possible dependencies among the variables of a multivariate probability distri- bution. PGMs, such as Bayesian networks and Markov networks, are now widely accepted as a powerful and mature framework for reasoning and decision making under uncertainty in knowledge-based systems. With the increase of their popularity, the range of graphical models being investigated and used has also expanded. Several types of graphs with dif- ferent conditional independence interpretations - also known as Markov properties - have been proposed and used in graphical models.

The graphical structure of a …


Challenges In Large-Scale Machine Learning Systems: Security And Correctness, Emad Alsuwat Oct 2019

Challenges In Large-Scale Machine Learning Systems: Security And Correctness, Emad Alsuwat

Theses and Dissertations

In this research, we address the impact of data integrity on machine learning algorithms. We study how an adversary could corrupt Bayesian network structure learning algorithms by inserting contaminated data items. We investigate the resilience of two commonly used Bayesian network structure learning algorithms, namely the PC and LCD algorithms, against data poisoning attacks that aim to corrupt the learned Bayesian network model.

Data poisoning attacks are one of the most important emerging security threats against machine learning systems. These attacks aim to corrupt machine learning models by con- taminating datasets in the training phase. The lack of resilience of …


Stacked Modelling Framework, Kareem Abdelfatah Oct 2019

Stacked Modelling Framework, Kareem Abdelfatah

Theses and Dissertations

The thesis develops a predictive modeling framework based on stacked Gaussian processes and applies it to two main applications in environmental and chemical en- gineering. First, a network of independently trained Gaussian processes (StackedGP) is introduced to obtain analytical predictions of quantities of interest (model out- puts) with quantified uncertainties. StackedGP framework supports component- based modeling in different fields such as environmental and chemical science, en- hances predictions of quantities of interest through a cascade of intermediate predic- tions usually addressed by cokriging, and propagates uncertainties through emulated dynamical systems driven by uncertain forcing variables. By using analytical first and …


Fkrr-Mvsf: A Fuzzy Kernel Ridge Regression Model For Identifying Dna-Binding Proteins By Multi-View Sequence Features Via Chou's Five-Step Rule, Yi Zou, Yijie Ding, Jijun Tang, Fei Guo, Li Peng Sep 2019

Fkrr-Mvsf: A Fuzzy Kernel Ridge Regression Model For Identifying Dna-Binding Proteins By Multi-View Sequence Features Via Chou's Five-Step Rule, Yi Zou, Yijie Ding, Jijun Tang, Fei Guo, Li Peng

Faculty Publications

DNA-binding proteins play an important role in cell metabolism. In biological laboratories, the detection methods of DNA-binding proteins includes yeast one-hybrid methods, bacterial singles and X-ray crystallography methods and others, but these methods involve a lot of labor, material and time. In recent years, many computation-based approachs have been proposed to detect DNA-binding proteins. In this paper, a machine learning-based method, which is called the Fuzzy Kernel Ridge Regression model based on Multi-View Sequence Features (FKRR-MVSF), is proposed to identifying DNA-binding proteins. First of all, multi-view sequence features are extracted from protein sequences. Next, a Multiple Kernel Learning (MKL) algorithm …


Fkrr-Mvsf: A Fuzzy Kernel Ridge Regression Model For Identifying Dna-Binding Proteins By Multi-View Sequence Features Via Chou's Five-Step Rule, Yi Zou, Yije Ding, Jijun Tang, Fei Guo, Li Peng Sep 2019

Fkrr-Mvsf: A Fuzzy Kernel Ridge Regression Model For Identifying Dna-Binding Proteins By Multi-View Sequence Features Via Chou's Five-Step Rule, Yi Zou, Yije Ding, Jijun Tang, Fei Guo, Li Peng

Faculty Publications

DNA-binding proteins play an important role in cell metabolism. In biological laboratories, the detection methods of DNA-binding proteins includes yeast one-hybrid methods, bacterial singles and X-ray crystallography methods and others, but these methods involve a lot of labor, material and time. In recent years, many computation-based approachs have been proposed to detect DNA-binding proteins. In this paper, a machine learning-based method, which is called the Fuzzy Kernel Ridge Regression model based on Multi-View Sequence Features (FKRR-MVSF), is proposed to identifying DNA-binding proteins. First of all, multi-view sequence features are extracted from protein sequences. Next, a Multiple Kernel Learning (MKL) algorithm …


A Review Of Text Corpus-Based Tourism Big Data Mining, Qin Lin, Shaobo Li, Sen Zhang, Jie Hu, Jianjun Hu Aug 2019

A Review Of Text Corpus-Based Tourism Big Data Mining, Qin Lin, Shaobo Li, Sen Zhang, Jie Hu, Jianjun Hu

Faculty Publications

With the massive growth of the Internet, text data has become one of the main formats of tourism big data. As an effective expression means of tourists’ opinions, text mining of such data has big potential to inspire innovations for tourism practitioners. In the past decade, a variety of text mining techniques have been proposed and applied to tourism analysis to develop tourism value analysis models, build tourism recommendation systems, create tourist profiles, and make policies for supervising tourism markets. The successes of these techniques have been further boosted by the progress of natural language processing (NLP), machine learning, and …


A Review Of Text Corpus-Based Tourism Big Data Mining, Qin Li, Shaobo Li, Sen Zhang, Jie Hu, Jianhun Hu Aug 2019

A Review Of Text Corpus-Based Tourism Big Data Mining, Qin Li, Shaobo Li, Sen Zhang, Jie Hu, Jianhun Hu

Faculty Publications

With the massive growth of the Internet, text data has become one of the main formats of tourism big data. As an effective expression means of tourists’ opinions, text mining of such data has big potential to inspire innovations for tourism practitioners. In the past decade, a variety of text mining techniques have been proposed and applied to tourism analysis to develop tourism value analysis models, build tourism recommendation systems, create tourist profiles, and make policies for supervising tourism markets. The successes of these techniques have been further boosted by the progress of natural language processing (NLP), machine learning, and …


Personalized Product Evaluation Based On Gra-Topsis And Kansei Engineering, Huafeng Quan, Shaobo Li, Hongjing Wei, Jianjun Hu Jul 2019

Personalized Product Evaluation Based On Gra-Topsis And Kansei Engineering, Huafeng Quan, Shaobo Li, Hongjing Wei, Jianjun Hu

Faculty Publications

With the improvement of human living standards, users’ requirements have changed from function to emotion. Helping users pick out the most suitable product based on their subjective requirements is of great importance for enterprises. This paper proposes a Kansei engineering-based grey relational analysis and techniques for order preference by similarity to ideal solution (KE-GAR-TOPSIS) method to make a subjective user personalized ranking of alternative products. The KE-GRA-TOPSIS method integrates five methods, including Kansei Engineering (KE), analytic hierarchy process (AHP), entropy, game theory, and grey relational analysis-TOPSIS (GRA-TOPSIS). First, an evaluation system is established by KE and AHP. Second, we define …


Using Big Data Analytics To Improve Hiv Medical Care Utilisation In South Carolina: A Study Protocol, Bankole Olatosi, Jiajia Zhang, Sharon Weissman, Jianjun Hu, Mohammad Rifat Haider, Xiaoming Li Jun 2019

Using Big Data Analytics To Improve Hiv Medical Care Utilisation In South Carolina: A Study Protocol, Bankole Olatosi, Jiajia Zhang, Sharon Weissman, Jianjun Hu, Mohammad Rifat Haider, Xiaoming Li

Faculty Publications

Introduction Linkage and retention in HIV medical care remains problematic in the USA. Extensive health utilisation data collection through electronic health records (EHR) and claims data represent new opportunities for scientific discovery. Big data science (BDS) is a powerful tool for investigating HIV care utilisation patterns. The South Carolina (SC) office of Revenue and Fiscal Affairs (RFA) data warehouse captures individual-level longitudinal health utilisation data for persons living with HIV (PLWH). The data warehouse includes EHR, claims and data from private institutions, housing, prisons, mental health, Medicare, Medicaid, State Health Plan and the department of health and human services. The …


Deep Autoencoder Neural Networks For Short-Term Traffic Congestion Prediction Of Transportation Networks, Sen Zhang, Yong Yao, Jie Hu, Yong Zhao, Shaobo Li, Jianjun Hu May 2019

Deep Autoencoder Neural Networks For Short-Term Traffic Congestion Prediction Of Transportation Networks, Sen Zhang, Yong Yao, Jie Hu, Yong Zhao, Shaobo Li, Jianjun Hu

Faculty Publications

Traffic congestion prediction is critical for implementing intelligent transportation systems for improving the efficiency and capacity of transportation networks. However, despite its importance, traffic congestion prediction is severely less investigated compared to traffic flow prediction, which is partially due to the severe lack of large-scale high-quality traffic congestion data and advanced algorithms. This paper proposes an accessible and general workflow to acquire large-scale traffic congestion data and to create traffic congestion datasets based on image analysis. With this workflow we create a dataset named Seattle Area Traffic Congestion Status (SATCS) based on traffic congestion map snapshots from a publicly available …


Convolutional Neural Networks For Crystal Material Property Prediction Using Hybrid Orbital-Field Matrix And Magpie Descriptors, Zhuo Cao, Yabo Dan, Zheng Xiong, Chengcheng Niu, Xiang Li, Songrong Qian, Jianjun Hu Apr 2019

Convolutional Neural Networks For Crystal Material Property Prediction Using Hybrid Orbital-Field Matrix And Magpie Descriptors, Zhuo Cao, Yabo Dan, Zheng Xiong, Chengcheng Niu, Xiang Li, Songrong Qian, Jianjun Hu

Faculty Publications

Computational prediction of crystal materials properties can help to do large-scale in-silicon screening. Recent studies of material informatics have focused on expert design of multi-dimensional interpretable material descriptors/features. However, successes of deep learning such as Convolutional Neural Networks (CNN) in image recognition and speech recognition have demonstrated their automated feature extraction capability to effectively capture the characteristics of the data and achieve superior prediction performance. Here, we propose CNN-OFM-Magpie, a CNN model with OFM (Orbital-field Matrix) and Magpie descriptors to predict the formation energy of 4030 crystal material by exploiting the complementarity of two-dimensional OFM features and Magpie features. Experiments …