Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 17 of 17

Full-Text Articles in Entire DC Network

Machine Learning-Based Models For Assessing Impacts Before, During And After Hurricane Events, Julie L. Harvey Sep 2019

Machine Learning-Based Models For Assessing Impacts Before, During And After Hurricane Events, Julie L. Harvey

Electronic Theses and Dissertations

Social media provides an abundant amount of real-time information that can be used before, during, and after extreme weather events. Government officials, emergency managers, and other decision makers can use social media data for decision-making, preparation, and assistance. Machine learning-based models can be used to analyze data collected from social media. Social media data and cloud cover temperature as physical sensor data was analyzed in this study using machine learning techniques. Data was collected from Twitter regarding Hurricane Florence from September 11, 2018 through September 20, 2018 and Hurricane Michael from October 1, 2018 through October 18, 2018. Natural language …


An Explainable Recommender System Based On Semantically-Aware Matrix Factorization., Mohammed Sanad Alshammari Aug 2019

An Explainable Recommender System Based On Semantically-Aware Matrix Factorization., Mohammed Sanad Alshammari

Electronic Theses and Dissertations

Collaborative Filtering techniques provide the ability to handle big and sparse data to predict the ratings for unseen items with high accuracy. Matrix factorization is an accurate collaborative filtering method used to predict user preferences. However, it is a black box system that recommends items to users without being able to explain why. This is due to the type of information these systems use to build models. Although rich in information, user ratings do not adequately satisfy the need for explanation in certain domains. White box systems, in contrast, can, by nature, easily generate explanations. However, their predictions are less …


Clustering Of Multiple Instance Data., Andrew D. Karem May 2019

Clustering Of Multiple Instance Data., Andrew D. Karem

Electronic Theses and Dissertations

An emergent area of research in machine learning that aims to develop tools to analyze data where objects have multiple representations is Multiple Instance Learning (MIL). In MIL, each object is represented by a bag that includes a collection of feature vectors called instances. A bag is positive if it contains at least one positive instance, and negative if no instances are positive. One of the main objectives in MIL is to identify a region in the instance feature space with high correlation to instances from positive bags and low correlation to instances from negative bags -- this region is …


Clustering Heterogeneous Autism Spectrum Disorder Data., Mariem Boujelbene May 2019

Clustering Heterogeneous Autism Spectrum Disorder Data., Mariem Boujelbene

Electronic Theses and Dissertations

Autism spectrum disorder (ASD) is a developmental disorder that affects communication and behavior. Several studies have been conducted in the past years to develop a better understanding of the disease and therefore a better diagnosis and a better treatment by analyzing diverse data sets consisting of behavioral surveys and tests, phenotype description, and brain imagery. However, data analysis is challenged by the diversity, complexity and heterogeneity of patient cases and by the need for integrating diverse data sets to reach a better understanding of ASD. The aim of our study is to mine homogeneous groups of patients from a heterogeneous …


Modeling And Counteracting Exposure Bias In Recommender Systems., Sami Khenissi May 2019

Modeling And Counteracting Exposure Bias In Recommender Systems., Sami Khenissi

Electronic Theses and Dissertations

Recommender systems are becoming widely used in everyday life. They use machine learning algorithms which learn to predict our preferences and thus influence our choices among a staggering array of options online, such as movies, books, products, and even news articles. Thus what we discover and see online, and consequently our opinions and decisions, are becoming increasingly affected by automated predictions made by learning machines. Similarly, the predictive accuracy of these learning machines heavily depends on the feedback data, such as ratings and clicks, that we provide them. This mutual influence can lead to closed-loop interactions that may cause unknown …


Studying And Handling Iterated Algorithmic Biases In Human And Machine Learning Interaction., Wenlong Sun May 2019

Studying And Handling Iterated Algorithmic Biases In Human And Machine Learning Interaction., Wenlong Sun

Electronic Theses and Dissertations

Algorithmic bias consists of biased predictions born from ingesting unchecked information, such as biased samples and biased labels. Furthermore, the interaction between people and algorithms can exacerbate bias such that neither the human nor the algorithms receive unbiased data. Thus, algorithmic bias can be introduced not only before and after the machine learning process but sometimes also in the middle of the learning process. With a handful of exceptions, only a few categories of bias have been studied in Machine Learning, and there are few, if any, studies of the impact of bias on both human behavior and algorithm performance. …


Supervised Machine Learning Techniques For Short-Term Load Forecasting, Harish Amarasundar Jan 2019

Supervised Machine Learning Techniques For Short-Term Load Forecasting, Harish Amarasundar

Electronic Theses and Dissertations

Electric Load Forecasting is essential for the utility companies for energy management based on the demand. Machine Learning Algorithms has been in the forefront for prediction algorithms. This Thesis is mainly aimed to provide utility companies with a better insight about the wide range of Techniques available to forecast the load demands based on different scenarios. Supervised Machine Learning Algorithms were used to come up with the best possible solution for Short-Term Electric Load forecasting. The input Data set has the hourly load values, Weather data set and other details of a Day. The models were evaluated using MAPE and …


Deep Learning-Based Visual Crack Detection Using Google Street View Images, Mohsen Maniat Jan 2019

Deep Learning-Based Visual Crack Detection Using Google Street View Images, Mohsen Maniat

Electronic Theses and Dissertations

The need for developing an economical and efficient quality assessment system for pave-ment motivates this study to take advantage of available new technologies and provide a novel approach to address this need. In this study, the utility of using Google Street View (GSV) for evaluating the quality of pavement is investigated. GSV is a technology featured in Google Maps and Google Earth that provides interactive panoramas along many streets throughout the world. This technology provides a large data set of pavement images that can be used for pavement evaluation. Advanced deep learning algorithms are utilized to automate the pavement assessment …


Applying Machine Learning Algorithms For The Analysis Of Biological Sequences And Medical Records, Shaopeng Gu Jan 2019

Applying Machine Learning Algorithms For The Analysis Of Biological Sequences And Medical Records, Shaopeng Gu

Electronic Theses and Dissertations

The modern sequencing technology revolutionizes the genomic research and triggers explosive growth of DNA, RNA, and protein sequences. How to infer the structure and function from biological sequences is a fundamentally important task in genomics and proteomics fields. With the development of statistical and machine learning methods, an integrated and user-friendly tool containing the state-of-the-art data mining methods are needed. Here, we propose SeqFea-Learn, a comprehensive Python pipeline that integrating multiple steps: feature extraction, dimensionality reduction, feature selection, predicting model constructions based on machine learning and deep learning approaches to analyze sequences. We used enhancers, RNA N6- methyladenosine sites and …


Applied Deep Learning In Orthopaedics, William Stewart Burton Ii Jan 2019

Applied Deep Learning In Orthopaedics, William Stewart Burton Ii

Electronic Theses and Dissertations

The reemergence of deep learning in recent years has led to its successful application in a wide variety of fields. As a subfield of machine learning, deep learning offers an array of powerful algorithms for data-driven applications. Orthopaedics stands to benefit from the potential of deep learning for advancements in the field. This thesis investigated applications of deep learning for the field of orthopaedics through the development of three distinct projects.

First, algorithms were developed for the automatic segmentation of the structures in the knee from MRI. The resulting algorithms can be used to accurately segment full MRI scans in …


Multi-Step Forecast Of The Implied Volatility Surface Using Deep Learning, Nikita Medvedev Jan 2019

Multi-Step Forecast Of The Implied Volatility Surface Using Deep Learning, Nikita Medvedev

Electronic Theses and Dissertations

Implied volatility is an essential input to price an option. Machine learning architectures have shown strengths in learning option pricing formulas and estimating implied volatility cross-sectionally. However, implied volatility time series forecasting is typically done using the univariate time series and often for short intervals. When a univariate implied volatility series is forecasted, important implied volatility properties such as volatility skew and the term structure are lost. More importantly, short term forecasts can’t take advantage of the long term persistence in the volatility series. The thesis attempts to bridge the gap between machine learning-based implied volatility modeling and multivariate multi-step …


Real-Time Automated Weld Quality Analysis From Ultrasonic B-Scan Using Deep Learning, Zarreen Naowal Reza Jan 2019

Real-Time Automated Weld Quality Analysis From Ultrasonic B-Scan Using Deep Learning, Zarreen Naowal Reza

Electronic Theses and Dissertations

Resistance spot welding is a widely used process for joining metals using electrically generated heat or Joule heating. It is one of the most commonly used techniques in automotive industry to weld sheet metals in order to form a car body. Although, industrial robots are used as automated spot welders in massive scale in the industries, the weld quality inspection process still requires human involvement to decide if a weld should be passed as acceptable or not. Not only it is a tedious and error- prone job, but also it costs industries lots of time and money. Therefore, making this …


Computational Drug Repurposing For Breast Cancer Subtypes, Roopesh Dhara Jan 2019

Computational Drug Repurposing For Breast Cancer Subtypes, Roopesh Dhara

Electronic Theses and Dissertations

Breast cancer makes up 25 percent of all new cancer diagnoses globally according to the American Cancer Society(ACS). Developing a highly effective drug can be a time consuming and an expensive ordeal. Drug repurposing is a tremendous approach which takes away some disadvantages of traditional drug development procedures making it both time and cost effective. In this thesis, we are interested in finding good drugs for each of the ten subtypes of breast cancer. Repurposing incorporates identifying unique indications of pre-approved drugs and utilizing them to observe the anti-correlation between the perturbation data and disease data. If anti-correlation, whether it …


Data Patterns Discovery Using Unsupervised Learning, Rachel A. Lewis Jan 2019

Data Patterns Discovery Using Unsupervised Learning, Rachel A. Lewis

Electronic Theses and Dissertations

Self-care activities classification poses significant challenges in identifying children’s unique functional abilities and needs within the exceptional children healthcare system. The accuracy of diagnosing a child's self-care problem, such as toileting or dressing, is highly influenced by an occupational therapists’ experience and time constraints. Thus, there is a need for objective means to detect and predict in advance the self-care problems of children with physical and motor disabilities. We use clustering to discover interesting information from self-care problems, perform automatic classification of binary data, and discover outliers. The advantages are twofold: the advancement of knowledge on identifying self-care problems in …


Regression Tree Construction For Reinforcement Learning Problems With A General Action Space, Anthony S. Bush Jr Jan 2019

Regression Tree Construction For Reinforcement Learning Problems With A General Action Space, Anthony S. Bush Jr

Electronic Theses and Dissertations

Part of the implementation of Reinforcement Learning is constructing a regression of values against states and actions and using that regression model to optimize over actions for a given state. One such common regression technique is that of a decision tree; or in the case of continuous input, a regression tree. In such a case, we fix the states and optimize over actions; however, standard regression trees do not easily optimize over a subset of the input variables\cite{Card1993}. The technique we propose in this thesis is a hybrid of regression trees and kernel regression. First, a regression tree splits over …


Ifocus: A Framework For Non-Intrusive Assessment Of Student Attention Level In Classrooms, Narayanan Veliyath Jan 2019

Ifocus: A Framework For Non-Intrusive Assessment Of Student Attention Level In Classrooms, Narayanan Veliyath

Electronic Theses and Dissertations

The process of learning is not merely determined by what the instructor teaches, but also by how the student receives that information. An attentive student will naturally be more open to obtaining knowledge than a bored or frustrated student. In recent years, tools such as skin temperature measurements and body posture calculations have been developed for the purpose of determining a student's affect, or emotional state of mind. However, measuring eye-gaze data is particularly noteworthy in that it can collect measurements non-intrusively, while also being relatively simple to set up and use. This paper details how data obtained from such …


Computer Vision-Based Traffic Sign Detection And Extraction: A Hybrid Approach Using Gis And Machine Learning, Zihao Wu Jan 2019

Computer Vision-Based Traffic Sign Detection And Extraction: A Hybrid Approach Using Gis And Machine Learning, Zihao Wu

Electronic Theses and Dissertations

Traffic sign detection and positioning have drawn considerable attention because of the recent development of autonomous driving and intelligent transportation systems. In order to detect and pinpoint traffic signs accurately, this research proposes two methods. In the first method, geo-tagged Google Street View images and road networks were utilized to locate traffic signs. In the second method, both traffic signs categories and locations were identified and extracted from the location-based GoPro video. TensorFlow is the machine learning framework used to implement these two methods. To that end, 363 stop signs were detected and mapped accurately using the first method (Google …