Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 30 of 32

Full-Text Articles in Physical Sciences and Mathematics

Automatically Classifying Non-Functional Requirements With Feature Extraction And Supervised Machine Learning Techniques, Mahtab Ezzatikarami Dec 2020

Automatically Classifying Non-Functional Requirements With Feature Extraction And Supervised Machine Learning Techniques, Mahtab Ezzatikarami

Electronic Thesis and Dissertation Repository

Abstract. Context and Motivation: Non-functional requirements (NFRs) of a system need to be classified into different types such as usability, performance, etc. This would enable stakeholders to ensure the completeness of their work by extracting specific NFRs related to their expertise. Question/Problem: Because of the size and complexity of requirement specification documents, the manual classification of NFRs is time-consuming, labour-intensive, and error-prone. We thus need an automated solution that can provide a highly accurate and efficient categorization of NFRs. Principal ideas/results: In this investigation, using natural language processing and supervised machine learning (SML) techniques, we investigate with feature extraction techniques …


Regional Integration: Physician Perceptions On Electronic Medical Record Use And Impact In South West Ontario, Sadiq Raji Dec 2020

Regional Integration: Physician Perceptions On Electronic Medical Record Use And Impact In South West Ontario, Sadiq Raji

Electronic Thesis and Dissertation Repository

Regional initiatives in the health care context in Canada are typically organized and administered along geographic boundaries or operational units. Regional integration of Electronic Medical Records (EMR) has been continuing across Canadian provinces in recent years, yet the use and impact of regionally integrated EMRs are not routinely assessed and questions remain about their impact on and use in physicians’ practices. Are stated goals of simplifying connections and sharing of electronic health information collected and managed by many health services providers being met? What are physicians’ perspectives on the use and impact of regionally integrated EMR? In this thesis, I …


Applying Front End Compiler Process To Parse Polynomials In Parallel, Amha W. Tsegaye Dec 2020

Applying Front End Compiler Process To Parse Polynomials In Parallel, Amha W. Tsegaye

Electronic Thesis and Dissertation Repository

Parsing large expressions, in particular large polynomial expressions, is an important task for computer algebra systems. Despite of the apparent simplicity of the problem, its efficient software implementation brings various challenges. Among them is the fact that this is a memory bound application for which a multi-threaded implementation is necessarily limited by the characteristics of the memory organization of supporting hardware.

In this thesis, we design, implement and experiment with a multi-threaded parser for large polynomial expressions. We extract parallelism by splitting the input character string, into meaningful sub-strings that can be parsed concurrently before being merged into a single …


Longitudinal Partitioning Waveform Relaxation Methods For The Analysis Of Transmission Line Circuits, Tarik Menkad Dec 2020

Longitudinal Partitioning Waveform Relaxation Methods For The Analysis Of Transmission Line Circuits, Tarik Menkad

Electronic Thesis and Dissertation Repository

Three research projects are presented in this manuscript. Projects one and two describe two waveform relaxation algorithms (WR) with longitudinal partitioning for the time-domain analysis of transmission line circuits. Project three presents theoretical results about the convergence of WR for chains of general circuits.

The first WR algorithm uses a assignment-partition procedure that relies on inserting external series combinations of positive and negative resistances into the circuit to control the speed of convergence of the algorithm. The convergence of the subsequent WR method is examined, and fast convergence is cast as a generic optimization problem in the frequency-domain. An automatic …


Making Sense Of Online Public Health Debates With Visual Analytics Systems, Anton Ninkov Nov 2020

Making Sense Of Online Public Health Debates With Visual Analytics Systems, Anton Ninkov

Electronic Thesis and Dissertation Repository

Online debates occur frequently and on a wide variety of topics. Particularly, online debates about various public health topics (e.g., vaccines, statins, cannabis, dieting plans) are prevalent in today’s society. These debates are important because of the real-world implications they can have on public health. Therefore, it is important for public health stakeholders (i.e., those with a vested interest in public health) and the general public to have the ability to make sense of these debates quickly and effectively. This dissertation investigates ways of enabling sense-making of these debates with the use of visual analytics systems (VASes). VASes are computational …


Exploring Explicit And Implicit Feature Spaces In Natural Language Processing Using Self-Enrichment And Vector Space Analysis, Vincent Sippola Oct 2020

Exploring Explicit And Implicit Feature Spaces In Natural Language Processing Using Self-Enrichment And Vector Space Analysis, Vincent Sippola

Electronic Thesis and Dissertation Repository

Machine Learning in Natural Language Processing (NLP) deals directly with distributed representations of words and sentences. Words are transformed into vectors of real values, called embeddings, and used as the inputs to machine learning models. These architectures are then used to solve NLP tasks such as Sentiment Analysis and Natural Language Inference. While solving these tasks many models will create word embeddings and sentence embeddings as outputs. We are interested in how we can transform and analyze these output embeddings and modify our models, to both improve the task result and give us an understanding of the spaces. To this …


Framework For Kernel Based Bm3d Algorithm, Mena Abdelrahman Massoud Aug 2020

Framework For Kernel Based Bm3d Algorithm, Mena Abdelrahman Massoud

Electronic Thesis and Dissertation Repository

Patch-based approaches such as Block Matching and 3D collaborative Filtering (BM3D) algorithm represent the current state-of-the-art in image denoising. However, BM3D still suffers from degradation in performance in smooth areas as well as loss of image details, specifically in the presence of high noise levels.

Integrating shape adaptive methods with BM3D improves the denoising outcome including the visual quality of the denoised image; and also maintains image details. In this study, we proposed a framework that produces multiple images using various shapes. These images were aggregated at the pixel or patch levels for both stages in BM3D, and when appropriately …


Computational Methods For Predicting Protein-Protein Interactions And Binding Sites, Yiwei Li Aug 2020

Computational Methods For Predicting Protein-Protein Interactions And Binding Sites, Yiwei Li

Electronic Thesis and Dissertation Repository

Proteins are essential to organisms and participate in virtually every process within cells. Quite often, they keep the cells functioning by interacting with other proteins. This process is called protein-protein interaction (PPI). The bonding amino acid residues during the process of protein-protein interactions are called PPI binding sites. Identifying PPIs and PPI binding sites are fundamental problems in system biology.

Experimental methods for solving these two problems are slow and expensive. Therefore, great efforts are being made towards increasing the performance of computational methods.

We present DELPHI, a deep learning based program for PPI site prediction and SPRINT, an algorithmic …


Simulation Approaches To X-Ray C-Arm-Based Interventions, Daniel R. Allen Aug 2020

Simulation Approaches To X-Ray C-Arm-Based Interventions, Daniel R. Allen

Electronic Thesis and Dissertation Repository

Mobile C-Arm systems have enabled interventional spine procedures, such as facet joint injections, to be performed minimally-invasively under X-ray or fluoroscopy guidance. The downside to these procedures is the radiation exposure the patient and medical staff are subject to, which can vary greatly depending on the procedure as well as the skill and experience of the team. Standard training methods for these procedures involve the use of a physical C-Arm with real X-rays training on either cadavers or via an apprenticeship-based program. Many guidance systems have been proposed in the literature which aim to reduce the amount of radiation exposure …


Discrimination Of Leucine And Isoleucine In De Novo Peptide Sequencing Using Deep Neural Networks, Bingran Shen Aug 2020

Discrimination Of Leucine And Isoleucine In De Novo Peptide Sequencing Using Deep Neural Networks, Bingran Shen

Electronic Thesis and Dissertation Repository

De novo peptide sequencing from tandem MS data is a key technology in proteomics for understanding the structure of proteins, especially for first seen sequences. Although this technique has advanced rapidly in recent years and become more effective, one crucial problem remained unsolved. Due to the isomerism of leucine and isoleucine, they are practically indistinguishable in de novo sequencing using traditional tandem MS data. Some experimental attempts have been made to resolve this ambiguity such as EThCD fragmentation process. In this study, we took a data focused approach rather than only looking for characteristic satellite ions produced by the EThCD …


Optimized Machine Learning Models Towards Intelligent Systems, Mohammadnoor Ahmad Mohammad Injadat Jul 2020

Optimized Machine Learning Models Towards Intelligent Systems, Mohammadnoor Ahmad Mohammad Injadat

Electronic Thesis and Dissertation Repository

The rapid growth of the Internet and related technologies has led to the collection of large amounts of data by individuals, organizations, and society in general [1]. However, this often leads to information overload which occurs when the amount of input (e.g. data) a human is trying to process exceeds their cognitive capacities [2]. Machine learning (ML) has been proposed as one potential methodology capable of extracting useful information from large sets of data [1]. This thesis focuses on two applications. The first is education, namely e-Learning environments. Within this field, this thesis proposes different optimized ML ensemble models to …


Automated Anomaly Detection And Localization System For A Microservices Based Cloud System, Priyanka Prakash Naikade Jul 2020

Automated Anomaly Detection And Localization System For A Microservices Based Cloud System, Priyanka Prakash Naikade

Electronic Thesis and Dissertation Repository

Context: With an increasing number of applications running on a microservices-based cloud system (such as AWS, GCP, IBM Cloud), it is challenging for the cloud providers to offer uninterrupted services with guaranteed Quality of Service (QoS) factors. Problem Statement: Existing monitoring frameworks often do not detect critical defects among a large volume of issues generated, thus affecting recovery response times and usage of maintenance human resource. Also, manually tracing the root causes of the issues requires a significant amount of time. Objective: The objective of this work is to: (i) detect performance anomalies, in real-time, through monitoring KPIs (Key Performance …


Visual Analytics Of Electronic Health Records With A Focus On Acute Kidney Injury, Sheikh S. Abdullah Jul 2020

Visual Analytics Of Electronic Health Records With A Focus On Acute Kidney Injury, Sheikh S. Abdullah

Electronic Thesis and Dissertation Repository

The increasing use of electronic platforms in healthcare has resulted in the generation of unprecedented amounts of data in recent years. The amount of data available to clinical researchers, physicians, and healthcare administrators continues to grow, which creates an untapped resource with the ability to improve the healthcare system drastically. Despite the enthusiasm for adopting electronic health records (EHRs), some recent studies have shown that EHR-based systems hardly improve the ability of healthcare providers to make better decisions. One reason for this inefficacy is that these systems do not allow for human-data interaction in a manner that fits and supports …


Reinforcement Learning In Large, Structured Action Spaces: A Simulation Study Of Decision Support For Spinal Cord Injury Rehabilitation, Nathan Phelps Jul 2020

Reinforcement Learning In Large, Structured Action Spaces: A Simulation Study Of Decision Support For Spinal Cord Injury Rehabilitation, Nathan Phelps

Electronic Thesis and Dissertation Repository

Reinforcement learning (RL) has helped improve decision-making in several applications. However, applying traditional RL is challenging in some applications, such as rehabilitation of people with a spinal cord injury (SCI). Among other factors, using RL in this domain is difficult because there are many possible treatments (i.e., large action space) and few patients (i.e., limited training data). Treatments for SCIs have natural groupings, so we propose two approaches to grouping treatments so that an RL agent can learn effectively from limited data. One relies on domain knowledge of SCI rehabilitation and the other learns similarities among treatments using an embedding …


Extensions Of Classification Method Based On Quantiles, Yuanhao Lai Jun 2020

Extensions Of Classification Method Based On Quantiles, Yuanhao Lai

Electronic Thesis and Dissertation Repository

This thesis deals with the problem of classification in general, with a particular focus on heavy-tailed or skewed data. The classification problem is first formalized by statistical learning theory and several important classification methods are reviewed, where the distance-based classifiers, including the median-based classifier and the quantile-based classifier (QC), are especially useful for the heavy-tailed or skewed inputs. However, QC is limited by its model capacity and the issue of high-dimensional accumulated errors. Our objective of this study is to investigate more general methods while retaining the merits of QC.

We present four extensions of QC, which appear in chronological …


Triaging Twitter Users: An Exploratory Visual Analytics System, Parinaz Nasr Esfahani Jun 2020

Triaging Twitter Users: An Exploratory Visual Analytics System, Parinaz Nasr Esfahani

Electronic Thesis and Dissertation Repository

Twitter is one of the most popular microblogging and social networking services. Many people from a wide range of backgrounds use Twitter to contribute their thoughts on different topics through postings, known as ``tweets”. Analysts collect and analyze tweets to extract knowledge. To rely on tweets, it is crucial to assess Twitter users’ credibility. In recent years, researchers have proposed various techniques, especially data analytics models, for evaluating Twitter users and analyzing their behaviour; however, these techniques do not engage analysts in the process, leading to a lack of understanding and trust in results. In this thesis, an exploratory visual …


Design Of Interactive Visualizations For Next-Generation Ultra-Large Communication Networks, Wenjun Chen Jun 2020

Design Of Interactive Visualizations For Next-Generation Ultra-Large Communication Networks, Wenjun Chen

Electronic Thesis and Dissertation Repository

With the increasing size and complexity of next-generation communication networks, it is critical to utilize interactive information visualization to support the network monitoring, planning, and management. Effectively visualizing large-scale networks has been considered difficult with traditional methods because of the high link density and complicated node relationship. Given the limited screen space, it is essential to explore how to present ultra-large-scale network data efficiently that can assist Internet Service Provider’s (ISP) network planning and management activities. This work proposes a design of the real-time interactive visualization system that combines the idea of progressive disclosure and multiple panels to elegantly visualize …


A Hybrid Approach To Procedural Dungeon Generation, Mathias Paul Babin Jun 2020

A Hybrid Approach To Procedural Dungeon Generation, Mathias Paul Babin

Electronic Thesis and Dissertation Repository

This thesis presents a novel approach to the Procedural Content Generation (PCG) of both maze and dungeon environments. The solution we propose in this thesis borrows techniques from both Procedural Content Generation via Machine Learning as well as Constructive PCG methods. The approach we take involves decomposing the problem of level generation into a series of stages which begins with the production of macro-level functional structures and ends with micro-level aesthetic details; specifically, we train a Deep Convolutional Neural Network to produce high-quality mazes, which in turn, are transformed into the rooms of larger dungeon levels using a constructive algorithm. …


Machine Learning With Digital Signal Processing For Rapid And Accurate Alignment-Free Genome Analysis: From Methodological Design To A Covid-19 Case Study, Gurjit Singh Randhawa Jun 2020

Machine Learning With Digital Signal Processing For Rapid And Accurate Alignment-Free Genome Analysis: From Methodological Design To A Covid-19 Case Study, Gurjit Singh Randhawa

Electronic Thesis and Dissertation Repository

In the field of bioinformatics, taxonomic classification is the scientific practice of identifying, naming, and grouping of organisms based on their similarities and differences. The problem of taxonomic classification is of immense importance considering that nearly 86% of existing species on Earth and 91% of marine species remain unclassified. Due to the magnitude of the datasets, the need exists for an approach and software tool that is scalable enough to handle large datasets and can be used for rapid sequence comparison and analysis. We propose ML-DSP, a stand-alone alignment-free software tool that uses Machine Learning and Digital Signal Processing to …


Edge-Cloud Iot Data Analytics: Intelligence At The Edge With Deep Learning, Ananda Mohon M. Ghosh May 2020

Edge-Cloud Iot Data Analytics: Intelligence At The Edge With Deep Learning, Ananda Mohon M. Ghosh

Electronic Thesis and Dissertation Repository

Rapid growth in numbers of connected devices, including sensors, mobile, wearable, and other Internet of Things (IoT) devices, is creating an explosion of data that are moving across the network. To carry out machine learning (ML), IoT data are typically transferred to the cloud or another centralized system for storage and processing; however, this causes latencies and increases network traffic. Edge computing has the potential to remedy those issues by moving computation closer to the network edge and data sources. On the other hand, edge computing is limited in terms of computational power and thus is not well suited for …


Range Flow: New Algorithm Design And Quantitative And Qualitative Analysis, Seereen Noorwali Apr 2020

Range Flow: New Algorithm Design And Quantitative And Qualitative Analysis, Seereen Noorwali

Electronic Thesis and Dissertation Repository

Optical flow computation is one of the oldest and most active research fields in computer vision and image processing. It encompasses the following areas: motion estimation, video compression, object detection and tracking, image dominant plane extraction, movement detection, robot navigation, visual odometry, traffic analysis, and vehicle tracking. Optical flow methods calculate the motion between two image frames. In 2D images, optical flow specifies how far each pixel moves between adjacent frames; in 3D images, it specifies how much each voxel moves between adjacent volumes in the dataset. Since 1980, several algorithms have successfully estimated 2D and 3D optical flow. Notably, …


Identifying External Cross-References Using Natural Language Processing (Nlp), Elham Rahmani Apr 2020

Identifying External Cross-References Using Natural Language Processing (Nlp), Elham Rahmani

Electronic Thesis and Dissertation Repository

[Context and motivation] Software engineers build systems that need to be compliant with relevant regulations. These regulations are stated in authoritative documents from which regulatory requirements need to be elicited. Project contract contains cross-references to these regulatory requirements in external documents. [Problem] Exploring and identifying the regulatory requirements in voluminous textual data is enormously time consuming, and hence costly, and error-prone in sizable software projects. [Principal idea and novelty] We use Natural Language Processing (NLP), Pattern Recognition and Web Scrapping techniques for automatically extracting external cross-references from contractual requirements and prepare a map for representing related external cross-references …


A Visual Analytics System For Investigating Multimorbidity Using Supervised Machine Learning, Maede Sadat Nouri Apr 2020

A Visual Analytics System For Investigating Multimorbidity Using Supervised Machine Learning, Maede Sadat Nouri

Electronic Thesis and Dissertation Repository

Patterns of multimorbidity are complex and difficult to summarise using static visualization techniques like tables and charts. We present a visual analytics system with the goal of facilitating the process of making sense of data collected from patients with multimorbidity. The system reveals underlying patterns in the data visually and interactively, which enables users to easily assess both prevalence and correlation estimates of different chronic diseases among multimorbid patients with varying characteristics. To do so, the system uses count-based conditional probability, binary logistic regression, softmax regression and decision tree models to dynamically compute and visualize prevalence and correlation estimates for …


Physical Dispersions Of Meteor Showers Through High Precision Optical Observations, Denis Vida Apr 2020

Physical Dispersions Of Meteor Showers Through High Precision Optical Observations, Denis Vida

Electronic Thesis and Dissertation Repository

Meteoroids ejected from comets form meteoroid streams which disperse over time due to gravitational perturbations and non-gravitational forces. When stream meteoroids collide with the Earth's atmosphere, they are visible as meteors emanating from a common point-like area (radiant) in the sky. Measuring the size of meteor shower radiant areas can provide insight into stream formation and age. The tight radiant dispersion of young streams are difficult to determine due to measurement error, but if successfully measured, the dispersion could be used to constrain meteoroid ejection velocities from their parent comets. The estimated ejection velocity is an uncertain, model-dependent value with …


A Requirements Measurement Program For Systems Engineering Projects: Metrics, Indicators, Models, And Tools For Internal Stakeholders, Ibtehal Noorwali Apr 2020

A Requirements Measurement Program For Systems Engineering Projects: Metrics, Indicators, Models, And Tools For Internal Stakeholders, Ibtehal Noorwali

Electronic Thesis and Dissertation Repository

Software engineering (SE) measurement has shown to lead to improved quality and productivity in software and systems projects and, thus, has received significant attention in the literature, particularly for the design and development stages. In requirements engineering (RE), research and practice has recognized the importance of requirements measurement (RM) for tracking progress, identifying gaps in downstream deliverables related to requirements, managing requirements-related risks, reducing requirements errors and defects, and project management and decision making.

However, despite the recognized benefits of RM, research indicates that only 5\% of the literature on SE measurement addresses requirements. This small percentage is reflected in …


Deep Learning On Smart Meter Data: Non-Intrusive Load Monitoring And Stealthy Black-Box Attacks, Junfei Wang Apr 2020

Deep Learning On Smart Meter Data: Non-Intrusive Load Monitoring And Stealthy Black-Box Attacks, Junfei Wang

Electronic Thesis and Dissertation Repository

Climate change and environmental concerns are instigating widespread changes in modern electricity sectors due to energy policy initiatives and advances in sustainable technologies. To raise awareness of sustainable energy usage and capitalize on advanced metering infrastructure (AMI), a novel deep learning non-intrusive load monitoring (NILM) model is proposed to disaggregate smart meter readings and identify the operation of individual appliances. This model can be used by Electric power utility (EPU) companies and third party entities, and then utilized to perform active or passive consumer power demand management. Although machine learning (ML) algorithms are powerful, these remain vulnerable to adversarial attacks. …


Investigating Citation Linkage As A Sentence Similarity Measurement Task Using Deep Learning, Sudipta Singha Roy Mar 2020

Investigating Citation Linkage As A Sentence Similarity Measurement Task Using Deep Learning, Sudipta Singha Roy

Electronic Thesis and Dissertation Repository

Research publications reflect advancements in the corresponding research domain. In these research publications, scientists often use citations to bolster the presented research findings and portray the improvements that come with these findings, at the same time, to make the contents more understandable to the audience by navigating the flow of information. In the science domain, a citation refers to the document from where this information originates but doesn't specify the text span that is actually being cited. A more precise reference would indicate the text being referenced. This thesis develops a framework which can create a linkage between the citing …


Hierarchical Group And Attribute-Based Access Control: Incorporating Hierarchical Groups And Delegation Into Attribute-Based Access Control, Daniel Servos Mar 2020

Hierarchical Group And Attribute-Based Access Control: Incorporating Hierarchical Groups And Delegation Into Attribute-Based Access Control, Daniel Servos

Electronic Thesis and Dissertation Repository

Attribute-Based Access Control (ABAC) is a promising alternative to traditional models of access control (i.e. Discretionary Access Control (DAC), Mandatory Access Control (MAC) and Role-Based Access control (RBAC)) that has drawn attention in both recent academic literature and industry application. However, formalization of a foundational model of ABAC and large-scale adoption is still in its infancy. The relatively recent popularity of ABAC still leaves a number of problems unexplored. Issues like delegation, administration, auditability, scalability, hierarchical representations, etc. have been largely ignored or left to future work. This thesis seeks to aid in the adoption of ABAC by filling in …


Driving Maneuver Detection Using Knowledge Distillation Networks, Kyle Windsor Mar 2020

Driving Maneuver Detection Using Knowledge Distillation Networks, Kyle Windsor

Electronic Thesis and Dissertation Repository

In this thesis, we examine the current state of Advanced Driving Assistance Systems (ADAS) and their relation to maneuver prediction in the literature. We then attempt to solve the problem of variable inter-driver behavior by applying a novel distillation learning system using RoadLab data on tracked driver cephalo-ocular gaze behavior in tandem with high-resolution CANbus data. Current training-based methods in maneuver prediction are potentially subject to underfitting as drivers may exhibit different behavior when preparing to maneuver, but it has been shown that drivers can be grouped into at least two distinct behavior models. We use this information to personalize …


Network Impact Modeling And Analysis: A Qos Perspective, Tarandeep K. Randhawa Feb 2020

Network Impact Modeling And Analysis: A Qos Perspective, Tarandeep K. Randhawa

Electronic Thesis and Dissertation Repository

The International Data Corporation (IDC) estimated that total digital data created, replicated, and consumed was 4.4 Zettabytes (ZB) in the year 2013, 8 ZB in 2015, and predicted to reach 40 ZB by 2020. This massive amount of internet traffic put a great overhead on network capacity which may impact network Quality of Service (QoS) such as latency, jitter, throughput, packet loss, and load balancing. From the Internet Service Provider’s (ISP’s) perspective, understanding the possible impact of the future internet traffic on its network is critical for provisioning their network capacity in a cost-effective manner while meeting network QoS requirements. …