Open Access. Powered by Scholars. Published by Universities.®
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
- Institution
- Keyword
-
- Machine learning (8)
- Deep learning (6)
- Machine Learning (5)
- Logistic regression (3)
- Support Vector Machine (3)
-
- Twitter (3)
- Unsupervised learning (3)
- Accuracy (2)
- Big data (2)
- Data mining (2)
- GPU (2)
- Natural language processing (2)
- Sentiment analysis (2)
- Social media (2)
- Supervised learning (2)
- 2D maps (1)
- 911 (1)
- ADF (1)
- ARIMA (1)
- ARWU world university rankings (1)
- Active data bundle (ADB) (1)
- Adversarial learning (1)
- Algorithmic efficiency (1)
- Algorithms (1)
- Anomaly (1)
- Approximate algorithms (1)
- Area under the ROC curve (1)
- Artifact removal (1)
- Artificial intellgence (1)
- Artificial intelligence (1)
- Publication Type
Articles 1 - 30 of 39
Full-Text Articles in Physical Sciences and Mathematics
Quantitative Metrics For Mutation Testing, Amani M. Ayad
Quantitative Metrics For Mutation Testing, Amani M. Ayad
Dissertations
Program mutation is the process of generating versions of a base program by applying elementary syntactic modifications; this technique has been used in program testing in a variety of applications, most notably to assess the quality of a test data set. A good test set will discover the difference between the original program and mutant except if the mutant is semantically equivalent to the original program, despite being syntactically distinct.
Equivalent mutants are a major nuisance in the practice of mutation testing, because they introduce a significant amount of bias and uncertainty in the analysis of test results; indeed, mutants …
Early Detection Of Fake News On Social Media, Yang Liu
Early Detection Of Fake News On Social Media, Yang Liu
Dissertations
The ever-increasing popularity and convenience of social media enable the rapid widespread of fake news, which can cause a series of negative impacts both on individuals and society. Early detection of fake news is essential to minimize its social harm. Existing machine learning approaches are incapable of detecting a fake news story soon after it starts to spread, because they require certain amounts of data to reach decent effectiveness which take time to accumulate. To solve this problem, this research first analyzes and finds that, on social media, the user characteristics of fake news spreaders distribute significantly differently from those …
Bio-Inspired Learning And Hardware Acceleration With Emerging Memories, Shruti R. Kulkarni
Bio-Inspired Learning And Hardware Acceleration With Emerging Memories, Shruti R. Kulkarni
Dissertations
Machine Learning has permeated many aspects of engineering, ranging from the Internet of Things (IoT) applications to big data analytics. While computing resources available to implement these algorithms have become more powerful, both in terms of the complexity of problems that can be solved and the overall computing speed, the huge energy costs involved remains a significant challenge. The human brain, which has evolved over millions of years, is widely accepted as the most efficient control and cognitive processing platform. Neuro-biological studies have established that information processing in the human brain relies on impulse like signals emitted by neurons called …
Cancer Risk Prediction With Whole Exome Sequencing And Machine Learning, Abdulrhman Fahad M Aljouie
Cancer Risk Prediction With Whole Exome Sequencing And Machine Learning, Abdulrhman Fahad M Aljouie
Dissertations
Accurate cancer risk and survival time prediction are important problems in personalized medicine, where disease diagnosis and prognosis are tuned to individuals based on their genetic material. Cancer risk prediction provides an informed decision about making regular screening that helps to detect disease at the early stage and therefore increases the probability of successful treatments. Cancer risk prediction is a challenging problem. Lifestyle, environment, family history, and genetic predisposition are some factors that influence the disease onset. Cancer risk prediction based on predisposing genetic variants has been studied extensively. Most studies have examined the predictive ability of variants in known …
Using Machine Learning Classification Methods To Detect The Presence Of Heart Disease, Nestor Pereira
Using Machine Learning Classification Methods To Detect The Presence Of Heart Disease, Nestor Pereira
Dissertations
Cardiovascular disease (CVD) is the most common cause of death in Ireland, and probably, worldwide. According to the Health Service Executive (HSE) cardiovascular disease accounting for 36% of all deaths, and one important fact, 22% of premature deaths (under age 65) are from CVD.
Using data from the Heart Disease UCI Data Set (UCI Machine Learning), we use machine learning techniques to detect the presence or absence of heart disease in the patient according to 14 features provide for this dataset. The different results are compared based on accuracy performance, confusion matrix and area under the Receiver Operating Characteristics (ROC) …
Factor Analysis Of Mixed Data (Famd) And Multiple Linear Regression In R, Nestor Pereira
Factor Analysis Of Mixed Data (Famd) And Multiple Linear Regression In R, Nestor Pereira
Dissertations
In the previous projects, it has been worked to statistically analysis of the factors to impact the score of the subjects of Mathematics and Portuguese for several groups of the student from secondary school from Portugal.
In this project will be interested in finding a model, hypothetically multiple linear regression, to predict the final score, dependent variable G3, of the student according to some features divide into two groups. One group, analyses the features or predictors which impact in the final score more related to the performance of the students, means variables like study time or past failures. The second …
Social Media Sentiment Analysis With A Deep Neural Network: An Enhanced Approach Using User Behavioral Information, Ahmed Sulaiman M. Alharbi
Social Media Sentiment Analysis With A Deep Neural Network: An Enhanced Approach Using User Behavioral Information, Ahmed Sulaiman M. Alharbi
Dissertations
Sentiment analysis on social media such as Twitter has become a very important and challenging task. Due to the characteristics of such data (including tweet length, spelling errors, abbreviations, and special characters), the sentiment analysis task in such an environment requires a non-traditional approach. Moreover, social media sentiment analysis constitutes a fundamental problem with many interesting applications, such as for Business Intelligence, Medical Monitoring, and National Security. Most current social media sentiment classification methods judge the sentiment polarity primarily according to textual content and neglect other information on these platforms. In this research, we propose deep learning based frameworks that …
Scalable Algorithms And Hybrid Parallelization Strategies For Multivariate Integration With Paradapt And Cuda, Omofolakunmi Elizabeth Olagbemi
Scalable Algorithms And Hybrid Parallelization Strategies For Multivariate Integration With Paradapt And Cuda, Omofolakunmi Elizabeth Olagbemi
Dissertations
The evaluation of numerical integrals finds applications in fields such as High Energy Physics, Bayesian Statistics, Stochastic Geometry, Molecular Modeling and Medical Physics. The erratic behavior of some integrands due to singularities, peaks, or ridges in the integration region suggests the need for reliable algorithms and software that not only provide an estimation of the integral with a level of accuracy acceptable to the user, but also perform this task in a timely manner. We developed ParAdapt, a numerical integration software based on a classic global adaptive strategy, which employs Graphical Processing Units (GPUs) in providing integral evaluations. Specifically, ParAdapt …
Toward Self-Reconfigurable Parametric Systems: Reinforcement Learning Approach, Ting-Yu Mu
Toward Self-Reconfigurable Parametric Systems: Reinforcement Learning Approach, Ting-Yu Mu
Dissertations
For the ongoing advancement of the fields of Information Technology (IT) and Computer Science, machine learning-based approaches are utilized in different ways in order to solve the problems that belong to the Nondeterministic Polynomial time (NP)-hard complexity class or to approximate the problems if there is no known efficient way to find a solution. Problems that determine the proper set of reconfigurable parameters of parametric systems to obtain the near optimal performance are typically classified as NP-hard problems with no efficient mathematical models to obtain the best solutions. This body of work aims to advance the knowledge of machine learning …
Developing A Computational Framework For A Construction Scheduling Decision Support Web Based Expert System, Feroz Ahmed
Developing A Computational Framework For A Construction Scheduling Decision Support Web Based Expert System, Feroz Ahmed
Dissertations
Decision-making is one of the basic cognitive processes of human behaviors by which a preferred option or a course of action is chosen from among a set of alternatives based on certain criteria. Decision-making is the thought process of selecting a logical choice from the available options. When trying to make a good decision, all the positives and negatives of each option should be evaluated. This decision-making process is particularly challenging during the preparation of a construction schedule, where it is difficult for a human to analyze all possible outcomes of each and every situation because, construction of a project …
A Framework For Personalized Content Recommendations To Support Informal Learning In Massively Diverse Information Wikis, Heba M Ismail
A Framework For Personalized Content Recommendations To Support Informal Learning In Massively Diverse Information Wikis, Heba M Ismail
Dissertations
Personalization has proved to achieve better learning outcomes by adapting to specific learners’ needs, interests, and/or preferences. Traditionally, most personalized learning software systems focused on formal learning. However, learning personalization is not only desirable for formal learning, it is also required for informal learning, which is self-directed, does not follow a specified curriculum, and does not lead to formal qualifications. Wikis among other informal learning platforms are found to attract an increasing attention for informal learning, especially Wikipedia. The nature of wikis enables learners to freely navigate the learning environment and independently construct knowledge without being forced to follow a …
Optimal Sampling Paths For Autonomous Vehicles In Uncertain Ocean Flows, Andrew J. De Stefan
Optimal Sampling Paths For Autonomous Vehicles In Uncertain Ocean Flows, Andrew J. De Stefan
Dissertations
Despite an extensive history of oceanic observation, researchers have only begun to build a complete picture of oceanic currents. Sparsity of instrumentation has created the need to maximize the information extracted from every source of data in building this picture. Within the last few decades, autonomous vehicles, or AVs, have been employed as tools to aid in this research initiative. Unmanned and self-propelled, AVs are capable of spending weeks, if not months, exploring and monitoring the oceans. However, the quality of data acquired by these vehicles is highly dependent on the paths along which they collect their observational data. The …
Applied Deep Learning In Intelligent Transportation Systems And Embedding Exploration, Xiaoyuan Liang
Applied Deep Learning In Intelligent Transportation Systems And Embedding Exploration, Xiaoyuan Liang
Dissertations
Deep learning techniques have achieved tremendous success in many real applications in recent years and show their great potential in many areas including transportation. Even though transportation becomes increasingly indispensable in people’s daily life, its related problems, such as traffic congestion and energy waste, have not been completely solved, yet some problems have become even more critical. This dissertation focuses on solving the following fundamental problems: (1) passenger demand prediction, (2) transportation mode detection, (3) traffic light control, in the transportation field using deep learning. The dissertation also extends the application of deep learning to an embedding system for visualization …
Predicting The Complexity Of Locality Patterns In Loop Nests In C Scientific Programs, Nasser M. Alsaedi
Predicting The Complexity Of Locality Patterns In Loop Nests In C Scientific Programs, Nasser M. Alsaedi
Dissertations
On modern computer systems, the performance of an application depends on its locality. Most existing locality measurements performed by compiler static analysis mainly target analyzing regular array references in loop nests. Measurements based on compiler static analysis have limited applicability when the loop bounds are unknown at compile time, when the control flow is dynamic, or when index arrays or pointer operations are used. In addition, compiler static analysis cannot adapt to input change.
Training-based locality analysis predicts the data reuse change across program inputs to provide run-time information. This analysis quantifies the number of unique memory locations accessed between …
Enhancing Scalability In Genetic Programming With Adaptable Constraints, Type Constraints And Automatically Defined Functions, George Gerules
Enhancing Scalability In Genetic Programming With Adaptable Constraints, Type Constraints And Automatically Defined Functions, George Gerules
Dissertations
Genetic Programming is a type of biological inspired machine learning. It is composed of a population of stochastic individuals. Those individuals can exchange portions of themselves with others in the population through the crossover operation that draws its inspiration from biology. Other biologically inspired operations include mutation and reproduction. The form an individual takes can be many things. It, however, is represented most of the time as a computer program. Constructing correct efficient programs can be notoriously difficult. Various grammar, typing, function constraint, or counting mechanisms can guide creation and evolution of those individuals. These mechanisms can reduce search space …
Exploring The Dynamics Of Scientific Research, Shilpa Lakhanpal
Exploring The Dynamics Of Scientific Research, Shilpa Lakhanpal
Dissertations
Scientific research papers present the research endeavors of numerous scientists around the world, and are documented across multitudes of technical conference proceedings, and other such publications. Given the plethora of such research data, if we could automate the extraction of key interesting areas of research, and provide access to this new information, it would make literature searches incredibly easier for researchers. This in turn could be very useful for them in furthering their research agenda. With this goal in mind, we have endeavored to provide such solutions through our research. Specifically, the focus of our research is to design, analyze …
High-Performance Quasi-Monte Carlo Integration And Applications, Ahmed Hassan H. Almulihi
High-Performance Quasi-Monte Carlo Integration And Applications, Ahmed Hassan H. Almulihi
Dissertations
While adaptive integration by region partitioning is generally effective in low dimensions, quasi-Monte Carlo methods can be used for integral approximations in moderate to high dimensions. Important application areas include high-energy physics, statistics, computational finance and stochastic geometry with applications in robotics, tessellations and imaging from medical data using tetrahedral meshes.
Lattice rule integration is a class of quasi-Monte Carlo methods, implemented by an equal-weight cubature formula and suited for fairly smooth functions. Successful methods to construct these rules are the component-by-component (CBC) algorithm by Sloan and Restsov (2001) and the fast algorithm for CBC by Nuyens and Cools (2006). …
Approximate Algorithms For Regulatory Motif Discovery In Dna, Hasnaa Imad Al-Shaikhli
Approximate Algorithms For Regulatory Motif Discovery In Dna, Hasnaa Imad Al-Shaikhli
Dissertations
Motif discovery is the problem of finding common substrings within a set of biological strings. Therefore it can be applied to finding Transcription Factor Binding Sites (TFBS) that have common patterns (motifs). A transcription factor molecule can bind to multiple binding sites in the promoter region of different genes to make these genes co-regulating. The Planted (l, d) Motif Problem (PMP) is a classic version of motif discovery where l is the motif length and d represents the maximum allowed mutation distance. The quorum Planted (l, d, q) Motif Problem (qPMP) is a version of PMP …
Model-Based Deep Autoencoders For Characterizing Discrete Data With Application To Genomic Data Analysis, Tian Tian
Dissertations
Deep learning techniques have achieved tremendous successes in a wide range of real applications in recent years. For dimension reduction, deep neural networks (DNNs) provide a natural choice to parameterize a non-linear transforming function that maps the original high dimensional data to a lower dimensional latent space. Autoencoder is a kind of DNNs used to learn efficient feature representation in an unsupervised manner. Deep autoencoder has been widely explored and applied to analysis of continuous data, while it is understudied for characterizing discrete data. This dissertation focuses on developing model-based deep autoencoders for modeling discrete data. A motivating example of …
Blind Separation For Intermittent Sources Via Sparse Dictionary Learning, Annan Dong
Blind Separation For Intermittent Sources Via Sparse Dictionary Learning, Annan Dong
Dissertations
Radio frequency sources are observed at a fusion center via sensor measurements made over slow flat-fading channels. The number of sources may be larger than the number of sensors, but their activity is sparse and intermittent with bursty transmission patterns. To account for this, sources are modeled as hidden Markov models with known or unknown parameters. The problem of blind source estimation in the absence of channel state information is tackled via a novel algorithm, consisting of a dictionary learning (DL) stage and a per-source stochastic filtering (PSF) stage. The two stages work in tandem, with the latter operating on …
Probabilistic Spiking Neural Networks : Supervised, Unsupervised And Adversarial Trainings, Alireza Bagheri
Probabilistic Spiking Neural Networks : Supervised, Unsupervised And Adversarial Trainings, Alireza Bagheri
Dissertations
Spiking Neural Networks (SNNs), or third-generation neural networks, are networks of computation units, called neurons, in which each neuron with internal analogue dynamics receives as input and produces as output spiking, that is, binary sparse, signals. In contrast, second-generation neural networks, termed as Artificial Neural Networks (ANNs), rely on simple static non-linear neurons that are known to be energy-intensive, hindering their implementations on energy-limited processors such as mobile devices. The sparse event-based characteristics of SNNs for information transmission and encoding have made them more feasible for highly energy-efficient neuromorphic computing architectures. The most existing training algorithms for SNNs are based …
Statistical Machine Learning Methods For Mining Spatial And Temporal Data, Fei Tan
Statistical Machine Learning Methods For Mining Spatial And Temporal Data, Fei Tan
Dissertations
Spatial and temporal dependencies are ubiquitous properties of data in numerous domains. The popularity of spatial and temporal data mining has thus grown with the increasing prevalence of massive data. The presence of spatial and temporal attributes not only provides complementary useful perspectives, but also poses new challenges to the representation and integration into the learning procedure. In this dissertation, the involved spatial and temporal dependencies are explored with three genres: sample-wise, feature-wise, and target-wise. A family of novel methodologies is developed accordingly for the dependency representation in respective scenarios.
First, dependencies among discrete, continuous and repeated observations are studied …
Workload Allocation In Mobile Edge Computing Empowered Internet Of Things, Qiang Fan
Workload Allocation In Mobile Edge Computing Empowered Internet Of Things, Qiang Fan
Dissertations
In the past few years, a tremendous number of smart devices and objects, such as smart phones, wearable devices, industrial and utility components, are equipped with sensors to sense the real-time physical information from the environment. Hence, Internet of Things (IoT) is introduced, where various smart devices are connected with each other via the internet and empowered with data analytics. Owing to the high volume and fast velocity of data streams generated by IoT devices, the cloud that can provision flexible and efficient computing resources is employed as a smart "brain" to process and store the big data generated from …
Towards An Architecture For Secure Privacy-Preserving Opportunistic Resource Utilization Networks, Ahmed A. Al-Gburi
Towards An Architecture For Secure Privacy-Preserving Opportunistic Resource Utilization Networks, Ahmed A. Al-Gburi
Dissertations
The paradigm of Opportunistic Resource Utilization Networks (oppnets) advances technology in the field of ad hoc networks. The salient feature of oppnets is their use of “helpers” to expand opportunistically when the need for more resources or capabilities arises. Like any other pervasive computing systems, oppnets face numerous security and privacy challenges. These challenges are addressed by utilizing two major ideas: Pervasive Trust Foundation (PTF) and Active Data Bundles (ADBs). The PTF paradigm makes trust the basis for security and privacy in pervasive computing systems, including oppnets. The ADBs are self-protecting data constructs that encapsulate together—in an inseparable way—sensitive data, …
High-Performance Reductive Strategies For Big Data From Lc-Ms/Ms Proteomics, Muaaz Gul Awan
High-Performance Reductive Strategies For Big Data From Lc-Ms/Ms Proteomics, Muaaz Gul Awan
Dissertations
Mass Spectrometry (MS)-based proteomics utilizes high performance liquid chromatography in tandem with high-throughput mass spectrometers. These experiments can produce MS data sets with astonishing speed and volume that can easily reach peta-scale level, creating storage and computational problems for large-scale systems biology studies. Each spectrum output by a mass spectrometer may consist of thousands of peaks, which must all be processed to deduce the corresponding peptide. However, only a small percentage of peaks in a spectrum are useful for further processing, as most of the peaks are either noise or are not useful. Our experiments have shown that 90 to …
Forecasting Anomalous Events And Performance Correlation Analysis In Event Data, Sonya Leech [Thesis]
Forecasting Anomalous Events And Performance Correlation Analysis In Event Data, Sonya Leech [Thesis]
Dissertations
Classical and Deep Learning methods are quite common approaches for anomaly detection. Extensive research has been conducted on single point anomalies. Collective anomalies that occur over a set of two or more durations are less likely to happen by chance than that of a single point anomaly. Being able to observe and predict these anomalous events may reduce the risk of a server’s performance. This paper presents a comparative analysis into time-series forecasting of collective anomalous events using two procedures. One is a classical SARIMA model and the other is a deep learning Long-Short Term Memory (LSTM) model. It then …
An Investigation Into The Predictive Capability Of Customer Spending In Modelling Mortgage Default, Donal Finn [Thesis]
An Investigation Into The Predictive Capability Of Customer Spending In Modelling Mortgage Default, Donal Finn [Thesis]
Dissertations
The mortgage arrears crisis in Ireland was and is among the most severe experienced on record and although there has been a decreasing trend in the number of mortgages in default in the past four years, it still continues to cause distress to borrowers and vulnerabilities to lenders. There are indications that one of the main factors associated with mortgage default is loan affordability, of which the level of disposable income is a driver. Additionally, guidelines set out by the European Central Bank instructed financial institutions to adopt measures to further reduce and prevent loans defaulting, including the implementation and …
Multi-Sensory Deep Learning Architectures For Slam Dunk Scene Classification, Paul Minogue
Multi-Sensory Deep Learning Architectures For Slam Dunk Scene Classification, Paul Minogue
Dissertations
Basketball teams at all levels of the game invest a considerable amount of time and effort into collecting, segmenting, and analysing footage from their upcoming opponents previous games. This analysis helps teams identify and exploit the potential weaknesses of their opponents and is commonly cited as one of the key elements required to achieve success in the modern game. The growing importance of this type of analysis has prompted research into the application of computer vision and audio classification techniques to help teams classify scoring sequences and key events using game footage. However, this research tends to focus on classifying …
An Evaluation Of The Information Security Awareness Of University Students, Alan Pike
An Evaluation Of The Information Security Awareness Of University Students, Alan Pike
Dissertations
Between January 2017 and March 2018, it is estimated that more than 1.9 billion personal and sensitive data records were compromised online. The average cost of a data breach in 2018 was reported to be in the region of US$3.62 million. These figures alone highlight the need for computer users to have a high level of information security awareness (ISA). This research was conducted to establish the ISA of students in a university. There were three aspects to this piece of research. The first was to review and analyse the security habits of students in terms of their own personal …
Noise Reduction In Eeg Signals Using Convolutional Autoencoding Techniques, Conor Hanrahan
Noise Reduction In Eeg Signals Using Convolutional Autoencoding Techniques, Conor Hanrahan
Dissertations
The presence of noise in electroencephalography (EEG) signals can significantly reduce the accuracy of the analysis of the signal. This study assesses to what extent stacked autoencoders designed using one-dimensional convolutional neural network layers can reduce noise in EEG signals. The EEG signals, obtained from 81 people, were processed by a two-layer one-dimensional convolutional autoencoder (CAE), whom performed 3 independent button pressing tasks. The signal-to-noise ratios (SNRs) of the signals before and after processing were calculated and the distributions of the SNRs were compared. The performance of the model was compared to noise reduction performance of Principal Component Analysis, with …