Open Access. Powered by Scholars. Published by Universities.®

Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 30 of 39

Full-Text Articles in Engineering

Comparing Anova And Powershap Feature Selection Methods Via Shapley Additive Explanations Of Models Of Mental Workload Built With The Theta And Alpha Eeg Band Ratios, Bujar Raufi, Luca Longo Mar 2024

Comparing Anova And Powershap Feature Selection Methods Via Shapley Additive Explanations Of Models Of Mental Workload Built With The Theta And Alpha Eeg Band Ratios, Bujar Raufi, Luca Longo

Articles

Background: Creating models to differentiate self-reported mental workload perceptions is challenging and requires machine learning to identify features from EEG signals. EEG band ratios quantify human activity, but limited research on mental workload assessment exists. This study evaluates the use of theta-to-alpha and alpha-to-theta EEG band ratio features to distinguish human self-reported perceptions of mental workload. Methods: In this study, EEG data from 48 participants were analyzed while engaged in resting and task-intensive activities. Multiple mental workload indices were developed using different EEG channel clusters and band ratios. ANOVA’s F-score and PowerSHAP were used to extract the statistical features. At …


Using Machine Learning Methods To Develop Person-Centered Models Predicting Stem Major Choice, Marcell Nagy, Joyce Main, Roland Molontay, Amanda Griffith Oct 2023

Using Machine Learning Methods To Develop Person-Centered Models Predicting Stem Major Choice, Marcell Nagy, Joyce Main, Roland Molontay, Amanda Griffith

Research Papers

Understanding the factors that influence the choice of a STEM major is important for developing effective strategies to increase participation in STEM fields and meet the growing demand for skilled workers. This research is based on the nationally representative data of 25,206 students surveyed in the High School Longitudinal Study of 2009 (HSLS:09). The HSLS:09 includes longitudinal data from 9th-grade students through their postsecondary study. First, we use machine learning to predict who is going to opt for a STEM major. Then we use interpretable ML tools, such as SHAP values, to investigate the key factors that influence students' decisions …


Analysing Child Sexual Abuse Activities In The Dark Web Based On An Efficient Csam Detection Algorithm, Vuong Ngo, Christina Thorpe, Susan Mckeever Sep 2023

Analysing Child Sexual Abuse Activities In The Dark Web Based On An Efficient Csam Detection Algorithm, Vuong Ngo, Christina Thorpe, Susan Mckeever

Articles

Abstract: Child sexual abuse material (CSAM) activities are prevalent on the Dark Web to evade detection, posing a global challenge for law enforcement. Our objective is to analyze CSAM discussions in this concealed space using a Support Vector Machine model, achieving an accuracy of 87.6%. Across eight forums, approximately 28.4% of posts contained CSAM, with victim ages most commonly reported as 12, 14, 13, and 11 years old for YouTube, Skype, Instagram, and Facebook, respectively. Additionally, in forums discussing boys, the most frequently mentioned nationalities in CSAM posts were English, German, and American, accounting for 12%, 7.8%, and 6% of …


Investigating The Effects Of Network Dynamics On Quality Of Delivery Prediction And Monitoring For Video Delivery Networks, Obinna C. Izima Jan 2023

Investigating The Effects Of Network Dynamics On Quality Of Delivery Prediction And Monitoring For Video Delivery Networks, Obinna C. Izima

Doctoral

Video streaming over the Internet requires an optimized delivery system given the advances in network architecture, for example, Software Defined Networks. Machine Learning (ML) models have been deployed in an attempt to predict the quality of the video streams. Some of these efforts have considered the prediction of Quality of Delivery (QoD) metrics of the video stream in an effort to measure the quality of the video stream from the network perspective. In most cases, these models have either treated the ML algorithms as black-boxes or failed to capture the network dynamics of the associated video streams.

This PhD investigates …


Exploring Gender Bias In Semantic Representations For Occupational Classification In Nlp: Techniques And Mitigation Strategies, Joseph Michael O'Carroll Jan 2023

Exploring Gender Bias In Semantic Representations For Occupational Classification In Nlp: Techniques And Mitigation Strategies, Joseph Michael O'Carroll

Dissertations

Gender bias in Natural Language Processing (NLP) models is a non-trivial problem that can perpetuate and amplify existing societal biases. This thesis investigates gender bias in occupation classification and explores the effectiveness of different debiasing methods for language models to reduce the impact of bias in the model’s representations. The study employs a data-driven empirical methodology focusing heavily on experimentation and result investigation. The study uses five distinct semantic representations and models with varying levels of complexity to classify the occupation of individuals based on their biographies.


An Evaluation Of The Eeg Alpha-To-Theta And Theta-To-Alpha Band Ratios As Indexes Of Mental Workload, Bujar Raufi, Luca Longo Jan 2023

An Evaluation Of The Eeg Alpha-To-Theta And Theta-To-Alpha Band Ratios As Indexes Of Mental Workload, Bujar Raufi, Luca Longo

Articles

Many research works indicate that EEG bands, specifically the alpha and theta bands, have been potentially helpful cognitive load indicators. However, minimal research exists to validate this claim. This study aims to assess and analyze the impact of the alpha-to-theta and the theta-to-alpha band ratios on supporting the creation of models capable of discriminating self-reported perceptions of mental workload. A dataset of raw EEG data was utilized in which 48 subjects performed a resting activity and an induced task demanding exercise in the form of a multitasking SIMKAP test. Band ratios were devised from frontal and parietal electrode clusters. Building …


Persuasive Communication Systems: A Machine Learning Approach To Predict The Effect Of Linguistic Styles And Persuasion Techniques, Annye Braca, Pierpaolo Dondio Jan 2023

Persuasive Communication Systems: A Machine Learning Approach To Predict The Effect Of Linguistic Styles And Persuasion Techniques, Annye Braca, Pierpaolo Dondio

Articles

Prediction is a critical task in targeted online advertising, where predictions better than random guessing can translate to real economic return. This study aims to use machine learning (ML) methods to identify individuals who respond well to certain linguistic styles/persuasion techniques based on Aristotle’s means of persuasion, rhetorical devices, cognitive theories and Cialdini’s principles, given their psychometric profile.


Probability Expressions In Ai Decision Support: Impacts On Human+Ai Team Performance, Elias Spinn Jan 2023

Probability Expressions In Ai Decision Support: Impacts On Human+Ai Team Performance, Elias Spinn

Dissertations

AI decision support systems aim to assist people in highly complex and consequential domains to make efficient, effective, and high-quality decisions. AI alone cannot be guaranteed to be correct in these complex decision tasks, and a human is often needed to ensure decision accuracy. The ambition is for these human+ AI teams to perform better together than either would individually. To realise this, decision makers must trust their AI partners appropriately, knowing when to rely on their recommendations and when to be sceptical. However, research has shown that decision makers often either mistrust and underutilise these systems, or trust them …


Enhancing Zero‑Shot Action Recognition In Videos By Combining Gans With Text And Images, Kaiqiang Huang, Luis Miralles-Pechuán, Susan Mckeever Jan 2023

Enhancing Zero‑Shot Action Recognition In Videos By Combining Gans With Text And Images, Kaiqiang Huang, Luis Miralles-Pechuán, Susan Mckeever

Articles

Zero-shot action recognition (ZSAR) tackles the problem of recognising actions that have not been seen by the model during the training phase. Various techniques have been used to achieve ZSAR in the field of human action recognition (HAR) in videos. Techniques based on generative adversarial networks (GANs) are the most promising in terms of performance. GANs are trained to generate representations of unseen videos conditioned on information related to the unseen classes, such as class label embeddings. In this paper, we present an approach based on combining information from two different GANs, both of which generate a visual representation of …


Comparing Poor And Favorable Outcome Prediction With Machine Learning After Mechanical Thrombectomy In Acute Ischemic Stroke, Matthias A. Mutke, Vince I. Madai, Adam Hilbert, Esra Zihni, Arne Potreck, Charlotte S. Weyland, Markus A. Mohlenbruch, Sabine Heiland, Peter A. Ringleb, Simon Nagel, Martin Beendszus, Dietmar Frey Jan 2023

Comparing Poor And Favorable Outcome Prediction With Machine Learning After Mechanical Thrombectomy In Acute Ischemic Stroke, Matthias A. Mutke, Vince I. Madai, Adam Hilbert, Esra Zihni, Arne Potreck, Charlotte S. Weyland, Markus A. Mohlenbruch, Sabine Heiland, Peter A. Ringleb, Simon Nagel, Martin Beendszus, Dietmar Frey

Articles

Outcome prediction after mechanical thrombectomy (MT) in patients with acute ischemic stroke (AIS) and large vessel occlusion (LVO) is commonly performed by focusing on favorable outcome (modified Rankin Scale, mRS 0–2) after 3 months but poor outcome representing severe disability and mortality (mRS 5 and 6) might be of equal importance for clinical decision-making.


How Visual Stimuli Evoked P300 Is Transforming The Brain–Computer Interface Landscape: A Prisma Compliant Systematic Review, Jai Kalra, Prashasti Mittal, Nirmiti Mittal, Abhishek Arora, Utkarsh Tewari, Aviral Chharia, Rahul Upadhyay, Vinay Kumar, Luca Longo Jan 2023

How Visual Stimuli Evoked P300 Is Transforming The Brain–Computer Interface Landscape: A Prisma Compliant Systematic Review, Jai Kalra, Prashasti Mittal, Nirmiti Mittal, Abhishek Arora, Utkarsh Tewari, Aviral Chharia, Rahul Upadhyay, Vinay Kumar, Luca Longo

Articles

Non-invasive Visual Stimuli evoked-EEGbased P300 BCIs have gained immense attention in recent years due to their ability to help patients with disability using BCI-controlled assistive devices and applications. In addition to the medical field, P300 BCI has applications in entertainment, robotics, and education. The current article systematically reviews 147 articles that were published between 2006-2021*. Articles that pass the pre-defined criteria are included in the study. Further, classification based on their primary focus, including article orientation, participants’ age groups, tasks given, databases, the EEG devices used in the studies, classification models, and application domain, is performed. The application-based classification considers …


Ml-Based Online Traffic Classification For Sdns, Mohammed Nsaif, Gergely Kovasznai, Mohammed Abboosh, Ali Malik, Ruairí De Fréin May 2022

Ml-Based Online Traffic Classification For Sdns, Mohammed Nsaif, Gergely Kovasznai, Mohammed Abboosh, Ali Malik, Ruairí De Fréin

Articles

Traffic classification is a crucial aspect for Software-Defined Networking functionalities. This paper is a part of an on-going project aiming at optimizing power consumption in the environment of software-defined datacenter networks. We have developed a novel routing strategy that can blindly balance between the power consumption and the quality of service for the incoming traffic flows. In this paper, we demonstrate how to classify the network traffic flows so that the quality of service of each flow-class can be guaranteed efficiently. This is achieved by creating a dataset that encompasses different types of network traffic such as video, VoIP, game …


Analysis Of Rule-Based And Shallow Statistical Models For Covid-19 Cough Detection For A Preliminary Diagnosis, Arshia Arif, Eisa Alanazi, Ayesha Zeb, Waqar Shahid Qureshi May 2022

Analysis Of Rule-Based And Shallow Statistical Models For Covid-19 Cough Detection For A Preliminary Diagnosis, Arshia Arif, Eisa Alanazi, Ayesha Zeb, Waqar Shahid Qureshi

Conference papers

Coronavirus pandemic that has spread all over the world, is one of its kind in the recent past, that has mobilized researchers in areas such as (not limited to) pre-screening solutions, contact tracing, vaccine developments, and crowd estimation. Pre-screening using symptoms identification, cough classification, and contact tracing mobile applications gained significant popularity during the initial outbreak of the pandemic. Audio recordings of coughing individuals are one of the sources that can help in the pre-screening of COVID-19 patients. This research focuses on quantitative analysis of covid cough classification using audio recordings of coughing individuals. For analysis, we used three different …


Detecting Iot Attacks Using An Ensemble Machine Learning Model, Vikas Tomar, Sachin Sharma Mar 2022

Detecting Iot Attacks Using An Ensemble Machine Learning Model, Vikas Tomar, Sachin Sharma

Articles

Malicious attacks are becoming more prevalent due to the growing use of Internet of Things (IoT) devices in homes, offices, transportation, healthcare, and other locations. By incorporating fog computing into IoT, attacks can be detected in a short amount of time, as the distance between IoT devices and fog devices is smaller than the distance between IoT devices and the cloud. Machine learning is frequently used for the detection of attacks due to the huge amount of data available from IoT devices. However, the problem is that fog devices may not have enough resources, such as processing power and memory, …


Age Specific Models To Capture The Change In Risk Factor Contribution By Age To Short Term Primary Ischemic Stroke Risk, Elizabeth Hunter, John D. Kelleher Feb 2022

Age Specific Models To Capture The Change In Risk Factor Contribution By Age To Short Term Primary Ischemic Stroke Risk, Elizabeth Hunter, John D. Kelleher

Articles

Age is one of the most important risk factors when it comes to stroke risk prediction. However, including age as a risk factor in a stroke prediction model can give rise to a number of difficulties. Age often dominates the risk score, and also not all risk factors contribute proportionally to stroke risk by age. In this study we investigate a number of common stroke risk factors, using Framingham heart study data from the NHLBI Biologic Specimen and Data Repository Information Coordinating Center to determine if they appear to contribute proportionally by age to a stroke risk score. As we …


Exploring The Concept Of The Digital Educator During Covid-19, Fernando Jimenez, Gracia Sanchez, Jose Palma, Luis Miralles-Pechuán, Juan A. Botia Jan 2022

Exploring The Concept Of The Digital Educator During Covid-19, Fernando Jimenez, Gracia Sanchez, Jose Palma, Luis Miralles-Pechuán, Juan A. Botia

Articles

T In many machine learning classification problems, datasets are usually of high dimensionality and therefore require efficient and effective methods for identifying the relative importance of their attributes, eliminating the redundant and irrelevant ones. Due to the huge size of the search space of the possible solutions, the attribute subset evaluation feature selection methods are not very suitable, so in these scenarios feature ranking methods are used. Most of the feature ranking methods described in the literature are univariate methods, which do not detect interactions between factors. In this paper, we propose two new multivariate feature ranking methods based on …


A Hybrid Machine Learning Technique For Feature Optimization In Object-Based Classification Of Debris-Covered Glaciers, Shikha Sharda, Mohit Srivastava, Hemendra Singh Gusain, Naveen Kumar Sharma, Kamaljit Singh Bhatia, Mohit Bajaj, Harsimrat Kaur, Hossam Zawbaa, Salah Kamel Jan 2022

A Hybrid Machine Learning Technique For Feature Optimization In Object-Based Classification Of Debris-Covered Glaciers, Shikha Sharda, Mohit Srivastava, Hemendra Singh Gusain, Naveen Kumar Sharma, Kamaljit Singh Bhatia, Mohit Bajaj, Harsimrat Kaur, Hossam Zawbaa, Salah Kamel

Articles

Object-based features like spectral, topographic, and textural are supportive to determine debris-covered glacier classes. The original feature space includes relevant and irrelevant features. The inclusion of all these features increases the complexity and renders the classifier’s performance. Therefore, feature space optimization is requisite for the classification process. Previous studies have shown a rigorous exercise in manually selecting the best combination of features to define the target class and proven to be a time consuming task. The present study proposed a hybrid feature selection technique to automate the selection of the best suitable features. This study aimed to reduce the classifier’s …


A Sinusoidal Signal Reconstruction Method For The Inversion Of The Mel-Spectrogram, Anastasia Natsiou, Sean O'Leary Dec 2021

A Sinusoidal Signal Reconstruction Method For The Inversion Of The Mel-Spectrogram, Anastasia Natsiou, Sean O'Leary

Articles

The synthesis of sound via deep learning methods has recently received much attention. Some problems for deep learning approaches to sound synthesis relate to the amount of data needed to specify an audio signal and the necessity of preserving both the long and short time coherence of the synthesised signal. Visual time-frequency representations such as the log-mel-spectrogram have gained in popularity. The log- mel-spectrogram is a perceptually informed representation of audio that greatly compresses the amount of information required for the description of the sound. However, because of this compression, this representation is not directly invertible. Both signal processing and …


Hybrid Modelling For Stroke Care: Review And Suggestions Of New Approaches For Risk Assessment And Simulation Of Scenarios, Tilda Herrgårdh, Vince I. Madai, John Kelleher, Rasmus Magnusson, Mika Gustafsson, Lili Milani, Peter Gennemark, Gunnar Cedersund Jan 2021

Hybrid Modelling For Stroke Care: Review And Suggestions Of New Approaches For Risk Assessment And Simulation Of Scenarios, Tilda Herrgårdh, Vince I. Madai, John Kelleher, Rasmus Magnusson, Mika Gustafsson, Lili Milani, Peter Gennemark, Gunnar Cedersund

Articles

Stroke is an example of a complex and multi-factorial disease involving multiple organs, timescales, and disease mechanisms. To deal with this complexity, and to realize Precision Medicine of stroke, mathematical models are needed. Such approaches include: 1) machine learning, 2) bioinformatic network models, and 3) mechanistic models. Since these three approaches have complementary strengths and weaknesses, a hybrid modelling approach combining them would be the most beneficial. However, no concrete approach ready to be implemented for a specific disease has been presented to date. In this paper, we both review the strengths and weaknesses of the three approaches, and propose …


Developing An Open-Book Online Exam For Final Year Students, Keith Quille, Keith Nolan, Brett Becker, Sean Mchugh Jan 2021

Developing An Open-Book Online Exam For Final Year Students, Keith Quille, Keith Nolan, Brett Becker, Sean Mchugh

Conference Papers

Like many others, our institution had to adapt our traditional proctored, written examinations to open-book online variants due to the COVID-19 pandemic. This paper describes the process applied to develop open-book online exams for final year (undergraduate) students studying Applied Machine Learning and Applied Artificial Intelligence and Deep Learning courses as part of a four-year BSc in Computer Science. We also present processes used to validate the examinations as well as plagiarism detection methods implemented. Findings from this study highlight positive effects of using open-book online exams, with 85% of students reporting that they either prefer online open-book examinations or …


Are We In The Digital Dark Times? How The Philosophy Of Hannah Arendt Can Illuminate Some Of The Ethical Dilemmas Posed By Modern Digital Technologies, Damian Gordon, Anna Becevel Jan 2021

Are We In The Digital Dark Times? How The Philosophy Of Hannah Arendt Can Illuminate Some Of The Ethical Dilemmas Posed By Modern Digital Technologies, Damian Gordon, Anna Becevel

Conference Papers

Philosophers are not generally credited with being clairvoyant, and yet because they recognise, record and reflect on trends in their society, their observations can often appear prescient. In the field of the ethics of technology, there is, perhaps, no philosopher whose perspective on these issues is worth examining in detail more than that of Hannah Arendt, who can offer real perspective on the challenges we are facing with technologies in the twenty-first century. Arendt, a thinker of Jewish-German origin, student of Martin Heidegger and Karl Jaspers, encountered her life turning point when she was forced into becoming a refugee as …


Moving Targets: Addressing Concept Drift In Supervised Models For Hacker Communication Detection, Susan Mckeever, Brian Keegan, Andrei Quieroz Jun 2020

Moving Targets: Addressing Concept Drift In Supervised Models For Hacker Communication Detection, Susan Mckeever, Brian Keegan, Andrei Quieroz

Conference papers

Abstract—In this paper, we are investigating the presence of concept drift in machine learning models for detection of hacker communications posted in social media and hacker forums. The supervised models in this experiment are analysed in terms of performance over time by different sources of data (Surface web and Deep web). Additionally, to simulate real-world situations, these models are evaluated using time-stamped messages from our datasets, posted over time on social media platforms. We have found that models applied to hacker forums (deep web) presents an accuracy deterioration in less than a 1-year period, whereas models applied to Twitter (surface …


Active Learning For Auditory Hierarchy, William Coleman, Sarah Jane Delany, Charlie Cullen, Ming Yan Jan 2020

Active Learning For Auditory Hierarchy, William Coleman, Sarah Jane Delany, Charlie Cullen, Ming Yan

Conference papers

Much audio content today is rendered as a static stereo mix: fundamentally a fixed single entity. Object-based audio envisages the delivery of sound content using a collection of individual sound ‘objects’ controlled by accompanying metadata. This offers potential for audio to be delivered in a dynamic manner providing enhanced audio for consumers. One example of such treatment is the concept of applying varying levels of data compression to sound objects thereby reducing the volume of data to be transmitted in limited bandwidth situations. This application motivates the ability to accurately classify objects in terms of their ‘hierarchy’. That is, whether …


Qoe Enhancement In Next Generation Wireless Ecosystems: A Machine Learning Approach, Eva Ibarrola, Mark Davis, Camille Voisin, Ciara Close, Leire Cristobo Jan 2019

Qoe Enhancement In Next Generation Wireless Ecosystems: A Machine Learning Approach, Eva Ibarrola, Mark Davis, Camille Voisin, Ciara Close, Leire Cristobo

Articles

Next-generation wireless ecosystems are expected to comprise heterogeneous technologies and diverse deployment scenarios. Ensuring quality of service (QoS) will be one of the major challenges on account of a variety of factors that are beyond the control of network and service providers in these environments. In this context, ITU-T is working on defining new Recommendations related to QoS and users' quality of experience (QoE) for the 5G era. Considering the new ITU-T QoS framework, we propose a methodology to develop a global QoS management model for next generation wireless ecosystems taking advantage of big data and machine learning (ML). The …


Analyzing Twitter Feeds To Facilitate Crises Informatics And Disaster Response During Mass Emergencies, Arshdeep Kaur Jan 2019

Analyzing Twitter Feeds To Facilitate Crises Informatics And Disaster Response During Mass Emergencies, Arshdeep Kaur

Dissertations

It is a common practice these days for general public to use various micro-blogging platforms, predominantly Twitter, to share ideas, opinions and information about things and life. Twitter is also being increasingly used as a popular source of information sharing during natural disasters and mass emergencies to update and communicate the extent of the geographic phenomena, report the affected population and casualties, request or provide volunteering services and to share the status of disaster recovery process initiated by humanitarian-aid and disaster-management organizations. Recent research in this area has affirmed the potential use of such social media data for various disaster …


Examining A Hate Speech Corpus For Hate Speech Detection And Popularity Prediction, Filip Klubicka, Raquel Fernandez Jan 2018

Examining A Hate Speech Corpus For Hate Speech Detection And Popularity Prediction, Filip Klubicka, Raquel Fernandez

Other resources

As research on hate speech becomes more and more relevant every day, most of it is still focused on hate speech detection. By attempting to replicate a hate speech detection experiment performed on an existing Twitter corpus annotated for hate speech, we highlight some issues that arise from doing research in the field of hate speech, which is essentially still in its infancy. We take a critical look at the training corpus in order to understand its biases, while also using it to venture beyond hate speech detection and investigate whether it can be used to shed light on other …


A Lightweight Classification Algorithm For Human Activity Recognition In Outdoor Spaces, Graham Mccalmont, Huiru Zheng, Haiying Wang, S. I. Mcclean, Matteo Zallio, Damon Berry Jan 2018

A Lightweight Classification Algorithm For Human Activity Recognition In Outdoor Spaces, Graham Mccalmont, Huiru Zheng, Haiying Wang, S. I. Mcclean, Matteo Zallio, Damon Berry

Conference Papers

The aim of this paper is to discuss the development of a lightweight classification algorithm for human activity recognition in a defined setting. Current techniques to analyse data such as machine learning are often very resource intensive meaning they can only be implemented on machines or devices that have large amounts of storage or processing power. The lightweight algorithm uses Euclidean distance to measure the difference between two points and predict the class of new records.

The results of the algorithm are largely positive achieving accuracy of 100% when classifying records taken from the same sensor position and accuracy of …


A Machine Learning Management Model For Qoe Enhancement In Next-Generation Wireless Ecosystems, Eva Ibarrola, Mark Davis, Camille Voisin, Ciara Close, Leire Cristobo Jan 2018

A Machine Learning Management Model For Qoe Enhancement In Next-Generation Wireless Ecosystems, Eva Ibarrola, Mark Davis, Camille Voisin, Ciara Close, Leire Cristobo

Conference papers

Next-generation wireless ecosystems are expected to comprise heterogeneous technologies and diverse deployment scenarios. Ensuring a good quality of service (QoS) will be one of the major challenges of next-generation wireless systems on account of a variety of factors that are beyond the control of network and service providers. In this context, ITU-T is working on updating the various Recommendations related to QoS and users' quality of experience (QoE). Considering the ITU-T QoS framework, we propose a methodology to develop a global QoS management model for next-generation wireless ecosystems taking advantage of big data and machine learning. The results from a …


Adapt At Semeval-2018 Task 9: Skip-Gram Word Embeddings For Unsupervised Hypernym Discovery In Specialised Corpora, Alfredo Maldonado, Filip Klubicka Jan 2018

Adapt At Semeval-2018 Task 9: Skip-Gram Word Embeddings For Unsupervised Hypernym Discovery In Specialised Corpora, Alfredo Maldonado, Filip Klubicka

Other resources

This paper describes a simple but competitive unsupervised system for hypernym discovery. The system uses skip-gram word embeddings with negative sampling, trained on specialised corpora. Candidate hypernyms for an input word are predicted based on cosine similar- ity scores. Two sets of word embedding mod- els were trained separately on two specialised corpora: a medical corpus and a music indus- try corpus. Our system scored highest in the medical domain among the competing unsu- pervised systems but performed poorly on the music industry domain. Our approach does not depend on any external data other than raw specialised corpora.


From Business Understanding To Deployment: An Application Of Machine Learning Algorithms To Forecast Customer Visits Per Hour To A Fast-Casual Restaurant In Dublin, Odunayo David Adedeji Jan 2018

From Business Understanding To Deployment: An Application Of Machine Learning Algorithms To Forecast Customer Visits Per Hour To A Fast-Casual Restaurant In Dublin, Odunayo David Adedeji

Dissertations

This research project identifies the significant factors that affects the number of customer visits to a fast-casual restaurant every hour and proceeds to develop several machine learning models to forecast customer visits. The core value proposition of fast-casual restaurants is quality food delivered at speed which means they have to prepare meals in advance of customers visit but the problem with this approach is in forecasting future demand, under estimating demand could lead to inadequate meal preparation which would leave customers unsatisfied while over estimation of demand could lead to wastage especially with restaurants having to comply with food safety …