Open Access. Powered by Scholars. Published by Universities.®

Computer Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 5 of 5

Full-Text Articles in Computer Engineering

Examining A Hate Speech Corpus For Hate Speech Detection And Popularity Prediction, Filip Klubicka, Raquel Fernandez Jan 2018

Examining A Hate Speech Corpus For Hate Speech Detection And Popularity Prediction, Filip Klubicka, Raquel Fernandez

Other resources

As research on hate speech becomes more and more relevant every day, most of it is still focused on hate speech detection. By attempting to replicate a hate speech detection experiment performed on an existing Twitter corpus annotated for hate speech, we highlight some issues that arise from doing research in the field of hate speech, which is essentially still in its infancy. We take a critical look at the training corpus in order to understand its biases, while also using it to venture beyond hate speech detection and investigate whether it can be used to shed light on other …


Adapt At Semeval-2018 Task 9: Skip-Gram Word Embeddings For Unsupervised Hypernym Discovery In Specialised Corpora, Alfredo Maldonado, Filip Klubicka Jan 2018

Adapt At Semeval-2018 Task 9: Skip-Gram Word Embeddings For Unsupervised Hypernym Discovery In Specialised Corpora, Alfredo Maldonado, Filip Klubicka

Other resources

This paper describes a simple but competitive unsupervised system for hypernym discovery. The system uses skip-gram word embeddings with negative sampling, trained on specialised corpora. Candidate hypernyms for an input word are predicted based on cosine similar- ity scores. Two sets of word embedding mod- els were trained separately on two specialised corpora: a medical corpus and a music indus- try corpus. Our system scored highest in the medical domain among the competing unsu- pervised systems but performed poorly on the music industry domain. Our approach does not depend on any external data other than raw specialised corpora.


From Business Understanding To Deployment: An Application Of Machine Learning Algorithms To Forecast Customer Visits Per Hour To A Fast-Casual Restaurant In Dublin, Odunayo David Adedeji Jan 2018

From Business Understanding To Deployment: An Application Of Machine Learning Algorithms To Forecast Customer Visits Per Hour To A Fast-Casual Restaurant In Dublin, Odunayo David Adedeji

Dissertations

This research project identifies the significant factors that affects the number of customer visits to a fast-casual restaurant every hour and proceeds to develop several machine learning models to forecast customer visits. The core value proposition of fast-casual restaurants is quality food delivered at speed which means they have to prepare meals in advance of customers visit but the problem with this approach is in forecasting future demand, under estimating demand could lead to inadequate meal preparation which would leave customers unsatisfied while over estimation of demand could lead to wastage especially with restaurants having to comply with food safety …


Application Of Synthetic Informative Minority Over-Sampling (Simo) Algorithm Leveraging Support Vector Machine (Svm) On Small Datasets With Class Imbalance, Akshatha Fakkeriah Kallappanamatt Jan 2018

Application Of Synthetic Informative Minority Over-Sampling (Simo) Algorithm Leveraging Support Vector Machine (Svm) On Small Datasets With Class Imbalance, Akshatha Fakkeriah Kallappanamatt

Dissertations

Developing predictive models for classification problems considering imbalanced datasets is one of the basic difficulties in data mining and decision-analytics. A classifier’s performance will decline dramatically when applied to an imbalanced dataset. Standard classifiers such as logistic regression, Support Vector Machine (SVM) are appropriate for balanced training sets whereas provides suboptimal classification results when used on unbalanced dataset. Performance metric with prediction accuracy encourages a bias towards the majority class, while the rare instances remain unknown though the model contributes a high overall precision. There are chances where minority instances might be treated as noise and vice versa. (Haixiang et …


Examining A Hate Speech Corpus For Hate Speech Detection And Popularity Prediction, Filip Klubicka, Raquel Fernandez Jan 2018

Examining A Hate Speech Corpus For Hate Speech Detection And Popularity Prediction, Filip Klubicka, Raquel Fernandez

Conference papers

As research on hate speech becomes more and more relevant every day, most of it is still focused on hate speech detection. By attempting to replicate a hate speech detection experiment performed on an existing Twitter corpus annotated for hate speech, we highlight some issues that arise from doing research in the field of hate speech, which is essentially still in its infancy. We take a critical look at the training corpus in order to understand its biases, while also using it to venture beyond hate speech detection and investigate whether it can be used to shed light on other …