Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Journal

Machine learning

2017

Scholarly Horizons: University of Minnesota, Morris Undergraduate Journal

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Identifying Twitter Spam By Utilizing Random Forests, Humza S. Haider Jul 2017

Identifying Twitter Spam By Utilizing Random Forests, Humza S. Haider

Scholarly Horizons: University of Minnesota, Morris Undergraduate Journal

The use of Twitter has rapidly grown since the first tweet in 2006. The number of spammers on Twitter shows a similar increase. Classifying users into spammers and non-spammers has been heavily researched, and new methods for spam detection are developing rapidly. One of these classification techniques is known as random forests. We examine three studies that employ random forests using user based features, geo-tagged features, and time dependent features. Each study showed high accuracy rates and F-measures with the exception of one model that had a test set with a more realistic proportion of spam relative to typical testing …