Open Access. Powered by Scholars. Published by Universities.®

Computer Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

PDF

Dissertations

2017

Enhancement.

Articles 1 - 1 of 1

Full-Text Articles in Computer Engineering

“How Short Is A Piece Of String?”: An Investigation Into The Impact Of Text Length On Short-Text Classification Accuracy, Austin Mccartney Sep 2017

“How Short Is A Piece Of String?”: An Investigation Into The Impact Of Text Length On Short-Text Classification Accuracy, Austin Mccartney

Dissertations

The recent increase in the widespread use of short messages, for example micro-blogs or SMS communications, has created an opportunity to harvest a vast amount of information through machine-based classification. However, traditional classification methods have failed to produce accuracies comparable to those obtained from similar classification of longer texts. Several approaches have been employed to extend traditional methods to overcome this problem, including the enhancement of the original texts through the construction of associations with external data enrichment sources, ranging from thesauri and semantic nets such as Wordnet, to pre-built online taxonomies such as Wikipedia. Other avenues of investigation have …