Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Statistics

2017

Books/Book Chapters

Articles 1 - 1 of 1

Full-Text Articles in Entire DC Network

A Comparison On The Classification Of Short-Text Documents Using Latent Dirichlet Allocation And Formal Concept Analysis, Noel Rogers, Luca Longo Jan 2017

A Comparison On The Classification Of Short-Text Documents Using Latent Dirichlet Allocation And Formal Concept Analysis, Noel Rogers, Luca Longo

Books/Book Chapters

With the increasing amounts of textual data being collected online, automated text classification techniques are becoming increasingly important. However, a lot of this data is in the form of short-text with just a handful of terms per document (e.g. Text messages, tweets or Facebook posts). This data is generally too sparse and noisy to obtain satisfactory classification. Two techniques which aim to alleviate this problem are Latent Dirichlet Allocation (LDA) and Formal Concept Analysis (FCA). Both techniques have been shown to improve the performance of short-text classification by reducing the sparsity of the input data. The relative performance of classifiers …