Open Access. Powered by Scholars. Published by Universities.®
Articles 1 - 1 of 1
Full-Text Articles in Engineering
Impact Of Character N-Grams Attention Scores For English And Russian News Articles Authorship Attribution, Liliya Mukhmutova, Robert J. Ross, Giancarlo Salton
Impact Of Character N-Grams Attention Scores For English And Russian News Articles Authorship Attribution, Liliya Mukhmutova, Robert J. Ross, Giancarlo Salton
Conference papers
Language embeddings are often used as black-box word-level tools that provide powerful language analysis across many tasks, but yet for many tasks such as Authorship Attribution access to feature level information on character n-grams can provide insights to help with model refinement and development. In this paper we investigate and evaluate the importance of character n-grams within an embeddings context in authorship attribution through the use of attention scores. We perform this investigation both for English (Reuters_50_50) and Russian (Taiga) news authorship datasets. Our analysis show that character n-grams attention score is higher for n-grams that are considered to be …