Open Access. Powered by Scholars. Published by Universities.®

Data Science Commons

Open Access. Powered by Scholars. Published by Universities.®

Dissertations

2021

Natural Language Processing

Articles 1 - 1 of 1

Full-Text Articles in Data Science

Semantic Classification Of Multidialectal Arabic Social Media, Tom Rishel May 2021

Semantic Classification Of Multidialectal Arabic Social Media, Tom Rishel

Dissertations

Arabic is one of the most widely used languages in the world, but due in part to its morphological and syntactic richness, resources for automated processing of Arabic are relatively rare. Arabic takes three primary forms: Classical Arabic as seen in the Qur’an and other classical texts; Modern Standard Arabic (MSA) as seen in newspapers, formal documents, and other written text intended for widespread distribution; and dialectal Arabic as used in common speech and informal communication. Social media posts are often written in informal language and may include non-standard spellings, abbreviations, emoticons, hashtags, and emojis. Dialectal Arabic is commonly used …