Open Access. Powered by Scholars. Published by Universities.®

Spanish and Portuguese Language and Literature Commons

Open Access. Powered by Scholars. Published by Universities.®

City University of New York (CUNY)

Theses/Dissertations

2021

Computer mediated communication

Articles 1 - 1 of 1

Full-Text Articles in Spanish and Portuguese Language and Literature

A Computational Study In The Detection Of English–Spanish Code-Switches, Yohamy C. Polanco Feb 2021

A Computational Study In The Detection Of English–Spanish Code-Switches, Yohamy C. Polanco

Dissertations, Theses, and Capstone Projects

Code-switching is the linguistic phenomenon where a multilingual person alternates between two or more languages in a conversation, whether that be spoken or written. This thesis studies the automatic detection of code-switching occurring specifically between English and Spanish in two corpora.

Twitter and other social media sites have provided an abundance of linguistic data that is available to researchers to perform countless experiments. Collecting the data is fairly easy if a study is on monolingual text, but if a study requires code-switched data, this becomes a complication as APIs only accept one language as a parameter. This thesis focuses on …