Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

First and Second Language Acquisition

PDF

City University of New York (CUNY)

Theses/Dissertations

2016

Part-of-speech

Articles 1 - 1 of 1

Full-Text Articles in Entire DC Network

An Evaluation Of Pos Taggers For The Childes Corpus, Rui Huang Sep 2016

An Evaluation Of Pos Taggers For The Childes Corpus, Rui Huang

Dissertations, Theses, and Capstone Projects

This project evaluates four mainstream taggers on a representative collection of child-adult’s dialogues from Child Language Data Exchange System. The nine children’s files from Valian corpora and part of Eve corpora have been manually labeled, and rewrote with LARC tagset. They served as gold standard corpora in the training and testing process. Four taggers: CLAN MOR tagger, ACOPOST trigram tagger, Stanford parser, and Ver. 1.14 of Brill tagger have been tested by 10-fold cross validation. By analyzing what kinds of assumptions the tagger made about category assignment lead to failing, we identify several problematic cases of tagging. By comparing the …