Open Access. Powered by Scholars. Published by Universities.®

Social and Behavioral Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

2010

Brigham Young University

Series

A probabilistic morphological analyzer

Articles 1 - 1 of 1

Full-Text Articles in Social and Behavioral Sciences

A Probabilistic Morphological Analyzer For Syriac, Deryle W. Lonsdale, Peter J. Mcclanahan, George Busby, Robbie A. Haertel, Kristian Heal, Kevin Seppi, Eric Ringger Jan 2010

A Probabilistic Morphological Analyzer For Syriac, Deryle W. Lonsdale, Peter J. Mcclanahan, George Busby, Robbie A. Haertel, Kristian Heal, Kevin Seppi, Eric Ringger

Faculty Publications

We define a probabilistic morphological analyzer using a data-driven approach for Syriac in order to facilitate the creation of an annotated corpus. Syriac is an under-resourced Semitic language for which there are no available language tools such as morphological analyzers. We introduce novel probabilistic models for segmentation, dictionary linkage, and morphological tagging and connect them in a pipeline to create a probabilistic morphological analyzer requiring only labeled data. We explore the performance of models with varying amounts of training data and find that with about 34,500 labeled tokens, we can outperform a reasonable baseline trained on over 99,000 tokens and …