Open Access. Powered by Scholars. Published by Universities.®

Reading and Language Commons

Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences

Faculty & Staff Publications

Series

2023

Articles 1 - 1 of 1

Full-Text Articles in Reading and Language

Querying The Past: Automatic Source Attribution With Language Models, Ryan Muther, Mathew Barber, David Smith Jan 2023

Querying The Past: Automatic Source Attribution With Language Models, Ryan Muther, Mathew Barber, David Smith

Faculty & Staff Publications

This paper explores new methods for locating the sources used to write a text by 昀椀ne-tuning a variety of language models to rerank candidate sources. These methods promise to shed new light on traditions with complex citational practices, such as in medieval Arabic where citations are ambiguous and boundaries of quotation are poorly defined. After retrieving candidates sources using a baseline BM25 retrieval model, a variety of reranking methods are tested to see how effective they are at the task of source attribution. We conduct experiments on two datasets—English Wikipedia and medieval Arabic historical writing—and employ a variety of retrieval- …