Open Access. Powered by Scholars. Published by Universities.®

Social and Behavioral Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 16 of 16

Full-Text Articles in Social and Behavioral Sciences

Assessing The Prevalence And Archival Rate Of Uris To Git Hosting Platforms In Scholarly Publications, Emily Escamilla Aug 2023

Assessing The Prevalence And Archival Rate Of Uris To Git Hosting Platforms In Scholarly Publications, Emily Escamilla

Computer Science Theses & Dissertations

The definition of scholarly content has expanded to include the data and source code that contribute to a publication. While major archiving efforts to preserve conventional scholarly content, typically in PDFs (e.g., LOCKSS, CLOCKSS, Portico), are underway, no analogous effort has yet emerged to preserve the data and code referenced in those PDFs, particularly the scholarly code hosted online on Git Hosting Platforms (GHPs). Similarly, Software Heritage is working to archive public source code, but there is value in archiving the surrounding ephemera that provide important context to the code while maintaining their original URIs. In current implementations, source code …


Avoiding Zombies In Archival Replay Using Serviceworker, Sawood Alam, Mat Kelly, Michele C. Weigle, Michael L. Nelson Jan 2017

Avoiding Zombies In Archival Replay Using Serviceworker, Sawood Alam, Mat Kelly, Michele C. Weigle, Michael L. Nelson

Computer Science Faculty Publications

[First paragraph] A Composite Memento is an archived representation of a web page with all the page requisites such as images and stylesheets. All embedded resources have their own URIs, hence, they are archived independently. For a meaningful archival replay, it is important to load all the page requisites from the archive within the temporal neighborhood of the base HTML page. To achieve this goal, archival replay systems try to rewrite all the resource references to appropriate archived versions before serving HTML, CSS, or JS. However, an effective server-side URL rewriting is difficult when URLs are generated dynamically using JavaScript. …


Scripts In A Frame: A Framework For Archiving Deferred Representations, Justin F. Brunelle Apr 2016

Scripts In A Frame: A Framework For Archiving Deferred Representations, Justin F. Brunelle

Computer Science Theses & Dissertations

Web archives provide a view of the Web as seen by Web crawlers. Because of rapid advancements and adoption of client-side technologies like JavaScript and Ajax, coupled with the inability of crawlers to execute these technologies effectively, Web resources become harder to archive as they become more interactive. At Web scale, we cannot capture client-side representations using the current state-of-the art toolsets because of the migration from Web pages to Web applications. Web applications increasingly rely on JavaScript and other client-side programming languages to load embedded resources and change client-side state. We demonstrate that Web crawlers and other automatic archival …


When Should I Make Preservation Copies Of Myself?, Charles L. Cartledge, Michael L. Nelson Sep 2014

When Should I Make Preservation Copies Of Myself?, Charles L. Cartledge, Michael L. Nelson

Computer Science Presentations

PDF of a powerpoint presentation from the Joint Conference on Digital Libraries (JCDL) 2014 in London, United Kingdom, September 9, 2014. Also available on Slideshare.


Bits Of Research, Michele C. Weigle Jun 2014

Bits Of Research, Michele C. Weigle

Computer Science Presentations

PDF of a powerpoint presentation that provides an overview of digital preservation, web archiving, and information visualization research; dated June 26, 2014. Also available on Slideshare.


Moved But Not Gone: An Evaluation Of Real-Time Methods For Discovering Replacement Web Pages, Martin Klein, Michael L. Nelson Jan 2014

Moved But Not Gone: An Evaluation Of Real-Time Methods For Discovering Replacement Web Pages, Martin Klein, Michael L. Nelson

Computer Science Faculty Publications

Inaccessible Web pages and 404 “Page Not Found” responses are a common Web phenomenon and a detriment to the user’s browsing experience. The rediscovery of missing Web pages is, therefore, a relevant research topic in the digital preservation as well as in the Information Retrieval realm. In this article, we bring these two areas together by analyzing four content- and link-based methods to rediscover missing Web pages. We investigate the retrieval performance of the methods individually as well as their combinations and give an insight into how effective these methods are over time. As the main result of this work, …


Telling Stories With Web Archives, Michele C. Weigle Nov 2013

Telling Stories With Web Archives, Michele C. Weigle

Computer Science Presentations

PDF of a powerpoint presentation from the Southeast Women in Computing Conference in Lake Guntersville State Park, Alabama, November 16, 2013. Also available on Slideshare.


Old Dominion University Computer Science Iipc New Member, Michael L. Nelson Apr 2013

Old Dominion University Computer Science Iipc New Member, Michael L. Nelson

Computer Science Presentations

PDF of a powerpoint presentation from the International Internet Preservation Consortium (IIPC) 2013 General Assembly in Ljubljana, Slovenia, April 22, 2013. Also available on Slideshare.


Visualizing Digital Collections At Archive-It, Michele C. Weigle, Michael L. Nelson Dec 2012

Visualizing Digital Collections At Archive-It, Michele C. Weigle, Michael L. Nelson

Computer Science Presentations

PDF of a powerpoint presentation from a Archive-It Partners Meeting in Annapolis, Maryland, December 3, 2012. Also available on Slideshare.


Why Care About The Past?, Michael L. Nelson, Michele C. Weigle Jan 2012

Why Care About The Past?, Michael L. Nelson, Michele C. Weigle

Computer Science Presentations

A set of slides used in various presentations by the authors to show that replaying an experience via archived web pages is more compelling than reading a summary of the event. Also available on Slideshare.


(Re-) Discovering Lost Web Pages, Martin Klein, Michael L. Nelson Oct 2009

(Re-) Discovering Lost Web Pages, Martin Klein, Michael L. Nelson

Computer Science Presentations

PDF of a powerpoint presentation from a Mathematics & Computer Science Seminar at Emory University, Atlanta, Georgia, October 2, 2009. Also available on Slideshare.


Synchronicity: Just-In-Time Discovery Of Lost Web Pages, Martin Klein, Michael L. Nelson Jun 2009

Synchronicity: Just-In-Time Discovery Of Lost Web Pages, Martin Klein, Michael L. Nelson

Computer Science Presentations

PDF of a powerpoint presentation from the National Digital Information Infrastructure and Preservation Program (NDIIPP) Partners Meeting, Washington D.C., June 24-25, 2009. Also available on Slideshare.


Can't Find Your 404s?, Martin Klein, Frank Mccown, Joan Smith, Michael L. Nelson Mar 2009

Can't Find Your 404s?, Martin Klein, Frank Mccown, Joan Smith, Michael L. Nelson

Computer Science Presentations

PDF of a powerpoint presentation at the Santa Fe Complex, Santa Fe, New Mexico, March 13, 2009. Also available on Slideshare.


Tools For A Preservation-Ready Web, Joan A. Smith, Michael L. Nelson Jul 2008

Tools For A Preservation-Ready Web, Joan A. Smith, Michael L. Nelson

Computer Science Presentations

PDF of a powerpoint presentation from the National Digital Information Infrastructure and Preservation Program (NDIIPP) Partners Meeting, Washington D.C., July 9, 2008. Also available on Slideshare.


Factors Affecting Website Reconstruction From The Web Infrastructure, Frank Mccown, Norou Diawara, Michael L. Nelson Jun 2007

Factors Affecting Website Reconstruction From The Web Infrastructure, Frank Mccown, Norou Diawara, Michael L. Nelson

Computer Science Faculty Publications

When a website is suddenly lost without a backup, it may be reconstituted by probing web archives and search engine caches for missing content. In this paper we describe an experiment where we crawled and reconstructed 300 randomly selected websites on a weekly basis for 14 weeks. The reconstructions were performed using our web-repository crawler named Warrick which recovers missing resources from the Web Infrastructure (WI), the collective preservation effort of web archives and search engine caches. We examine several characteristics of the websites over time including birth rate, decay and age of resources. We evaluate the reconstructions when compared …


The Open Archives Initiative, Michael L. Nelson May 2007

The Open Archives Initiative, Michael L. Nelson

Computer Science Presentations

PDF of a powerpoint presentation from the Open Archives Initiative DRIADE ( Digital Repository of Information and Data for Evolution) Workshop, Durham, North Carolina, May 16-17, 2007. Also available on Slideshare.