Open Access. Powered by Scholars. Published by Universities.®
Social and Behavioral Sciences Commons™
Open Access. Powered by Scholars. Published by Universities.®
- Discipline
- Publication
- Publication Type
Articles 1 - 16 of 16
Full-Text Articles in Social and Behavioral Sciences
Assessing The Prevalence And Archival Rate Of Uris To Git Hosting Platforms In Scholarly Publications, Emily Escamilla
Assessing The Prevalence And Archival Rate Of Uris To Git Hosting Platforms In Scholarly Publications, Emily Escamilla
Computer Science Theses & Dissertations
The definition of scholarly content has expanded to include the data and source code that contribute to a publication. While major archiving efforts to preserve conventional scholarly content, typically in PDFs (e.g., LOCKSS, CLOCKSS, Portico), are underway, no analogous effort has yet emerged to preserve the data and code referenced in those PDFs, particularly the scholarly code hosted online on Git Hosting Platforms (GHPs). Similarly, Software Heritage is working to archive public source code, but there is value in archiving the surrounding ephemera that provide important context to the code while maintaining their original URIs. In current implementations, source code …
Avoiding Zombies In Archival Replay Using Serviceworker, Sawood Alam, Mat Kelly, Michele C. Weigle, Michael L. Nelson
Avoiding Zombies In Archival Replay Using Serviceworker, Sawood Alam, Mat Kelly, Michele C. Weigle, Michael L. Nelson
Computer Science Faculty Publications
[First paragraph] A Composite Memento is an archived representation of a web page with all the page requisites such as images and stylesheets. All embedded resources have their own URIs, hence, they are archived independently. For a meaningful archival replay, it is important to load all the page requisites from the archive within the temporal neighborhood of the base HTML page. To achieve this goal, archival replay systems try to rewrite all the resource references to appropriate archived versions before serving HTML, CSS, or JS. However, an effective server-side URL rewriting is difficult when URLs are generated dynamically using JavaScript. …
Scripts In A Frame: A Framework For Archiving Deferred Representations, Justin F. Brunelle
Scripts In A Frame: A Framework For Archiving Deferred Representations, Justin F. Brunelle
Computer Science Theses & Dissertations
Web archives provide a view of the Web as seen by Web crawlers. Because of rapid advancements and adoption of client-side technologies like JavaScript and Ajax, coupled with the inability of crawlers to execute these technologies effectively, Web resources become harder to archive as they become more interactive. At Web scale, we cannot capture client-side representations using the current state-of-the art toolsets because of the migration from Web pages to Web applications. Web applications increasingly rely on JavaScript and other client-side programming languages to load embedded resources and change client-side state. We demonstrate that Web crawlers and other automatic archival …
When Should I Make Preservation Copies Of Myself?, Charles L. Cartledge, Michael L. Nelson
When Should I Make Preservation Copies Of Myself?, Charles L. Cartledge, Michael L. Nelson
Computer Science Presentations
PDF of a powerpoint presentation from the Joint Conference on Digital Libraries (JCDL) 2014 in London, United Kingdom, September 9, 2014. Also available on Slideshare.
Bits Of Research, Michele C. Weigle
Bits Of Research, Michele C. Weigle
Computer Science Presentations
PDF of a powerpoint presentation that provides an overview of digital preservation, web archiving, and information visualization research; dated June 26, 2014. Also available on Slideshare.
Moved But Not Gone: An Evaluation Of Real-Time Methods For Discovering Replacement Web Pages, Martin Klein, Michael L. Nelson
Moved But Not Gone: An Evaluation Of Real-Time Methods For Discovering Replacement Web Pages, Martin Klein, Michael L. Nelson
Computer Science Faculty Publications
Inaccessible Web pages and 404 “Page Not Found” responses are a common Web phenomenon and a detriment to the user’s browsing experience. The rediscovery of missing Web pages is, therefore, a relevant research topic in the digital preservation as well as in the Information Retrieval realm. In this article, we bring these two areas together by analyzing four content- and link-based methods to rediscover missing Web pages. We investigate the retrieval performance of the methods individually as well as their combinations and give an insight into how effective these methods are over time. As the main result of this work, …
Telling Stories With Web Archives, Michele C. Weigle
Telling Stories With Web Archives, Michele C. Weigle
Computer Science Presentations
PDF of a powerpoint presentation from the Southeast Women in Computing Conference in Lake Guntersville State Park, Alabama, November 16, 2013. Also available on Slideshare.
Old Dominion University Computer Science Iipc New Member, Michael L. Nelson
Old Dominion University Computer Science Iipc New Member, Michael L. Nelson
Computer Science Presentations
PDF of a powerpoint presentation from the International Internet Preservation Consortium (IIPC) 2013 General Assembly in Ljubljana, Slovenia, April 22, 2013. Also available on Slideshare.
Visualizing Digital Collections At Archive-It, Michele C. Weigle, Michael L. Nelson
Visualizing Digital Collections At Archive-It, Michele C. Weigle, Michael L. Nelson
Computer Science Presentations
PDF of a powerpoint presentation from a Archive-It Partners Meeting in Annapolis, Maryland, December 3, 2012. Also available on Slideshare.
Why Care About The Past?, Michael L. Nelson, Michele C. Weigle
Why Care About The Past?, Michael L. Nelson, Michele C. Weigle
Computer Science Presentations
A set of slides used in various presentations by the authors to show that replaying an experience via archived web pages is more compelling than reading a summary of the event. Also available on Slideshare.
(Re-) Discovering Lost Web Pages, Martin Klein, Michael L. Nelson
(Re-) Discovering Lost Web Pages, Martin Klein, Michael L. Nelson
Computer Science Presentations
PDF of a powerpoint presentation from a Mathematics & Computer Science Seminar at Emory University, Atlanta, Georgia, October 2, 2009. Also available on Slideshare.
Synchronicity: Just-In-Time Discovery Of Lost Web Pages, Martin Klein, Michael L. Nelson
Synchronicity: Just-In-Time Discovery Of Lost Web Pages, Martin Klein, Michael L. Nelson
Computer Science Presentations
PDF of a powerpoint presentation from the National Digital Information Infrastructure and Preservation Program (NDIIPP) Partners Meeting, Washington D.C., June 24-25, 2009. Also available on Slideshare.
Can't Find Your 404s?, Martin Klein, Frank Mccown, Joan Smith, Michael L. Nelson
Can't Find Your 404s?, Martin Klein, Frank Mccown, Joan Smith, Michael L. Nelson
Computer Science Presentations
PDF of a powerpoint presentation at the Santa Fe Complex, Santa Fe, New Mexico, March 13, 2009. Also available on Slideshare.
Tools For A Preservation-Ready Web, Joan A. Smith, Michael L. Nelson
Tools For A Preservation-Ready Web, Joan A. Smith, Michael L. Nelson
Computer Science Presentations
PDF of a powerpoint presentation from the National Digital Information Infrastructure and Preservation Program (NDIIPP) Partners Meeting, Washington D.C., July 9, 2008. Also available on Slideshare.
Factors Affecting Website Reconstruction From The Web Infrastructure, Frank Mccown, Norou Diawara, Michael L. Nelson
Factors Affecting Website Reconstruction From The Web Infrastructure, Frank Mccown, Norou Diawara, Michael L. Nelson
Computer Science Faculty Publications
When a website is suddenly lost without a backup, it may be reconstituted by probing web archives and search engine caches for missing content. In this paper we describe an experiment where we crawled and reconstructed 300 randomly selected websites on a weekly basis for 14 weeks. The reconstructions were performed using our web-repository crawler named Warrick which recovers missing resources from the Web Infrastructure (WI), the collective preservation effort of web archives and search engine caches. We examine several characteristics of the websites over time including birth rate, decay and age of resources. We evaluate the reconstructions when compared …
The Open Archives Initiative, Michael L. Nelson
The Open Archives Initiative, Michael L. Nelson
Computer Science Presentations
PDF of a powerpoint presentation from the Open Archives Initiative DRIADE ( Digital Repository of Information and Data for Evolution) Workshop, Durham, North Carolina, May 16-17, 2007. Also available on Slideshare.