Open Access. Powered by Scholars. Published by Universities.®

Social and Behavioral Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 18 of 18

Full-Text Articles in Social and Behavioral Sciences

Extracting Information From Twitter Screenshots, Tarannum Zaki, Michael L. Nelson, Michele C. Weigle Apr 2023

Extracting Information From Twitter Screenshots, Tarannum Zaki, Michael L. Nelson, Michele C. Weigle

Modeling, Simulation and Visualization Student Capstone Conference

Screenshots are prevalent on social media as a common approach for information sharing. Users rarely verify before sharing screenshots whether they are fake or real. Information sharing through fake screenshots can be highly responsible for misinformation and disinformation spread on social media. There are services of the live web and web archives that could be used to validate the content of a screenshot. We are going to develop a tool that would automatically provide a probability whether a screenshot is fake by using the services of the live web and web archives.


The Dsa Toolkit Shines Light Into Dark And Stormy Archives, Shawn Morgan Jones, Himarsha R. Jayanetti, Alex Osborne, Paul Koerbin, Klein Martin, Michele C. Weigle, Michael L. Nelson Jan 2022

The Dsa Toolkit Shines Light Into Dark And Stormy Archives, Shawn Morgan Jones, Himarsha R. Jayanetti, Alex Osborne, Paul Koerbin, Klein Martin, Michele C. Weigle, Michael L. Nelson

Computer Science Faculty Publications

Web archive collections are created with a particular purpose in mind. A curator selects seeds, or original resources, which are then captured by an archiving system and stored as archived web pages, or mementos. The systems that build web archive collections are often configured to revisit the same original resource multiple times. This is incredibly useful for understanding an unfolding news story or the evolution of an organization. Unfortunately, over time, some of these original resources can go off-topic and no longer suit the purpose for which the collection was originally created. They can go off-topic due to web site …


Improving Collection Understanding For Web Archives With Storytelling: Shining Light Into Dark And Stormy Archives, Shawn M. Jones Jul 2021

Improving Collection Understanding For Web Archives With Storytelling: Shining Light Into Dark And Stormy Archives, Shawn M. Jones

Computer Science Theses & Dissertations

Collections are the tools that people use to make sense of an ever-increasing number of archived web pages. As collections themselves grow, we need tools to make sense of them. Tools that work on the general web, like search engines, are not a good fit for these collections because search engines do not currently represent multiple document versions well. Web archive collections are vast, some containing hundreds of thousands of documents. Thousands of collections exist, many of which cover the same topic. Few collections include standardized metadata. Too many documents from too many collections with insufficient metadata makes collection understanding …


Web Archives At The Nexus Of Good Fakes And Flawed Originals, Michael L. Nelson Jan 2019

Web Archives At The Nexus Of Good Fakes And Flawed Originals, Michael L. Nelson

Computer Science Faculty Publications

[Summary] The authenticity, integrity, and provenance of resources we encounter on the web are increasingly in question. While many people are inured to the possibility of altered images, the easy accessibility of powerful software tools that synthesize audio and video will unleash a torrent of convincing “deepfakes” into our social discourse. Archives will no longer be monopolized by a countable number of institutions such as governments and publishers, but will become a competitive space filled with social engineers, propagandists, conspiracy theorists, and aspiring Hollywood directors. While the historical record has never been singular nor unmalleable, current technologies empower an unprecedented …


It Is Hard To Compute Fixity On Archived Web Pages, Mohamed Aturban, Michael L. Nelson, Michele C. Weigle Jan 2018

It Is Hard To Compute Fixity On Archived Web Pages, Mohamed Aturban, Michael L. Nelson, Michele C. Weigle

Computer Science Faculty Publications

[Introduction] Checking fixity in web archives is performed to ensure archived resources, or mementos (denoted by URI-M) have remained unaltered since when they were captured. The final report of the PREMIS Working Group [2] defines information used for fixity as "information used to verify whether an object has been altered in an undocumented or unauthorized way." The common technique for checking fixity is to generate a current hash value (i.e., a message digest or a checksum) for a file using a cryptographic hash function (e.g., SHA-256) and compare it to the hash value generated originally. If they have different hash …


Using Web Archives To Enrich The Live Web Experience Through Storytelling, Yasmin Alnoamany Jul 2016

Using Web Archives To Enrich The Live Web Experience Through Storytelling, Yasmin Alnoamany

Computer Science Theses & Dissertations

Much of our cultural discourse occurs primarily on the Web. Thus, Web preservation is a fundamental precondition for multiple disciplines. Archiving Web pages into themed collections is a method for ensuring these resources are available for posterity. Services such as Archive-It exists to allow institutions to develop, curate, and preserve collections of Web resources. Understanding the contents and boundaries of these archived collections is a challenge for most people, resulting in the paradox of the larger the collection, the harder it is to understand. Meanwhile, as the sheer volume of data grows on the Web, "storytelling" is becoming a popular …


Tools Managing Seed Urls (Detecting Off-Topic Pages), Yasmin Alnoamany, Michele C. Weigle, Michael L. Nelson Jun 2015

Tools Managing Seed Urls (Detecting Off-Topic Pages), Yasmin Alnoamany, Michele C. Weigle, Michael L. Nelson

Computer Science Presentations

PDF of a powerpoint presentation from the Columbia University Web Archiving Collaboration: New Tools and Models Conference, in New York, New York, June 4-5, 2015. Also available on Slideshare.


Tools For Managing The Past Web, Michele C. Weigle, Michael L. Nelson, Yasmin Alnoamany, Ahmed Alsum, Justin Brunelle, Mat Kelly, Hany Salaheldeen Nov 2014

Tools For Managing The Past Web, Michele C. Weigle, Michael L. Nelson, Yasmin Alnoamany, Ahmed Alsum, Justin Brunelle, Mat Kelly, Hany Salaheldeen

Computer Science Presentations

PDF of a powerpoint presentation from the Archive-It Partners Meeting in Montgomery, Alabama, November 18, 2014. Also available on Slideshare.


"Archive What I See Now" Bringing Institutional Web Archiving Tools To The Individual Researcher, Michele C. Weigle, Michael L. Nelson, Liza Potts Sep 2014

"Archive What I See Now" Bringing Institutional Web Archiving Tools To The Individual Researcher, Michele C. Weigle, Michael L. Nelson, Liza Potts

Computer Science Presentations

PDF of a powerpoint presentation from the 2014 National Endowment for the Humanities (NEH) Office of Digital Humanities (ODH) Project Directors' Meeting in Washington D. C., September 15, 2014. Also available form Slideshare.


Bits Of Research, Michele C. Weigle Jun 2014

Bits Of Research, Michele C. Weigle

Computer Science Presentations

PDF of a powerpoint presentation that provides an overview of digital preservation, web archiving, and information visualization research; dated June 26, 2014. Also available on Slideshare.


Moved But Not Gone: An Evaluation Of Real-Time Methods For Discovering Replacement Web Pages, Martin Klein, Michael L. Nelson Jan 2014

Moved But Not Gone: An Evaluation Of Real-Time Methods For Discovering Replacement Web Pages, Martin Klein, Michael L. Nelson

Computer Science Faculty Publications

Inaccessible Web pages and 404 “Page Not Found” responses are a common Web phenomenon and a detriment to the user’s browsing experience. The rediscovery of missing Web pages is, therefore, a relevant research topic in the digital preservation as well as in the Information Retrieval realm. In this article, we bring these two areas together by analyzing four content- and link-based methods to rediscover missing Web pages. We investigate the retrieval performance of the methods individually as well as their combinations and give an insight into how effective these methods are over time. As the main result of this work, …


Telling Stories With Web Archives, Michele C. Weigle Nov 2013

Telling Stories With Web Archives, Michele C. Weigle

Computer Science Presentations

PDF of a powerpoint presentation from the Southeast Women in Computing Conference in Lake Guntersville State Park, Alabama, November 16, 2013. Also available on Slideshare.


Why Care About The Past?, Michael L. Nelson, Michele C. Weigle Jan 2012

Why Care About The Past?, Michael L. Nelson, Michele C. Weigle

Computer Science Presentations

A set of slides used in various presentations by the authors to show that replaying an experience via archived web pages is more compelling than reading a summary of the event. Also available on Slideshare.


My Point Of View, Michael L. Nelson Sep 2010

My Point Of View, Michael L. Nelson

Computer Science Presentations

PDF of a powerpoint presentation from the Web Archiving Cooperative (WAC) Meeting, Stanford University, September 9, 2010. Also available on Slideshare.


(Re-) Discovering Lost Web Pages, Martin Klein, Michael L. Nelson Oct 2009

(Re-) Discovering Lost Web Pages, Martin Klein, Michael L. Nelson

Computer Science Presentations

PDF of a powerpoint presentation from a Mathematics & Computer Science Seminar at Emory University, Atlanta, Georgia, October 2, 2009. Also available on Slideshare.


Synchronicity: Just-In-Time Discovery Of Lost Web Pages, Martin Klein, Michael L. Nelson Jun 2009

Synchronicity: Just-In-Time Discovery Of Lost Web Pages, Martin Klein, Michael L. Nelson

Computer Science Presentations

PDF of a powerpoint presentation from the National Digital Information Infrastructure and Preservation Program (NDIIPP) Partners Meeting, Washington D.C., June 24-25, 2009. Also available on Slideshare.


Can't Find Your 404s?, Martin Klein, Frank Mccown, Joan Smith, Michael L. Nelson Mar 2009

Can't Find Your 404s?, Martin Klein, Frank Mccown, Joan Smith, Michael L. Nelson

Computer Science Presentations

PDF of a powerpoint presentation at the Santa Fe Complex, Santa Fe, New Mexico, March 13, 2009. Also available on Slideshare.


Tools For A Preservation-Ready Web, Joan A. Smith, Michael L. Nelson Jul 2008

Tools For A Preservation-Ready Web, Joan A. Smith, Michael L. Nelson

Computer Science Presentations

PDF of a powerpoint presentation from the National Digital Information Infrastructure and Preservation Program (NDIIPP) Partners Meeting, Washington D.C., July 9, 2008. Also available on Slideshare.