Open Access. Powered by Scholars. Published by Universities.®

Social and Behavioral Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Old Dominion University

Library and Information Science

Keyword
Publication Year
Publication
Publication Type
File Type

Articles 1 - 30 of 306

Full-Text Articles in Social and Behavioral Sciences

Robots Still Outnumber Humans In Web Archives In 2019, But Less Than In 2015 And 2012, Himarsha R. Jayanetti, Kritika Garg, Sawood Alam, Michael L. Nelson, Michele C. Weigle Jan 2024

Robots Still Outnumber Humans In Web Archives In 2019, But Less Than In 2015 And 2012, Himarsha R. Jayanetti, Kritika Garg, Sawood Alam, Michael L. Nelson, Michele C. Weigle

Computer Science Faculty Publications

The significance of the web and the crucial role of web archives in its preservation highlight the necessity of understanding how users, both human and robot, access web archive content, and how best to satisfy this disparate needs of both types of users. To identify robots and humans in web archives and analyze their respective access patterns, we used the Internet Archive’s (IA) Wayback Machine access logs from 2012, 2015, and 2019, as well as Arquivo.pt’s (Portuguese Web Archive) access logs from 2019. We identified user sessions in the access logs and classified those sessions as human or robot based …


Book Challenges Popping Up All Over: What Do School Principals Need To Know?, Samantha Laine Hull, Sue Kimmel Jan 2024

Book Challenges Popping Up All Over: What Do School Principals Need To Know?, Samantha Laine Hull, Sue Kimmel

STEMPS Faculty Publications

This chapter provides practical advice and reasons for school leaders to support students' intellectual freedom through their support of school libraries and school librarians. The chapter begins with a short but critical literature review that includes case law on the topic of censorship in schools. The concerns of teachers and librarians from a recent study are summarized and help build the foundation for practical and ready to use advice for any school leaders to uphold the intellectual freedom of all students.


Assessing The Prevalence And Archival Rate Of Uris To Git Hosting Platforms In Scholarly Publications, Emily Escamilla Aug 2023

Assessing The Prevalence And Archival Rate Of Uris To Git Hosting Platforms In Scholarly Publications, Emily Escamilla

Computer Science Theses & Dissertations

The definition of scholarly content has expanded to include the data and source code that contribute to a publication. While major archiving efforts to preserve conventional scholarly content, typically in PDFs (e.g., LOCKSS, CLOCKSS, Portico), are underway, no analogous effort has yet emerged to preserve the data and code referenced in those PDFs, particularly the scholarly code hosted online on Git Hosting Platforms (GHPs). Similarly, Software Heritage is working to archive public source code, but there is value in archiving the surrounding ephemera that provide important context to the code while maintaining their original URIs. In current implementations, source code …


Fair Signposting Profile, Herbert Van De Sompel, Martin Klein, Shawn Jones, Michael L. Nelson, Simeon Warner, Anusuriya Devaraju, Robert Huber, Wilko Steinhoff, Vyacheslav Tykhonov, Luc Boruta, Enno Meijers, Stian Soiland-Reyes, Mark Wilkonson May 2023

Fair Signposting Profile, Herbert Van De Sompel, Martin Klein, Shawn Jones, Michael L. Nelson, Simeon Warner, Anusuriya Devaraju, Robert Huber, Wilko Steinhoff, Vyacheslav Tykhonov, Luc Boruta, Enno Meijers, Stian Soiland-Reyes, Mark Wilkonson

Computer Science Faculty Publications

[First paragraph] This page details concrete recipes that platforms that host research outputs (e.g. data repositories, institutional repositories, publisher platforms, etc.) can follow to implement Signposting, a lightweight yet powerful approach to increase the FAIRness of scholarly objects.


Supporting Account-Based Queries For Archived Instagram Posts, Himarsha R. Jayanetti May 2023

Supporting Account-Based Queries For Archived Instagram Posts, Himarsha R. Jayanetti

Computer Science Theses & Dissertations

Social media has become one of the primary modes of communication in recent times, with popular platforms such as Facebook, Twitter, and Instagram leading the way. Despite its popularity, Instagram has not received as much attention in academic research compared to Facebook and Twitter, and its significant role in contemporary society is often overlooked. Web archives are making efforts to preserve social media content despite the challenges posed by the dynamic nature of these sites. The goal of our research is to facilitate the easy discovery of archived copies, or mementos, of all posts belonging to a specific Instagram account …


Biodiversity Of Philippine Marine Fishes: A Dna Barcode Reference Library Based On Voucher Specimens, Katherine E. Bemis, Matthew G. Girard, Mudjekeewis D. Santos, Kent E. Carpenter, Jonathan R. Deeds, Diane E. Pitassy, Nicko Amor L. Flores, Elizabeth S. Hunter, Amy C. Driskell, Kenneth S. Macdonald Iii, Lee A. Weigt, Jeffrey T. Williams Jan 2023

Biodiversity Of Philippine Marine Fishes: A Dna Barcode Reference Library Based On Voucher Specimens, Katherine E. Bemis, Matthew G. Girard, Mudjekeewis D. Santos, Kent E. Carpenter, Jonathan R. Deeds, Diane E. Pitassy, Nicko Amor L. Flores, Elizabeth S. Hunter, Amy C. Driskell, Kenneth S. Macdonald Iii, Lee A. Weigt, Jeffrey T. Williams

Biological Sciences Faculty Publications

Accurate identification of fishes is essential for understanding their biology and to ensure food safety for consumers. DNA barcoding is an important tool because it can verify identifications of both whole and processed fishes that have had key morphological characters removed (e.g., filets, fish meal); however, DNA reference libraries are incomplete, and public repositories for sequence data contain incorrectly identified sequences. During a nine-year sampling program in the Philippines, a global biodiversity hotspot for marine fishes, we developed a verified reference library of cytochrome c oxidase subunit I (COI) sequences for 2,525 specimens representing 984 species. Specimens were primarily purchased …


The 50 Most Cited Papers On Rugby Since 2000 Reveal A Focus Primarily On Strength And Conditioning In Elite Male Players, Katherine J. Hunzinger, Eric Schussler Jan 2023

The 50 Most Cited Papers On Rugby Since 2000 Reveal A Focus Primarily On Strength And Conditioning In Elite Male Players, Katherine J. Hunzinger, Eric Schussler

Rehabilitation Sciences Faculty Publications

We sought to conduct a bibliometric analysis and review of the most cited publications relating to rugby since 2000 in order to identify topics of interest and those that warrant further investigations. Clarivate Web of Science database was used to perform a literature search using the search term "rugby." The top 200 papers by citation count were extracted and reviewed for the inclusion criteria: all subjects were rugby players. The top 50 manuscripts were included for analysis of author, publication year, country of lead authors, institution, journal name and impact factor, topic, participant sex, and level of rugby. The total …


Past Challenges And The Future Of Discrete Event Simulation, Andrew J. Collins, Farinaz Sabz Ali Pour, Craig A. Jordan Jan 2023

Past Challenges And The Future Of Discrete Event Simulation, Andrew J. Collins, Farinaz Sabz Ali Pour, Craig A. Jordan

Engineering Management & Systems Engineering Faculty Publications

The American scientist Carl Sagan once said: “You have to know the past to understand the present.” We argue that having a meaningful dialogue on the future of simulation requires a baseline understanding of previous discussions on its future. For this paper, we conduct a review of the discrete event simulation (DES) literature that focuses on its future to understand better the path that DES has been following, both in terms of who is using simulation and what directions they think DES should take. Our review involves a qualitative literature review of DES and a quantitative bibliometric analysis of the …


Comparing The "Value Of Information Services" For Providers And Vulnerable Patrons: A Mixed-Methods Study With Academic Libraries And Students With Disabilities, Devendra Potnis, Kevin J. Mallary Jan 2023

Comparing The "Value Of Information Services" For Providers And Vulnerable Patrons: A Mixed-Methods Study With Academic Libraries And Students With Disabilities, Devendra Potnis, Kevin J. Mallary

STEMPS Faculty Publications

Introduction. This multi-year, mixed-methods study compares (a) the reasons administrators and librarians of academic libraries invest in assistive technology for delivering information services to students with disabilities, with (b) the benefits that influence these students’ intention to use AT.

Method. In the first phase, 50 library administrators and 22 librarians from 186 public universities across the US shared their top-three reasons for investing in assistive technology through a qualitative survey. In the second phase, 322 students with disabilities from the same institutions completed a quantitative survey, in which respondents shared individual-level benefits that influence their intention to use assistive technology. …


The Trustworthiness Of The Cumulative Knowledge In Industrial/Organizational Psychology: The Current State Of Affairs And A Path Forward, Sheila K. Keener, Sven Kepes, Ann-Kathrin Torka Jan 2023

The Trustworthiness Of The Cumulative Knowledge In Industrial/Organizational Psychology: The Current State Of Affairs And A Path Forward, Sheila K. Keener, Sven Kepes, Ann-Kathrin Torka

Management Faculty Publications

The goal of industrial/organizational (IO) psychology, is to build and organize trustworthy knowledge about people-related phenomena in the workplace. Unfortunately, as with other scientific disciplines, our discipline may be experiencing a “crisis of confidence” stemming from the lack of reproducibility and replicability of many of our field's research findings, which would suggest that much of our research may be untrustworthy. If a scientific discipline's research is deemed untrustworthy, it can have dire consequences, including the withdraw of funding for future research. In this focal article, we review the current state of reproducibility and replicability in IO psychology and related fields. …


Deeppatent2: A Large-Scale Benchmarking Corpus For Technical Drawing Understanding, Kehinde Ajayi, Xin Wei, Martin Gryder, Winston Shields, Jian Wu, Shawn M. Jones, Michal Kucer, Diane Oyen Jan 2023

Deeppatent2: A Large-Scale Benchmarking Corpus For Technical Drawing Understanding, Kehinde Ajayi, Xin Wei, Martin Gryder, Winston Shields, Jian Wu, Shawn M. Jones, Michal Kucer, Diane Oyen

Computer Science Faculty Publications

Recent advances in computer vision (CV) and natural language processing have been driven by exploiting big data on practical applications. However, these research fields are still limited by the sheer volume, versatility, and diversity of the available datasets. CV tasks, such as image captioning, which has primarily been carried out on natural images, still struggle to produce accurate and meaningful captions on sketched images often included in scientific and technical documents. The advancement of other tasks such as 3D reconstruction from 2D images requires larger datasets with multiple viewpoints. We introduce DeepPatent2, a large-scale dataset, providing more than 2.7 million …


Progenitor Cell Isolation From Mouse Epididymal Adipose Tissue And Sequencing Library Construction, Qianglin Liu, Chaoyang Li, Yuxia Li, Leshan Wang, Xujia Zhang, Buhao Deng, Peidong Gao, Mohammad Shiri, Fozi Alkaifi, Junxing Zhao, Jacqueline M. Stephens, Constantine A. Simintiras, Joseph Francis, Jiangwen Sun, Xing Fu Jan 2023

Progenitor Cell Isolation From Mouse Epididymal Adipose Tissue And Sequencing Library Construction, Qianglin Liu, Chaoyang Li, Yuxia Li, Leshan Wang, Xujia Zhang, Buhao Deng, Peidong Gao, Mohammad Shiri, Fozi Alkaifi, Junxing Zhao, Jacqueline M. Stephens, Constantine A. Simintiras, Joseph Francis, Jiangwen Sun, Xing Fu

Computer Science Faculty Publications

Here, we present a protocol to isolate progenitor cells from mouse epididymal visceral adipose tissue and construct bulk RNA and assay for transposase-accessible chromatin with sequencing (ATAC-seq) libraries. We describe steps for adipose tissue collection, cell isolation, and cell staining and sorting. We then detail procedures for both ATAC-seq and RNA sequencing library construction. This protocol can also be applied to other tissues and cell types directly or with minor modifications.

For complete details on the use and execution of this protocol, please refer to Liu et al. (2023).1

*1 Liu, Q., Li, C., Deng, B., Gao, P., …


Claimdistiller: Scientific Claim Extraction With Supervised Contrastive Learning, Xin Wei, Md Reshad Ul Hoque, Jian Wu, Jiang Li Jan 2023

Claimdistiller: Scientific Claim Extraction With Supervised Contrastive Learning, Xin Wei, Md Reshad Ul Hoque, Jian Wu, Jiang Li

Computer Science Faculty Publications

The growth of scientific papers in the past decades calls for effective claim extraction tools to automatically and accurately locate key claims from unstructured text. Such claims will benefit content-wise aggregated exploration of scientific knowledge beyond the metadata level. One challenge of building such a model is how to effectively use limited labeled training data. In this paper, we compared transfer learning and contrastive learning frameworks in terms of performance, time and training data size. We found contrastive learning has better performance at a lower cost of data across all models. Our contrastive-learning-based model ClaimDistiller has the highest performance, boosting …


"We Collect Tons Of Data... We Report What We Think Our Community Cares The Most About... We Learn So Much From It:" School Librarians' Evidence Collection And Sharing Practices, Jennifer Moore, Maria Cahill, Jeffrey Discala, Wanyi Wang Jan 2023

"We Collect Tons Of Data... We Report What We Think Our Community Cares The Most About... We Learn So Much From It:" School Librarians' Evidence Collection And Sharing Practices, Jennifer Moore, Maria Cahill, Jeffrey Discala, Wanyi Wang

STEMPS Faculty Publications

Evidence-based practice (EBP) offers school librarians a systematic process for developing, assessing, and revising their school library programs. Two of the seven steps in this process involve collecting and sharing meaningful evidence with appropriate stakeholders, often for advocacy purposes, strategically selecting communication channels and methods aligned with target audiences. Through a survey collecting both quantitative and qualitative data, 161 school librarians in Kentucky, Virginia, and Texas shared their experiences with evidence-based practice. The study reported here focuses on school librarians’ evidence collection and sharing practices. Findings indicate school librarians collect easily obtainable data and share evidence of practice widely; however, …


A Complicated Legacy Defines School Librarians As Teachers, Mary Keeling Jan 2023

A Complicated Legacy Defines School Librarians As Teachers, Mary Keeling

STEMPS Faculty Publications

The article analyzes how the legacy of school librarianship inform the future of school librarians as teachers. Topics discussed include lower test scores and fewer opportunities to develop critical thinking and digital literacy skills among students, need for the federal government to recognize the importance of school libraries and how well-staffed school library with a qualified librarian can provide essential services and resources.


Culturally Responsive Librarians: Shifting Perspectives Toward Racial Empathy, Elizabeth A. Burns Jan 2023

Culturally Responsive Librarians: Shifting Perspectives Toward Racial Empathy, Elizabeth A. Burns

STEMPS Faculty Publications

Libraries are charged with being inclusive spaces for all patrons. Library (library and information science [LIS]) preparation programs, by extension, must prepare the next generation of librarians to meet the needs of an increasingly diverse population. It is imperative that today’s librarians are equipped to infuse diversity, equity, and inclusion (DEI) theory with best practice when establishing policy and procedure for the library environment, staff, and programing. With little research and no established protocol in LIS education, it is unclear how pre-service librarians are trained in DEI to meet the needs of all users. This exploratory study used a participatory …


School Library Advocacy: Enhancing Opportunities For All Learners, Elizabeth A. Burns Jan 2023

School Library Advocacy: Enhancing Opportunities For All Learners, Elizabeth A. Burns

STEMPS Faculty Publications

There is a lack of consistency in how school librarians understand and engage in advocacy for their programs and the profession (Burns, 2015; Lance & Kachel, 2018). It is important that school librarians demonstrate the positive impact they contribute to improving instruction for student learners and advancing access so that all learners have equal opportunities. The school library and school librarian should be available to each student (Kachel, 2021). The School Library Manifesto (2021) includes language to support a strong school library with a qualified school librarian. Advocacy goals can be highlighted throughout this critical set of school library guidelines. …


Hashes Are Not Suitable To Verify Fixity Of The Public Archived Web, Mohamed Aturban, Martin Klein, Herbert Van De Sompel, Sawood Alam, Michael L. Nelson, Michele C. Weigle Jan 2023

Hashes Are Not Suitable To Verify Fixity Of The Public Archived Web, Mohamed Aturban, Martin Klein, Herbert Van De Sompel, Sawood Alam, Michael L. Nelson, Michele C. Weigle

Computer Science Faculty Publications

Web archives, such as the Internet Archive, preserve the web and allow access to prior states of web pages. We implicitly trust their versions of archived pages, but as their role moves from preserving curios of the past to facilitating present day adjudication, we are concerned with verifying the fixity of archived web pages, or mementos, to ensure they have always remained unaltered. A widely used technique in digital preservation to verify the fixity of an archived resource is to periodically compute a cryptographic hash value on a resource and then compare it with a previous hash value. If the …


Diffusion Of Public Library Innovations: A Case Study On Parking Lot Wi-Fi Hotspots Diffusion Development, Samantha Laine Hull Aug 2022

Diffusion Of Public Library Innovations: A Case Study On Parking Lot Wi-Fi Hotspots Diffusion Development, Samantha Laine Hull

STEMPS Theses & Dissertations

Public libraries have begun to provide services well beyond books and online databases. Prior to the pandemic, many libraries expanded their collection to include items like power drills or board games in their circulation. They also started partnering with social service organizations to better serve their patrons’ needs beyond those that are educational and entertainment based. Despite being broadly trusted by most people and having clever and innovative ideas, some public libraries’ budgets and time limits left marketing efforts at a minimum. In order to address the communication problem many public libraries face, in this study I sought to align …


Scholarly Big Data Quality Assessment: A Case Study Of Document Linking And Conflation With S2orc, Jian Wu, Ryan Hiltabrand, Dominik Soós, C. Lee Giles Jan 2022

Scholarly Big Data Quality Assessment: A Case Study Of Document Linking And Conflation With S2orc, Jian Wu, Ryan Hiltabrand, Dominik Soós, C. Lee Giles

Computer Science Faculty Publications

Recently, the Allen Institute for Artificial Intelligence released the Semantic Scholar Open Research Corpus (S2ORC), one of the largest open-access scholarly big datasets with more than 130 million scholarly paper records. S2ORC contains a significant portion of automatically generated metadata. The metadata quality could impact downstream tasks such as citation analysis, citation prediction, and link analysis. In this project, we assess the document linking quality and estimate the document conflation rate for the S2ORC dataset. Using semi-automatically curated ground truth corpora, we estimated that the overall document linking quality is high, with 92.6% of documents correctly linking to six major …


A Gamefied Synthetic Environment For Evaluation Of Counter-Disinformation Solutions, Jesse Richman, Lora Pitman, Girish S. Nandakumar Jan 2022

A Gamefied Synthetic Environment For Evaluation Of Counter-Disinformation Solutions, Jesse Richman, Lora Pitman, Girish S. Nandakumar

Political Science & Geography Faculty Publications

This paper presents a simulation-based approach to countering online dis/misinformation. This disruptive technology experiment incorporated a synthetic environment component, based on adapted SIR epidemiological model to evaluate and visualize the effectiveness of suggested solutions to the issue. The participants in the simulation were given a realistic scenario depicting a dis/misinformation threat and were asked to select a number of solutions, described in IoS (Ideas-of-Systems) cards. During the event, the qualitative and quantitative characteristics of the IoS cards, were tested in a synthetic environment (SEN), built after a Susceptible-Infected-Resistant (SIR) model. The participants, divided into teams, presented and justified their dis/misinformation …


The Dsa Toolkit Shines Light Into Dark And Stormy Archives, Shawn Morgan Jones, Himarsha R. Jayanetti, Alex Osborne, Paul Koerbin, Klein Martin, Michele C. Weigle, Michael L. Nelson Jan 2022

The Dsa Toolkit Shines Light Into Dark And Stormy Archives, Shawn Morgan Jones, Himarsha R. Jayanetti, Alex Osborne, Paul Koerbin, Klein Martin, Michele C. Weigle, Michael L. Nelson

Computer Science Faculty Publications

Web archive collections are created with a particular purpose in mind. A curator selects seeds, or original resources, which are then captured by an archiving system and stored as archived web pages, or mementos. The systems that build web archive collections are often configured to revisit the same original resource multiple times. This is incredibly useful for understanding an unfolding news story or the evolution of an organization. Unfortunately, over time, some of these original resources can go off-topic and no longer suit the purpose for which the collection was originally created. They can go off-topic due to web site …


Theory Entity Extraction For Social And Behavioral Sciences Papers Using Distant Supervision, Xin Wei, Lamia Salsabil, Jian Wu Jan 2022

Theory Entity Extraction For Social And Behavioral Sciences Papers Using Distant Supervision, Xin Wei, Lamia Salsabil, Jian Wu

Computer Science Faculty Publications

Theories and models, which are common in scientific papers in almost all domains, usually provide the foundations of theoretical analysis and experiments. Understanding the use of theories and models can shed light on the credibility and reproducibility of research works. Compared with metadata, such as title, author, keywords, etc., theory extraction in scientific literature is rarely explored, especially for social and behavioral science (SBS) domains. One challenge of applying supervised learning methods is the lack of a large number of labeled samples for training. In this paper, we propose an automated framework based on distant supervision that leverages entity mentions …


Research In Action: Impacting Library Communities With Field-Based Projects, Elizabeth A. Burns Jan 2022

Research In Action: Impacting Library Communities With Field-Based Projects, Elizabeth A. Burns

STEMPS Faculty Publications

Our library and information studies (LIS) program is grounded in the principals of social justice, leadership, and authentic practice. One way candidates of the program meet these ideals is through participation in a required internship. During the internship students complete an independent project on site at their internship location.

Using Elliot’s (1991) steps of action research, the students in the internship course identify an issue, collect or use data to inform action, analyze the findings, and reflect on the results. An initial needs assessment is conducted. This includes establishing a rationale to inform practice. Students then implement a hands-on response …


Opportunities For Autism Information Shared Through Professional Conferences, Amelia Anderson, Selena Layden, Crystal Stang (Ed.), Jennifer L. Branch-Mueller (Ed.) Jan 2022

Opportunities For Autism Information Shared Through Professional Conferences, Amelia Anderson, Selena Layden, Crystal Stang (Ed.), Jennifer L. Branch-Mueller (Ed.)

STEMPS Faculty Publications

With prevalence most recently reported at 1 in 44 (Maenner et al., 2021) children diagnosed with autism spectrum disorder (ASD) in the United States, school librarians can and should expect to see these children in their schools and in their libraries. However, previous work indicates that school librarians are not being provided with an adequate education about this in their graduate coursework (Layden, Anderson, & Hayden, 2021). This study expands upon previous work to explore the preparation of school librarians about autism by examining the previous five years of state library conference programs.


Streaminghub: Interactive Stream Analysis Workflows, Yasith Jayawardana, Vikas G. Ashok, Sampath Jayarathna Jan 2022

Streaminghub: Interactive Stream Analysis Workflows, Yasith Jayawardana, Vikas G. Ashok, Sampath Jayarathna

Computer Science Faculty Publications

Reusable data/code and reproducible analyses are foundational to quality research. This aspect, however, is often overlooked when designing interactive stream analysis workflows for time-series data (e.g., eye-tracking data). A mechanism to transmit informative metadata alongside data may allow such workflows to intelligently consume data, propagate metadata to downstream tasks, and thereby auto-generate reusable, reproducible analytic outputs with zero supervision. Moreover, a visual programming interface to design, develop, and execute such workflows may allow rapid prototyping for interdisciplinary research. Capitalizing on these ideas, we propose StreamingHub, a framework to build metadata propagating, interactive stream analysis workflows using visual programming. We conduct …


D-Lib Magazine Pioneered Web-Based Scholarly Communication, Michael L. Nelson, Herbert Van De Sompel Jan 2022

D-Lib Magazine Pioneered Web-Based Scholarly Communication, Michael L. Nelson, Herbert Van De Sompel

Computer Science Faculty Publications

The web began with a vision of, as stated by Tim Berners-Lee in 1991, “that much academic information should be freely available to anyone”. For many years, the development of the web and the development of digital libraries and other scholarly communications infrastructure proceeded in tandem. A milestone occurred in July, 1995, when the first issue of D-Lib Magazine was published as an online, HTML-only, open access magazine, serving as the focal point for the then emerging digital library research community. In 2017 it ceased publication, in part due to the maturity of the community it served as well as …


This Old Vase: Ancient Art And Primary Source Instruction In The Archives, Laraann Canner Nov 2021

This Old Vase: Ancient Art And Primary Source Instruction In The Archives, Laraann Canner

Libraries Faculty & Staff Publications

No abstract provided.


It Started With A Zine And Ended With A Zoom: How We Successfully Created A Virtual Arts Festival During Covid-19 At Odu, Laraann Canner, Gay Acompanado, Jennifer Hoyt Oct 2021

It Started With A Zine And Ended With A Zoom: How We Successfully Created A Virtual Arts Festival During Covid-19 At Odu, Laraann Canner, Gay Acompanado, Jennifer Hoyt

Libraries Faculty & Staff Presentations

No abstract provided.


Improving Collection Understanding For Web Archives With Storytelling: Shining Light Into Dark And Stormy Archives, Shawn M. Jones Jul 2021

Improving Collection Understanding For Web Archives With Storytelling: Shining Light Into Dark And Stormy Archives, Shawn M. Jones

Computer Science Theses & Dissertations

Collections are the tools that people use to make sense of an ever-increasing number of archived web pages. As collections themselves grow, we need tools to make sense of them. Tools that work on the general web, like search engines, are not a good fit for these collections because search engines do not currently represent multiple document versions well. Web archive collections are vast, some containing hundreds of thousands of documents. Thousands of collections exist, many of which cover the same topic. Few collections include standardized metadata. Too many documents from too many collections with insufficient metadata makes collection understanding …