Open Access. Powered by Scholars. Published by Universities.®

Library and Information Science Commons

Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics

Series

2022

Institution
Keyword
Publication

Articles 1 - 29 of 29

Full-Text Articles in Library and Information Science

Creating Data From Unstructured Text With Context Rule Assisted Machine Learning (Craml), Stephen Meisenbacher, Peter Norlander Dec 2022

Creating Data From Unstructured Text With Context Rule Assisted Machine Learning (Craml), Stephen Meisenbacher, Peter Norlander

School of Business: Faculty Publications and Other Works

Popular approaches to building data from unstructured text come with limitations, such as scalability, interpretability, replicability, and real-world applicability. These can be overcome with Context Rule Assisted Machine Learning (CRAML), a method and no-code suite of software tools that builds structured, labeled datasets which are accurate and reproducible. CRAML enables domain experts to access uncommon constructs within a document corpus in a low-resource, transparent, and flexible manner. CRAML produces document-level datasets for quantitative research and makes qualitative classification schemes scalable over large volumes of text. We demonstrate that the method is useful for bibliographic analysis, transparent analysis of proprietary data, …


Safe Sharing For Sensitive Data, Kristi Thompson Dec 2022

Safe Sharing For Sensitive Data, Kristi Thompson

Western Libraries Presentations

This workshop focused on the question of when and how human subjects' data can be safely shared. It introduced the basics of data anonymization and discussed how to tell if a dataset has been de-identified. Case studies of successful anonymization and some spectacular failures were shared


Getting Started Analyzing Data In Spss, Kristi Thompson Nov 2022

Getting Started Analyzing Data In Spss, Kristi Thompson

Western Libraries Presentations

SPSS is a popular package for analyzing data. This session will discuss how to get started on a simple quantitative analysis project using SPSS. Topics covered will include getting summary statistics, creating and modifying variables, creating graphs, running simple analyses, and interpreting SPSS output.


Why We Should Remember The Soviet Information Age?, Ksenia Tatarchenko Oct 2022

Why We Should Remember The Soviet Information Age?, Ksenia Tatarchenko

Research Collection College of Integrative Studies

How to navigate the rapidly changing digital geopolitics of the world today? How do we make sense of digital transformation and its many social, political, cultural, and environmental implications at different locations around the world?


Developing Research Data Management Services In A Regional Comprehensive University: The Case Of Central Washington University, Ping Fu, Maurice Blackson, Maura Valentino Aug 2022

Developing Research Data Management Services In A Regional Comprehensive University: The Case Of Central Washington University, Ping Fu, Maurice Blackson, Maura Valentino

Library Scholarship

This study aims to analyze the needs of researchers in a regional comprehensive university for research data management services; discuss the options for developing a research data management program at the university; and then propose a phased three-year implementation plan for the university libraries. The method was to design a survey to collect information from researchers and assess and evaluate their needs for research data management services. The results show that researchers’ needs in a regional comprehensive university could be quite different from those of researchers in a research-intensive university. Also, the results verify the hypothesis that researchers in the …


Geo 100: Environmental Geology Oer Curation, Chealsye Bowley Apr 2022

Geo 100: Environmental Geology Oer Curation, Chealsye Bowley

Curated OER Collections

This OER curation is an annotated bibliography of prospective OER for the GVSU course GEO 100: Environmental Geology.


Ens 300: Principles Of Sustainability Oer Curation, Chealsye Bowley Apr 2022

Ens 300: Principles Of Sustainability Oer Curation, Chealsye Bowley

Curated OER Collections

This OER curation is an annotated bibliography of prospective OER for the GVSU course ENS 300: Principles of Sustainability, assembled by request from the instructor.


A Mathematical Model For The Adoption Of Information And Communication Technology In School Libraries In Nigeria, Helen Olubunmi Jaiyeola Akinade, Jeremiah Ademola Balogun, Peter Adebayo Idowu Apr 2022

A Mathematical Model For The Adoption Of Information And Communication Technology In School Libraries In Nigeria, Helen Olubunmi Jaiyeola Akinade, Jeremiah Ademola Balogun, Peter Adebayo Idowu

Library Philosophy and Practice (e-journal)

This study focused on the development of a mathematical model required for estimating the number of adopters of ICT devices among libraries located in Nigeria. Data for this study was collected from 121 respondents selected based on a research survey approach using simple random sampling. 9 ICT devices were identified, namely: PCs, printers/fax machines, search engines, e-library systems, bulk SMS services, library management systems, bar/QR code readers, projectors and video conferencing. The results showed that the earliest ICT devices were adopted for use in 1997, such as: PCs, printers/fax machines and search engines. The remaining ICT devices were adopted in …


Building Capacity For Data-Driven Scholarship, Jamie Rogers Mar 2022

Building Capacity For Data-Driven Scholarship, Jamie Rogers

Works of the FIU Libraries

This talk provides an overview of "dLOC as Data: A Thematic Approach to Caribbean Newspapers," an initiative developed to increase access to digitized Caribbean newspaper text for bulk download, facilitating computational analysis. Capacity building for future research in Caribbean Studies being a crucial aspect of this initiative, a thematic toolkit was developed to facilitate use of the project data as well as provide replicable processes. The toolkit includes sample text analysis projects, as well as tutorials and detailed project documentation. While the toolkit focuses on the history of hurricanes and tropical cyclones of the region, the methodologies and tools used …


Communicating Science With Little (Or No) Budget: Design Rules And Tricks For The Non-Artist, Kiyomi D. Deards Mar 2022

Communicating Science With Little (Or No) Budget: Design Rules And Tricks For The Non-Artist, Kiyomi D. Deards

University of Nebraska-Lincoln Libraries: Conference Presentations and Speeches

This presentation is for the self-proclaimed non-artist scientist who wants to communicate science effectively but has little (or no) budget to hire professionals to create and edit images (artwork, tables, graphs), websites, presentation slides, and publications. For this scientist, learning basic easy-to-apply design rules and tricks can facilitate the preparation of scientific material. The speaker has experience designing formal and informal presentations, creating videos and podcasts, working with graphic designers, and designing websites. The speaker will provide tips and suggestions based on her own experiences, collaborations, and acting as a consultant for informal science communication projects. Moreover, strategies for using …


Online Masters In Data Science, Joanna Burkhardt Feb 2022

Online Masters In Data Science, Joanna Burkhardt

Library Impact Statements

No abstract provided.


Mathematical Foundations For Data Science Ams/Dsp 563, Harrison Dekker Feb 2022

Mathematical Foundations For Data Science Ams/Dsp 563, Harrison Dekker

Collection Development Reports and Documents

No abstract provided.


Data Analytics And Visualization Dsp 562, Harrison Dekker Feb 2022

Data Analytics And Visualization Dsp 562, Harrison Dekker

Collection Development Reports and Documents

No abstract provided.


Advanced Topics In Machine Learning Dsp 566, Harrison Dekker Feb 2022

Advanced Topics In Machine Learning Dsp 566, Harrison Dekker

Collection Development Reports and Documents

No abstract provided.


Advanced Database Concepts, Cloud Computing And Big Data Dsp 567, Harrison Dekker Feb 2022

Advanced Database Concepts, Cloud Computing And Big Data Dsp 567, Harrison Dekker

Collection Development Reports and Documents

No abstract provided.


Introduction To Statistical Computing Dsp 565, Harrison Dekker Feb 2022

Introduction To Statistical Computing Dsp 565, Harrison Dekker

Collection Development Reports and Documents

No abstract provided.


Data Science For Business Dsp 568, Harrison Dekker Feb 2022

Data Science For Business Dsp 568, Harrison Dekker

Collection Development Reports and Documents

No abstract provided.


Applications Of Data Science In Biological Science Dsp 569, Harrison Dekker Feb 2022

Applications Of Data Science In Biological Science Dsp 569, Harrison Dekker

Collection Development Reports and Documents

No abstract provided.


Environmental Isotope Geochemistry Ocg 550, Joanna Burkhardt Jan 2022

Environmental Isotope Geochemistry Ocg 550, Joanna Burkhardt

Library Impact Statements

No abstract provided.


A Call For The Library Community To Deploy Best Practices Toward A Database For Biocultural Knowledge Relating To Climate Change, Martha B. Lerski Jan 2022

A Call For The Library Community To Deploy Best Practices Toward A Database For Biocultural Knowledge Relating To Climate Change, Martha B. Lerski

Publications and Research

Abstract

Purpose – In this paper, a call to the library and information science community to support documentation and conservation of cultural and biocultural heritage has been presented.

Design/methodology/approach – Based in existing Literature, this proposal is generative and descriptive— rather than prescriptive—regarding precisely how libraries should collaborate to employ technical and ethical best practices to provide access to vital data, research and cultural narratives relating to climate.

Findings – COVID-19 and climate destruction signal urgent global challenges. Library best practices are positioned to respond to climate change. Literature indicates how libraries preserve, share and cross-link cultural and scientific knowledge. …


Campus Mobile History Application, Drew Adan, Christine Sears Jan 2022

Campus Mobile History Application, Drew Adan, Christine Sears

Summer Community of Scholars (RCEU and HCR) Project Proposals

No abstract provided.


Law Library Blog (January 2022): Legal Beagle's Blog Archive, Roger Williams University School Of Law Jan 2022

Law Library Blog (January 2022): Legal Beagle's Blog Archive, Roger Williams University School Of Law

Law Library Newsletters/Blog

No abstract provided.


D-Lib Magazine Pioneered Web-Based Scholarly Communication, Michael L. Nelson, Herbert Van De Sompel Jan 2022

D-Lib Magazine Pioneered Web-Based Scholarly Communication, Michael L. Nelson, Herbert Van De Sompel

Computer Science Faculty Publications

The web began with a vision of, as stated by Tim Berners-Lee in 1991, “that much academic information should be freely available to anyone”. For many years, the development of the web and the development of digital libraries and other scholarly communications infrastructure proceeded in tandem. A milestone occurred in July, 1995, when the first issue of D-Lib Magazine was published as an online, HTML-only, open access magazine, serving as the focal point for the then emerging digital library research community. In 2017 it ceased publication, in part due to the maturity of the community it served as well as …


Scholarly Big Data Quality Assessment: A Case Study Of Document Linking And Conflation With S2orc, Jian Wu, Ryan Hiltabrand, Dominik Soós, C. Lee Giles Jan 2022

Scholarly Big Data Quality Assessment: A Case Study Of Document Linking And Conflation With S2orc, Jian Wu, Ryan Hiltabrand, Dominik Soós, C. Lee Giles

Computer Science Faculty Publications

Recently, the Allen Institute for Artificial Intelligence released the Semantic Scholar Open Research Corpus (S2ORC), one of the largest open-access scholarly big datasets with more than 130 million scholarly paper records. S2ORC contains a significant portion of automatically generated metadata. The metadata quality could impact downstream tasks such as citation analysis, citation prediction, and link analysis. In this project, we assess the document linking quality and estimate the document conflation rate for the S2ORC dataset. Using semi-automatically curated ground truth corpora, we estimated that the overall document linking quality is high, with 92.6% of documents correctly linking to six major …


The Dsa Toolkit Shines Light Into Dark And Stormy Archives, Shawn Morgan Jones, Himarsha R. Jayanetti, Alex Osborne, Paul Koerbin, Klein Martin, Michele C. Weigle, Michael L. Nelson Jan 2022

The Dsa Toolkit Shines Light Into Dark And Stormy Archives, Shawn Morgan Jones, Himarsha R. Jayanetti, Alex Osborne, Paul Koerbin, Klein Martin, Michele C. Weigle, Michael L. Nelson

Computer Science Faculty Publications

Web archive collections are created with a particular purpose in mind. A curator selects seeds, or original resources, which are then captured by an archiving system and stored as archived web pages, or mementos. The systems that build web archive collections are often configured to revisit the same original resource multiple times. This is incredibly useful for understanding an unfolding news story or the evolution of an organization. Unfortunately, over time, some of these original resources can go off-topic and no longer suit the purpose for which the collection was originally created. They can go off-topic due to web site …


Theory Entity Extraction For Social And Behavioral Sciences Papers Using Distant Supervision, Xin Wei, Lamia Salsabil, Jian Wu Jan 2022

Theory Entity Extraction For Social And Behavioral Sciences Papers Using Distant Supervision, Xin Wei, Lamia Salsabil, Jian Wu

Computer Science Faculty Publications

Theories and models, which are common in scientific papers in almost all domains, usually provide the foundations of theoretical analysis and experiments. Understanding the use of theories and models can shed light on the credibility and reproducibility of research works. Compared with metadata, such as title, author, keywords, etc., theory extraction in scientific literature is rarely explored, especially for social and behavioral science (SBS) domains. One challenge of applying supervised learning methods is the lack of a large number of labeled samples for training. In this paper, we propose an automated framework based on distant supervision that leverages entity mentions …


Streaminghub: Interactive Stream Analysis Workflows, Yasith Jayawardana, Vikas G. Ashok, Sampath Jayarathna Jan 2022

Streaminghub: Interactive Stream Analysis Workflows, Yasith Jayawardana, Vikas G. Ashok, Sampath Jayarathna

Computer Science Faculty Publications

Reusable data/code and reproducible analyses are foundational to quality research. This aspect, however, is often overlooked when designing interactive stream analysis workflows for time-series data (e.g., eye-tracking data). A mechanism to transmit informative metadata alongside data may allow such workflows to intelligently consume data, propagate metadata to downstream tasks, and thereby auto-generate reusable, reproducible analytic outputs with zero supervision. Moreover, a visual programming interface to design, develop, and execute such workflows may allow rapid prototyping for interdisciplinary research. Capitalizing on these ideas, we propose StreamingHub, a framework to build metadata propagating, interactive stream analysis workflows using visual programming. We conduct …


Guide To The Dr. L.S. Dederick Papers, 1908-1956, Undated, Orson Kingsley, Patrick Koetsch Jan 2022

Guide To The Dr. L.S. Dederick Papers, 1908-1956, Undated, Orson Kingsley, Patrick Koetsch

Archives & Special Collections Finding Aids

Louis Serle (L.S.) Dederick was born in Chicago in 1883. He received his Ph.D. in Mathematics from Harvard University in 1909. From 1909 – 1917 he was a professor at Princeton University. From 1917 – 1924 he was professor at the U.S. Naval Academy in Annapolis, Maryland. In 1926 Dederick began working for the U.S. Army, Ordnance. During his time there he was the Associate Director of the Ballistic Research Laboratory at the Aberdeen Proving Grounds in Aberdeen, Maryland where he focused on ballistics research.

While Dederick worked as a mathematician at the Aberdeen Proving Grounds, he was involved with …


Understanding The Enumerated World: Making Sense Of Data As An Information Source, Kristi Thompson, Elizabeth Hill, Alexandra Cooper Jan 2022

Understanding The Enumerated World: Making Sense Of Data As An Information Source, Kristi Thompson, Elizabeth Hill, Alexandra Cooper

Western Libraries Publications

Chapter in ACRL publication The Data Literacy Cookbook.

This recipe is a guide to preparing an instructional session aimed at postsecondary students in the social or health sciences or related disciplines on locating, evaluating, and using secondary data sources as information resources. Who collects data? Where can you access them? Why are data available on some topics and not others? Why are some statistics available at a detailed level of geography and others only nationally? What are some key limitations of official statistics, and where can information be found to fill in the gaps? This recipe uses these questions to …