Open Access. Powered by Scholars. Published by Universities.®
- Institution
- Publication
-
- University of Nebraska-Lincoln Libraries: Conference Presentations and Speeches (4)
- CDRH Grant Reports (2)
- CSE Conference and Workshop Papers (2)
- Other resources (2)
- Accessus (1)
-
- Bucknell University Digital Scholarship Conference (1)
- CSE Technical Reports (1)
- College of Computing and Digital Media Dissertations (1)
- Computer Ethics - Philosophical Enquiry (CEPE) Proceedings (1)
- Conference papers (1)
- Criticism (1)
- Dissertations, Theses, and Capstone Projects (1)
- Electronic Thesis and Dissertation Repository (1)
- MFA Thesis Exhibit Catalogs (1)
- Publications (1)
- Publications and Research (1)
- Senior Projects Spring 2017 (1)
- UNL Libraries: Faculty Publications (1)
- Publication Type
Articles 1 - 24 of 24
Full-Text Articles in Arts and Humanities
Towards An Experimental Bibliography Of Hemispheric Reconstruction Newspapers, Joshua Ortiz Baco, Benjamin Charles Germain Lee, Jim Casey, Sarah H. Salter
Towards An Experimental Bibliography Of Hemispheric Reconstruction Newspapers, Joshua Ortiz Baco, Benjamin Charles Germain Lee, Jim Casey, Sarah H. Salter
Criticism
Digital collections of newspapers have drawn broader attention to the fragmented and scattered print histories of minoritized communities. Attempts to survey these histories through bibliography, however, quickly meet with a fundamental problem: the practice of bibliographic description calls for creating a static record of social affiliations. Given the overwhelming scholarly consensus that categories such as race, ethnicity, and language are socially constructed, this article introduces an experimental bibliographic method for mapping the vast landscape of historical newspapers. This method extends the machine learning affordances of a recent project called Newspaper Navigator to enumerate the newspapers in Chronicling America according to …
Defining And Detecting Toxicity On Social Media: Context And Knowledge Are Key, Amit Sheth, Valerie Shalin, Ugur Kursuncu
Defining And Detecting Toxicity On Social Media: Context And Knowledge Are Key, Amit Sheth, Valerie Shalin, Ugur Kursuncu
Publications
As the role of online platforms has become increasingly prominent for communication, toxic behaviors, such as cyberbullying and harassment, have been rampant in the last decade. On the other hand, online toxicity is multi-dimensional and sensitive in nature, which makes its detection challenging. As the impact of exposure to online toxicity can lead to serious implications for individuals and communities, reliable models and algorithms are required for detecting and understanding such communications. In this paper We define toxicity to provide a foundation drawing social theories. Then, we provide an approach that identifies multiple dimensions of toxicity and incorporates explicit knowledge …
Pathways To The Native Storyteller: A Method To Enable Computational Story Understanding, Aramide O. Kehinde
Pathways To The Native Storyteller: A Method To Enable Computational Story Understanding, Aramide O. Kehinde
College of Computing and Digital Media Dissertations
The primary objective of this thesis is to develop a method that uses machine learning algorithms to enable computational story understanding. This research is conducted with the aim of establishing a system called the Native Storyteller that plans and creates storytelling experiences for human users. The paper first establishes the desired capabilities of the system and then deep dives into how to enable story understanding, which is the core ability the system needs to function. As such, the research places emphasis on natural language processing and its application to solving key problems in this context. Namely, machine representation of story …
Introducing Gower Shorts, Eve Salisbury, Georgiana Donavin
Introducing Gower Shorts, Eve Salisbury, Georgiana Donavin
Accessus
This essay provides an introduction to a special issue of Accessus, entitled "Gower Shorts," slightly expanded and revised versions of conference papers presented at the International Congress on Medieval Studies at Kalamazoo in 2018-19 and the Modern Language Association Conference in Chicago in 2019.
Final Presentation To The Library Of Congress On Digital Libraries, Intelligent Data Analytics, And Augmented Description, Elizabeth Lorang, Leen-Kiat Soh, Yi Liu, Chulwoo Pack
Final Presentation To The Library Of Congress On Digital Libraries, Intelligent Data Analytics, And Augmented Description, Elizabeth Lorang, Leen-Kiat Soh, Yi Liu, Chulwoo Pack
University of Nebraska-Lincoln Libraries: Conference Presentations and Speeches
This presentation to Library of Congress staff, delivered onsite on January 10, 2020, presents a tour through the demonstration project pursued by the Aida digital libraries research team with the Library of Congress in 2019-2020. In addition to providing an overview and analysis of the specific machine learning projects scoped and explored, this presentation includes a number of high-level take-aways and recommendations designed to influence and inform the Library of Congress's machine learning efforts going forward.
Digital Libraries, Intelligent Data Analytics, And Augmented Description: A Demonstration Project, Elizabeth Lorang, Leen-Kiat Soh, Yi Liu, Chulwoo Pack
Digital Libraries, Intelligent Data Analytics, And Augmented Description: A Demonstration Project, Elizabeth Lorang, Leen-Kiat Soh, Yi Liu, Chulwoo Pack
UNL Libraries: Faculty Publications
From July 16-to November 8, 2019, the Aida digital libraries research team at the University of Nebraska-Lincoln collaborated with the Library of Congress on “Digital Libraries, Intelligent Data Analytics, and Augmented Description: A Demonstration Project.“ This demonstration project sought to (1) develop and investigate the viability and feasibility of textual and image-based data analytics approaches to support and facilitate discovery; (2) understand technical tools and requirements for the Library of Congress to improve access and discovery of its digital collections; and (3) enable the Library of Congress to plan for future possibilities. In pursuit of these goals, we focused our …
Virtual Wrap-Up Presentation: Digital Libraries, Intelligent Data Analytics, And Augmented Description, Elizabeth Lorang, Leen-Kiat Soh, Yi Liu, Chulwoo Pack
Virtual Wrap-Up Presentation: Digital Libraries, Intelligent Data Analytics, And Augmented Description, Elizabeth Lorang, Leen-Kiat Soh, Yi Liu, Chulwoo Pack
CSE Conference and Workshop Papers
Includes framing, overview, and discussion of the explorations pursued as part of the Digital Libraries, Intelligent Data Analytics, and Augmented Description demonstration project, pursued by members of the Aida digital libraries research team at the University of Nebraska-Lincoln through a research services contract with the Library of Congress. This presentation covered: Aida research team and background for the demonstration project; broad outlines of “Digital Libraries, Intelligent Data Analytics, and Augmented Description”; what changed for us as a research team over the collaboration and why; deliverables of our work; thoughts toward “What next”; and deep-dives into the explorations. The machine learning …
Collaborating On Machine Reading: Training Algorithms To Read Complex Collections, Carrie M. Pirmann, Brian R. King, Bhagawat Acharya, Katherine M. Faull
Collaborating On Machine Reading: Training Algorithms To Read Complex Collections, Carrie M. Pirmann, Brian R. King, Bhagawat Acharya, Katherine M. Faull
Bucknell University Digital Scholarship Conference
Interdisciplinary collaboration between two faculty members in the humanities and computer science, a research librarian, and an undergraduate student has led to remarkable results in an ongoing international DH research project that has at its core 18th century manuscripts. The corpus stems from a vast collection of archival materials held by the Moravian Church in the UK, Germany, and the US. The number of pages to be transcribed, differences in handwriting styles, paper quality, and original language pose enormous problems for the feasibility of human transcription. This presentation will review the hypothesis, process, and findings of a summer research project …
Document Images And Machine Learning: A Collaboratory Between The Library Of Congress And The Image Analysis For Archival Discovery (Aida) Lab At The University Of Nebraska, Lincoln, Ne, Yi Liu, Chulwoo Pack, Leen-Kiat Soh, Elizabeth Lorang
Document Images And Machine Learning: A Collaboratory Between The Library Of Congress And The Image Analysis For Archival Discovery (Aida) Lab At The University Of Nebraska, Lincoln, Ne, Yi Liu, Chulwoo Pack, Leen-Kiat Soh, Elizabeth Lorang
CSE Conference and Workshop Papers
This presentation summarized and presented preliminary results from the first weeks of work conducted by the Aida research team in response to Library of Congress funding notice ID 030ADV19Q0274, “The Library of Congress – Pre-processing Pilot.” It includes overviews of projects on historic document segmentation, document classification, document quality assessment, figure and graph extraction from historic documents, text-line extraction from figures, subject and objective quality assesments, and digitization type differentiation.
Rethinking Algorithmic Bias Through Phenomenology And Pragmatism, Johnathan C. Flowers
Rethinking Algorithmic Bias Through Phenomenology And Pragmatism, Johnathan C. Flowers
Computer Ethics - Philosophical Enquiry (CEPE) Proceedings
In 2017, Amazon discontinued an attempt at developing a hiring algorithm which would enable the company to streamline its hiring processes due to apparent gender discrimination. Specifically, the algorithm, trained on over a decade’s worth of resumes submitted to Amazon, learned to penalize applications that contained references to women, that indicated graduation from all women’s colleges, or otherwise indicated that an applicant was not male. Amazon’s algorithm took up the history of Amazon’s applicant pool and integrated it into its present “problematic situation,” for the purposes of future action. Consequently, Amazon declared the project a failure: even after attempting to …
Application Of The Image Analysis For Archival Discovery Team’S First- Generation Methods And Software To The Burney Collection Of British Newspapers, Elizabeth Lorang, Leen-Kiat Soh, Chulwoo Pack, Yi Liu, Delaram Rahimighazikalayeh, John O'Brien
Application Of The Image Analysis For Archival Discovery Team’S First- Generation Methods And Software To The Burney Collection Of British Newspapers, Elizabeth Lorang, Leen-Kiat Soh, Chulwoo Pack, Yi Liu, Delaram Rahimighazikalayeh, John O'Brien
CDRH Grant Reports
The current study, “Application of the Image Analysis for Archival Discovery Team’s First- Generation Methods and Software to the Burney Collection of British Newspapers,” is the first test of our approaches—methods and software—to a different newspaper corpus, specifically the 17th and 18 Century Burney Newspapers Collection. This study stands as the first complete attempt at applying Aida’s software and methods to non-Chronicling America newspapers, as a step toward understanding the potential of our approaches across digitized historic newspapers. In taking this step, our goals were (1) to test how well the software and a classifier model developed on Chronicling America …
Recipe For Disaster, Zac Travis
Recipe For Disaster, Zac Travis
MFA Thesis Exhibit Catalogs
Today’s rapid advances in algorithmic processes are creating and generating predictions through common applications, including speech recognition, natural language (text) generation, search engine prediction, social media personalization, and product recommendations. These algorithmic processes rapidly sort through streams of computational calculations and personal digital footprints to predict, make decisions, translate, and attempt to mimic human cognitive function as closely as possible. This is known as machine learning.
The project Recipe for Disaster was developed by exploring automation in technology, specifically through the use of machine learning and recurrent neural networks. These algorithmic models feed on large amounts of data as a …
Interim Performance Report, Lg‐71‐16‐0152‐16, Extending Intelligent Computational Image Analysis For Archival Discovery, March 2019, Elizabeth Lorang, Leen-Kiat Soh, John O'Brien
Interim Performance Report, Lg‐71‐16‐0152‐16, Extending Intelligent Computational Image Analysis For Archival Discovery, March 2019, Elizabeth Lorang, Leen-Kiat Soh, John O'Brien
CDRH Grant Reports
The primary goal of "Extending Intelligent Computational Image Analysis for Archival Discovery" is to investigate the use of image analysis as a methodology for content identification, description, and information retrieval in digital libraries and other digitized collections. Building on work started under a National Endowment for the Humanities' Office of Digital Humanities Start-up Grant, our IMLS project seeks to 1) analyze and verify our previously developed image analysis approach and extend it so that it is newspaper agnostic, type agnostic, and language agnostic; 2) scale and revise the intelligent image analysis approach and determine the ideal balance between precision and …
Work-In-Progress Reports Submitted To The Library Of Congress As Part Of Digital Libraries, Intelligent Data Analytics, And Augmented Description, Chulwoo Pack, Yi Liu, Leen-Kiat Soh, Elizabeth Lorang
Work-In-Progress Reports Submitted To The Library Of Congress As Part Of Digital Libraries, Intelligent Data Analytics, And Augmented Description, Chulwoo Pack, Yi Liu, Leen-Kiat Soh, Elizabeth Lorang
CSE Technical Reports
This document includes work-in-progress reports submitted to the Library of Congress as part of the Aida digital libraries research team's work on Digital Libraries, Intelligent Data Analytics, and Augmented Description: A Demonstration Project. These work-in-progress reports provide a snapshot glimpse, as well as underlying rationale and decision-making, at various points in the development of the project and its machine learning explorations. Reports cover explorations on historic newspapers, minimally-processed manuscript collections, materials digitized from physical originals and those digitized from microform surrogates, and investigate challenges related to image segmentation and document zoning, classification, document image quality analysis, metadata generation, and more.
Using Chronicling America’S Images To Explore Digitized Historic Newspapers & Imagine Alternative Futures, Elizabeth Lorang, Leen-Kiat Soh
Using Chronicling America’S Images To Explore Digitized Historic Newspapers & Imagine Alternative Futures, Elizabeth Lorang, Leen-Kiat Soh
University of Nebraska-Lincoln Libraries: Conference Presentations and Speeches
This presentation situates the work of the Aida team broadly as well as hinges this work on some very specific challenges for digital libraries. In doing so demonstrate the many types of questions and domains to be explored in digitized newspapers.
Hr500k – A Reference Training Corpus Of Croatian., Nikola Ljubešić, Željko Agić, Filip Klubicka, Vuk Batanović, Tomaž Erjavec
Hr500k – A Reference Training Corpus Of Croatian., Nikola Ljubešić, Željko Agić, Filip Klubicka, Vuk Batanović, Tomaž Erjavec
Conference papers
In this paper we present hr500k, a Croatian reference training corpus of 500 thousand tokens, segmented at document, sentence and word level, and annotated for morphosyntax, lemmas, dependency syntax, named entities, and semantic roles. We present each annotation layer via basic label statistics and describe the final encoding of the resource in CoNLL and TEI formats. We also give a description of the rather turbulent history of the resource and give insights into the topic and genre distribution in the corpus. Finally, we discuss further enrichments of the corpus with additional layers, which are already underway.
Music For Ai Reports: Dual Prospects In Music Production, Achim Koh
Music For Ai Reports: Dual Prospects In Music Production, Achim Koh
Dissertations, Theses, and Capstone Projects
Recent developments in artificial intelligence (AI) technology have led to industrial attempts at applying AI to music making, namely AI music. In the context of the history of music technology, AI music raises the prospect of a new phase that extends digital technology’s role as central mode of music production. The computer has become an essential metamedium in contemporary cultural production, leading in the field of music to the digitization of tools and content and the digitalization of social institutions and relationships. This technological change had the dual effect of decentralizing music production while reinforcing capitalist logic in it. The …
Patterns, Collaboration, Practice: Algorithms As Editing For Historic Periodicals, Elizabeth Lorang
Patterns, Collaboration, Practice: Algorithms As Editing For Historic Periodicals, Elizabeth Lorang
University of Nebraska-Lincoln Libraries: Conference Presentations and Speeches
This presentation positions my recent work on the algorithmic “discovery” of poetic material in historic newspapers within the contexts of my various roles as an editor of periodical literature and also consider how duplicative processes and algorithms encode principles and values and function as editorial acts. Ultimately, I hope to pose a range of questions to prompt discussion around the place (or not) of machine learning in identifying and selecting texts and bodies of work; what ideas we’re actually exploring/are able to explore when we enlist technology in stages of this work; and the stakes of these activities, whether human …
Increasing Our Vision For 21st-Century Digital Libraries, Elizabeth M. Lorang, Leen-Kiat Soh
Increasing Our Vision For 21st-Century Digital Libraries, Elizabeth M. Lorang, Leen-Kiat Soh
University of Nebraska-Lincoln Libraries: Conference Presentations and Speeches
This presentation
- Reads digital library interfaces—or their "main door" interfaces—as glimpses into what we have thus far valued in the development of digital libraries
- Frames a visual way of thinking about textual materials
- Introduces the work of our research team—where we are now, and where we're headed
- Draws some connections between the parts
This presentation is very much a look into thinking in process and work in progress and proposes the following ideas:
- As a community, we can do much more with the digital images we're creating of textual materials than we've heretofore done.
- We aspire to have additional layers …
Examining A Hate Speech Corpus For Hate Speech Detection And Popularity Prediction, Filip Klubicka, Raquel Fernandez
Examining A Hate Speech Corpus For Hate Speech Detection And Popularity Prediction, Filip Klubicka, Raquel Fernandez
Other resources
As research on hate speech becomes more and more relevant every day, most of it is still focused on hate speech detection. By attempting to replicate a hate speech detection experiment performed on an existing Twitter corpus annotated for hate speech, we highlight some issues that arise from doing research in the field of hate speech, which is essentially still in its infancy. We take a critical look at the training corpus in order to understand its biases, while also using it to venture beyond hate speech detection and investigate whether it can be used to shed light on other …
Adapt At Semeval-2018 Task 9: Skip-Gram Word Embeddings For Unsupervised Hypernym Discovery In Specialised Corpora, Alfredo Maldonado, Filip Klubicka
Adapt At Semeval-2018 Task 9: Skip-Gram Word Embeddings For Unsupervised Hypernym Discovery In Specialised Corpora, Alfredo Maldonado, Filip Klubicka
Other resources
This paper describes a simple but competitive unsupervised system for hypernym discovery. The system uses skip-gram word embeddings with negative sampling, trained on specialised corpora. Candidate hypernyms for an input word are predicted based on cosine similar- ity scores. Two sets of word embedding mod- els were trained separately on two specialised corpora: a medical corpus and a music indus- try corpus. Our system scored highest in the medical domain among the competing unsu- pervised systems but performed poorly on the music industry domain. Our approach does not depend on any external data other than raw specialised corpora.
Mouse Vs. Machine: The Game, Cafferty Aiko Frattarelli
Mouse Vs. Machine: The Game, Cafferty Aiko Frattarelli
Senior Projects Spring 2017
Many modern video games built by big name companies are coded by a group of people together using, and possibly modifying, an already designed game engine. These games usually have another group of people creating the artwork. In this project, I coded and designed a video game from scratch, as well as created all the artwork used in the game. The player controls a mouse character who fights a variety of monsters. In order to create the complexity of the game, I implement basic neural networks as the enemy artificial intelligence, i.e. the decision making process of the enemy. It …
Bullipedia: Un Caso De Construcción Social De Conocimiento Gastronómico, Antonio Jimenez-Mavillard
Bullipedia: Un Caso De Construcción Social De Conocimiento Gastronómico, Antonio Jimenez-Mavillard
Electronic Thesis and Dissertation Repository
elBulliFoundation seeks to be a center for creativity and innovation in high cuisine. Originating from elBulli, the 3-stared by Michelin restaurant and voted best restaurant in the world five times by Restaurant magazine, the foundation’s main project, Bullipedia, endeavors to become a hub for gastronomic knowledge held within an online encyclopedia on cuisine. However, this is an idea yet to be developed. Thus, the question to answer at this point is: What should the Bullipedia be like? In this thesis, I have identified several requirements that Bullipedia should meet –mainly, sustainability, creativity, user collaboration, quality contents, and community trust– and …
Topic Modeling And Figurative Language, Lisa M. Rhody
Topic Modeling And Figurative Language, Lisa M. Rhody
Publications and Research
Located at the center of Jorie Graham’s collection The End of Beauty, “Self Portrait as Hurray and Delay” crafts a portrait of the artist, poised at a precarious moment in which thought begins to take shape. Like Penelope, Graham entertains the illusion, if only momentarily, of a choice between bringing a creative impulse into form or allowing it to come undone. A weaver of language, Graham subtly, deftly, but unsuccessfully attempts to delay the inevitable moment in poetic creation in which complexity of thought adopts form through language, and so realized is also reduced. In The End of Beauty, the …