Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 28 of 28

Full-Text Articles in Entire DC Network

A Methodology For Transforming Marc21 Personal Name Authority Metadata Into Linked Open Data With Integration Of Viaf And Lcnaf Datasets: An Experimental Study, Subhendu Kar, Rajesh Das Apr 2021

A Methodology For Transforming Marc21 Personal Name Authority Metadata Into Linked Open Data With Integration Of Viaf And Lcnaf Datasets: An Experimental Study, Subhendu Kar, Rajesh Das

Library Philosophy and Practice (e-journal)

The scope and application of present web technology in the library and information sector have been increasingly transforming in terms of storage, processing, and delivery of services. Libraries, information centers, archives, museums, etc. are being driven to add meaningful and interoperable web-based library services to address the growing information needs of the users. One of the major developments in recent times is the adoption of semantic web technologies in providing web-based library services. Semantic web technology is an advanced web interface that offers structured web-based data and allows organizations or institutions to describe, communicate, retrieve and re-distribute over the web. …


Producing Linked Open Dataset From Bibliographic Data With Integration Of External Data Sources For Academic Libraries, Biswajit Saha, Dr. Rajesh Das Nov 2020

Producing Linked Open Dataset From Bibliographic Data With Integration Of External Data Sources For Academic Libraries, Biswajit Saha, Dr. Rajesh Das

Library Philosophy and Practice (e-journal)

This paper has focused on transformation of bibliographic data to linked open data (LOD) as RDF(Resource Description Framework) triple model with integration of external resources. Library & Information centres and knowledge centres deal with various types of databases like bibliographic databases, full text databases, archival databases, statistical databases, CD/DVD ROM databases and more. Presently, web technology changes storing, processing, and disseminating services rapidly. The semantic web technology is an advance technology of web platform which provides structured data on web for describing and retrieving by the organization or institutions. It may provide more information from other external resources to the …


Developing And Integration Of An Online Thesaurus: With Special Reference To Lgbtqia Related Terms, Sayani Mukherjee Jan 2020

Developing And Integration Of An Online Thesaurus: With Special Reference To Lgbtqia Related Terms, Sayani Mukherjee

Library Philosophy and Practice (e-journal)

No abstract provided.


Lower The Barrier & Be Empowered: Creating And Including Linked Data Vocabularies For Digital Collections, Sai Deng Jan 2019

Lower The Barrier & Be Empowered: Creating And Including Linked Data Vocabularies For Digital Collections, Sai Deng

Faculty Scholarship and Creative Works

Linked data has been explored and adopted by the library and archive community in recent years, but it has remained a relatively high bar to implement for most librarians and catalogers in their daily work. To lower the barrier, the librarians at the University of Central Florida (UCF) Libraries have adopted open source tools and platforms such as OpenRefine and Wikidata to their workflows to include linked data for their collections in the digital repositories as well as the library catalog. This presentation will review digital repositories' capabilities in accommodating linked data and show several cases of adding linked data …


A Step Forward: Adding Linked Data Vocabularies To Digital Repositories, Sai Deng May 2018

A Step Forward: Adding Linked Data Vocabularies To Digital Repositories, Sai Deng

Faculty Scholarship and Creative Works

Linked data has been a recent endeavor in the library community and various libraries have experimented with linked data in their traditional library catalog and digital collections. This presentation will review the different linked data practices especially focusing on those related to digital collections in academic libraries, explore the current digital library systems’ capabilities in accommodating linked data and present how linked data vocabularies have been added to the digital repositories at the University of Central Florida (UCF) Libraries. The UCF Libraries have been adding linked data vocabularies to its institutional repository from the Library of Congress’ authority files and …


Data Extraction From Web Tables: The Devil Is In The Details, George Nagy, Sharad C. Seth, Dongpu Jin, David W. Embley, Spencer Machado, Mukkai Krishnamoorthy Jul 2017

Data Extraction From Web Tables: The Devil Is In The Details, George Nagy, Sharad C. Seth, Dongpu Jin, David W. Embley, Spencer Machado, Mukkai Krishnamoorthy

CSE Conference and Workshop Papers

We present a method based on header paths for efficient and complete extraction of labeled data from tables meant for humans. Although many table configurations yield to the proposed syntactic analysis, some require access to semantic knowledge. Clicking on one or two critical cells per table, through a simple interface, is sufficient to resolve most of these problem tables. Header paths, a purely syntactic representation of visual tables, can be transformed (“factored”) into existing representations of structured data such as category trees, relational tables, and RDF triples. From a random sample of 200 web tables from ten large statistical web …


A Distributed Graph Approach For Pre-Processing Linked Rdf Data Using Supercomputers, Michael J. Lewis, George K. Thiruvathukal, Venkatram Vishwanath, Michael J. Papka, Andrew Johnson May 2017

A Distributed Graph Approach For Pre-Processing Linked Rdf Data Using Supercomputers, Michael J. Lewis, George K. Thiruvathukal, Venkatram Vishwanath, Michael J. Papka, Andrew Johnson

Computer Science: Faculty Publications and Other Works

Efficient RDF, graph based queries are becoming more pertinent based on the increased interest in data analytics and its intersection with large, unstructured but connected data. Many commercial systems have adopted distributed RDF graph systems in order to handle increasing dataset sizes and complex queries. This paper introduces a distribute graph approach to pre-processing linked data. Instead of traversing the memory graph, our system indexes pre-processed join elements that are organized in a graph structure. We analyze the Dbpedia data-set (derived from the Wikipedia corpus) and compare our access method to the graph traversal access approach which we also devise. …


Integrating Distributed Sources Of Information For Construction Cost Estimating Using Semantic Web And Semantic Web Service Technologies, Mehrdad Niknam, Saeed Karshenas Sep 2015

Integrating Distributed Sources Of Information For Construction Cost Estimating Using Semantic Web And Semantic Web Service Technologies, Mehrdad Niknam, Saeed Karshenas

Civil and Environmental Engineering Faculty Research and Publications

A construction project requires collaboration of several organizations such as owner, designer, contractor, and material supplier organizations. These organizations need to exchange information to enhance their teamwork. Understanding the information received from other organizations requires specialized human resources. Construction cost estimating is one of the processes that requires information from several sources including a building information model (BIM) created by designers, estimating assembly and work item information maintained by contractors, and construction material cost data provided by material suppliers. Currently, it is not easy to integrate the information necessary for cost estimating over the Internet.

This paper discusses a new …


Faces: Diversity-Aware Entity Summarization Using Incremental Hierarchical Conceptual Clustering, Kalpa Gunaratna, Krishnaprasad Thirunarayan, Amit P. Sheth Jan 2015

Faces: Diversity-Aware Entity Summarization Using Incremental Hierarchical Conceptual Clustering, Kalpa Gunaratna, Krishnaprasad Thirunarayan, Amit P. Sheth

Kno.e.sis Publications

Semantic Web documents that encode facts about entities on the Web have been growing rapidly in size and evolving over time. Creating summaries on lengthy Semantic Web documents for quick identification of the corresponding entity has been of great contemporary interest. In this paper, we explore automatic summarization techniques that characterize and enable identification of an entity and create summaries that are human friendly. Specifically, we highlight the importance of diversified (faceted) summaries by combining three dimensions: diversity, uniqueness, and popularity. Our novel diversity-aware entity summarization approach mimics human conceptual clustering techniques to group facts, and picks representative facts from …


Modeling Object Relationships In Fedora Commons Using Rdf, Graham Hukill Oct 2014

Modeling Object Relationships In Fedora Commons Using Rdf, Graham Hukill

Library Scholarly Publications

Modeling digital object relationships with RDF statements is perhaps the single most important thing we do in creating our digital collections infrastructure. This presentation provides a brief overview of RDF, "triples", how they are utilized in Fedora Commons for modeling relationships between objects, and some future goals.


Rdfa And Microdata, Bethany Wetherill Jan 2014

Rdfa And Microdata, Bethany Wetherill

Library Philosophy and Practice (e-journal)

This project seeks to explore and observe differences in RDFa and microdata and their ability to retain proper schematization and syntax when converted back to RDF/XML. Online conversion tools were used to transpose existing RDF/XML files from online data dumps to RDFa and microdata, and then back to RDF/XML, offering some insights into RDFa and microdata’s capabilities, as well as a taste of what may happen in the future if major search engines decide to move away from microdata and developers need to convert to a different semantic markup language. Multiple online converters were employed in the conversion process in …


Linked Data And The Library Of Congress, Corinne M. Laurence Dec 2013

Linked Data And The Library Of Congress, Corinne M. Laurence

Library Philosophy and Practice (e-journal)

The Resource Description Framework (RDF) is a machine-processable metadata standard created to link pieces of data from around the World Wide Web. It does this by creating meaningful statements about resources, which are identified by Uniform Resource Identifiers (URIs). The Linked Data (LD) that emerges will be part of the Semantic Web, a new way of linking, searching, and finding information on the Web. Libraries around the world have begun to adopt RDF for their metadata in an attempt to make their metadata more discoverable on the World Wide Web, where the majority of their users are. The Library of …


Comparison Of Clustered Rdf Data Stores, Venkata Patchigolla Jul 2011

Comparison Of Clustered Rdf Data Stores, Venkata Patchigolla

Purdue Polytechnic Masters Theses

Storing data in RDF format helps in simpler data interchange among different researchers compared to present approaches. There has been tremendous increase in the applications that use RDF data. The nature of RDF data is such that it tends to increase explosively. This makes it necessary to consider the time for retrieval and scalability of data while selecting a suitable RDF data store for developing applications. The research concentrates on comparing BigOWLIM. Bigdata, 4store and Virtuoso RDF stores on basis of their scalability and performance of storing and retrieving cancer proteomics and mass spectrometry data using SPARQL queries. In this …


Linked Open Social Signals, Pablo N. Mendes, Alexandre Passant, Pavan Kapanipathi, Amit P. Sheth Jan 2010

Linked Open Social Signals, Pablo N. Mendes, Alexandre Passant, Pavan Kapanipathi, Amit P. Sheth

Kno.e.sis Publications

In this paper we discuss the collection, semantic annotation and analysis of real-time social signals from micro-blogging data. We focus on users interested in analyzing social signals collectively for sensemaking. Our proposal enables flexibility in selecting subsets for analysis, alleviating information overload. We define an architecture that is based on state-of-the-art Semantic Web technologies and a distributed publish subscribe protocol for real time communication. In addition, we discuss our method and application in a scenario related to the health care reform in the United States.


Adding Escience Assets To The Data Web, Herbert H. Van De Sompel, Carl Lagoze, Michael L. Nelson, Simeon Warner, Robert Sanderson, Pete Johnston Apr 2009

Adding Escience Assets To The Data Web, Herbert H. Van De Sompel, Carl Lagoze, Michael L. Nelson, Simeon Warner, Robert Sanderson, Pete Johnston

Computer Science Faculty Publications

Aggregations of Web resources are increasingly important in scholarship as it adopts new methods that are data-centric, collaborative, and networked-based. The same notion of aggregations of resources is common to the mashed-up, socially networked information environment of Web 2.0. We present a mechanism to identify and describe aggregations of Web resources that has resulted from the Open Archives Initiative - Object Reuse and Exchange (OAI-ORE) project. The OAI-ORE specifications are based on the principles of the Architecture of the World Wide Web, the Semantic Web, and the Linked Data effort. Therefore, their incorporation into the cyberinfrastructure that supports eScholarship will …


Tcruzikb: Enabling Complex Queries For Genomic Data Exploration, Pablo N. Mendes, Bobby Mcknight, Amit P. Sheth, Jessica C. Kissinger Aug 2008

Tcruzikb: Enabling Complex Queries For Genomic Data Exploration, Pablo N. Mendes, Bobby Mcknight, Amit P. Sheth, Jessica C. Kissinger

Kno.e.sis Publications

We developed a novel analytical environment to aid in the examination of the extensive amount of interconnected data available for genome projects. Our focus is to enable flexibility and abstraction from implementation details, while retaining the expressivity required for post-genomic research. To achieve this goal, we associated genomics data to ontologies and implemented a query formulation and execution environment with added visualization capabilities. We use ontology schemas to guide the user through the process of building complex queries in a flexible Web interface. Queries are serialized in SPARQL and sent to servers via Ajax. A component for visualization of the …


A Framework To Support Spatial, Temporal And Thematic Analytics Over Semantic Web Data, Matthew Perry, Amit P. Sheth May 2008

A Framework To Support Spatial, Temporal And Thematic Analytics Over Semantic Web Data, Matthew Perry, Amit P. Sheth

Kno.e.sis Publications

Spatial and temporal data are critical components in many applications. This is especially true in analytical applications ranging from scientific discovery to national security and criminal investigation. The analytical process often requires uncovering and analyzing complex thematic relationships between disparate people, places and events. Fundamentally new query operators based on the graph structure of Semantic Web data models, such as semantic associations, are proving useful for this purpose. However, these analysis mechanisms are primarily intended for thematic relationships. In this paper, we describe a framework built around the RDF data model for analysis of thematic, spatial and temporal relationships between …


Rdb2rdf: Incorporating Domain Semantics In Structured Data, Satya S. Sahoo Apr 2008

Rdb2rdf: Incorporating Domain Semantics In Structured Data, Satya S. Sahoo

Kno.e.sis Publications

No abstract provided.


Traveling The Semantic Web Through Space, Theme And Time, Amit P. Sheth, Matthew Perry Jan 2008

Traveling The Semantic Web Through Space, Theme And Time, Amit P. Sheth, Matthew Perry

Kno.e.sis Publications

In this installment of Semantics and Services, we further develop the idea of spatial, temporal, and thematic (STT) processing of semantic Web data and describe the Web infrastructure needed to support it. Starting from Ramesh Jain's vision of the EventWeb as a view of what's possible with a Web that better accommodates all three dimensions of event-related information (thematic, spatial, and temporal), we outline the architecture needed to support it and current research that aims to realize it.


From “Glycosyltransferase” To “Congenital Muscular Dystrophy”: Integrating Knowledge From Ncbi Entrez Gene And The Gene Ontology, Satya S. Sahoo, Kelly Zeng, Olivier Bodenreider, Amit P. Sheth Jan 2007

From “Glycosyltransferase” To “Congenital Muscular Dystrophy”: Integrating Knowledge From Ncbi Entrez Gene And The Gene Ontology, Satya S. Sahoo, Kelly Zeng, Olivier Bodenreider, Amit P. Sheth

Kno.e.sis Publications

Entrez Gene (EG), Online Mendelian Inheritance in Man (OMIM) and the Gene Ontology (GO) are three complementary knowledge resources that can be used to correlate genomic data with disease information. However, bridging between genotype and phenotype through these resources currently requires manual effort or the development of customized software. In this paper, we argue that integrating EG and GO provides a robust and flexible solution to this problem. We demonstrate how the Resource Description Framework (RDF) developed for the Semantic Web can be used to represent and integrate these resources and enable seamless access to them as a unified resource. …


What, Where And When: Supporting Semantic, Spatial And Temporal Queries In A Dbms, Matthew Perry, Amit P. Sheth, Farshad Hakimpour, Prateek Jain Jan 2007

What, Where And When: Supporting Semantic, Spatial And Temporal Queries In A Dbms, Matthew Perry, Amit P. Sheth, Farshad Hakimpour, Prateek Jain

Kno.e.sis Publications

Spatial and temporal data are critical components in many applications. This is especially true in analytical domains such as national security and criminal investigation. The outcome of the analytical process in these applications often hinges on uncovering and analyzing complex relationships between disparate people, places and events. Fundamentally new query operators based on the graph structure of Semantic Web data models, such as semantic associations, are proving useful in these applications. However, these analysis mechanisms are primarily intended for thematic relationships. We describe a framework built around the RDF metadata model for analysis of thematic, spatial and temporal relationships between …


Semantic Analytics Visualization, Leonidas Deligiannidis, Amit P. Sheth, Boanerges Aleman-Meza May 2006

Semantic Analytics Visualization, Leonidas Deligiannidis, Amit P. Sheth, Boanerges Aleman-Meza

Kno.e.sis Publications

In this paper we present a new tool for semantic analytics through 3D visualization called “Semantic Analytics Visualization” (SAV). It has the capability for visualizing ontologies and meta-data including annotated web-documents, images, and digital media such as audio and video clips in a synthetic three-dimensional semi-immersive environment. More importantly, SAV supports visual semantic analytics, whereby an analyst can interactively investigate complex relationships between heterogeneous information. The tool is built using Virtual Reality technology which makes SAV a highly interactive system. The backend of SAV consists of a Semantic Analytics system that supports query processing and semantic association discovery. Using a …


Data Processing In Space, Time, And Semantics Dimensions, Farshad Hakimpour, Boanerges Aleman-Meza, Matthew Perry, Amit P. Sheth Jan 2006

Data Processing In Space, Time, And Semantics Dimensions, Farshad Hakimpour, Boanerges Aleman-Meza, Matthew Perry, Amit P. Sheth

Kno.e.sis Publications

This work presents an experimental system for data processing in space, time and semantics dimensions using current Semantic Web technologies. The paper describes how we obtain geographic and event data from Internet sources and also how we integrate them into an RDF store. We briefly introduce a set of functionalities in space, time and semantics dimensions. These functionalities are implemented based on our existing technology for main-memory based RDF data processing developed in the LSDIS Lab. A number of these functionalities are exposed as REST Web services. We present two sample client side applications that are developed using a combination …


Ontoqa: Metric-Based Ontology Quality Analysis, Samir Tartir, I. Budak Arpinar, Michael Moore, Amit P. Sheth, Boanerges Aleman-Meza Nov 2005

Ontoqa: Metric-Based Ontology Quality Analysis, Samir Tartir, I. Budak Arpinar, Michael Moore, Amit P. Sheth, Boanerges Aleman-Meza

Kno.e.sis Publications

As the Semantic Web gains importance for sharing knowledge on the Internet this has lead to the development and publishing of many ontologies in different domains. When trying to reuse existing ontologies into their applications, users are faced with the problem of determining if an ontology is suitable for their needs. In this paper, we introduce OntoQA, an approach that analyzes ontology schemas and their populations (i.e. knowledgebases) and describes them through a well defined set of metrics. These metrics can highlight key characteristics of an ontology schema as well as its population and enable users to make an informed …


Tontogen: A Synthetic Data Set Generator For Semantic Web Applications, Matthew Perry Jan 2005

Tontogen: A Synthetic Data Set Generator For Semantic Web Applications, Matthew Perry

Kno.e.sis Publications

No abstract provided.


Discovering Informative Subgraphs In Rdf Graphs, William H. Milnor, Cartic Ramakrishnan, Matthew Perry, Amit P. Sheth, John A. Miller, Krzysztof Kochut Jan 2005

Discovering Informative Subgraphs In Rdf Graphs, William H. Milnor, Cartic Ramakrishnan, Matthew Perry, Amit P. Sheth, John A. Miller, Krzysztof Kochut

Kno.e.sis Publications

Discovering patterns in graphs has long been an area of interest. In most contemporary approaches to such pattern discovery either quantitative anomalies or frequency of substructure is used to measure the interestingness of a pattern. In this paper we address the issue of discovering informative sub-graphs within RDF graphs. We motivate our work with an example related to Semantic Search. A user might pose a question of the form: ' What are the most relevant ways in which entity X is related to entity Y?' the response to which is a subgraph connecting X to Y. Relevance of the …


Ρ-Queries: Enabling Querying For Semantic Associations On The Semantic Web, Kemafor Anyanwu, Amit P. Sheth May 2003

Ρ-Queries: Enabling Querying For Semantic Associations On The Semantic Web, Kemafor Anyanwu, Amit P. Sheth

Kno.e.sis Publications

This paper presents the notion of Semantic Associations as complex relationships between resource entities. These relationships capture both a connectivity of entities as well as similarity of entities based on a specific notion of similarity called ρ-isomorphism. It formalizes these notions for the RDF data model, by introducing a notion of a Property Sequence as a type. In the context of a graph model such as that for RDF, Semantic Associations amount to specific certain graph signatures. Specifically, they refer to sequences (i.e. directed paths) here called Property Sequences, between entities, networks of Property Sequences (i.e. undirected paths), or subgraphs …


Logical Information Modeling Of Web-Accessible Heterogeneous Digital Assets, Kshitij Shah, Amit P. Sheth Apr 1998

Logical Information Modeling Of Web-Accessible Heterogeneous Digital Assets, Kshitij Shah, Amit P. Sheth

Kno.e.sis Publications

This paper introduces the MREF framework for representing and correlating information at a higher semantic level than is possible with Web-based information systems today. The role that metadata plays in this framework is described, together with a metadata based infrastructure to support our media independent information correlation paradigm. To keep it consistent with evolving standards, broader acceptance and ease of implementation, MREF abstraction is structured on top of RDF and XML. Its central role in the context of the InfoQuilt system, for exploiting heterogeneous digital media using a federated and scalable architecture, is briefly described.