Open Access. Powered by Scholars. Published by Universities.®

Databases and Information Systems Commons

Open Access. Powered by Scholars. Published by Universities.®

3060 Full-Text Articles 3435 Authors 686988 Downloads 124 Institutions

All Articles in Databases and Information Systems

Faceted Search

3060 full-text articles. Page 1 of 105.

Accuracy And Coverage Of Using The Assigned International Classification Of Diseases, 9th And 10th Revision, Clinical Modification Codes For Detecting Bleeding Events In Electronic Health Record, Victoria J. Wang, David D. McManus, Hong Yu 2017 University of Massachusetts Medical School

Accuracy And Coverage Of Using The Assigned International Classification Of Diseases, 9th And 10th Revision, Clinical Modification Codes For Detecting Bleeding Events In Electronic Health Record, Victoria J. Wang, David D. Mcmanus, Hong Yu

David D. McManus

Background: Hemorrhages are common events that confer significant risk for in-hospital and post-discharge morbidity and mortality among cardiovascular disease (CVD) patients treated with anticoagulation. International Classification of Diseases, 9th and 10th Revision, Clinical Modification (ICD-9-CM, ICD-10-CM) codes have been widely used in CVD research and managements. Objective: To determine the accuracy and coverage of assigned ICD-CM codes for reporting bleeding events. Methods: From the University of Massachusetts Medical School electronic health record (EHR) database we identified 21k patients on anticoagulation with high bleeding risks based on their ICD-9-CM or ICD-10-CM codes. Through manual chart review, we selected one unstructured note ...


Estimating Accuracy Of Personal Identifiable Information In Integrated Data Systems, Amani "Mohammad Jum'h" Amin Shatnawi 2017 Utah State University

Estimating Accuracy Of Personal Identifiable Information In Integrated Data Systems, Amani "Mohammad Jum'h" Amin Shatnawi

All Graduate Theses and Dissertations

Without a valid assessment of accuracy there is a risk of data users coming to incorrect conclusions or making bad decision based on inaccurate data. This dissertation proposes a theoretical method for developing data-accuracy metrics specific for any given person-centric integrated system and how a data analyst can use these metrics to estimate the overall accuracy of person-centric data.

Estimating the accuracy of Personal Identifiable Information (PII) creates a corresponding need to model and formalize PII for both the real-world and electronic data, in a way that supports rigorous reasoning relative to real-world facts, expert opinions, and aggregate knowledge. This ...


Data Insertion In Bitcoin's Blockchain, Andrew Sward, Vecna OP_0, Forrest Stonedahl 2017 Augustana College, Rock Island

Data Insertion In Bitcoin's Blockchain, Andrew Sward, Vecna Op_0, Forrest Stonedahl

Computer Science: Faculty Scholarship & Creative Works

This paper provides the first comprehensive survey of methods for inserting arbitrary data into Bitcoin's blockchain. Historical methods of data insertion are described, along with lesser-known techniques that are optimized for efficiency. Insertion methods are compared on the basis of efficiency, cost, convenience of data reconstruction, permanence, and potentially negative impact on the Bitcoin ecosystem.


Marketing The Mountain State: A Large N Study Of User Engagement On Twitter, Kirk Richardson 2017 Illinois State University

Marketing The Mountain State: A Large N Study Of User Engagement On Twitter, Kirk Richardson

Capstone Projects – Politics and Government

Much of the evolving research on the use of social media in destination marketing emphasizes how information diffusion influences the reputational image of place. The present study uses Twitter data to focus on the relative differences in user engagement across discrete account types. Specifically, this is done to examine how the official destination marketing organization of Montana—the Montana Office of Tourism (MTOT)—performs relative to other account types. Several regression analyses conducted on Twitter data associated with an ongoing MTOT place branding campaign reveal that tweets sent from ‘official’ accounts are more likely to be retweeted, and are estimated ...


Question Type Recognition Using Natural Language Input, Aishwarya Soni 2017 San Jose State University

Question Type Recognition Using Natural Language Input, Aishwarya Soni

Master's Projects

Recently, numerous specialists are concentrating on the utilization of Natural Language Processing (NLP) systems in various domains, for example, data extraction and content mining. One of the difficulties with these innovations is building up a precise Question and Answering (QA) System. Question type recognition is the most significant task in a QA system, for example, chat bots. Organization such as National Institute of Standards (NIST) hosts a conference series called as Text REtrieval Conference (TREC) series which keeps a competition every year to encourage and improve the technique of information retrieval from a large corpus of text. When a user ...


The Use And Effectiveness Of Online Social Media In Volunteer Organizations, Amy J. Connolly 2017 University of South Florida

The Use And Effectiveness Of Online Social Media In Volunteer Organizations, Amy J. Connolly

Amy J Connolly

Volunteer organizations face two challenges not found in non-volunteer organizations: recruiting and retaining volunteers. While social media use is increasing amongst individuals, its use and effectiveness for volunteer recruitment and retention by volunteer organizations is unknown. The dissertation reports the results of three studies to investigate this important question. Using a mixed-methods approach, it addressed the dual nature of social media and its effectiveness by including volunteer organizations and social media users. This dissertation found that although volunteer organizations are not using social media effectively, they could virtualize requirements of the recruitment process by focusing on relatable events instead of ...


The Use And Effectiveness Of Online Social Media In Volunteer Organizations, Amy J. Connolly 2017 University of South Florida

The Use And Effectiveness Of Online Social Media In Volunteer Organizations, Amy J. Connolly

Amy J Connolly

Volunteer organizations face two challenges not found in non-volunteer organizations: recruiting and retaining volunteers. While social media use is increasing amongst individuals, its use and effectiveness for volunteer recruitment and retention by volunteer organizations is unknown. The dissertation reports the results of three studies to investigate this important question. Using a mixed-methods approach, it addressed the dual nature of social media and its effectiveness by including volunteer organizations and social media users. This dissertation found that although volunteer organizations are not using social media effectively, they could virtualize requirements of the recruitment process by focusing on relatable events instead of ...


An Object Oriented Approach To Modeling And Simulation Of Routing In Large Communication Networks, Armin Mikler, Johnny S. Wong, Vasant Honavar 2017 Iowa State University

An Object Oriented Approach To Modeling And Simulation Of Routing In Large Communication Networks, Armin Mikler, Johnny S. Wong, Vasant Honavar

Johnny Wong

The complexity (number of entities, interactions between entities, and resulting emergent dynamic behavior) of large communication environments which contain hundreds of nodes and links make simulation an important tool for the study of such systems. Given the difficulties associated with complete analytical treatment of complex dynamical systems, it is often the only practical tool that is available. This paper presents an example of a flexible, modular, object-oriented toolbox designed to support modeling and experimental analysis of a large family of heuristic knowledge representation and decision functions for adaptive self-managing communication networks with particular emphasis on routing strategies. It discusses in ...


Design And Implementation Of A Media Uploading System, Mu Zhang, Johnny S. Wong, Wallapak Tavanapong 2017 Iowa State University

Design And Implementation Of A Media Uploading System, Mu Zhang, Johnny S. Wong, Wallapak Tavanapong

Johnny Wong

This paper presents the design and performance analysis of an uploading system that automatically uploads multimedia files to a centralized server given client hard deadlines. If not uploaded by the deadlines, existing files may be lost or new files cannot be recorded. The uploading systems with hard deadlines have several important applications in practice. For instance, such systems can be used in hospitals to gather videos generated from medical devices from various operating rooms for post-procedure analysis and in law enforcement to collect video recordings from police cars during routine patrolling. In this paper, we study the uploading system with ...


Eds Usability Testing Final Report, 2017 Selected Works

Eds Usability Testing Final Report

Sally Krash

Usability studies in academic libraries are essential tools to assess functionality and accessibility of library services.  The University of Massachusetts Amherst Libraries recently conducted usability studies on EBSCO’s Discovery search platform, which is to be the default search platform on the UMass Amherst Libraries’ website beginning on July 2017.   During the spring of 2017, Information Resources Management utilized surveys, focus groups, and hands-on testing of students and faculty to assess how library patrons interacted with the new discovery service (EDS) and other related library services.  The following report documents this usability study, findings discovered, and recommendations hitherto.  The following ...


Ipm Information Technology, John K. VanDyk 2017 Iowa State University

Ipm Information Technology, John K. Vandyk

John K. VanDyk

The use of information technology to obtain and manage IPM information will continue to grow. By applying the basic principles of information taxonomies such as tagging information with terms from vocabularies, filtering and aggregation, knowledge workers will have the necessary tools to become increasingly informed about the realm ofiPM.


Selecting Link Resolver And Knowledge Base Software: Implications Of Interoperability, Cyndy Chisare, Jody C. Fagan, David J. Gaines, Michael Trocchia 2017 James Madison University

Selecting Link Resolver And Knowledge Base Software: Implications Of Interoperability, Cyndy Chisare, Jody C. Fagan, David J. Gaines, Michael Trocchia

Libraries

Link resolver software and their associated knowledge bases are essential technologies for modern academic libraries. However, because of the increasing number of possible integrations involving link resolver software and knowledge bases, a library’s vendor relationships, product choices, and consortial arrangements may have the most dramatic effects on the user experience and back-end maintenance workloads. A project team at a large comprehensive university recently investigated link resolver products in an attempt to increase efficiency of back-end workflows while maintaining or improving the patron experience. The methodology used for product comparison may be useful for other libraries.


Analyzing The Keystroke Dynamics Of Web Identifiers, Andrew G. West 2017 University of Pennsylvania

Analyzing The Keystroke Dynamics Of Web Identifiers, Andrew G. West

Dr. Andrew G. West

Web identifiers such as usernames, hashtags, and domain names serve important roles in online navigation, communication, and community building. Therefore the entities that choose such names must ensure that end-users are able to quickly and accurately enter them in applications. Uniqueness requirements, a desire for short strings, and an absence of delimiters often constrain this name selection process.

To gain perspective on the speed and correctness of name entry, we crowdsource the typing of 51,000+ web identifiers. Surface level analysis reveals, for example, that typing speed is generally a linear function of identifier length. Examining keystroke dynamics at finer ...


An Open Source Discussion Group Recommendation System, Sarika Padmashali 2017 San Jose State University

An Open Source Discussion Group Recommendation System, Sarika Padmashali

Master's Projects

A recommendation system analyzes user behavior on a website to make suggestions about what a user should do in the future on the website. It basically tries to predict the “rating” or “preference” a user would have for an action. Yioop is an open source search engine, wiki system, and user discussion group system managed by Dr. Christopher Pollett at SJSU. In this project, we have developed a recommendation system for Yioop where users are given suggestions about the threads and groups they could join based on their user history. We have used collaborative filtering techniques to make recommendations and ...


Adding Differential Privacy In An Open Board Discussion Board System, Pragya Rana 2017 San Jose State University

Adding Differential Privacy In An Open Board Discussion Board System, Pragya Rana

Master's Projects

This project implements a privacy system for statistics generated by the Yioop search and discussion board system. Statistical data for such a system consists of various counts, sums, and averages that might be displayed for groups, threads, etc. When statistical data is made publicly available, there is no guarantee of preserving the privacy of an individual. Ideally, any data extracted should not reveal any sensitive information about an individual. In order to help achieve this, we implemented a Differential Privacy mechanism for Yioop. Differential privacy preserves privacy up to some controllable parameters of the number of items or individuals being ...


Document Classification Using Machine Learning, Ankit Basarkar 2017 San Jose State University

Document Classification Using Machine Learning, Ankit Basarkar

Master's Projects

To perform document classification algorithmically, documents need to be represented such that it is understandable to the machine learning classifier. The report discusses the different types of feature vectors through which document can be represented and later classified. The project aims at comparing the Binary, Count and TfIdf feature vectors and their impact on document classification. To test how well each of the three mentioned feature vectors perform, we used the 20-newsgroup dataset and converted the documents to all the three feature vectors. For each feature vector representation, we trained the Naïve Bayes classifier and then tested the generated ...


Headline Generation Using Deep Neural Networks, Dhruven Vora 2017 San Jose State University

Headline Generation Using Deep Neural Networks, Dhruven Vora

Master's Projects

News headline generation is one of the important text summarization tasks. Human generated news headlines are generally intended to catch the eye rather than provide useful information. There have been many approaches to generate meaningful headlines by either using neural networks or using linguistic features. In this report, we are proposing a novel approach based on integrating Hedge Trimmer, which is a grammar based extractive summarization system with a deep neural network abstractive summarization system to generate meaningful headlines. We analyze the results against current recurrent neural network based headline generation system.


Reducing Query Latency For Information Retrieval, Swapnil Satish Kamble 2017 San Jose State University

Reducing Query Latency For Information Retrieval, Swapnil Satish Kamble

Master's Projects

As the world is moving towards Big Data, NoSQL (Not only SQL) databases are gaining much more popularity. Among the other advantages of NoSQL databases, one of their key advantage is that they facilitate faster retrieval for huge volumes of data, as compared to traditional relational databases. This project deals with one such popular NoSQL database, Apache HBase. It performs quite efficiently in cases of retrieving information using the rowkey (similar to a primary key in a SQL database). But, in cases where one needs to get information based on non-rowkey columns, the response latency is higher than what we ...


A Chatbot Framework For Yioop, Harika Nukala 2017 San Jose State University

A Chatbot Framework For Yioop, Harika Nukala

Master's Projects

Over the past few years, messaging applications have become more popular than Social networking sites. Instead of using a specific application or website to access some service, chatbots are created on messaging platforms to allow users to interact with companies’ products and also give assistance as needed. In this project, we designed and implemented a chatbot Framework for Yioop. The goal of the Chatbot Framework for Yioop project is to provide a platform for developers in Yioop to build and deploy chatbot applications. A chatbot is a web service that can converse with users using artificial intelligence in messaging platforms ...


Named Entity Recognition And Classification For Natural Language Inputs At Scale, Shreeraj Dabholkar 2017 San Jose State University

Named Entity Recognition And Classification For Natural Language Inputs At Scale, Shreeraj Dabholkar

Master's Projects

Natural language processing (NLP) is a technique by which computers can analyze, understand, and derive meaning from human language. Phrases in a body of natural text that represent names, such as those of persons, organizations or locations are referred to as named entities. Identifying and categorizing these named entities is still a challenging task, research on which, has been carried out for many years. In this project, we build a supervised learning based classifier which can perform named entity recognition and classification (NERC) on input text and implement it as part of a chatbot application. The implementation is then scaled ...


Digital Commons powered by bepress