Open Access. Powered by Scholars. Published by Universities.®

Databases and Information Systems Commons

Open Access. Powered by Scholars. Published by Universities.®

2021

Discipline
Institution
Keyword
Publication
Publication Type

Articles 1 - 30 of 439

Full-Text Articles in Databases and Information Systems

On Performance Optimization And Prediction Of Parallel Computing Frameworks In Big Data Systems, Haifa Alquwaiee Dec 2021

On Performance Optimization And Prediction Of Parallel Computing Frameworks In Big Data Systems, Haifa Alquwaiee

Dissertations

A wide spectrum of big data applications in science, engineering, and industry generate large datasets, which must be managed and processed in a timely and reliable manner for knowledge discovery. These tasks are now commonly executed in big data computing systems exemplified by Hadoop based on parallel processing and distributed storage and management. For example, many companies and research institutions have developed and deployed big data systems on top of NoSQL databases such as HBase and MongoDB, and parallel computing frameworks such as MapReduce and Spark, to ensure timely data analyses and efficient result delivery for decision making and business …


Design And Development Of Alumni Career Information System Using Php Mysql, Mustofa Abi Hamid, Didik Aribowo, Rini Anggraini Dec 2021

Design And Development Of Alumni Career Information System Using Php Mysql, Mustofa Abi Hamid, Didik Aribowo, Rini Anggraini

Elinvo (Electronics, Informatics, and Vocational Education)

Alumni data collection at the Electrical Engineering Vocational Education Universitas Sultan Ageng Tirtayasa was still performed manually and there were no career information media about soft skills training and development, tracer studies, and job vacancies information. Therefore, media is needed to accommodate career information and alumni data collection quickly and effectively. The web-based information system using PHP MySQL was developed and tested for feasibility as an information medium for soft skills training and development, tracer studies, job vacancies information, as well as career counseling and consulting. This study used a Modify R&D as a development method and the waterfall method …


Evaluating Technology-Mediated Collaborative Workflows For Telehealth, Christopher Bondy Ph.D., Pengcheng Shi, Pamela Grover Md, Vicki Hanson, Linlin Chen, Rui Li Dec 2021

Evaluating Technology-Mediated Collaborative Workflows For Telehealth, Christopher Bondy Ph.D., Pengcheng Shi, Pamela Grover Md, Vicki Hanson, Linlin Chen, Rui Li

Articles

Goals: This paper discusses the need for a predictable method to evaluate gains and gaps of collaborative technology-mediated workflows and introduces an evaluation framework to address this need. Methods: The Collaborative Space Analysis Framework (CS-AF), introduced in this research, is a cross-disciplinary evaluation method designed to evaluate technology-mediated collaborative workflows. The 5-step CS-AF approach includes: (1) current-state workflow definition, (2) current-state (baseline) workflow assessment, (3) technology-mediated workflow development and deployment, (4) technology-mediated workflow assessment, (5) analysis, and conclusions. For this research, a comprehensive, empirical study of hypertension exam workflow for telehealth was conducted using the CS-AF approach. Results: The CS-AF …


An Open Source Direct Messaging And Enhanced Recommendation System For Yioop, Aniruddha Dinesh Mallya Dec 2021

An Open Source Direct Messaging And Enhanced Recommendation System For Yioop, Aniruddha Dinesh Mallya

Master's Projects

Recommendation systems and direct messaging systems are two popular components of web portals. A recommendation system is an information filtering system that seeks to predict the "rating" or "preference" a user would give to an item and a direct messaging system allows private communication between users of any platform. Yioop, is an open source, PHP search engine and web portal that can be configured to allow users to create discussion groups, blogs, wikis etc.

In this project, we expanded on Yioop’s group system so that every user now has a personal group. Personal groups were then used to add user …


High Performance Document Store Implementation In Rust, Ishaan Aggarwal Dec 2021

High Performance Document Store Implementation In Rust, Ishaan Aggarwal

Master's Projects

Databases are a core part of any application which requires persistence of data. The performance of applications involving the use of database systems is directly proportional to how fast their database read-write operations are. The aim of this project was to build a high- performance document store which can support variety of applications which require data storage and retrieval of some kind. This document store can be used as an independently running backend service which can be utilized by search engines, applications which deal with keeping records, etc. We used Rust to make this document store which is fast, robust, …


Deep Convolutional Neural Networks For Accurate Diagnosis Of Covid-19 Patients Using Chest X-Ray Image Databases From Italy, Canada, And The Usa, Amgad A. Salama, Samy H. Darwish, Samir M. Abdel-Mageed, Radwa A. Meshref, Ehab I. Mohamed Dec 2021

Deep Convolutional Neural Networks For Accurate Diagnosis Of Covid-19 Patients Using Chest X-Ray Image Databases From Italy, Canada, And The Usa, Amgad A. Salama, Samy H. Darwish, Samir M. Abdel-Mageed, Radwa A. Meshref, Ehab I. Mohamed

The University of Louisville Journal of Respiratory Infections

Introduction: Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), famously known as COVID-19, has quickly become a global pandemic. Chest X-ray (CXR) imaging has proven reliable, fast, and cost-effective for identifying COVID-19 infections, which proceeds to display atypical unilateral patchy infiltration in the lungs like typical pneumonia. We employed the deep convolutional neural network (DCNN) ResNet-34 to detect and classify CXR images from patients with COVID-19 and Viral Pneumonia and Normal Controls.

Methods: We created a single database containing 781 source CXR images from four different international sub-databases: the Società Italiana di Radiologia Medica e Interventistica (SIRM), the GitHub Database, the …


Comparison Of Major Cloud Providers, Justin Berman Dec 2021

Comparison Of Major Cloud Providers, Justin Berman

Other Student Works

This paper will compare the following major cloud providers: Microsoft Azure, Amazon AWS, Google Cloud, and IBM Cloud. An introduction to the companies and their history, fundamentals and services, strengths and weaknesses, costs, and their security will be discussed throughout this writing.


Node.Js Based Document Store For Web Crawling, David Bui Dec 2021

Node.Js Based Document Store For Web Crawling, David Bui

Master's Projects

WARC files are central to internet preservation projects. They contain the raw resources of web crawled data and can be used to create windows into the past of web pages at the time they were accessed. Yet there are few tools that manipulate WARC files outside of basic parsing. The creation of our tool WARC-KIT gives users in the Node.js JavaScript environment, a tool kit to interact with and manipulate WARC files.

Included with WARC-KIT is a WARC parsing tool known as WARCFilter that can be used standalone tool to parse, filter, and create new WARC files. WARCFilter can also, …


Using Parallel Primary Caches To Improve Capacity And Bandwidth, John Rubena Wani Dec 2021

Using Parallel Primary Caches To Improve Capacity And Bandwidth, John Rubena Wani

Archived Theses and Dissertations

No abstract provided.


Moment-Preserving Piecewise Approximation For 1-D And 2-D Signals, Soha M. A. A. Seif Dec 2021

Moment-Preserving Piecewise Approximation For 1-D And 2-D Signals, Soha M. A. A. Seif

Archived Theses and Dissertations

No abstract provided.


Shape Similarity By Deformation Using Polynomial Transformation, Hanan M. Moussa Dec 2021

Shape Similarity By Deformation Using Polynomial Transformation, Hanan M. Moussa

Archived Theses and Dissertations

No abstract provided.


Messiness: Automating Iot Data Streaming Spatial Analysis, Christopher White, Atilio Barreda Ii Dec 2021

Messiness: Automating Iot Data Streaming Spatial Analysis, Christopher White, Atilio Barreda Ii

Publications and Research

The spaces we live in go through many transformations over the course of a year, a month, or a day; My room has seen tremendous clutter and pristine order within the span of a few hours. My goal is to discover patterns within my space and formulate an understanding of the changes that occur. This insight will provide actionable direction for maintaining a cleaner environment, as well as provide some information about the optimal times for productivity and energy preservation.

Using a Raspberry Pi, I will set up automated image capture in a room in my home. These images will …


Ready, Willing, And Able, Gerry Boyle Dec 2021

Ready, Willing, And Able, Gerry Boyle

Colby Magazine

So what gives? How, after four years on Mayflower Hill, do these Colby alumni have an outsized impact in a fintech company that is focused on, for example, changing the way municipal bonds are traded? What makes them able to dive in and figure it out? “That’s part of the liberal arts education,” said Associate Professor of History John Turner, who taught Tagg Martin ’13, history major turned MarketAxess go-to analyst. “You’re always learning. … You are always going to be mastering something, as opposed to having mastered.”


Integration Of Blockchain Technology Into Automobiles To Prevent And Study The Causes Of Accidents, John Kim Dec 2021

Integration Of Blockchain Technology Into Automobiles To Prevent And Study The Causes Of Accidents, John Kim

Electronic Theses, Projects, and Dissertations

Automobile collisions occur daily. We now live in an information-driven world, one where technology is quickly evolving. Blockchain technology can change the automotive industry, the safety of the motoring public and its surrounding environment by incorporating this vast array of information. It can place safety and efficiency at the forefront to pedestrians, public establishments, and provide public agencies with pertinent information securely and efficiently. Other industries where Blockchain technology has been effective in are as follows: supply chain management, logistics, and banking. This paper reviews some statistical information regarding automobile collisions, Blockchain technology, Smart Contracts, Smart Cities; assesses the feasibility …


Data Of The Constructivist Practices In The Learning Environment Survey From Engineering Undergraduates: An Exploratory Factor Analysis, Chengcheng Li, Shaoan Zhang, Tiberio Garza, Yingtao Jiang Dec 2021

Data Of The Constructivist Practices In The Learning Environment Survey From Engineering Undergraduates: An Exploratory Factor Analysis, Chengcheng Li, Shaoan Zhang, Tiberio Garza, Yingtao Jiang

Teaching and Learning Faculty Research

This paper presents the dataset of a questionnaire on first-year engineering undergraduates’ perceptions of constructivist practices in the learning environment. The questionnaire with a 5-Likert scale was adapted from previous research. The sample consisted of 293 first-year engineering undergraduates in the southwest region of the United States. The online questionnaire was sent to participants who completed it voluntarily at the end of Fall 2019. A total of 274 of 293 participants completed the questionnaire with a response rate of 93.515%. Exploratory factor analysis was conducted to test the underlying factor structure of the questionnaire, which serves as a good reference …


Examining The Effects Of Information And Communication Technologies In The Legal Representation Of Latin American Asylum Seekers, Victor M. Portillo Ochoa Dec 2021

Examining The Effects Of Information And Communication Technologies In The Legal Representation Of Latin American Asylum Seekers, Victor M. Portillo Ochoa

Open Access Theses & Dissertations

The purpose of this thesis was to explore how legal defense nonprofit organizations (NPO) are using Information and Communication Technologies (ICT) to provide legal defense for asylum seekers and improve the conditions of immigrants at detention centers. In addition, this research explored the impact of ICTs on legal defense NPOs, bottlenecks, and security implications when supporting vulnerable communities. ICTs profoundly impacted the way we interact in a post-pandemic world, and it presents new challenges and possibilities for legal defense nonprofit organizations that are helping vulnerable communities. This study consists of staff and volunteers from different legal defense nonprofit organizations NPOs …


Towards Non-Intrusive Camera-Based Heart Rate Variability Estimation In The Car Under Naturalistic Condition, Shu Liu, Kevin Koch, Zimu Zhou, Martin Maritsch, Xiaoxi He, Elgar Fleisch, Felix Wortmann Dec 2021

Towards Non-Intrusive Camera-Based Heart Rate Variability Estimation In The Car Under Naturalistic Condition, Shu Liu, Kevin Koch, Zimu Zhou, Martin Maritsch, Xiaoxi He, Elgar Fleisch, Felix Wortmann

Research Collection School Of Computing and Information Systems

Driver status monitoring systems are a vital component of smart cars in the future, especially in the era when an increasing amount of time is spent in the vehicle. The heart rate (HR) is one of the most important physiological signals of driver status. To infer HR of drivers, the mainstream of existing research focused on capturing subtle heartbeat-induced vibration of the torso or leveraged photoplethysmography (PPG) that detects cardiac cycle-related blood volume changes in the microvascular. However, existing approaches rely on dedicated sensors that are expensive and cumbersome to be integrated or are vulnerable to ambient noise. Moreover, their …


Efficient Data Structures For Text Processing Applications, Paniz Abedin Dec 2021

Efficient Data Structures For Text Processing Applications, Paniz Abedin

Electronic Theses and Dissertations, 2020-

This thesis is devoted to designing and analyzing efficient text indexing data structures and associated algorithms for processing text data. The general problem is to preprocess a given text or a collection of texts into a space-efficient index to quickly answer various queries on this data. Basic queries such as counting/reporting a given pattern's occurrences as substrings of the original text are useful in modeling critical bioinformatics applications. This line of research has witnessed many breakthroughs, such as the suffix trees, suffix arrays, FM-index, etc. In this work, we revisit the following problems: 1. The Heaviest Induced Ancestors problem 2. …


Imon: Appearance-Based Gaze Tracking System On Mobile Devices, Sinh Huynh, Rajesh Krishna Balan, Jeonggil Ko Dec 2021

Imon: Appearance-Based Gaze Tracking System On Mobile Devices, Sinh Huynh, Rajesh Krishna Balan, Jeonggil Ko

Research Collection School Of Computing and Information Systems

Gaze tracking is a key building block used in many mobile applications including entertainment, personal productivity, accessibility, medical diagnosis, and visual attention monitoring. In this paper, we present iMon, an appearance-based gaze tracking system that is both designed for use on mobile phones and has significantly greater accuracy compared to prior state-of-the-art solutions. iMon achieves this by comprehensively considering the gaze estimation pipeline and then overcoming three different sources of errors. First, instead of assuming that the user's gaze is fixed to a single 2D coordinate, we construct each gaze label using a probabilistic 2D heatmap gaze representation input to …


Fair And Diverse Group Formation Based On Multidimensional Features, Mohammed Saad A Alqahtani Dec 2021

Fair And Diverse Group Formation Based On Multidimensional Features, Mohammed Saad A Alqahtani

Graduate Theses and Dissertations

The goal of group formation is to build a team to accomplish a specific task. Algorithms are being developed to improve the team's effectiveness so formed and the efficiency of the group selection process. However, there is concern that team formation algorithms could be biased against minorities due to the algorithms themselves or the data on which they are trained. Hence, it is essential to build fair team formation systems that incorporate demographic information into the process of building the group. Although there has been extensive work on modeling individuals’ expertise for expert recommendation and/or team formation, there has been …


Integration Of Internet Of Things And Health Recommender Systems, Moonkyung Yang Dec 2021

Integration Of Internet Of Things And Health Recommender Systems, Moonkyung Yang

Electronic Theses, Projects, and Dissertations

The Internet of Things (IoT) has become a part of our lives and has provided many enhancements to day-to-day living. In this project, IoT in healthcare is reviewed. IoT-based healthcare is utilized in remote health monitoring, observing chronic diseases, individual fitness programs, helping the elderly, and many other healthcare fields. There are three main architectures of smart IoT healthcare: Three-Layer Architecture, Service-Oriented Based Architecture (SoA), and The Middleware-Based IoT Architecture. Depending on the required services, different IoT architecture are being used. In addition, IoT healthcare services, IoT healthcare service enablers, IoT healthcare applications, and IoT healthcare services focusing on Smartwatch …


Methods And Applications Of Synthetic Data Generation, Jason Anderson Dec 2021

Methods And Applications Of Synthetic Data Generation, Jason Anderson

All Dissertations

The advent of data mining and machine learning has highlighted the value of large and varied sources of data, while increasing the demand for synthetic data captures the structural and statistical characteristics of the original data without revealing personal or proprietary information contained in the original dataset.

In this dissertation, we use examples from original research to show that, using appropriate models and input parameters, synthetic data that mimics the characteristics of real data can be generated with sufficient rate and quality to address the volume, structural complexity, and statistical variation requirements of research and development of digital information processing …


Curriculum Complexity And Graduation Rates At Utah State University, Hayden Hoopes Dec 2021

Curriculum Complexity And Graduation Rates At Utah State University, Hayden Hoopes

Undergraduate Honors Capstone Projects

This study utilizes a curricular analytics framework developed by Heileman et al. (2018) to examine the relationship between curriculum complexity and graduation rates in academic programs at Utah State University. The goal in quantifying the complexity of curricula is to determine whether or not prerequisite courses and other factors of curricula structure impacts graduation from the university. To accomplish this goal, curriculum complexity spreadsheets were developed for 96 degree programs at the university, which facilitated the assignment of curriculum complexity scores to the 6,337 students who qualified for the quasi-experimental study. Logistic regression was then applied to the resulting data …


Rmm: Reinforced Memory Management For Class-Incremental Learning, Yaoyao Liu, Qianru Sun, Qianru Sun Dec 2021

Rmm: Reinforced Memory Management For Class-Incremental Learning, Yaoyao Liu, Qianru Sun, Qianru Sun

Research Collection School Of Computing and Information Systems

Class-Incremental Learning (CIL) [38] trains classifiers under a strict memory budget: in each incremental phase, learning is done for new data, most of which is abandoned to free space for the next phase. The preserved data are exemplars used for replaying. However, existing methods use a static and ad hoc strategy for memory allocation, which is often sub-optimal. In this work, we propose a dynamic memory management strategy that is optimized for the incremental phases and different object classes. We call our method reinforced memory management (RMM), leveraging reinforcement learning. RMM training is not naturally compatible with CIL as the …


Spurring Digital Transformation In Singapore's Legal Industry, Xin Juan Chua, Steven M. Miller Dec 2021

Spurring Digital Transformation In Singapore's Legal Industry, Xin Juan Chua, Steven M. Miller

Research Collection School Of Computing and Information Systems

COVID-19 has transformed the way we live and work. It has caused the processes and operations of businesses and organisations to be restructured, as well as transformed business models. A 2020 McKinsey Global survey reported that companies all over the world claim they have accelerated the digitalisation of their customer and supply-chain interactions, as well as their internal operations, by three to four years. They also said they thought the share of digital or digitally enabled products in their portfolios has advanced by seven years. While technology transformation is not new to the legal profession, COVID-19 has cemented the importance …


Fine-Grained Generalization Analysis Of Inductive Matrix Completion, Antoine Ledent, Rodrigo Alves, Yunwen Lei, Marius Kloft Dec 2021

Fine-Grained Generalization Analysis Of Inductive Matrix Completion, Antoine Ledent, Rodrigo Alves, Yunwen Lei, Marius Kloft

Research Collection School Of Computing and Information Systems

In this paper, we bridge the gap between the state-of-the-art theoretical results for matrix completion with the nuclear norm and their equivalent in \textit{inductive matrix completion}: (1) In the distribution-free setting, we prove bounds improving the previously best scaling of \widetilde{O}(rd2) to \widetilde{O}(d3/2√r), where d is the dimension of the side information and rr is the rank. (2) We introduce the (smoothed) \textit{adjusted trace-norm minimization} strategy, an inductive analogue of the weighted trace norm, for which we show guarantees of the order \widetilde{O}(dr) under arbitrary sampling. In the inductive case, a similar rate was previously achieved only under uniform sampling …


Canita: Faster Rates For Distributed Convex Optimization With Communication Compression, Zhize Li, Peter Richtarik Dec 2021

Canita: Faster Rates For Distributed Convex Optimization With Communication Compression, Zhize Li, Peter Richtarik

Research Collection School Of Computing and Information Systems

Due to the high communication cost in distributed and federated learning, methods relying on compressed communication are becoming increasingly popular. Besides, the best theoretically and practically performing gradient-type methods invariably rely on some form of acceleration/momentum to reduce the number of communications (faster convergence), e.g., Nesterov's accelerated gradient descent (Nesterov, 1983, 2004) and Adam (Kingma and Ba, 2014). In order to combine the benefits of communication compression and convergence acceleration, we propose a \emph{compressed and accelerated} gradient method based on ANITA (Li, 2021) for distributed optimization, which we call CANITA. Our CANITA achieves the \emph{first accelerated rate} $O\bigg(\sqrt{\Big(1+\sqrt{\frac{\omega^3}{n}}\Big)\frac{L}{\epsilon}} + \omega\big(\frac{1}{\epsilon}\big)^{\frac{1}{3}}\bigg)$, …


Learning Large Neighborhood Search Policy For Integer Programming, Yaoxin Wu, Wen Song, Zhiguang Cao, Jie Zhang Dec 2021

Learning Large Neighborhood Search Policy For Integer Programming, Yaoxin Wu, Wen Song, Zhiguang Cao, Jie Zhang

Research Collection School Of Computing and Information Systems

We propose a deep reinforcement learning (RL) method to learn large neighborhood search (LNS) policy for integer programming (IP). The RL policy is trained as the destroy operator to select a subset of variables at each step, which is reoptimized by an IP solver as the repair operator. However, the combinatorial number of variable subsets prevents direct application of typical RL algorithms. To tackle this challenge, we represent all subsets by factorizing them into binary decisions on each variable. We then design a neural network to learn policies for each variable in parallel, trained by a customized actor-critic algorithm. We …


Strategic Behavior And Market Inefficiency In Blockchain-Based Auctions, Ping Fan Ke, Jianqing Chen, Zhiling Guo Dec 2021

Strategic Behavior And Market Inefficiency In Blockchain-Based Auctions, Ping Fan Ke, Jianqing Chen, Zhiling Guo

Research Collection School Of Computing and Information Systems

Blockchain-based auctions play a key role in decentralized finance, such as liquidation of collaterals in crypto-lending. In this research, we show that a Blockchain-based auction is subject to the threat to availability because of the characteristics of the Blockchain platform, which could lead to auction inefficiency or even market failure. Specifically, an adversary could occupy all of the transaction capacity of an auction by sending transactions with sufficiently high transaction fees, and then win the item in an auction with a nearly zero bid price as there are no competitors available. We discuss how to prevent this kind of strategic …


Self-Supervised Learning Disentangled Group Representation As Feature, Tan Wang, Zhongqi Yue, Jianqiang Huang, Qianru Sun, Hanwang Zhang Dec 2021

Self-Supervised Learning Disentangled Group Representation As Feature, Tan Wang, Zhongqi Yue, Jianqiang Huang, Qianru Sun, Hanwang Zhang

Research Collection School Of Computing and Information Systems

A good visual representation is an inference map from observations (images) to features (vectors) that faithfully reflects the hidden modularized generative factors (semantics). In this paper, we formulate the notion of “good” representation from a group-theoretic view using Higgins’ definition of disentangled representation [38], and show that existing Self-Supervised Learning (SSL) only disentangles simple augmentation features such as rotation and colorization, thus unable to modularize the remaining semantics. To break the limitation, we propose an iterative SSL algorithm: Iterative Partition-based Invariant Risk Minimization (IP-IRM), which successfully grounds the abstract semantics and the group acting on them into concrete contrastive learning. …