Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Theses/Dissertations

Databases and Information Systems

2021

Institution
Keyword
Publication

Articles 1 - 30 of 128

Full-Text Articles in Physical Sciences and Mathematics

On Performance Optimization And Prediction Of Parallel Computing Frameworks In Big Data Systems, Haifa Alquwaiee Dec 2021

On Performance Optimization And Prediction Of Parallel Computing Frameworks In Big Data Systems, Haifa Alquwaiee

Dissertations

A wide spectrum of big data applications in science, engineering, and industry generate large datasets, which must be managed and processed in a timely and reliable manner for knowledge discovery. These tasks are now commonly executed in big data computing systems exemplified by Hadoop based on parallel processing and distributed storage and management. For example, many companies and research institutions have developed and deployed big data systems on top of NoSQL databases such as HBase and MongoDB, and parallel computing frameworks such as MapReduce and Spark, to ensure timely data analyses and efficient result delivery for decision making and business …


An Open Source Direct Messaging And Enhanced Recommendation System For Yioop, Aniruddha Dinesh Mallya Dec 2021

An Open Source Direct Messaging And Enhanced Recommendation System For Yioop, Aniruddha Dinesh Mallya

Master's Projects

Recommendation systems and direct messaging systems are two popular components of web portals. A recommendation system is an information filtering system that seeks to predict the "rating" or "preference" a user would give to an item and a direct messaging system allows private communication between users of any platform. Yioop, is an open source, PHP search engine and web portal that can be configured to allow users to create discussion groups, blogs, wikis etc.

In this project, we expanded on Yioop’s group system so that every user now has a personal group. Personal groups were then used to add user …


High Performance Document Store Implementation In Rust, Ishaan Aggarwal Dec 2021

High Performance Document Store Implementation In Rust, Ishaan Aggarwal

Master's Projects

Databases are a core part of any application which requires persistence of data. The performance of applications involving the use of database systems is directly proportional to how fast their database read-write operations are. The aim of this project was to build a high- performance document store which can support variety of applications which require data storage and retrieval of some kind. This document store can be used as an independently running backend service which can be utilized by search engines, applications which deal with keeping records, etc. We used Rust to make this document store which is fast, robust, …


Node.Js Based Document Store For Web Crawling, David Bui Dec 2021

Node.Js Based Document Store For Web Crawling, David Bui

Master's Projects

WARC files are central to internet preservation projects. They contain the raw resources of web crawled data and can be used to create windows into the past of web pages at the time they were accessed. Yet there are few tools that manipulate WARC files outside of basic parsing. The creation of our tool WARC-KIT gives users in the Node.js JavaScript environment, a tool kit to interact with and manipulate WARC files.

Included with WARC-KIT is a WARC parsing tool known as WARCFilter that can be used standalone tool to parse, filter, and create new WARC files. WARCFilter can also, …


Using Parallel Primary Caches To Improve Capacity And Bandwidth, John Rubena Wani Dec 2021

Using Parallel Primary Caches To Improve Capacity And Bandwidth, John Rubena Wani

Archived Theses and Dissertations

No abstract provided.


Moment-Preserving Piecewise Approximation For 1-D And 2-D Signals, Soha M. A. A. Seif Dec 2021

Moment-Preserving Piecewise Approximation For 1-D And 2-D Signals, Soha M. A. A. Seif

Archived Theses and Dissertations

No abstract provided.


Shape Similarity By Deformation Using Polynomial Transformation, Hanan M. Moussa Dec 2021

Shape Similarity By Deformation Using Polynomial Transformation, Hanan M. Moussa

Archived Theses and Dissertations

No abstract provided.


Examining The Effects Of Information And Communication Technologies In The Legal Representation Of Latin American Asylum Seekers, Victor M. Portillo Ochoa Dec 2021

Examining The Effects Of Information And Communication Technologies In The Legal Representation Of Latin American Asylum Seekers, Victor M. Portillo Ochoa

Open Access Theses & Dissertations

The purpose of this thesis was to explore how legal defense nonprofit organizations (NPO) are using Information and Communication Technologies (ICT) to provide legal defense for asylum seekers and improve the conditions of immigrants at detention centers. In addition, this research explored the impact of ICTs on legal defense NPOs, bottlenecks, and security implications when supporting vulnerable communities. ICTs profoundly impacted the way we interact in a post-pandemic world, and it presents new challenges and possibilities for legal defense nonprofit organizations that are helping vulnerable communities. This study consists of staff and volunteers from different legal defense nonprofit organizations NPOs …


Efficient Data Structures For Text Processing Applications, Paniz Abedin Dec 2021

Efficient Data Structures For Text Processing Applications, Paniz Abedin

Electronic Theses and Dissertations, 2020-

This thesis is devoted to designing and analyzing efficient text indexing data structures and associated algorithms for processing text data. The general problem is to preprocess a given text or a collection of texts into a space-efficient index to quickly answer various queries on this data. Basic queries such as counting/reporting a given pattern's occurrences as substrings of the original text are useful in modeling critical bioinformatics applications. This line of research has witnessed many breakthroughs, such as the suffix trees, suffix arrays, FM-index, etc. In this work, we revisit the following problems: 1. The Heaviest Induced Ancestors problem 2. …


Human Capital In The Knowledge Economy : A 3-Country Case Study In Healthcare, James Scott Mccallum Dec 2021

Human Capital In The Knowledge Economy : A 3-Country Case Study In Healthcare, James Scott Mccallum

Theses and Dissertations

During the present knowledge economy there appear to be labor shortages at the same time and in the same regions in which there is an excess of labor supply. Such a pattern would run counter to previous major economic disruptions, as well as questioning traditional free market economic theory of supply and demand principles. Implications for policy where there are global labor shortages along with surplus labor availability in a market economy, are significant. It will likely indicate a drag on economic growth for business sectors, for regions and perhaps globally. It would indicate an accompanying growing disparity of income. …


Fair And Diverse Group Formation Based On Multidimensional Features, Mohammed Saad A Alqahtani Dec 2021

Fair And Diverse Group Formation Based On Multidimensional Features, Mohammed Saad A Alqahtani

Graduate Theses and Dissertations

The goal of group formation is to build a team to accomplish a specific task. Algorithms are being developed to improve the team's effectiveness so formed and the efficiency of the group selection process. However, there is concern that team formation algorithms could be biased against minorities due to the algorithms themselves or the data on which they are trained. Hence, it is essential to build fair team formation systems that incorporate demographic information into the process of building the group. Although there has been extensive work on modeling individuals’ expertise for expert recommendation and/or team formation, there has been …


Integration Of Internet Of Things And Health Recommender Systems, Moonkyung Yang Dec 2021

Integration Of Internet Of Things And Health Recommender Systems, Moonkyung Yang

Electronic Theses, Projects, and Dissertations

The Internet of Things (IoT) has become a part of our lives and has provided many enhancements to day-to-day living. In this project, IoT in healthcare is reviewed. IoT-based healthcare is utilized in remote health monitoring, observing chronic diseases, individual fitness programs, helping the elderly, and many other healthcare fields. There are three main architectures of smart IoT healthcare: Three-Layer Architecture, Service-Oriented Based Architecture (SoA), and The Middleware-Based IoT Architecture. Depending on the required services, different IoT architecture are being used. In addition, IoT healthcare services, IoT healthcare service enablers, IoT healthcare applications, and IoT healthcare services focusing on Smartwatch …


Curriculum Complexity And Graduation Rates At Utah State University, Hayden Hoopes Dec 2021

Curriculum Complexity And Graduation Rates At Utah State University, Hayden Hoopes

Undergraduate Honors Capstone Projects

This study utilizes a curricular analytics framework developed by Heileman et al. (2018) to examine the relationship between curriculum complexity and graduation rates in academic programs at Utah State University. The goal in quantifying the complexity of curricula is to determine whether or not prerequisite courses and other factors of curricula structure impacts graduation from the university. To accomplish this goal, curriculum complexity spreadsheets were developed for 96 degree programs at the university, which facilitated the assignment of curriculum complexity scores to the 6,337 students who qualified for the quasi-experimental study. Logistic regression was then applied to the resulting data …


Integration Of Blockchain Technology Into Automobiles To Prevent And Study The Causes Of Accidents, John Kim Dec 2021

Integration Of Blockchain Technology Into Automobiles To Prevent And Study The Causes Of Accidents, John Kim

Electronic Theses, Projects, and Dissertations

Automobile collisions occur daily. We now live in an information-driven world, one where technology is quickly evolving. Blockchain technology can change the automotive industry, the safety of the motoring public and its surrounding environment by incorporating this vast array of information. It can place safety and efficiency at the forefront to pedestrians, public establishments, and provide public agencies with pertinent information securely and efficiently. Other industries where Blockchain technology has been effective in are as follows: supply chain management, logistics, and banking. This paper reviews some statistical information regarding automobile collisions, Blockchain technology, Smart Contracts, Smart Cities; assesses the feasibility …


Methods And Applications Of Synthetic Data Generation, Jason Anderson Dec 2021

Methods And Applications Of Synthetic Data Generation, Jason Anderson

All Dissertations

The advent of data mining and machine learning has highlighted the value of large and varied sources of data, while increasing the demand for synthetic data captures the structural and statistical characteristics of the original data without revealing personal or proprietary information contained in the original dataset.

In this dissertation, we use examples from original research to show that, using appropriate models and input parameters, synthetic data that mimics the characteristics of real data can be generated with sufficient rate and quality to address the volume, structural complexity, and statistical variation requirements of research and development of digital information processing …


Managing Incomplete Data In The Patient Discharge Summary To Support Correct Hospital Reimbursements, Fadi Naser Eddin Nov 2021

Managing Incomplete Data In The Patient Discharge Summary To Support Correct Hospital Reimbursements, Fadi Naser Eddin

USF Tampa Graduate Theses and Dissertations

The patient discharge summary is a document that conveys the patient's story to other healthcare practitioners, external users, and, most importantly from a financial perspective, health insurers. A defect or incompleteness in the patient's discharge summary will result in delays in the collection process through denial of the entire or partial reimbursement claim or, in the best-case scenario, delay until the discharge summary issue is resolved. The purpose of this project is to address the issue of the incompleteness of discharge summary from the perspective of healthcare providers, with the goal of understanding, diagnosing, and intervening in the research problem. …


Transfer-Learned Pruned Deep Convolutional Neural Networks For Efficient Plant Classification In Resource-Constrained Environments, Martinson Ofori Nov 2021

Transfer-Learned Pruned Deep Convolutional Neural Networks For Efficient Plant Classification In Resource-Constrained Environments, Martinson Ofori

Masters Theses & Doctoral Dissertations

Traditional means of on-farm weed control mostly rely on manual labor. This process is time-consuming, costly, and contributes to major yield losses. Further, the conventional application of chemical weed control can be economically and environmentally inefficient. Site-specific weed management (SSWM) counteracts this by reducing the amount of chemical application with localized spraying of weed species. To solve this using computer vision, precision agriculture researchers have used remote sensing weed maps, but this has been largely ineffective for early season weed control due to problems such as solar reflectance and cloud cover in satellite imagery. With the current advances in artificial …


Can We Make It Better? Assessing And Improving Quality Of Github Repositories, Gede Artha Azriadi Prana Nov 2021

Can We Make It Better? Assessing And Improving Quality Of Github Repositories, Gede Artha Azriadi Prana

Dissertations and Theses Collection (Open Access)

The code hosting platform GitHub has gained immense popularity worldwide in recent years, with over 200 million repositories hosted as of June 2021. Due to its popularity, it has great potential to facilitate widespread improvements across many software projects. Naturally, GitHub has attracted much research attention, and the source code in the various repositories it hosts also provide opportunity to apply techniques and tools developed by software engineering researchers over the years. However, much of existing body of research applicable to GitHub focuses on code quality of the software projects and ways to improve them. Fewer work focus on potential …


Residential Curbside Recycle Context Analysis, Ntchanang Mpafe Oct 2021

Residential Curbside Recycle Context Analysis, Ntchanang Mpafe

USF Tampa Graduate Theses and Dissertations

Curbside recycling as a preferred mode of residential and municipal sustainability goals seems to have an overwhelming acceptance and adoption in the US. About 69.8 million out of 97.3 million (72%) single-family households in the United States have access to curbside recycling services (State of Curbside Recycling Report, 2020). Collectively, the programs divert about nine million tons of recyclables from landfill disposal each year (Cottom, 2019).

For a design that started in the 1980s in the US, its rapid universal adoption seems to have precluded a concerted effort in examining the coproduced nature (Households: service receptors and Municipalities: service providers) …


Informing Complexity: The Business Case For Managing Digital Twins Of Complex Process Facilities As A Valuable Asset, William Randell Mcnair Oct 2021

Informing Complexity: The Business Case For Managing Digital Twins Of Complex Process Facilities As A Valuable Asset, William Randell Mcnair

USF Tampa Graduate Theses and Dissertations

The Digital Twins of complex facilities, specifically 3D models created during their design, is a potentially valuable information asset. This three- article dissertation explores the business case for firms in the petrochemical process industry to manage throughout the facility lifecycle. A maturity model is provided to illustrate the stages of digital twin evolution and serves as a tool to help communicate each of the five levels of digital twin maturity achievable in various use cases. An industry analysis reviews existing literature and proposes a model to assess informing or insight value of digital twins from three perspectives. Next, an empirical …


Enhancing Usability And Explainability Of Data Systems, Anna Fariha Oct 2021

Enhancing Usability And Explainability Of Data Systems, Anna Fariha

Doctoral Dissertations

The recent growth of data science expanded its reach to an ever-growing user base of nonexperts, increasing the need for usability, understandability, and explainability in these systems. Enhancing usability makes data systems accessible to people with different skills and backgrounds alike, leading to democratization of data systems. Furthermore, proper understanding of data and data-driven systems is necessary for the users to trust the function of the systems that learn from data. Finally, data systems should be transparent: when a data system behaves unexpectedly or malfunctions, the users deserve proper explanation of what caused the observed incident. Unfortunately, …


History Modeling For Conversational Information Retrieval, Chen Qu Oct 2021

History Modeling For Conversational Information Retrieval, Chen Qu

Doctoral Dissertations

Conversational search is an embodiment of an iterative and interactive approach to information retrieval (IR) that has been studied for decades. Due to the recent rise of intelligent personal assistants, such as Siri, Alexa, AliMe, Cortana, and Google Assistant, a growing part of the population is moving their information-seeking activities to voice- or text-based conversational interfaces. One of the major challenges of conversational search is to leverage the conversation history to understand and fulfill the users' information needs. In this dissertation work, we investigate history modeling approaches for conversational information retrieval. We start from history modeling for user intent prediction. …


Enabling Declarative And Scalable Prescriptive Analytics In Relational Data, Matteo Brucato Oct 2021

Enabling Declarative And Scalable Prescriptive Analytics In Relational Data, Matteo Brucato

Doctoral Dissertations

Constrained optimization problems are at the heart of significant applications in a broad range of domains, including finance, transportation, manufacturing, and healthcare. They are often found at the final step of business analytics, namely prescriptive analytics, to allow businesses to transform a rich understanding of data, typically provided by advanced predictive models, into actionable decisions. Modeling and solving these problems has relied on application-specific solutions, which are often complex, error-prone, and do not generalize. Our goal is to create a domain-independent, declarative approach, supported and powered by the system where the data relevant to these problems typically resides: the database. …


Neural Approaches To Feedback In Information Retrieval, Keping Bi Oct 2021

Neural Approaches To Feedback In Information Retrieval, Keping Bi

Doctoral Dissertations

Relevance feedback on search results indicates users' search intent and preferences. Extensive studies have shown that incorporating relevance feedback (RF) on the top k (usually 10) ranked results significantly improves the performance of re-ranking. However, most existing research on user feedback focuses on words-based retrieval models. Recently, neural retrieval models have shown their efficacy in capturing relevance matching in retrieval but little research has been conducted on neural approaches to feedback. This leads us to study different aspects of feedback with neural approaches in the dissertation. RF techniques are seldom used in real search scenarios since they can require significant …


Employees Breaking Bad With Technology: An Exploratory Analysis Of Human Factors That Drive Cyberspace Insider Threats, Marcus L. Green Oct 2021

Employees Breaking Bad With Technology: An Exploratory Analysis Of Human Factors That Drive Cyberspace Insider Threats, Marcus L. Green

USF Tampa Graduate Theses and Dissertations

As implementation of computer systems has continued to grow in business contexts, employee-driven cyberspace infractions have also grown in number. Employee cyberspace behaviors have continued to have detrimental effects on company computer systems. Actions that violate company cybersecurity policies can be either malicious or unmalicious. Solutions, by and large, have been electronic and centered on hardware and software. Those proposing solutions have begun to shift their focus to human risk vulnerabilities.

This study was novel in that its focus was identification of individual, cultural, and technological risk factors that drive cyberspace insider threat activities. Identifying factors that reduce insider threat …


Managing Health Locus Of Control In Patient-Provider Relationships, James Wallace Sep 2021

Managing Health Locus Of Control In Patient-Provider Relationships, James Wallace

USF Tampa Graduate Theses and Dissertations

Patient locus of control is a strong determinant of health outcomes, yet health care professionals do not typically address it in care plans. In fact, management of most medical conditions is hindered because the treating physician has little information about the patient’s locus of control. This research addresses the question “How can locus of control be used to enable health care practitioners to improve medical outcomes?”

Research Methodology. Using an engaged scholarship approach incorporating the Elaborated Action Design Research methodology, the research drives the guided, emergent design of a novel protocol and two separate artifacts for management of health locus …


Exploring, Understanding, Then Designing: Twitter Users’ Sharing Behavior For Minor Safety Incidents, Mashael Yousef Almoqbel Aug 2021

Exploring, Understanding, Then Designing: Twitter Users’ Sharing Behavior For Minor Safety Incidents, Mashael Yousef Almoqbel

Dissertations

Social media has become an integral part of human lives. Social media users resort to these platforms for various reasons. Users of these platforms spend a lot of time creating, reading, and sharing content, therefore, providing a wealth of available information for everyone to use. The research community has taken advantage of this and produced many publications that allow us to better understand human behavior. An important subject that is sometimes discussed and shared on social media is public safety. In the past, Twitter users have used the platform to share incidents, share information about incidents, victims and perpetrators, and …


Participatory Learning: Measuring Learning And Educational Technology Acceptance, Erick Sanchez Suasnabar Aug 2021

Participatory Learning: Measuring Learning And Educational Technology Acceptance, Erick Sanchez Suasnabar

Dissertations

Participatory Learning (PL) integrates several learning approaches, engaging students throughout the entire assignment process for both online and face-to-face courses. Beyond simply providing a solution, students also craft a problem (problem-based learning), grade each other (peer assessment and feedback), evaluate themselves (self-assessment), and can view others’ work (learning by example). This dissertation research explores the resulting learning effects. Contributions to both educational and Information Systems research include extending an early PL model and experiments that applied the PL approach to examinations, by validating and testing new constructs based on user activity and critical thinking. In addition, the study explores a …


Data-Driven Based Automatic Routing Planning For Mass, Qingwu Wang Aug 2021

Data-Driven Based Automatic Routing Planning For Mass, Qingwu Wang

Maritime Safety & Environment Management Dissertations (Dalian)

No abstract provided.


Exploratory Search With Archetype-Based Language Models, Brent D. Davis Aug 2021

Exploratory Search With Archetype-Based Language Models, Brent D. Davis

Electronic Thesis and Dissertation Repository

This dissertation explores how machine learning, natural language processing and information retrieval may assist the exploratory search task. Exploratory search is a search where the ideal outcome of the search is unknown, and thus the ideal language to use in a retrieval query to match it is unavailable. Three algorithms represent the contribution of this work. Archetype-based Modeling and Search provides a way to use previously identified archetypal documents relevant to an archetype to form a notion of similarity and find related documents that match the defined archetype. This is beneficial for exploratory search as it can generalize beyond standard …