Open Access. Powered by Scholars. Published by Universities.®

Databases and Information Systems Commons

Open Access. Powered by Scholars. Published by Universities.®

Theses/Dissertations

Database

Discipline
Institution
Publication Year
Publication

Articles 1 - 30 of 31

Full-Text Articles in Databases and Information Systems

Elevating Academic Administration: A Comprehensive Faculty Dashboard For Tracking Student Evaluations And Research, Musa M. Azeem Apr 2024

Elevating Academic Administration: A Comprehensive Faculty Dashboard For Tracking Student Evaluations And Research, Musa M. Azeem

Senior Theses

The USC Faculty Dashboard is a web application designed to revolutionize how department heads, professors, and instructors monitor progress and make decisions, providing a centralized hub for efficient data storage and analysis. Currently, there’s a gap in tools tailored for department heads to concisely manage the performance of their department, which our platform aims to fill. The USC Faculty Dashboard offers easy access to upload and view student evaluation and research information, empowering department heads to evaluate the performance of faculty members and seamlessly track their research grants, publications, and expenditures. Furthermore, professors and instructors gain personalized performance analysis tools, …


Creation Of A Digital Storage System For Genome Sequencing Metadata, Jacquelin W. Olexa Jan 2024

Creation Of A Digital Storage System For Genome Sequencing Metadata, Jacquelin W. Olexa

Undergraduate Theses, Professional Papers, and Capstone Artifacts

As the field of computational genomics continues to expand in both potential and application, it is now more imperative than ever to ensure that massive genetic sequencing datasets are properly stored in an accessible manner. This project sought to establish a practical, user-friendly, secure system for a genomics research lab (the Good Lab; thegoodlab.org) at the University of Montana. A MySQL database and connected web application was ruled the best configuration to maximize utility and accessibility for the lab’s researchers. Building the logical framework for the database, creating the server, and sourcing data occurred over several months. The dataset ranged …


Managing Inventory With A Database, David Bartlett Jan 2024

Managing Inventory With A Database, David Bartlett

Williams Honors College, Honors Research Projects

Large commercial companies often use warehouses to store and organize their product inventory. However, manually keeping track of inventory through physical means can be a tedious process and is at risk for a variety of potential issues. It is very easy for records to be inaccurate or duplicated, especially if large reorganizations are undertaken, as this can cause issues such as duplicate product ID numbers. Therefore, it was decided that an inventory management system utilizing a SQL database should be created. The system needed to have capabilities including allowing the entry of product information, the ability to search database records …


College Of Education Filemaker Extraction And End-User Database Development, Andrew Tran May 2022

College Of Education Filemaker Extraction And End-User Database Development, Andrew Tran

Electronic Theses, Projects, and Dissertations

The College of Education (CoE) at the California State University San Bernardino (CSUSB) developed a system to keep track of both state and national accreditation requirements using FileMaker 5, a database system. This accreditation data is crucial for reporting and record-keeping for the CSU Chancellor’s Office as well as the State of California. However, the database system was developed several decades ago, and software support has long since been dropped, causing the CoE’s legacy accreditation data to be at risk of being lost should the software or hardware suffer permanent failure. The purpose of this project was to perform extraction …


Data Processing In A Database Management System Using Parallel Processing, Stephen Shears Jan 2022

Data Processing In A Database Management System Using Parallel Processing, Stephen Shears

Williams Honors College, Honors Research Projects

This research project will be focused on parallel processing as it is used with database management systems to process data. Specifically, the goal is to see if creating a database management system with parallel processing at the forefront of its data processing can offer enough of an efficiency increase to warrant using it against a sequential database management system and is it possible to make that system just as reliable as those databases without parallel processing. A parallel processed database will be created with a focus on monitoring its data reliability and consistency. It will then be compared to two …


Enhancing Usability And Explainability Of Data Systems, Anna Fariha Oct 2021

Enhancing Usability And Explainability Of Data Systems, Anna Fariha

Doctoral Dissertations

The recent growth of data science expanded its reach to an ever-growing user base of nonexperts, increasing the need for usability, understandability, and explainability in these systems. Enhancing usability makes data systems accessible to people with different skills and backgrounds alike, leading to democratization of data systems. Furthermore, proper understanding of data and data-driven systems is necessary for the users to trust the function of the systems that learn from data. Finally, data systems should be transparent: when a data system behaves unexpectedly or malfunctions, the users deserve proper explanation of what caused the observed incident. Unfortunately, …


Multilateration Index., Chip Lynch Aug 2021

Multilateration Index., Chip Lynch

Electronic Theses and Dissertations

We present an alternative method for pre-processing and storing point data, particularly for Geospatial points, by storing multilateration distances to fixed points rather than coordinates such as Latitude and Longitude. We explore the use of this data to improve query performance for some distance related queries such as nearest neighbor and query-within-radius (i.e. “find all points in a set P within distance d of query point q”). Further, we discuss the problem of “Network Adequacy” common to medical and communications businesses, to analyze questions such as “are at least 90% of patients living within 50 miles of a covered emergency …


Metaxmorph: Hierarchical Transformation Of Data With Metadata, Shubham Airan Aug 2021

Metaxmorph: Hierarchical Transformation Of Data With Metadata, Shubham Airan

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

This research is about transforming data. Data comes in different shapes; it can be structured as a graph, a tree, a collection of tables, or some other shape. In this thesis, we focus on data structured as a tree, which is known as hierarchical data. The same data could be structured in many different tree shapes. Previously it was shown how to transform data from one tree shape, one hierarchy to another without losing any information. But sometimes the pieces of the hierarchy are annotated or associated with metadata, that is, with data about the data itself. The metadata can …


Accelerating Aggregation Efficiency: Using Postgres As A Cache With Mongodb, Mason Q. Porter-Brown Jan 2021

Accelerating Aggregation Efficiency: Using Postgres As A Cache With Mongodb, Mason Q. Porter-Brown

Senior Projects Fall 2021

For decades now relational databases, namely SQL, have been the industry standard. However, recently MongoDB and other NoSQL databases have been growing in popularity due to their flexibility and scalability. SQL still has the upper hand in a variety of areas, including data consistency, advanced and established analytics functions, and efficient “JOIN” functions. This project focuses on MongoDB’s shortcomings when it comes to replicating “JOIN” operations using MongoDB’s aggregate functions. “JOIN” operations refer to the action of comparing data from one or more collections of data and joining similar data together in order to analyze and draw statistics from the …


A Deductive Database For Knot Colourings, Dong Hyun Han Jan 2021

A Deductive Database For Knot Colourings, Dong Hyun Han

Senior Projects Spring 2021

This work constitutes progress toward the development of a knowledge base for braids, knots, and their colourings. The main result of this development is the creation of a logical model for storing data pertaining to braids, two-dimensional projections of three dimensional knots, finite quandles, and colorings of braids and knots by quandles. It uses the Entity Relationship data reference model as its starting point and makes the original design there. In addition, it includes a conversion of the Entity Relationship Diagram (ERD) to SQL queries that define tables corresponding to the ERD entity sets. Finally this work demonstrates how to …


Football’S Future: An Analytical Interpretation Of The Premier League, Hunter Witeof Jan 2021

Football’S Future: An Analytical Interpretation Of The Premier League, Hunter Witeof

Williams Honors College, Honors Research Projects

This project looks to take the statistics of soccer players and run them through an algorithm to determine how well a player is performing. The system that will be designed in the project will look to accomplish 3 main goals: allow the user to enter new statistics, store the data for all 38 game weeks for all 20 teams, and compute a score for each player’s performance for each game as well as the average of all of the player's scores.


Translating Counting Problems Into Computable Language Expressions, Zach Prescott Jun 2020

Translating Counting Problems Into Computable Language Expressions, Zach Prescott

Theses

The realm of automated problem solving is a relatively new field, even in the context of natural language processing. One area where this is often demonstrated is that of creating a program that can solve word problems. The program must understand the problem, perform some processing, and then convey this information to a user in a way that is accessible and understandable. There has been quite a lot of progress in this area with simpler problems. However, when it comes to understanding problems that involve a level of NLP, the results are not conclusive. In this paper, we would like …


Vzwam Web-Based Lookup, Ruben Claudio May 2020

Vzwam Web-Based Lookup, Ruben Claudio

Masters Theses & Doctoral Dissertations

This web-based lookup will allow V employees to find territory sales rep much faster. It will simplify the process and eliminate manual processes.

At the moment, a combination of multiple manual processes is needed to find territory sales reps. The company’s CRM does not allow to find rep sales quickly. When an in-house sales representative is talking to a prospect, this sales rep has to go through a few series of steps to find an outside or territory sales rep --which is usually needed to schedule in-person meetings, that results in delays while doing transactions with the prospects. Besides, because …


Technical Strategies Database Managers Use To Protect Systems From Security Breaches, Leonard Ogbonna Jan 2020

Technical Strategies Database Managers Use To Protect Systems From Security Breaches, Leonard Ogbonna

Walden Dissertations and Doctoral Studies

Healthcare organizations generate massive amounts of data through their databases that may be vulnerable to data breaches due to extensive user privileges, unpatched databases, standardized query language injections, weak passwords/usernames, and system weaknesses. The purpose of this qualitative multiple case study was to explore technical strategies database managers in Southeast/North Texas used to protect database systems from data breaches. The target population consisted of database managers from 2 healthcare organizations in this region. The integrated system theory of information security management was the conceptual framework. The data collection process included semistructured interviews with 9 database managers, including a review of …


Telecommunications Database, Tristan Hess Jan 2020

Telecommunications Database, Tristan Hess

Williams Honors College, Honors Research Projects

The final goal of this project was to create a web application that is specifically tailored to make management of data for the Telecommunications department easier. The software that the department uses for phone service generates monthly reports that cannot be stored into their current database system. The Telecommunications Database project is a four-tier web application that was developed to store the monthly report information and alleviate the burden of manually searching through the reports for information. The web application implements basic database functionality for searching, inserting, updating, and deleting data contained in the monthly reports. Advanced searching functionality and …


Relate Promotion Tool, David Nutt Jan 2019

Relate Promotion Tool, David Nutt

Williams Honors College, Honors Research Projects

This paper contains the documentation of a standalone application used to expedite promotion creation and reflections of the development process.


Environmental Restoration Database, Joao Nascimento Apr 2018

Environmental Restoration Database, Joao Nascimento

Graduate Theses & Non-Theses

Environmental restoration projects face many challenges. Public awareness, funding constraints, unpredictable weather, unknown biological/chemical factors and the uncertainty about how the targeted ecosystem will develop work against the planned and ideal restoration.

One way the projects’ efficiency can be improved is by using software tools for data and quality management systems, in order to share information, make field practice follow rules, keep track of maintenance tasks, measure results and, therefore, increase the rate of success by the amount of resources invested.

Since the conception of every project, all resources involved need to be focused and coherent to the final restoration …


Supporting Scientific Analytics Under Data Uncertainty And Query Uncertainty, Liping Peng Mar 2018

Supporting Scientific Analytics Under Data Uncertainty And Query Uncertainty, Liping Peng

Doctoral Dissertations

Data management is becoming increasingly important in many applications, in particular, in large scientific databases where (1) data can be naturally modeled by continuous random variables, and (2) queries can involve complex predicates and/or be difficult for users to express explicitly. My thesis work aims to provide efficient support to both the "data uncertainty" and the "query uncertainty". When data is uncertain, an important class of queries requires query answers to be returned if their existence probabilities pass a threshold. I start with optimizing such threshold query processing for continuous uncertain data in the relational model by (i) expediting selections …


Database Usability Enhancement In Data Exploration, Yue Wang Nov 2017

Database Usability Enhancement In Data Exploration, Yue Wang

Doctoral Dissertations

Database usability has become an important research topic over the last decade. In the early days, database management systems were maintained by sophisticated users like database administrators. Today, due to the availability of data and computing resources, more non-expert users are involved in database computation. From their point of view, database systems lack ease of use. So researchers believe that usability is as important as the performance and functionality of databases and therefore developed many techniques such as natural language interface to enhance the ease of use of databases. In this thesis, we find some deeper technical issues in database …


High-Performance Complex Event Processing For Decision Analytics, Haopeng Zhang Jul 2017

High-Performance Complex Event Processing For Decision Analytics, Haopeng Zhang

Doctoral Dissertations

Complex Event Processing (CEP) systems are becoming increasingly popular in do- mains for decision analytics such as financial services, transportation, cluster monitoring, supply chain management, business process management, and health care. These systems collect or create high volumes event streams, and often require such event streams to be processed in real-time. To this end, CEP queries are applied for filtering, correlation, ag- gregation, and transformation, to derive high-level, actionable information. Tasks for CEP systems fall into two categories: passive monitoring and proactive monitoring. For passive monitoring, users know their exact needs and express them in CEP queries, then CEP engines …


Oasis - Identifying The Core Attributes For Rdbms Alternatives, Benjamin P. Mcpherson Jan 2017

Oasis - Identifying The Core Attributes For Rdbms Alternatives, Benjamin P. Mcpherson

Electronic Theses and Dissertations

Since their introduction in the 1970s, relational database management systems have served as the dominate data storage technology. However, the demands of big data and Web 2.0 necessitated a change in the market, sparking the beginning of the NoSQL movement in the late 2000s. NoSQL databases exchanged the relational model and the guaranteed consistency of ACID transactions for improved performance and massive scalability [1]. While the benefits NoSQL provided proved useful, the lack of sufficient SQL functionality presented a major hurdle for organizations which require it to properly operate. It was clear that new RDBMS solutions which did not compromise …


Aiddata Gis International Fellowship: Ghana West-Africa, Jason N. Ready Dec 2016

Aiddata Gis International Fellowship: Ghana West-Africa, Jason N. Ready

Sustainability and Social Justice

My internship, or fellowship as it was commonly referred to, was funded by a non-profit organization out of Williamsburg Virginia called AidData. This fellowship took place in in the country of Ghana, West-Africa beginning in May of 2016 and continued for 14 weeks with 40 hours each week. The objective of this internship was to provide in-depth training on the use of geographic Information Systems to Private and Public sectors within the country to allow for increased efficiency, and transparency through data visualization. In accordance with the requirement of Clark Universities GISDE master’s program this paper will delve into the …


Building And Safety Department Android Mobile Application, Nary Simms Jan 2016

Building And Safety Department Android Mobile Application, Nary Simms

Mathematics and Computer Science Capstones

Research has found that Americans spend 4.5 hours watching television, 1.5 hours listening to the radio, about half an hour reading print and spend a whooping five plus hours per day in digital media (online, mobile, other). Out of these five hours, two hours and twenty minutes are spent on a mobile device (phone or tablet), which is a massive increase of about 575 percent from the twenty-four minutes that was reported in 2010. Flurry, an analytic app company, released data about their tracking of more than 300,000 apps in 2013, and they found the average time spent per day …


Cleaning Up Minnesota's Archeological Record With Maid: The Minnesota Archeological Integrated Database, Andrew Allen Brown Jan 2016

Cleaning Up Minnesota's Archeological Record With Maid: The Minnesota Archeological Integrated Database, Andrew Allen Brown

All Graduate Theses, Dissertations, and Other Capstone Projects

Minnesota archeologists face many difficulties in conducting archeological research and managing the state's cultural resources such as a lack of standardized data formats and field/lab procedures, a lack of a centralized data repository, and insufficient existing databases. The purpose of this thesis is to build the foundation for a database system that addresses these difficulties along with being efficient and effective for entering, managing, and analyzing archeological data produced in the field and in the lab. The Minnesota Archeological Integrated Database is being built to be a long-lasting, constantly evolving system to be used by archeologists and cultural resource managers …


Privacy-Preserving Sanitization In Data Sharing, Wentian Lu Nov 2014

Privacy-Preserving Sanitization In Data Sharing, Wentian Lu

Doctoral Dissertations

In the era of big data, the prospect of analyzing, monitoring and investigating all sources of data starts to stand out in every aspect of our life. The benefit of such practices becomes concrete only when analysts or investigators have the information shared from data owners. However, privacy is one of the main barriers that disrupt the sharing behavior, due to the fear of disclosing sensitive information. This dissertation describes data sanitization methods that disguise the sensitive information before sharing a dataset and our criteria are always protecting privacy while preserving utility as much as possible. In particular, we provide …


Hydrographic Surface Modeling Through A Raster Based Spline Creation Method, Julie G. Alexander May 2014

Hydrographic Surface Modeling Through A Raster Based Spline Creation Method, Julie G. Alexander

University of New Orleans Theses and Dissertations

The United States Army Corp of Engineers relies on accurate and detailed surface models for various construction projects and preventative measures. To aid in these efforts, it is necessary to work for advancements in surface model creation. Current methods for model creation include Delaunay triangulation, raster grid interpolation, and Hydraulic Spline grid generation. While these methods produce adequate surface models, attempts for improved methods can still be made.

A method for raster based spline creation is presented as a variation of the Hydraulic Spline algorithm. By implementing Hydraulic Splines in raster data instead of vector data, the model creation process …


Filestream Enhancement To The Courseweb Grading System, Derrick Massey Jul 2013

Filestream Enhancement To The Courseweb Grading System, Derrick Massey

All Capstone Projects

Data is sometimes unstructured, such as text documents, images, and videos. This unstructured data is regularly stored outside the database, different from its structured data. This separation can cause data management problems. Or, if the data is connected with regulated storage, the performance and the file streaming capabilities and can degrade.

FILESTREAM incorporates the SQL Server Database Engine with an NTFS file system by loading varbinary(max) BLOB data as files. And make them available to the file system. Transact-SQL statements can update, query, search, insert, and or back up FILESTREAM data. The Win32 file system interface affords streaming access to …


An Analysis Of Generational Caching Implemented In A Production Website, Marc E. Zych Jun 2013

An Analysis Of Generational Caching Implemented In A Production Website, Marc E. Zych

Master's Theses

Website scaling has been an issue since the inception of the web. The demand for user generated content and personalized web pages requires the use of a database for a storage engine. Unfortunately, scaling the database to handle large amounts of traffic is still a problem many companies face. One such company is iFixit, a provider of free, publicly-editable, online repair manuals. Like many websites, iFixit uses Memcached to decrease database load and improve response time. However, the caching strategy used is a very ad hoc one and therefore can be greatly improved.

Most research regarding web application caching focuses …


Employee Time Scheduling, Mark Peter Smith Jun 2010

Employee Time Scheduling, Mark Peter Smith

Computer Science and Software Engineering

Small business managers face the common problem of employee time scheduling. There is a solution to this problem in the form of an application called Lemming Scheduler. Lemming Scheduler is a Java based employee time scheduling program. Its features include a desktop based application that stores employee and business information as well as a web interface for employees to view schedules and update availability. The desktop application uses employee and shift information to automatically generate schedules. The generated schedules are viewable by employees outside of work by way of the web interface. Lemming Scheduler provides a light weight interface for …


A User-Centric Tabular Multi-Column Sorting Interface For Intact Transposition Of Columnar Data, David B. L. Miles Jan 2006

A User-Centric Tabular Multi-Column Sorting Interface For Intact Transposition Of Columnar Data, David B. L. Miles

Theses and Dissertations

Many usability features designed in software applications are not procedurally intuitive for software users. A good example of software usability involves tabular sorting in a spreadsheet. Single-column sorting, activated with a mouse click to a column header or toolbar button, often produces rearranged listings that reduce cognitive organization beyond the sorted column. Multi-column sorting, generated through menu-driven processes, provides derived organization, however, locating feature options through menu-based systems can be confusing. A means to overcome this confusion is prioritized selection of database arrays issued to columnar displays for the purpose of intact transposition of data. This is a unique process …