Open Access. Powered by Scholars. Published by Universities.®

Computer Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Database

Discipline
Institution
Publication Year
Publication
Publication Type

Articles 1 - 30 of 35

Full-Text Articles in Computer Engineering

Information-Theoretic Model Diagnostics (Infomod), Armin Esmaeilzadeh May 2023

Information-Theoretic Model Diagnostics (Infomod), Armin Esmaeilzadeh

UNLV Theses, Dissertations, Professional Papers, and Capstones

Model validation is a critical step in the development, deployment, and governance of machine learning models. During the validation process, the predictive power of a model is measured on unseen datasets with a variety of metrics such as Accuracy and F1-Scores for classification tasks. Although the most used metrics are easy to implement and understand, they are aggregate measures over all the segments of heterogeneous datasets, and therefore, they do not identify the performance variation of a model among different data segments. The lack of insight into how the model performs over segments of unseen datasets has raised significant challenges …


Preprocessing Of Astronomical Images From The Neowise Survey For Near-Earth Asteroid Detection, Rachel Meyer Apr 2022

Preprocessing Of Astronomical Images From The Neowise Survey For Near-Earth Asteroid Detection, Rachel Meyer

Scholar Week 2016 - present

Asteroid detection is a common field in astronomy for planetary defense which requires observations from survey telescopes to detect and classify different objects. The amount of data collected each night is increasing as better designed telescopes are created each year. This amount is quickly becoming unmanageable and many researchers are looking for ways to better process this data. The dominant solution is to implement computer algorithms to automatically detect these sources and to use Machine Learning in order to create a more efficient and accurate classifier. In the past there has been a focus on larger asteroids that create streaks …


Magic: The Gathering Card Virtualizer, Vincent Garbonick, Jacen C. Conlan, Jaret A. Varn Jan 2022

Magic: The Gathering Card Virtualizer, Vincent Garbonick, Jacen C. Conlan, Jaret A. Varn

Williams Honors College, Honors Research Projects

Any well-versed Magic: The Gathering (MTG) player or collector knows how difficult it can be to keep track of all cards in their collection. Some spend hours searching for that one specific card, and others are constantly scouring the internet for how much their collection costs. However, this issue does not only affect casual fans. Resale companies spend hours a day determining the costs of cards, and tournament judges painstakingly check players’ decks to ensure they are not cheating. To assist with these struggles, the design team proposed to create the MTG Card Virtualizer. This device scans MTG playing cards …


Synthesizing Realistic Substitute Data For A Law Enforcement Database Using A Python Library, Anthony Carrola Jan 2022

Synthesizing Realistic Substitute Data For A Law Enforcement Database Using A Python Library, Anthony Carrola

Graduate Theses, Dissertations, and Problem Reports

In many databases, there is private or sensitive data that should not be accessible to any but a few individuals, such as HIPAA (Health Insurance Portability and Accountability Act) protected or LE (law enforcement) data. However, there is often a need to work with the data or change it for proper and thorough testing, especially for the developers . In some cases, the developers may be authorized to access and view the data, but it is rarely allowable for that data to be changed. Further, it is unlikely, especially on a large project, that all of the developers will have …


Multilateration Index., Chip Lynch Aug 2021

Multilateration Index., Chip Lynch

Electronic Theses and Dissertations

We present an alternative method for pre-processing and storing point data, particularly for Geospatial points, by storing multilateration distances to fixed points rather than coordinates such as Latitude and Longitude. We explore the use of this data to improve query performance for some distance related queries such as nearest neighbor and query-within-radius (i.e. “find all points in a set P within distance d of query point q”). Further, we discuss the problem of “Network Adequacy” common to medical and communications businesses, to analyze questions such as “are at least 90% of patients living within 50 miles of a covered emergency …


Effect Of Information Technology Capital: Technology Infrastructure, Database, Software, And Brainware Toward Optimize The Use Of Information Technology (Case Study : Uin Sunan Ampel Of Surabaya), Rismawati Br Sitepu, Ilham M.Said, Tanti Handriana, Praptini Yulianti Jan 2021

Effect Of Information Technology Capital: Technology Infrastructure, Database, Software, And Brainware Toward Optimize The Use Of Information Technology (Case Study : Uin Sunan Ampel Of Surabaya), Rismawati Br Sitepu, Ilham M.Said, Tanti Handriana, Praptini Yulianti

Library Philosophy and Practice (e-journal)

This research was conducted to determine the extent of the influence of technology infrastructure costs, software costs, database costs and brainware costs to increase the information technology budget of the Sunan Ampel State Islamic University in Surabaya and efficient use of the budget. The purpose of this study is to prove that there is a positive and significant influence of technology infrastructure costs, software costs, database costs and brainware costs to increase information technology budgets by using validity and reliability tests and classic tests such as the Normality test, Multicollinearity test, autocorrelation test, Heteroskedasticity test , and Linearity test. This …


Guitar Store Inventory, Alexander Didonato Jan 2021

Guitar Store Inventory, Alexander Didonato

Williams Honors College, Honors Research Projects

This project displays my senior programming project for a made up Guitar store inventory.


Semantic, Integrated Keyword Search Over Structured And Loosely Structured Databases, Xinge Lu Dec 2020

Semantic, Integrated Keyword Search Over Structured And Loosely Structured Databases, Xinge Lu

Dissertations

Keyword search has been seen in recent years as an attractive way for querying data with some form of structure. Indeed, it allows simple users to extract information from databases without mastering a complex structured query language and without having knowledge of the schema of the data. It also allows for integrated search of heterogeneous data sources. However, as keyword queries are ambiguous and not expressive enough, keyword search cannot scale satisfactorily on big datasets and the answers are, in general, of low accuracy. Therefore, flat keyword search alone cannot efficiently return high quality results on large data with structure. …


Dbknot: A Transparent And Seamless, Pluggable Tamper Evident Database, Islam Khalil Oct 2020

Dbknot: A Transparent And Seamless, Pluggable Tamper Evident Database, Islam Khalil

Theses and Dissertations

Database integrity is crucial to organizations that rely on databases of important data. They suffer from the vulnerability to internal fraud. Database tampering by internal malicious employees with high technical authorization to their infrastructure or even compromised by externals is one of the important attack vectors.

This thesis addresses such challenge in a class of problems where data is appended only and is immutable. Examples of operations where data does not change is a) financial institutions (banks, accounting systems, stock market, etc., b) registries and notary systems where important data is kept but is never subject to change, and c) …


Room Management Web Application And Movement And Temperature Sensors, Visalbotr Chan, Huy Anh Duong Mar 2020

Room Management Web Application And Movement And Temperature Sensors, Visalbotr Chan, Huy Anh Duong

Computer Engineering

There are three main parts of this system: micro-controller, database, and website. Micro-controller detects motion of people walking in and out and It also measures room temperature and humidity in a confined space then updates collected data to the database. Our system’s database contains 6 main columns: room number, room capacity, number of students, temperature in Celsius, humidity in percent and date created. Finally, this database is queried by the website to display the information on the webpage. Users could also navigate on our site to check the most and least occupy rooms, and they can also search for a …


A Database For Indexable Carbide Inserts, Andrew Yoder Jun 2019

A Database For Indexable Carbide Inserts, Andrew Yoder

Computer Engineering

The indexable inserts project is a collaborative effort to aggregate into a single database as many indexable carbide inserts from as many manufacturers as possible. Inserts are generally labeled with a part number following a specific standard determined by shapes and measurements, however specifications for certain aspects of carbide inserts—such as which materials they can cut—can vary by manufacturer. There currently is not a way to search a comprehensive database containing tools from multiple manufacturers for a handful of inserts that would satisfy some necessary parameters, making finding the correct tool in a shop a much more time-consuming process than …


Improving And Understanding Data Quality In Large-Scale Data Systems, Xiaolan Wang Mar 2019

Improving And Understanding Data Quality In Large-Scale Data Systems, Xiaolan Wang

Doctoral Dissertations

Systems and applications rely heavily on data, which makes data quality a critical factor for their function. In turn, low quality data can be incredibly costly and disruptive, leading to loss of revenue, incorrect conclusions, and misguided policy decisions. Improving data quality is far more than purging datasets of errors; it is more important to improve the processes that produce the data, to collect good data sources that are used for generating the data, and to truly understand the quality of the data. Therefore, the objective of this thesis is to improve and understand data quality from the above aspects. …


A-Z Database Discovery Using Alma: Eliminate Redundancy And Simplify Your Workflow, Travis Clamon Feb 2019

A-Z Database Discovery Using Alma: Eliminate Redundancy And Simplify Your Workflow, Travis Clamon

Travis Clamon

Frustrated by having to maintain an A-Z databases list separately on our library website and in Alma/Primo, East Tennessee State University embarked on a goal to eliminate redundancy by using Alma as our primary source of metadata for eResources. This presentation will cover our entire workflow and the issues we encountered along the way. I'll first go over our process in Alma including MARC record creation, electronic collection setup, and the top level collection module. Next, I'll cover our workflow in Primo including normalization rules, scoping, and PNX display. The last section will cover the Alma API's and how they …


Predicting Co And Nox Emissions From Gas Turbines: Novel Data And A Benchmark Pems, Heysem Kaya, Pinar Tüfekci̇, Erdi̇nç Uzun Jan 2019

Predicting Co And Nox Emissions From Gas Turbines: Novel Data And A Benchmark Pems, Heysem Kaya, Pinar Tüfekci̇, Erdi̇nç Uzun

Turkish Journal of Electrical Engineering and Computer Sciences

Predictive emission monitoring systems (PEMS) are important tools for validation and backing up of costly continuous emission monitoring systems used in gas-turbine-based power plants. Their implementation relies on the availability of appropriate and ecologically valid data. In this paper, we introduce a novel PEMS dataset collected over five years from a gas turbine for the predictive modeling of the CO and NOx emissions. We analyze the data using a recent machine learning paradigm, and present useful insights about emission predictions. Furthermore, we present a benchmark experimental procedure for comparability of future works on the data


Logging, Visualization, And Analysis Of Network And Power Data Of Iot Devices, Neal Huynh Nguyen Dec 2018

Logging, Visualization, And Analysis Of Network And Power Data Of Iot Devices, Neal Huynh Nguyen

Master's Theses

There are approximately 23.14 billion IoT(Internet of Things) devices currently in use worldwide. This number is projected to grow to over 75 billion by 2025. Despite their ubiquity little is known about the security and privacy implications of IoT devices. Several large-scale attacks against IoT devices have already been recorded.

To help address this knowledge gap, we have collected a year’s worth of network traffic and power data from 16 common IoT devices. From this data, we show that we can identify different smart speakers, like the Echo Dot, from analyzing one minute of power data on a shared power …


Performance Analysis Of Java Persistence Api Providers, Besart Pllana Oct 2018

Performance Analysis Of Java Persistence Api Providers, Besart Pllana

UBT International Conference

Nowadays, fast and accurate access to data is very important. Usually data is managed and processed through software applications. In recent years, the most preferred programming model by most application developers is Object Oriented Programming (OOP) where data is represented through objects. These data must be persistent and therefore needs to be stored, and storage can be done on a variety of databases. The most common databases are Relational Database Management Systems (RDBMS). While persistence of objects in RDBMS is limited by object-relational mismatch which is the inconsistency of the direct interaction between two components based on different approaches, OOP …


Sort Vs. Hash Join On Knights Landing Architecture, Victor L. Pan, Felix Lin Aug 2018

Sort Vs. Hash Join On Knights Landing Architecture, Victor L. Pan, Felix Lin

The Summer Undergraduate Research Fellowship (SURF) Symposium

With the increasing amount of information stored, there is a need for efficient database algorithms. One of the most important database operations is “join”. This involves combining columns from two tables and grouping common values in the same row in order to minimize redundant data. The two main algorithms used are hash join and sort merge join. Hash join builds a hash table to allow for faster searching. Sort merge join first sorts the two tables to make it more efficient when comparing values. There has been a lot of debate over which approach is superior. At first, hash join …


An Embarrassment Of Riches: Data Integration In Vr Pompeii, Adam Schoelz May 2018

An Embarrassment Of Riches: Data Integration In Vr Pompeii, Adam Schoelz

Computer Science and Computer Engineering Undergraduate Honors Theses

It is fair to say that Pompeii is the most studied archaeological site in the world. Beyond the extensive remains of the city itself, the timing of its rediscovery and excavation place it in a unique historiographical position. The city has been continuously studied since the 18th century, with historians and archaeologists constantly reevaluating older sources as our knowledge of the ancient world expands. While several studies have approached the city from a data driven perspective, no studies of the city have taken a quantitative holistic approach on the scale of the VR Pompeii project. Hyper-specificity has been the order …


Chronic Risk And Disease Management Model Using Structured Query Language And Predictive Analysis, Mamata Ojha Jan 2018

Chronic Risk And Disease Management Model Using Structured Query Language And Predictive Analysis, Mamata Ojha

Electronic Theses and Dissertations

Individuals with chronic conditions are the ones who use health care most frequently and more than 50% of top ten causes of death are chronic diseases in United States and these members always have health high risk scores. In the field of population health management, identifying high risk members is very important in terms of patient health care, disease management and cost management. Disease management program is very effective way of monitoring and preventing chronic disease and health related complications and risk management allows physicians and healthcare companies to reduce patient’s health risk, help identifying members for care/disease management along …


Querying And Visualization Of Moving Objects Using Constraint Databases, Semere M. Woldemariam Jul 2017

Querying And Visualization Of Moving Objects Using Constraint Databases, Semere M. Woldemariam

Department of Computer Science and Engineering: Dissertations, Theses, and Student Research

Good querying and visualization of moving objects and their trajectories is still an open problem. This thesis investigates three types of moving objects. First, projectiles, whose parabolic motion is difficult to represent. Second, moving objects that slide down a slope. The representation of these objects is challenging because of their accelerating motion. Third, the motion of migrating animals. The motion of migrating animals is challenging because it also involves some spatio-temporal interpolation. The thesis shows a solution to these problems using ideas from physics and an implementation in the MLPQ constraint databases system. The MLPQ implementation enables several complex spatio-temporal …


A-Z Database Discovery Using Alma: Eliminate Redundancy And Simplify Your Workflow, Travis Clamon Jun 2017

A-Z Database Discovery Using Alma: Eliminate Redundancy And Simplify Your Workflow, Travis Clamon

ETSU Faculty Works

Frustrated by having to maintain an A-Z databases list separately on our library website and in Alma/Primo, East Tennessee State University embarked on a goal to eliminate redundancy by using Alma as our primary source of metadata for eResources. This presentation will cover our entire workflow and the issues we encountered along the way. I'll first go over our process in Alma including MARC record creation, electronic collection setup, and the top level collection module. Next, I'll cover our workflow in Primo including normalization rules, scoping, and PNX display. The last section will cover the Alma API's and how they …


Gyrus Higher Learning Management System, Nicholas Turnquist, Bailey Kingsley, Ryan Schnarre Jan 2017

Gyrus Higher Learning Management System, Nicholas Turnquist, Bailey Kingsley, Ryan Schnarre

Capstone Design Expo Posters

Our project was to develop a prototype learning management system for use of higher education for our sponsor, Gyrus Systems. This consisted of creating a MySQL relational database to store user and class information, to design and code a user interface that emphasized user experience, and to implement functionalities for each user role.

Early in the design phase we outlined which features were must haves, in order to demonstrate an adequate prototype, and had this list approved by our sponsor. They were then divided into two roles. The role of “student” has the ability to submit assignments, get information from …


Forget-Me-Not, Daniel Barber-Cironi, Shawn Nicholson, Jake Kruse, Nicole Dent Jan 2017

Forget-Me-Not, Daniel Barber-Cironi, Shawn Nicholson, Jake Kruse, Nicole Dent

Williams Honors College, Honors Research Projects

The purpose of Forget-Me-Not is to provide another level of care and comfort to those suffering from mild dementia, as well as provide further assistance for a friend, family member, or caretaker who may look after them. Research shows that timely reminders and persistent information can greatly improve the quality of life for those afflicted with mild dementia (Mokhtari et al.). Forget-Me-Not’s persistent display and wearable smart-bracelet offer a customizable and well connected system to provide these reminders. For the caretaker, a mobile application is provided in order to maintain the display and notify them of emergencies or critical events …


Modelling Approach For A Pcb Inventory In Our Environment, Gerlinde Knetsch Jul 2016

Modelling Approach For A Pcb Inventory In Our Environment, Gerlinde Knetsch

International Congress on Environmental Modelling and Software

The data of more than 100 monitoring programs in Germany for the substance group of persistent organic pollutants (POPs), including the polychlorinated biphenyls (PCBs) are to be used for a modelling approach for a PCB inventory. The application landscape of the Federation/Laender POP-DioxinDatabase, which is operated in the German Environment Agency, pursues an interdisciplinary approach and includes all these data. A cross-media evaluation and assessment of environmental data are necessary and relevant to the target system, compiling an inventory of PCBs in our environment. The question arises how the integrated modelling concept can control the knowledge-based methods, with the aim …


Wearable Ekg, Cale Hopkins, Tanner Papenfuss, Travis E. Michael Jun 2016

Wearable Ekg, Cale Hopkins, Tanner Papenfuss, Travis E. Michael

Computer Engineering

No abstract provided.


Content-Based Image Analysis With Applications To The Multifunction Printer Imaging Pipeline And Image Databases, Cheng Lu Apr 2016

Content-Based Image Analysis With Applications To The Multifunction Printer Imaging Pipeline And Image Databases, Cheng Lu

Open Access Dissertations

Image understanding is one of the most important topics for various applications. Most of image understanding studies focus on content-based approach while some others also rely on meta data of images. Image understanding includes several sub-topics such as classification, segmentation, retrieval and automatic annotation etc., which are heavily studied recently. This thesis proposes several new methods and algorithms for image classification, retrieval and automatic tag generation. The proposed algorithms have been tested and verified in multiple platforms. For image classification, our proposed method can complete classification in real-time under hardware constraints of all-in-one printer and adaptively improve itself by online …


Welcome To Willowtree: Come Take A Closer Look, Shahim-Abdul Satar, Kent White, Ayesha Zafar Jan 2016

Welcome To Willowtree: Come Take A Closer Look, Shahim-Abdul Satar, Kent White, Ayesha Zafar

Capstone Design Expo Posters

The demand for more workers in tech related fields has given students the opportunity of choosing between multiple job offers and students can now be more deliberate about the jobs that they accept. Because of this employers also need to set themselves apart from other companies. Many companies do this is by sending out brochures, PDFs, and other information that may be outdated by the time it goes out. Our solution to this problem was to make an application which would provide relevant data to the user as well as provide insight into the company. The end result is a …


Database For Online Accreditation Process In Directorate For Accreditation Of Kosovo (Dak-Mis), Ibush Luzha Nov 2015

Database For Online Accreditation Process In Directorate For Accreditation Of Kosovo (Dak-Mis), Ibush Luzha

UBT International Conference

Directorate for Accreditation of Kosovo (DAK) is only National Accreditation Body in Republic of Kosovo, recognized by Government, which in accordance with international standards, assesses technical competences of the Conformity Assessment Bodies (CAB)s that deal with activities such as: testing, calibration, certification and inspection both in public and private sector. Till now application for accreditation and all other procedures for accreditation of CABs are carried out so that all the documents of the application and the receipt of the certificate of accreditation are conducted in the offices of DAK. By Management Information System (MIS), database of DAK (DAK-MIS), customers will …


An Integrative Modeling Framework To Evaluate Wheat Production Systems: Fusarium Head Blight, Willingthon Pavan, José Maurício Cunha Fernandes, Alexandre Lazzaretti, Josué Toebe, Jorge Luis Bavaresco, Alex C. Ruane, Rodrigo Yoiti Tsukahara Jun 2014

An Integrative Modeling Framework To Evaluate Wheat Production Systems: Fusarium Head Blight, Willingthon Pavan, José Maurício Cunha Fernandes, Alexandre Lazzaretti, Josué Toebe, Jorge Luis Bavaresco, Alex C. Ruane, Rodrigo Yoiti Tsukahara

International Congress on Environmental Modelling and Software

This paper describes a practical, integrated, web-based and user friendly analysis tool for crop model users that provides quality control of input data, tracks user selections in model parameterization, and enables visual analysis of model outcomes using a single graphical user interface. This allows the user to undertake numerous steps in crop modeling and analysis in a seamless and integrated environment. The analysis and visualization components of the system were enabled utilizing R (pl/r) and the robustness of the underlying data structures and coupling point between crop and disease models were achieved through use of PostgreSQL database management system. The …


Data Mining Of Protein Databases, Christopher Assi Jul 2012

Data Mining Of Protein Databases, Christopher Assi

Department of Computer Science and Engineering: Dissertations, Theses, and Student Research

Data mining of protein databases poses special challenges because many protein databases are non-relational whereas most data mining and machine learning algorithms assume the input data to be a relational database. Protein databases are non-relational mainly because they often contain set data types. We developed new data mining algorithms that can restructure non-relational protein databases so that they become relational and amenable for various data mining and machine learning tools. We applied the new restructuring algorithms to a pancreatic protein database. After the restructuring, we also applied two classification methods, such as decision tree and SVM classifiers and compared their …