Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Data

Discipline
Institution
Publication Year
Publication
Publication Type
File Type

Articles 1 - 30 of 643

Full-Text Articles in Physical Sciences and Mathematics

Data Engineering: Building Software Efficiency In Medium To Large Organizations, Alessandro De La Torre Apr 2024

Data Engineering: Building Software Efficiency In Medium To Large Organizations, Alessandro De La Torre

Whittier Scholars Program

The introduction of PoetHQ, a mobile application, offers an economical strategy for colleges, potentially ushering in significant cost savings. These savings could be redirected towards enhancing academic programs and services, enriching the educational landscape for students. PoetHQ aims to democratize access to crucial software, effectively removing financial barriers and facilitating a richer educational experience. By providing an efficient software solution that reduces organizational overhead while maximizing accessibility for students, the project highlights the essential role of equitable education and resource optimization within academic institutions.


Citdet, Jordan A. James, Heather K. Manching, Matthew R. Mattia, Kim D. Bowman, Amanda M. Hulse-Kemp, William J. Beksi Apr 2024

Citdet, Jordan A. James, Heather K. Manching, Matthew R. Mattia, Kim D. Bowman, Amanda M. Hulse-Kemp, William J. Beksi

Computer Science and Engineering Datasets

The CitDet dataset is composed of images captured at the USDA Agricultural Research Service Subtropical Insects and Horticulture Research Unit in Fort Pierce, FL, USA. Data was collected between October 2021 and October 2022. 579 images were captured from different sections of the orchard using the open-source application Field Book on Android tablets. While collecting images, we faced the camera in a portrait orientation directly centered on the tree of interest. All images were taken at the edge of the soil in the tree row to simulate a ground-based robot imaging the tree while moving between two rows of trees. …


Demographic Data Analysis For Measuring Economic Impact Of The Branch Of Nashville, Tessa Pendleton, Annie Wardroup, Nicole Speyrer, Kimberly Amaya Hernandez Apr 2024

Demographic Data Analysis For Measuring Economic Impact Of The Branch Of Nashville, Tessa Pendleton, Annie Wardroup, Nicole Speyrer, Kimberly Amaya Hernandez

Belmont University Research Symposium (BURS)

As part of the Global Honors Scholars Collaborative, researchers aggregated data from The Belmont Data Collaborative to analyze the three primary ZIP codes (37211, 37013, 37217) served by The Branch of Nashville. These communities include immigrant and refugee populations, whom The Branch supports through its food bank, English classes, and further comprehensive care. Future program development will rely on the analysis of the current client base and eventual assessment of The Branch’s economic impact on the surrounding community. The goal of this research for The Branch of Nashville is twofold: (1) analyze the existing demographics within the above ZIP codes …


Identifying Rural Health Clinics Within The Transformed Medicaid Statistical Information System (T-Msis) Analytic Files, Katherine Ahrens Mph, Phd, Zachariah Croll, Yvonne Jonk Phd, John Gale Ms, Heidi O'Connor Ms Mar 2024

Identifying Rural Health Clinics Within The Transformed Medicaid Statistical Information System (T-Msis) Analytic Files, Katherine Ahrens Mph, Phd, Zachariah Croll, Yvonne Jonk Phd, John Gale Ms, Heidi O'Connor Ms

Rural Health Clinics

Researchers at the Maine Rural Health Research Center describe a methodology for identifying Rural Health Clinic encounters within the Medicaid claims data using Transformed Medicaid Statistical Information System (T-MSIS) Analytic Files.

Background: There is limited information on the extent to which Rural Health Clinics (RHC) provide pediatric and pregnancy-related services to individuals enrolled in state Medicaid/CHIP programs. In part this is because methods to identify RHC encounters within Medicaid claims data are outdated.

Methods: We used a 100% sample of the 2018 Medicaid Demographic and Eligibility and Other Services Transformed Medicaid Statistical Information System (T-MSIS) Analytic Files for 20 states …


Infusing Machine Learning And Computational Linguistics Into Clinical Notes, Funke V. Alabi, Onyeka Omose, Omotomilola Jegede Jan 2024

Infusing Machine Learning And Computational Linguistics Into Clinical Notes, Funke V. Alabi, Onyeka Omose, Omotomilola Jegede

Mathematics & Statistics Faculty Publications

Entering free-form text notes into Electronic Health Records (EHR) systems takes a lot of time from clinicians. A large portion of this paper work is viewed as a burden, which cuts into the amount of time doctors spend with patients and increases the risk of burnout. We will see how machine learning and computational linguistics can be infused in the processing of taking clinical notes. We are presenting a new language modeling task that predicts the content of notes conditioned on historical data from a patient's medical record, such as patient demographics, lab results, medications, and previous notes, with the …


Ethical Data Considerations For Engaging In Reparative Archival Practice, Jamie Rogers, Rhia Rae Nov 2023

Ethical Data Considerations For Engaging In Reparative Archival Practice, Jamie Rogers, Rhia Rae

Works of the FIU Libraries

Archival textually-rich materials--such as warranty deeds, mortgages, legal documents, and letter correspondence--can provide valuable historical insights, and if transcribed and analyzed, can produce data points in the form of unstructured text, tabular data, and geospatial assets. This presentation will provide an overview of the process Florida International University librarians went through to turn the papers of Dana A. Dorsey, Miami's first Black Millionaire, into data. Their work is guided by the concept of "collections as data" as a form of reparative archival practice, enabling the elevation of marginalized individuals' histories. The goal of reparative archival practice is to create a …


Delivering Healthcare To The Underserved, Edward Booty Nov 2023

Delivering Healthcare To The Underserved, Edward Booty

Asian Management Insights

Non-profits, governments, and businesses need to come together and use a data-driven approach to improve local basic healthcare access.


Data Ethics And Privacy For Researchers, Kelley F. Rowan Sep 2023

Data Ethics And Privacy For Researchers, Kelley F. Rowan

Works of the FIU Libraries

This workshop addresses specific data privacy and anonymization standards and techniques for researchers that are collecting personally identifiable information as well as sensitive information. The workshop covers federal, state, and international laws and regulations governing data privacy, the development of an impact assessment and privacy policy. The second half of the workshop focuses on ethical workflows, anonymization techniques and related resources.


Using Geographic Information To Explore Player-Specific Movement And Its Effects On Play Success In The Nfl, Hayley Horn, Eric Laigaie, Alexander Lopez, Shravan Reddy Aug 2023

Using Geographic Information To Explore Player-Specific Movement And Its Effects On Play Success In The Nfl, Hayley Horn, Eric Laigaie, Alexander Lopez, Shravan Reddy

SMU Data Science Review

American Football is a billion-dollar industry in the United States. The analytical aspect of the sport is an ever-growing domain, with open-source competitions like the NFL Big Data Bowl accelerating this growth. With the amount of player movement during each play, tracking data can prove valuable in many areas of football analytics. While concussion detection, catch recognition, and completion percentage prediction are all existing use cases for this data, player-specific movement attributes, such as speed and agility, may be helpful in predicting play success. This research calculates player-specific speed and agility attributes from tracking data and supplements them with descriptive …


Cybersecurity Safeguards: What Cybersecurity Safeguards Could Have Prevented The Intelligence/Data Breach By A Member Of The Air National Guard, Christopher Curtis Royal Aug 2023

Cybersecurity Safeguards: What Cybersecurity Safeguards Could Have Prevented The Intelligence/Data Breach By A Member Of The Air National Guard, Christopher Curtis Royal

Cyber Operations and Resilience Program Graduate Projects

Jack Teixeira, a 21-year-old IT specialist Air National Guard found himself on the wrong side of the US law after sharing what is considered classified and extremely sensitive information about USA's operations and role in Ukraine and Russia war. Like other previous cases of leakage of classified intelligence, the case of Teixeira raises concerns about the weaknesses and vulnerability of federal agencies' IT systems and security protocols governing accessibility to classified documents. Internal leakages of such classified documents hurt national security and can harm the country, especially when such secretive intelligence finds its way into the hands of enemies. Unauthorized …


Fintech Data Infrastructure For Esg Disclosure Compliance, Randall E. Duran, Peter Tierney Aug 2023

Fintech Data Infrastructure For Esg Disclosure Compliance, Randall E. Duran, Peter Tierney

Research Collection School Of Computing and Information Systems

Regulations related to the disclosure of environmental, governance, and social (ESG) factors are evolving rapidly and are a major concern for financial compliance worldwide. Information technology has the potential to reduce the effort and cost of ESG disclosure compliance. However, comprehensive and accurate ESG data are necessary for disclosures. Currently, the availability and quality of underlying data for ESG disclosures vary widely and are often deficient. The process involved with obtaining ESG data is also often inefficient and prone to error. This paper compares the models used and the evolution of Fintech data infrastructure developed to support financial services with …


A Bayesian Spatial Scan Statistic For Normal Data, Laasya Velamakanni Jul 2023

A Bayesian Spatial Scan Statistic For Normal Data, Laasya Velamakanni

Theses and Dissertations

Scan statistics are useful methods for detecting spatial clustering. While they were initially developed to detect regions with an excess of binomial or Poisson events, spatial scan statistics have been extended to detect hotspots in other types of data including continuous data. They have many applications in different fields such as epidemiology (e.g. detecting disease outbreaks), sociology (e.g. detecting crime hotspots), and environmental health (e.g. detecting high-pollution areas). Spatial scan statistics identify a ‘most likely cluster’ and then use a likelihood ratio test to determine if this cluster is statistically significant. Spatial scan statistics have been extended to the Bayesian …


Gis Data: Prince George’S County, Maryland – Shoreline Inventory Data 2023, Karinna Nunez, Tamia Rudnicky, Sharon Killeen, Jessica Hendricks, Catherine R. Duning, Evan Hill Jun 2023

Gis Data: Prince George’S County, Maryland – Shoreline Inventory Data 2023, Karinna Nunez, Tamia Rudnicky, Sharon Killeen, Jessica Hendricks, Catherine R. Duning, Evan Hill

Data

The shoreline inventory files have been generated to support the application of the Maryland Shoreline Stabilization Model (SSM), developed by the Center for Coastal Resources Management (CCRM), Virginia Institute of Marine Science (VIMS), to enhance and streamline regulatory decision making in Maryland. This shoreline inventory includes the features needed as inputs to run the SSM.

The data developed for the Shoreline Inventory is based on a three-tiered shoreline assessment approach. This assessment characterizes conditions by using observations made remotely at the desktop using high resolution imagery. The three-tiered shoreline assessment approach divides the shore zone into three regions:

1) the …


Gis Data: Harford County, Maryland – Shoreline Inventory Data 2023, Karinna Nunez, Tamia Rudnicky, Sharon Killeen, Jessica Hendricks, Catherine R. Duning, Evan Hill Jun 2023

Gis Data: Harford County, Maryland – Shoreline Inventory Data 2023, Karinna Nunez, Tamia Rudnicky, Sharon Killeen, Jessica Hendricks, Catherine R. Duning, Evan Hill

Data

The shoreline inventory files have been generated to support the application of the Maryland Shoreline Stabilization Model (SSM), developed by the Center for Coastal Resources Management (CCRM), Virginia Institute of Marine Science (VIMS), to enhance and streamline regulatory decision making in Maryland. This shoreline inventory includes the features needed as inputs to run the SSM.

The data developed for the Shoreline Inventory is based on a three-tiered shoreline assessment approach. This assessment characterizes conditions by using observations made remotely at the desktop using high resolution imagery. The three-tiered shoreline assessment approach divides the shore zone into three regions:

1) the …


Gis Data: Cecil County, Maryland – Shoreline Inventory Data 2023, Karinna Nunez, Tamia Rudnicky, Sharon Killeen, Jessica Hendricks, Catherine R. Duning, Evan Hill Jun 2023

Gis Data: Cecil County, Maryland – Shoreline Inventory Data 2023, Karinna Nunez, Tamia Rudnicky, Sharon Killeen, Jessica Hendricks, Catherine R. Duning, Evan Hill

Data

The shoreline inventory files have been generated to support the application of the Maryland Shoreline Stabilization Model (SSM), developed by the Center for Coastal Resources Management (CCRM), Virginia Institute of Marine Science (VIMS), to enhance and streamline regulatory decision making in Maryland. This shoreline inventory includes the features needed as inputs to run the SSM.

The data developed for the Shoreline Inventory is based on a three-tiered shoreline assessment approach. This assessment characterizes conditions by using observations made remotely at the desktop using high resolution imagery. The three-tiered shoreline assessment approach divides the shore zone into three regions:

1) the …


Gis Data: Caroline County, Maryland – Shoreline Inventory Data 2023, Karinna Nunez, Tamia Rudnicky, Sharon Killeen, Jessica Hendricks, Catherine R. Duning, Evan Hill Jun 2023

Gis Data: Caroline County, Maryland – Shoreline Inventory Data 2023, Karinna Nunez, Tamia Rudnicky, Sharon Killeen, Jessica Hendricks, Catherine R. Duning, Evan Hill

Data

The shoreline inventory files have been generated to support the application of the Maryland Shoreline Stabilization Model (SSM), developed by the Center for Coastal Resources Management (CCRM), Virginia Institute of Marine Science (VIMS), to enhance and streamline regulatory decision making in Maryland. This shoreline inventory includes the features needed as inputs to run the SSM.

The data developed for the Shoreline Inventory is based on a three-tiered shoreline assessment approach. This assessment characterizes conditions by using observations made remotely at the desktop using high resolution imagery. The three-tiered shoreline assessment approach divides the shore zone into three regions:

1) the …


Phantom Shootings, Allan Ambris Jun 2023

Phantom Shootings, Allan Ambris

Dissertations, Theses, and Capstone Projects

This capstone is a website designed to critique NYC Open Data reporting with respect to shootings through a series of visualizations and discoveries. The NYPD Shooting Incidents datasets (Historic and Year to Date) introduce themselves to the user by claiming to be a “list of every shooting incident that occurred in NYC.” The supplied documentation reveals that this is not the case.

After understanding the supporting materials, there are still undisclosed truths. My exploration of the data revealed that a single victim may be represented across multiple entries. Additionally, multiple victims may be represented by a single entry. It is …


Data-Optimized Spatial Field Predictions For Robotic Adaptive Sampling: A Gaussian Process Approach, Zachary Nathan May 2023

Data-Optimized Spatial Field Predictions For Robotic Adaptive Sampling: A Gaussian Process Approach, Zachary Nathan

Computer Science Senior Theses

We introduce a framework that combines Gaussian Process models, robotic sensor measurements, and sampling data to predict spatial fields. In this context, a spatial field refers to the distribution of a variable throughout a specific area, such as temperature or pH variations over the surface of a lake. Whereas existing methods tend to analyze only the particular field(s) of interest, our approach optimizes predictions through the effective use of all available data. We validated our framework on several datasets, showing that errors can decline by up to two-thirds through the inclusion of additional colocated measurements. In support of adaptive sampling, …


Baseball’S Evolution In The 21st Century, And How It Exemplifies Human Response To Change, Jonathan Sharpe May 2023

Baseball’S Evolution In The 21st Century, And How It Exemplifies Human Response To Change, Jonathan Sharpe

Honors Projects

The game of baseball has changed a lot in the past twenty years. It can be primarily attributed to the explosion in data analytics and how they are used to evaluate baseball players. This led to different player profiles being preferred and eventually led to the development of players changing. As a result, the strategies employed have also evolved and turned into a different game than seen only a couple of decades ago. This paper will explore the changes that the game has seen. On the other hand, Major League Baseball has also implemented its own changes to try and …


Nviz: Unraveling Neural Networks Through Visualization, Kevin Hoffman Apr 2023

Nviz: Unraveling Neural Networks Through Visualization, Kevin Hoffman

Mathematics and Computer Science Presentations

The growing utility of artificial intelligence (AI) is attributed to the development of neural networks. These networks are a class of models that make predictions based on previously observed data. While the inferential power of neural networks is great, the ability to explain their results is difficult because the underlying model is automatically generated. The AI community commonly refers to neural networks as black boxes because the patterns they learn from the data are not easily understood. This project aims to improve the visibility of patterns that neural networks identify in data. Through an interactive web application, NVIZ affords the …


Pore Connectivity Influences Mass Transport In Natural Rocks: Pore Structure, Gas Diffusion And Batch Sorption Studies, Qinhong Hu Feb 2023

Pore Connectivity Influences Mass Transport In Natural Rocks: Pore Structure, Gas Diffusion And Batch Sorption Studies, Qinhong Hu

Earth & Environmental Sciences Datasets

The Excel file contains supplementary dataset, in various sheets, for a manuscript entitled "Pore connectivity influences mass transport in natural rocks: Pore structure, gas diffusion and batch sorption studies" The Word file of "Data Dictonary" briefly explained the data in each sheet of the Excel file, from different measurements as detailed in the manuscript. Authors: Xiaoqing Yuan, Qinhong Hu (maxhu@uta.edu), Xiang Lin, Chen Zhao, Qiming Wang, Yukio Tachi, Yuta Fukatsu, Shoichiro Hamamoto, Marja Siitari-Kauppia, and Xiaodong Li


Social Impacts Of Robotics On The Labor And Employment Market, Kelvin Espinal Feb 2023

Social Impacts Of Robotics On The Labor And Employment Market, Kelvin Espinal

Dissertations, Theses, and Capstone Projects

Robotics have been introduced into the workplace to perform tasks that human beings have traditionally fulfilled. Complementing or substituting human labor with robotics eliminates human involvement in functions attributable to hazardous environments, heavy lifting, toxic substances, and repetitive low-level tasks. On the other hand, they are meant to be more efficient and cost-effective, saving money, time, and labor. However, since the introduction of robotics in the workforce, societal opposition has been towards this branch of technology in fear of losing employment, wages, and purpose.

Previous studies have reported an overarching societal fear that adopting robotics in the workplace and industry …


Data Curation For Modeling Tall Fescue Biomass Dynamics With Dssat-Csm, M. B. Hanson, P. D. Alderman, T. J. Butler, A. Caldeira Rocateli Jan 2023

Data Curation For Modeling Tall Fescue Biomass Dynamics With Dssat-Csm, M. B. Hanson, P. D. Alderman, T. J. Butler, A. Caldeira Rocateli

IGC Proceedings (1997-2023)

While models for predicting forage production are available to aid management decisions for some forage crops, there is limited research for a yield model designed specifically for tall fescue (Schedonorus arundinaceus). Therefore, our objective was to adapt an existing perennial forage model, the Decision Support System for Agrotechnology Transfer Cropping Systems Model (DSSAT-CSM) for predicting forage biomass of tall fescue in the southern Great Plains. To evaluate model performance, there must first be a high level of data manipulation and cleaning. In this project, a cohesive dataset combining biomass, weather, soil, and management data were structured into DSSAT …


Development Of Sensing And Programming Activities For Engineering Technology Pathways Using A Virtual Arduino Simulation Platform, Murat Kuzlu, Vukica Jovanovic, Otilia Popescu, Salih Sarp Jan 2023

Development Of Sensing And Programming Activities For Engineering Technology Pathways Using A Virtual Arduino Simulation Platform, Murat Kuzlu, Vukica Jovanovic, Otilia Popescu, Salih Sarp

Engineering Technology Faculty Publications

The Arduino platform has long been an efficient tool in teaching electrical engineering technology, electrical engineering, and computer science concepts in schools and universities and introducing new learners to programming and microcontrollers. Numerous Arduino projects are widely available through the open-source community, and they can help students to have hands-on experience in building circuits and programming electronics with a wide variety of topics that can make learning electrical prototyping fun. The educational fields of electrical engineering and electrical engineering technology need continuous updating to keep up with the continuous evolution of the computer system. Although the traditional Arduino platform has …


Development Of A Data Science Curriculum For An Engineering Technology Program, Salih Sarp, Murat Kuzlu, Otilia Popescu, Vukica M. Jovanovic, Zafer Acar Jan 2023

Development Of A Data Science Curriculum For An Engineering Technology Program, Salih Sarp, Murat Kuzlu, Otilia Popescu, Vukica M. Jovanovic, Zafer Acar

Engineering Technology Faculty Publications

Data science has gained the attention of various industries, educators, parents, and students thinking about their future careers. Statistics departments have traditionally offered data science courses for a long time. The main objective of these courses is to examine the fundamental concepts and theories. However, teaching data science courses has also expanded to other disciplines due to the vast amount of data being collected by numerous modern applications. Also, someone needs to learn how to collect and process data, especially from industrial devices, because of the recent development of Internet of Things (IoT) technologies. Hence, integrating data science into the …


Big Data Analytics Of Medical Data, Ashwin Rajasankar Dec 2022

Big Data Analytics Of Medical Data, Ashwin Rajasankar

Culminating Experience Projects

Data has become a huge part of modern decision making. With the improvements in computing performance and storage in the past two decades, storing large amounts of data has become much easier. Analyzing large amounts of data and creating data models with them can help organizations obtain insights and information which helps their decision making. Big data analytics has become an integral part of many fields such as retail, real estate, education, and medicine. In the project, the goal is to understand the working of Apache Spark and its different storage methods and create a data warehouse to analyze data. …


Safe Sharing For Sensitive Data, Kristi Thompson Dec 2022

Safe Sharing For Sensitive Data, Kristi Thompson

Western Libraries Presentations

This workshop focused on the question of when and how human subjects' data can be safely shared. It introduced the basics of data anonymization and discussed how to tell if a dataset has been de-identified. Case studies of successful anonymization and some spectacular failures were shared


Gis Data: Charles County, Maryland – Shoreline Inventory Data 2021, Karinna Nunez, Tamia Rudnicky, Sharon Killeen, Jessica Hendricks, Catherine R. Duning, Evan Hill Nov 2022

Gis Data: Charles County, Maryland – Shoreline Inventory Data 2021, Karinna Nunez, Tamia Rudnicky, Sharon Killeen, Jessica Hendricks, Catherine R. Duning, Evan Hill

Data

The shoreline inventory files have been generated to support the application of the Maryland Shoreline Stabilization Model (SSM), developed by the Center for Coastal Resources Management (CCRM), Virginia Institute of Marine Science (VIMS), to enhance and streamline regulatory decision making in Maryland. This shoreline inventory includes the features needed as inputs to run the SSM.

The data developed for the Shoreline Inventory is based on a three-tiered shoreline assessment approach. This assessment characterizes conditions by using observations made remotely at the desktop using high resolution imagery. The three-tiered shoreline assessment approach divides the shore zone into three regions:

1) the …


Getting Started Analyzing Data In Spss, Kristi Thompson Nov 2022

Getting Started Analyzing Data In Spss, Kristi Thompson

Western Libraries Presentations

SPSS is a popular package for analyzing data. This session will discuss how to get started on a simple quantitative analysis project using SPSS. Topics covered will include getting summary statistics, creating and modifying variables, creating graphs, running simple analyses, and interpreting SPSS output.


Gis Data: Worcester County, Maryland – Shoreline Inventory Data 2021, Karinna Nunez, Tamia Rudnicky, Sharon Killeen, Jessica Hendricks, Catherine R. Duning, Evan Hill Nov 2022

Gis Data: Worcester County, Maryland – Shoreline Inventory Data 2021, Karinna Nunez, Tamia Rudnicky, Sharon Killeen, Jessica Hendricks, Catherine R. Duning, Evan Hill

Data

The shoreline inventory files have been generated to support the application of the Maryland Shoreline Stabilization Model (SSM), developed by the Center for Coastal Resources Management (CCRM), Virginia Institute of Marine Science (VIMS), to enhance and streamline regulatory decision making in Maryland. This shoreline inventory includes the features needed as inputs to run the SSM.

The data developed for the Shoreline Inventory is based on a three-tiered shoreline assessment approach. This assessment characterizes conditions by using observations made remotely at the desktop using high resolution imagery. The three-tiered shoreline assessment approach divides the shore zone into three regions: 1) the …