Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Database

2021

Discipline
Institution
Publication
Publication Type

Articles 1 - 12 of 12

Full-Text Articles in Physical Sciences and Mathematics

The Sedimentary Geochemistry And Paleoenvironments Project, Úna C. Farrell, Rifaat Samawi, Savitha Anjanappa, Roman Klykov, Oyeleye O. Adeboye, Heda Agic, Anne Sofie C. Ahm, Thomas H. Boag, Fred Bowyer, Jochen J. Brocks, Tessa N. Brunoir, Donald E. Canfield, Xiaoyan Chen, Meng Cheng, Matthew O. Clarkson, Devon B. Cole, David R. Cordie, Peter W. Crockford, Huan Cui, Tais W. Dahl, Lucas D. Mouro, Keith Dewing, Stephen Q. Dornbos, Nadja Drabon, Julie A. Dumoulin, Joseph F. Emmings, Cecilia R. Endriga, Tiffani A. Fraser, Robert R. Gaines, Richard M. Gaschnig, Timothy M. Gibson, Geoffrey J. Gilleaudeau Nov 2021

The Sedimentary Geochemistry And Paleoenvironments Project, Úna C. Farrell, Rifaat Samawi, Savitha Anjanappa, Roman Klykov, Oyeleye O. Adeboye, Heda Agic, Anne Sofie C. Ahm, Thomas H. Boag, Fred Bowyer, Jochen J. Brocks, Tessa N. Brunoir, Donald E. Canfield, Xiaoyan Chen, Meng Cheng, Matthew O. Clarkson, Devon B. Cole, David R. Cordie, Peter W. Crockford, Huan Cui, Tais W. Dahl, Lucas D. Mouro, Keith Dewing, Stephen Q. Dornbos, Nadja Drabon, Julie A. Dumoulin, Joseph F. Emmings, Cecilia R. Endriga, Tiffani A. Fraser, Robert R. Gaines, Richard M. Gaschnig, Timothy M. Gibson, Geoffrey J. Gilleaudeau

Geosciences: Faculty Publications

No abstract provided.


Information Extraction And Classification On Journal Papers, Lei Yu Nov 2021

Information Extraction And Classification On Journal Papers, Lei Yu

Department of Computer Science and Engineering: Dissertations, Theses, and Student Research

The importance of journals for diffusing the results of scientific research has increased considerably. In the digital era, Portable Document Format (PDF) became the established format of electronic journal articles. This structured form, combined with a regular and wide dissemination, spread scientific advancements easily and quickly. However, the rapidly increasing numbers of published scientific articles requires more time and effort on systematic literature reviews, searches and screens. The comprehension and extraction of useful information from the digital documents is also a challenging task, due to the complex structure of PDF.

To help a soil science team from the United States …


Enhancing Usability And Explainability Of Data Systems, Anna Fariha Oct 2021

Enhancing Usability And Explainability Of Data Systems, Anna Fariha

Doctoral Dissertations

The recent growth of data science expanded its reach to an ever-growing user base of nonexperts, increasing the need for usability, understandability, and explainability in these systems. Enhancing usability makes data systems accessible to people with different skills and backgrounds alike, leading to democratization of data systems. Furthermore, proper understanding of data and data-driven systems is necessary for the users to trust the function of the systems that learn from data. Finally, data systems should be transparent: when a data system behaves unexpectedly or malfunctions, the users deserve proper explanation of what caused the observed incident. Unfortunately, …


Using Methanol Masers To Probe High Mass Star Forming Regions, Naomi S. Shechter, Anuj P. Sarma Aug 2021

Using Methanol Masers To Probe High Mass Star Forming Regions, Naomi S. Shechter, Anuj P. Sarma

DePaul Discoveries

Compared to low mass stars, the formation of high mass stars is not well understood. To understand better how high mass stars form, we can utilize masers, naturally amplified point sources of microwave radiation. One example is the methanol maser, which falls into two categories. Class I methanol masers form in the bipolar outflows from the protostar, and Class II masers form in the accretion disk. Their compact size and intensity make them an excellent source of information about the process of high mass star formation. We compiled a modest database of Class I and II methanol masers through a …


Multilateration Index., Chip Lynch Aug 2021

Multilateration Index., Chip Lynch

Electronic Theses and Dissertations

We present an alternative method for pre-processing and storing point data, particularly for Geospatial points, by storing multilateration distances to fixed points rather than coordinates such as Latitude and Longitude. We explore the use of this data to improve query performance for some distance related queries such as nearest neighbor and query-within-radius (i.e. “find all points in a set P within distance d of query point q”). Further, we discuss the problem of “Network Adequacy” common to medical and communications businesses, to analyze questions such as “are at least 90% of patients living within 50 miles of a covered emergency …


Plug-And-Play Sql, Shubham Swami Aug 2021

Plug-And-Play Sql, Shubham Swami

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

We present an efficient model to retrieve data from a database by implementing plug-and-play queries using the query guards. The model is efficient in the sense that it saves time when writing a query and promotes query portability and reuse. A plug-and-play query is a freestanding query that can couple to any data socket and self determine whether it can be evaluated reliably on the data. We use hierarchies to improve SQL querying in a way that eliminates the need to write a view to construct virtual tables or a set of tables to run a query. The hierarchy is …


Metaxmorph: Hierarchical Transformation Of Data With Metadata, Shubham Airan Aug 2021

Metaxmorph: Hierarchical Transformation Of Data With Metadata, Shubham Airan

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

This research is about transforming data. Data comes in different shapes; it can be structured as a graph, a tree, a collection of tables, or some other shape. In this thesis, we focus on data structured as a tree, which is known as hierarchical data. The same data could be structured in many different tree shapes. Previously it was shown how to transform data from one tree shape, one hierarchy to another without losing any information. But sometimes the pieces of the hierarchy are annotated or associated with metadata, that is, with data about the data itself. The metadata can …


Using Big Data Analytics To Optimize Practical Large Databases, Po-Chun Lu Jul 2021

Using Big Data Analytics To Optimize Practical Large Databases, Po-Chun Lu

Master of Science in Computer Science Theses

Big data analytics is gaining popularity for enterprises in optimizing their business processes ranging from retailers, supply chains, to online shopping stores. Existing practical raw data are far from usable to achieve the goal. Therefore, a good data pre-processing approach is required and is a key step to success. We propose to research on the effectiveness of data pre-processing and the business process based on a real world database. Our methodology involves natural language processing. Our key goal is to study appropriate methods with big data analysis techniques that can handle errors, ambiguity, and repeated descriptions caused by human languages. …


Beyond Streamflow: Call For A National Data Repository Of Streamflow Presence For Streams And Rivers In The United States, Kendra E. Kaiser Jun 2021

Beyond Streamflow: Call For A National Data Repository Of Streamflow Presence For Streams And Rivers In The United States, Kendra E. Kaiser

Geosciences Faculty Publications and Presentations

Observations of the presence or absence of surface water in streams are useful for characterizing streamflow permanence, which includes the frequency, duration, and spatial extent of surface flow in streams and rivers. Such data are particularly valuable for headwater streams, which comprise the vast majority of channel length in stream networks, are often non-perennial, and are frequently the most data deficient. Datasets of surface water presence exist across multiple data collection groups in the United States but are not well aligned for easy integration. Given the value of these data, a unified approach for organizing information on surface water presence …


Accelerating Aggregation Efficiency: Using Postgres As A Cache With Mongodb, Mason Q. Porter-Brown Jan 2021

Accelerating Aggregation Efficiency: Using Postgres As A Cache With Mongodb, Mason Q. Porter-Brown

Senior Projects Fall 2021

For decades now relational databases, namely SQL, have been the industry standard. However, recently MongoDB and other NoSQL databases have been growing in popularity due to their flexibility and scalability. SQL still has the upper hand in a variety of areas, including data consistency, advanced and established analytics functions, and efficient “JOIN” functions. This project focuses on MongoDB’s shortcomings when it comes to replicating “JOIN” operations using MongoDB’s aggregate functions. “JOIN” operations refer to the action of comparing data from one or more collections of data and joining similar data together in order to analyze and draw statistics from the …


Football’S Future: An Analytical Interpretation Of The Premier League, Hunter Witeof Jan 2021

Football’S Future: An Analytical Interpretation Of The Premier League, Hunter Witeof

Williams Honors College, Honors Research Projects

This project looks to take the statistics of soccer players and run them through an algorithm to determine how well a player is performing. The system that will be designed in the project will look to accomplish 3 main goals: allow the user to enter new statistics, store the data for all 38 game weeks for all 20 teams, and compute a score for each player’s performance for each game as well as the average of all of the player's scores.


A Deductive Database For Knot Colourings, Dong Hyun Han Jan 2021

A Deductive Database For Knot Colourings, Dong Hyun Han

Senior Projects Spring 2021

This work constitutes progress toward the development of a knowledge base for braids, knots, and their colourings. The main result of this development is the creation of a logical model for storing data pertaining to braids, two-dimensional projections of three dimensional knots, finite quandles, and colorings of braids and knots by quandles. It uses the Entity Relationship data reference model as its starting point and makes the original design there. In addition, it includes a conversion of the Entity Relationship Diagram (ERD) to SQL queries that define tables corresponding to the ERD entity sets. Finally this work demonstrates how to …