Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 17 of 17

Full-Text Articles in Physical Sciences and Mathematics

Messiness: Automating Iot Data Streaming Spatial Analysis, Christopher White, Atilio Barreda Ii Dec 2021

Messiness: Automating Iot Data Streaming Spatial Analysis, Christopher White, Atilio Barreda Ii

Publications and Research

The spaces we live in go through many transformations over the course of a year, a month, or a day; My room has seen tremendous clutter and pristine order within the span of a few hours. My goal is to discover patterns within my space and formulate an understanding of the changes that occur. This insight will provide actionable direction for maintaining a cleaner environment, as well as provide some information about the optimal times for productivity and energy preservation.

Using a Raspberry Pi, I will set up automated image capture in a room in my home. These images will …


Addressing The Learning Loss During The Covid-19 Pandemic Through The Adaptation Of Virtual Platforms, Nazrul I. Khandaker, Anika Nawar Mayeesha, Violeta Escandon Correa, Toralv Munro, Andrew Singh, Matthew Khargie, Ality Aghedo, Jasmin Budhan, Krishna Mahabir, Belal A. Sayeed Oct 2021

Addressing The Learning Loss During The Covid-19 Pandemic Through The Adaptation Of Virtual Platforms, Nazrul I. Khandaker, Anika Nawar Mayeesha, Violeta Escandon Correa, Toralv Munro, Andrew Singh, Matthew Khargie, Ality Aghedo, Jasmin Budhan, Krishna Mahabir, Belal A. Sayeed

Publications and Research

The York College-hosted NASA MAA (MUREP AEROSPACE ACADEMY) has always played a pivotal role in minimizing the learning loss during the summer months, which was heightened during the pandemic. Support from AT&T, Con Edison and NASA enabled the MAA program at York College to offer a virtual STEM education with an earth science concentration to 1000 plus underserved K1-12 students from the community last summer, including 160 high school students. Two factors made this endeavor fruitful: allowing additional time to engage in STEM lessons and increasing self-motivation to successfully accomplish assigned tasks. Students built partnerships and resolved technical issues with …


Leveraging The Popularity Of Virtual Conferencing Due To The Covid-19 Pandemic To Create New Opportunities For Stem Education, Andrew Singh, Nazrul I. Khandaker, Violeta Escandon Correa, Omadevi Singh, Ariel Skobelsky, Farhan Tanvir, Brian Sukhnandan, Matthew Khargie, Elton Selby, Masud Ahmed Oct 2021

Leveraging The Popularity Of Virtual Conferencing Due To The Covid-19 Pandemic To Create New Opportunities For Stem Education, Andrew Singh, Nazrul I. Khandaker, Violeta Escandon Correa, Omadevi Singh, Ariel Skobelsky, Farhan Tanvir, Brian Sukhnandan, Matthew Khargie, Elton Selby, Masud Ahmed

Publications and Research

Due to the COVID-19 pandemic, virtual learning has become a necessity for K9-16 education. Virtual classwork has been administered through platforms such as Google Classroom, Clever, and iReady. During the summer of 2021, the City University of New York (C.U.N.Y) York College campus hosted its NASA MAA MUREP (Minority University Research and Education Project Aerospace Academy) program virtually using a combination of Zoom, Google Docs, and even Canva, which some students requested as a more intuitive alternative to Microsoft PowerPoint. Students were mentored to use the scientific method to explore their interests in the STEM field, with a geoscience or …


Piecewise Linear Manifold Clustering, Artyom Diky Sep 2021

Piecewise Linear Manifold Clustering, Artyom Diky

Dissertations, Theses, and Capstone Projects

This work studies the application of topological analysis to non-linear manifold clustering. A novel method, that exploits the data clustering structure, allows to generate a topological representation of the point dataset. An analysis of topological construction under different simulated conditions is performed to explore the capabilities and limitations of the method, and demonstrated statistically significant improvements in performance. Furthermore, we introduce a new information-theoretical validation measure for clustering, that exploits geometrical properties of clusters to estimate clustering compressibility, for evaluation of the clustering goodness-of-fit without any prior information about true class assignments. We show how the new validation measure, when …


Making Space For Unquantifiable Data: Hand-Drawn Data Visualization, Eva Sibinga Sep 2021

Making Space For Unquantifiable Data: Hand-Drawn Data Visualization, Eva Sibinga

Dissertations, Theses, and Capstone Projects

This project makes space for personal “data” around labor and care, prompting users to consider the concrete and abstract (quantifiable and unquantifiable) forms labor and care take in their lives. The interactive, subjective data visualization uses hand-drawn visual elements to foreground that data about care and human interaction will always be ambiguous and complex, that they may never be satisfactorily or universally quantified, and that they will always be out of reach of perfect categorization.

The project provides an alternative to prescriptive truth-telling with data. Instead of using a dataset to provide data-driven answers and insights to users, the interactive …


Detecting Stance On Covid-19 Vaccine In A Polarized Media, Rodica Ceslov Sep 2021

Detecting Stance On Covid-19 Vaccine In A Polarized Media, Rodica Ceslov

Dissertations, Theses, and Capstone Projects

The growing polarization in the United States has been widely reported. Media coverage plays an important role in shaping public opinion and influences public debates on complex and unfamiliar topics. There are some benefits to individuals and society from political polarization and conflict between opposing viewpoints. However, recent research has primarily highlighted the negative consequences of polarization which reached an all-time high. One such topic is the Covid-19 vaccine which was developed in record time, and the public learned about its safety and possible risks through the media coverage.

In this capstone, we examine U.S. news media coverage on the …


Teaching Machine Learning For The Physical Sciences: A Summary Of Lessons Learned And Challenges, Viviana Acquaviva Aug 2021

Teaching Machine Learning For The Physical Sciences: A Summary Of Lessons Learned And Challenges, Viviana Acquaviva

Publications and Research

This paper summarizes some challenges encountered and best practices established in several years of teaching Machine Learning for the Physical Sciences at the undergraduate and graduate level. I discuss motivations for teaching ML to physicists, desirable properties of pedagogical materials, such as accessibility, relevance, and likeness to real-world research problems, and give examples of components of teaching units.


Content Analysis Of Two-Year And Four-Year Data Science Programs In The United States, Elizabeth Milonas, Duo Li, Qiping Zhang Jul 2021

Content Analysis Of Two-Year And Four-Year Data Science Programs In The United States, Elizabeth Milonas, Duo Li, Qiping Zhang

Publications and Research

Data has grown exponentially in the last decade, and this growth has resulted in vast challenges for both business and IT domains (Hassan & Liu, 2019). This growth has given rise to the Data Science field, which has also grown exponentially in the last few years (Hassan & Liu, 2019; Song & Zhu, 2016). The Data Science field has its origins in the statistics and mathematics domain (Cao, 2017b), but is now considered a multidisciplinary field (Aasheim et al., 2015). Data Science warrants knowledge of data analytics, programming, systems, applications, informatics, computing, communication, management, and sociology (Aasheim et al., 2015; …


Using Data Science To Create An Impact On A City Life And To Encourage Students From Underserved Communities To Get Into Stem, Elena Filatova, Deborah Hecht Jul 2021

Using Data Science To Create An Impact On A City Life And To Encourage Students From Underserved Communities To Get Into Stem, Elena Filatova, Deborah Hecht

Publications and Research

In this paper, we introduce a novel methodology for teaching Data Science. Our methodology relies on the outlook of the student body in our college. Our college is an urban, commuter, HSI (Hispanic Serving Institution) school with 34% Hispanic and 29% Black students. 61% of our students come from households with an income of less than $30,000+. Thus, many students in our college come from the communities that are underrepresented in the STEM fields and in the decision-making positions in the government (on the city level, state level, country level). However, in our methodology, we want to flip the situation …


Learn Biologically Meaningful Representation With Transfer Learning, Di He Jun 2021

Learn Biologically Meaningful Representation With Transfer Learning, Di He

Dissertations, Theses, and Capstone Projects

Machine learning has made significant contributions to bioinformatics and computational biol­ogy. In particular, supervised learning approaches have been widely used in solving problems such as bio­marker identification, drug response prediction, and so on. However, because of the limited availability of comprehensively labeled and clean data, constructing predictive models in super­ vised settings is not always desirable or possible, especially when using data­hunger, red­hot learning paradigms such as deep learning methods. Hence, there are urgent needs to develop new approaches that could leverage more readily available unlabeled data in driving successful machine learning ap­ plications in this area.

In my dissertation, …


An Empirical Study Of Refactorings And Technical Debt In Machine Learning Systems, Yiming Tang, Raffi T. Khatchadourian, Mehdi Bagherzadeh, Rhia Singh, Ajani Stewart, Anita Raja May 2021

An Empirical Study Of Refactorings And Technical Debt In Machine Learning Systems, Yiming Tang, Raffi T. Khatchadourian, Mehdi Bagherzadeh, Rhia Singh, Ajani Stewart, Anita Raja

Publications and Research

Machine Learning (ML), including Deep Learning (DL), systems, i.e., those with ML capabilities, are pervasive in today’s data-driven society. Such systems are complex; they are comprised of ML models and many subsystems that support learning processes. As with other complex systems, ML systems are prone to classic technical debt issues, especially when such systems are long-lived, but they also exhibit debt specific to these systems. Unfortunately, there is a gap of knowledge in how ML systems actually evolve and are maintained. In this paper, we fill this gap by studying refactorings, i.e., source-to-source semantics-preserving program transformations, performed in real-world, open-source …


Estimation Of The Planetary Boundary Layer Height: Part 1: Global Radar Wind Profiler Network Data; Part 2: A Comparison To Ceilometer Data, Holly Josephs May 2021

Estimation Of The Planetary Boundary Layer Height: Part 1: Global Radar Wind Profiler Network Data; Part 2: A Comparison To Ceilometer Data, Holly Josephs

Theses and Dissertations

Two methods for estimating the planetary boundary layer, an algorithm to identify a maximum in the backscatter and a covariance wavelet transform method, are explored and applied to global radar wind profiler network data and ceilometer data respectively. The objective of the study is to establish that the data sources and algorithms can be used to estimate planetary boundary layer heights so that global studies can make use of these estimates. Data from the global network of wind profilers required significant restructuring and quality control in order to be used for the present study. The maximum backscatter identification algorithm was …


Application Of Randomness In Finance, Jose Sanchez, Daanial Ahmad, Satyanand Singh May 2021

Application Of Randomness In Finance, Jose Sanchez, Daanial Ahmad, Satyanand Singh

Publications and Research

Brownian Motion which is also considered to be a Wiener process and can be thought of as a random walk. In our project we had briefly discussed the fluctuations of financial indices and related it to Brownian Motion and the modeling of Stock prices.


Discovering Kepler’S Third Law From Planetary Data, Boyan Kostadinov, Satyanand Singh May 2021

Discovering Kepler’S Third Law From Planetary Data, Boyan Kostadinov, Satyanand Singh

Publications and Research

In this data-inspired project, we illustrate how Kepler’s Third Law of Planetary Motion can be discovered from fitting a power model to real planetary data obtained from NASA, using regression modeling. The power model can be linearized, thus we can use linear regression to fit the model parameters to the data, but we also show how a non-linear regression can be implemented, using the R programming language. Our work also illustrates how the linear least squares used for fitting the power model can be implemented in Desmos, which could serve as the computational foundation for this project at a lower …


A New Feature Selection Method Based On Class Association Rule, Sami A. Al-Dhaheri Feb 2021

A New Feature Selection Method Based On Class Association Rule, Sami A. Al-Dhaheri

Dissertations, Theses, and Capstone Projects

Feature selection is a key process for supervised learning algorithms. It involves discarding irrelevant attributes from the training dataset from which the models are derived. One of the vital feature selection approaches is Filtering, which often uses mathematical models to compute the relevance for each feature in the training dataset and then sorts the features into descending order based on their computed scores. However, most Filtering methods face several challenges including, but not limited to, merely considering feature-class correlation when defining a feature’s relevance; additionally, not recommending which subset of features to retain. Leaving this decision to the end-user may …


Public Interest Technology – Exploring Covid-19 Health Data, Sarah Zelikovitz Jan 2021

Public Interest Technology – Exploring Covid-19 Health Data, Sarah Zelikovitz

Open Educational Resources

This module is part of a Introduction to Data Science course that covers the different parts of the data science process: data acquisition, cleaning, exploratory data analysis, and modeling. The COVID-19 pandemic has created much interest in public health data, as well as interest in visualization of all types of data. Public health data has a set of challenges that is unique to health data, with HIPAA laws, and real time collection of data. With COVID-19, the challenges are particularly amplified, as data collection and statistics collected are constantly changing in response to feedback from labs, hospitals, drug companies, and …


Goes-R Supervised Machine Learning, Ronald Adomako Jan 2021

Goes-R Supervised Machine Learning, Ronald Adomako

Dissertations and Theses

The GOES-R series is a product line of four satellite, with two currently on-orbit (GOES-16 “East” and GOES-17 “West”). GOES-17 is susceptible to a Loop-Heat-Pipe (LHP) phenomenon where during Fall and Spring seasons, there are times of day where some of the infrared bands records inaccurate readings from the Advanced Baseline Imager (ABI). This occurs from joint astronomical behavior and position of the GOES-17. This calibration issue occurs when the LHP instrument fails to radiate the heat of the sun out of ABI. Predictive Calibration (pCal) is an algorithm developed by instrument vendors for the National Oceanic Atmospheric Agency (NOAA) …