Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 31 - 60 of 93

Full-Text Articles in Physical Sciences and Mathematics

Solving Multiple Inference In Graphical Models, Cong Chen Sep 2021

Solving Multiple Inference In Graphical Models, Cong Chen

Dissertations, Theses, and Capstone Projects

For inference problems in graphical models, much effort has been directed at algorithms for obtaining one single optimal prediction. In practice, the data is often noisy or incomplete, which makes one single optimal solution unreliable. To address this problem, multiple Inference is proposed to find several best solutions, M-Best, where multiple hypotheses are preferred for advanced reasoning. People use oracle accuracy as an evaluation criterion expecting one of the solutions has high accuracy with the ground truth. It has been shown that it is beneficial for the top solutions to be diverse. Approaches for solving diverse multiple inference are proposed …


Adversarial Training For Skill Learning In A Mobile Robot, Todd W. Flyr Sep 2021

Adversarial Training For Skill Learning In A Mobile Robot, Todd W. Flyr

Dissertations, Theses, and Capstone Projects

Machine Learning in mobile robotics is sometimes hampered by the difficulties associated with the creation of a large corpus of labeled data that most neural network based learning algorithms demand. In recent years, advances in the field of machine learning have been facilitated via the creation of large collaboratively-created labeled training datasets that researchers can use as the basis for experiments to validate and improve their candidate neural network architectures. For the field of robotics, however, tasks are so disparate and the physical devices so varied that in most cases the creation of collaborative benchmark datasets are impractical. Obtaining data …


Teaching Machine Learning For The Physical Sciences: A Summary Of Lessons Learned And Challenges, Viviana Acquaviva Aug 2021

Teaching Machine Learning For The Physical Sciences: A Summary Of Lessons Learned And Challenges, Viviana Acquaviva

Publications and Research

This paper summarizes some challenges encountered and best practices established in several years of teaching Machine Learning for the Physical Sciences at the undergraduate and graduate level. I discuss motivations for teaching ML to physicists, desirable properties of pedagogical materials, such as accessibility, relevance, and likeness to real-world research problems, and give examples of components of teaching units.


Learn Biologically Meaningful Representation With Transfer Learning, Di He Jun 2021

Learn Biologically Meaningful Representation With Transfer Learning, Di He

Dissertations, Theses, and Capstone Projects

Machine learning has made significant contributions to bioinformatics and computational biol­ogy. In particular, supervised learning approaches have been widely used in solving problems such as bio­marker identification, drug response prediction, and so on. However, because of the limited availability of comprehensively labeled and clean data, constructing predictive models in super­ vised settings is not always desirable or possible, especially when using data­hunger, red­hot learning paradigms such as deep learning methods. Hence, there are urgent needs to develop new approaches that could leverage more readily available unlabeled data in driving successful machine learning ap­ plications in this area.

In my dissertation, …


Metareasoning, Opportunistic Exploration, And Explanations For Autonomous Indoor Navigation, Raj Korpan Jun 2021

Metareasoning, Opportunistic Exploration, And Explanations For Autonomous Indoor Navigation, Raj Korpan

Dissertations, Theses, and Capstone Projects

Autonomous indoor navigation is an important task for mobile robots deployed without a map in real-world environments, such as museums or offices. While it travels, an autonomous robot navigator must contend with lack of prior knowledge, sensor noise, actuator error, and inquisitive people. This dissertation addresses these challenges with a cognitively-based hierarchical reasoning architecture that incorporates learning, exploration, reactivity, planning, heuristics, and explanations. Evaluation by simulation in large, complex, indoor environments shows that a robot controller can successfully navigate without a detailed map of every obstruction's location when it performs limited initial global exploration and plans in its learned spatial …


Deep Learning Predicts Chromosomal Instability From Histopathology Images, Zhuoran Xu, Akanksha Verma, Uska Naveed, Samuel F. Bakhoum, Pegah Khosravi, Olivier Elemento May 2021

Deep Learning Predicts Chromosomal Instability From Histopathology Images, Zhuoran Xu, Akanksha Verma, Uska Naveed, Samuel F. Bakhoum, Pegah Khosravi, Olivier Elemento

Publications and Research

Chromosomal instability (CIN) is a hallmark of human cancer yet not readily testable for patients with cancer in routine clinical setting. In this study, we sought to explore whether CIN status can be predicted using ubiquitously available hematoxylin and eosin histology through a deep learning-based model. When applied to a cohort of 1,010 patients with breast cancer (Training set: n = 858, Test set: n = 152) from The Cancer Genome Atlas where 485 patients have high CIN status, our model accurately classified CIN status, achieving an area under the curve of 0.822 with 81.2% sensitivity and 68.7% specificity in …


Survey On Quantum Circuit Compilation For Noisy Intermediate-Scale Quantum Computers: Artificial Intelligence To Heuristics, Janusz Kusyk, Samah Mohamed Saeed, Muharrem Umit Uyar Mar 2021

Survey On Quantum Circuit Compilation For Noisy Intermediate-Scale Quantum Computers: Artificial Intelligence To Heuristics, Janusz Kusyk, Samah Mohamed Saeed, Muharrem Umit Uyar

Publications and Research

Computationally expensive applications, including machine learning, chemical simulations, and financial modeling, are promising candidates for noisy intermediate scale quantum (NISQ) computers. In these problems, one important challenge is mapping a quantum circuit onto NISQ hardware while satisfying physical constraints of an underlying quantum architecture. Quantum circuit compilation (QCC) aims to generate feasible mappings such that a quantum circuit can be executed in a given hardware platform with acceptable confidence in outcomes. Physical constraints of a NISQ computer change frequently, requiring QCC process to be repeated often. When a circuit cannot directly be executed on a quantum hardware due to its …


A Deep Learning Approach To Diagnostic Classification Of Prostate Cancer Using Pathology–Radiology Fusion, Pegah Khosravi, Maria Lysandrou, Mahmoud Eljalby, Qianzi Li, Ehsan Kazemi, Pantelis Zisimopoulos, Alexandros Sigaras, Matthew Brendel, Josue Barnes, Camir Ricketts, Dmitry Meleshko, Andy Yat, Timothy D. Mcclure, Brian D. Robinson, Andrea Sboner, Olivier Elemento, Bilal Chughtai, Iman Hajirasouliha Mar 2021

A Deep Learning Approach To Diagnostic Classification Of Prostate Cancer Using Pathology–Radiology Fusion, Pegah Khosravi, Maria Lysandrou, Mahmoud Eljalby, Qianzi Li, Ehsan Kazemi, Pantelis Zisimopoulos, Alexandros Sigaras, Matthew Brendel, Josue Barnes, Camir Ricketts, Dmitry Meleshko, Andy Yat, Timothy D. Mcclure, Brian D. Robinson, Andrea Sboner, Olivier Elemento, Bilal Chughtai, Iman Hajirasouliha

Publications and Research

Background

A definitive diagnosis of prostate cancer requires a biopsy to obtain tissue for pathologic analysis, but this is an invasive procedure and is associated with complications.

Purpose

To develop an artificial intelligence (AI)-based model (named AI-biopsy) for the early diagnosis of prostate cancer using magnetic resonance (MR) images labeled with histopathology information.

Study Type

Retrospective.

Population

Magnetic resonance imaging (MRI) data sets from 400 patients with suspected prostate cancer and with histological data (228 acquired in-house and 172 from external publicly available databases).

Field Strength/Sequence

1.5 to 3.0 Tesla, T2-weighted image pulse sequences.

Assessment

MR images reviewed and selected …


3d Object Detection, Instance Segmentation And Classification From 3d Range And 2d Color Images, Xiaoke Shen Feb 2021

3d Object Detection, Instance Segmentation And Classification From 3d Range And 2d Color Images, Xiaoke Shen

Dissertations, Theses, and Capstone Projects

We address the problem of 3D object detection and instance segmentation by proposing a novel object segmentation and detection system. First, we detect 2D objects based on RGB, Depth only, or RGB-D images. A 3D convolutional-based system, named Frustum VoxNet, is proposed. This system 1) generates frustums from 2D detection results, 2) proposes 3D candidate voxelized images for each frustum, and uses a 3D convolutional neural network (CNN) based on these candidates voxelized images to perform the 3D instance segmentation and object detection. Although the volumetric data representation is widely used for 3D object classification, there are fewer works on …


A New Feature Selection Method Based On Class Association Rule, Sami A. Al-Dhaheri Feb 2021

A New Feature Selection Method Based On Class Association Rule, Sami A. Al-Dhaheri

Dissertations, Theses, and Capstone Projects

Feature selection is a key process for supervised learning algorithms. It involves discarding irrelevant attributes from the training dataset from which the models are derived. One of the vital feature selection approaches is Filtering, which often uses mathematical models to compute the relevance for each feature in the training dataset and then sorts the features into descending order based on their computed scores. However, most Filtering methods face several challenges including, but not limited to, merely considering feature-class correlation when defining a feature’s relevance; additionally, not recommending which subset of features to retain. Leaving this decision to the end-user may …


Speech Enhancement Using Speech Synthesis Techniques, Soumi Maiti Feb 2021

Speech Enhancement Using Speech Synthesis Techniques, Soumi Maiti

Dissertations, Theses, and Capstone Projects

Traditional speech enhancement systems reduce noise by modifying the noisy signal to make it more like a clean signal, which suffers from two problems: under-suppression of noise and over-suppression of speech. These problems create distortions in enhanced speech and hurt the quality of the enhanced signal. We propose to utilize speech synthesis techniques for a higher quality speech enhancement system. Synthesizing clean speech based on the noisy signal could produce outputs that are both noise-free and high quality. We first show that we can replace the noisy speech with its clean resynthesis from a previously recorded clean speech dictionary from …


A Semi-Automated Approach To Medical Image Segmentation Using Conditional Random Field Inference, Yu-Chi Hu Sep 2020

A Semi-Automated Approach To Medical Image Segmentation Using Conditional Random Field Inference, Yu-Chi Hu

Dissertations, Theses, and Capstone Projects

Medical image segmentation plays a crucial role in delivering effective patient care in various diagnostic and treatment modalities. Manual delineation of target volumes and all critical structures is a very tedious and highly time-consuming process and introduce uncertainties of treatment outcomes of patients. Fully automatic methods holds great promise for reducing cost and time, while at the same time improving accuracy and eliminating expert variability, yet there are still great challenges. Legally and ethically, human oversight must be integrated with ”smart tools” favoring a semi-automatic technique which can leverage the best aspects of both human and computer.

In this work …


Machine Learning Applications For Drug Repurposing, Hansaim Lim Sep 2020

Machine Learning Applications For Drug Repurposing, Hansaim Lim

Dissertations, Theses, and Capstone Projects

The cost of bringing a drug to market is astounding and the failure rate is intimidating. Drug discovery has been of limited success under the conventional reductionist model of one-drug-one-gene-one-disease paradigm, where a single disease-associated gene is identified and a molecular binder to the specific target is subsequently designed. Under the simplistic paradigm of drug discovery, a drug molecule is assumed to interact only with the intended on-target. However, small molecular drugs often interact with multiple targets, and those off-target interactions are not considered under the conventional paradigm. As a result, drug-induced side effects and adverse reactions are often neglected …


Topics In Artifical Intelligence, Hunter Mcnichols, Nyc Tech-In-Residence Corps Apr 2020

Topics In Artifical Intelligence, Hunter Mcnichols, Nyc Tech-In-Residence Corps

Open Educational Resources

Syllabus for the course "CSC 59974: Special Topics in Artificial Intelligence" delivered at the City College of New York in Spring 2020 by Hunter McNichols as part of the Tech-in-Residence Corps program.


Does Applying Deep Learning In Financial Sentiment Analysis Lead To Better Classification Performance?, Tao Wang, Changhe Yuan, Cuiyuan Wang Apr 2020

Does Applying Deep Learning In Financial Sentiment Analysis Lead To Better Classification Performance?, Tao Wang, Changhe Yuan, Cuiyuan Wang

Publications and Research

Using a unique data set from Seeking Alpha, we compare the deep learning approach with traditional machine learning approaches in classifying financial text. We apply the long short-term memory (LSTM) as the deep learning method and Naive Bayes, SVM, Logistic Regression, XGBoost as the traditional machine learning approaches. The results suggest that the LSTM model outperforms the conventional machine learning methods on all metrics. Based on the tSNE graph, the success of the LSTM model is partially explained as the high-accuracy LSTM model distinguishes between positive and negative important sentiment words while those words are chosen based on SHAP values …


Artificial Intelligence: A New Paradigm In Obstetrics And Gynecology Research And Clinical Practice, Pulwasha Iftikhar, Marcela V. Kuijpers, Azadeh Khayyat, Aqsa Iftikhar, Maribel Degouvia De Sa Feb 2020

Artificial Intelligence: A New Paradigm In Obstetrics And Gynecology Research And Clinical Practice, Pulwasha Iftikhar, Marcela V. Kuijpers, Azadeh Khayyat, Aqsa Iftikhar, Maribel Degouvia De Sa

Publications and Research

Artificial intelligence (AI) is growing exponentially in various fields, including medicine. This paper reviews the pertinent aspects of AI in obstetrics and gynecology (OB/GYN) and how these can be applied to improve patient outcomes and reduce the healthcare costs and workload for clinicians.

Herein, we will address current AI uses in OB/GYN, and the use of AI as a tool to interpret fetal heart rate (FHR) and cardiotocography (CTG) to aid in the detection of preterm labor, pregnancy complications, and review discrepancies in its interpretation between clinicians to reduce maternal and infant morbidity and mortality. AI systems can be used …


Robust Neural Machine Translation, Abdul Rafae Khan Feb 2020

Robust Neural Machine Translation, Abdul Rafae Khan

Dissertations, Theses, and Capstone Projects

This thesis aims for general robust Neural Machine Translation (NMT) that is agnostic to the test domain. NMT has achieved high quality on benchmarks with closed datasets such as WMT and NIST but can fail when the translation input contains noise due to, for example, mismatched domains or spelling errors. The standard solution is to apply domain adaptation or data augmentation to build a domain-dependent system. However, in real life, the input noise varies in a wide range of domains and types, which is unknown in the training phase. This thesis introduces five general approaches to improve NMT accuracy and …


Amazon Alexa + Linked Open Data: Theorizing Concerning Relationships Between (Surveillant) Smart-Home Voice Assistants And Linked Open Data, Michelle Nitto Sep 2019

Amazon Alexa + Linked Open Data: Theorizing Concerning Relationships Between (Surveillant) Smart-Home Voice Assistants And Linked Open Data, Michelle Nitto

Publications and Research

No abstract provided.


Do It Like A Syntactician: Using Binary Gramaticality Judgements To Train Sentence Encoders And Assess Their Sensitivity To Syntactic Structure, Pablo Gonzalez Martinez Sep 2019

Do It Like A Syntactician: Using Binary Gramaticality Judgements To Train Sentence Encoders And Assess Their Sensitivity To Syntactic Structure, Pablo Gonzalez Martinez

Dissertations, Theses, and Capstone Projects

The binary nature of grammaticality judgments and their use to access the structure of syntax are a staple of modern linguistics. However, computational models of natural language rarely make use of grammaticality in their training or application. Furthermore, developments in modern neural NLP have produced a myriad of methods that push the baselines in many complex tasks, but those methods are typically not evaluated from a linguistic perspective. In this dissertation I use grammaticality judgements with artificially generated ungrammatical sentences to assess the performance of several neural encoders and propose them as a suitable training target to make models learn …


Semi-Supervised Regression With Generative Adversarial Networks Using Minimal Labeled Data, Greg Olmschenk Sep 2019

Semi-Supervised Regression With Generative Adversarial Networks Using Minimal Labeled Data, Greg Olmschenk

Dissertations, Theses, and Capstone Projects

This work studies the generalization of semi-supervised generative adversarial networks (GANs) to regression tasks. A novel feature layer contrasting optimization function, in conjunction with a feature matching optimization, allows the adversarial network to learn from unannotated data and thereby reduce the number of labels required to train a predictive network. An analysis of simulated training conditions is performed to explore the capabilities and limitations of the method. In concert with the semi-supervised regression GANs, an improved label topology and upsampling technique for multi-target regression tasks are shown to reduce data requirements. Improvements are demonstrated on a wide variety of vision …


Going Big: A Large-Scale Study On What Big Data Developers Ask, Mehdi Bagherzadeh, Raffi T. Khatchadourian Aug 2019

Going Big: A Large-Scale Study On What Big Data Developers Ask, Mehdi Bagherzadeh, Raffi T. Khatchadourian

Publications and Research

Software developers are increasingly required to write big data code. However, they find big data software development challenging. To help these developers it is necessary to understand big data topics that they are interested in and the difficulty of finding answers for questions in these topics. In this work, we conduct a large-scale study on Stackoverflow to understand the interest and difficulties of big data developers. To conduct the study, we develop a set of big data tags to extract big data posts from Stackoverflow; use topic modeling to group these posts into big data topics; group similar topics into …


Module: Robot Senses, Mohammad Azhar May 2019

Module: Robot Senses, Mohammad Azhar

Open Educational Resources

Learning Objectives:

Students will be able to:

  • Describe the basics of Sensors

  • Learn how to program the LEGO Robot to make decision using touch sensors


Module: Robot Locomotion Mini Hackathon, Mohammad Azhar May 2019

Module: Robot Locomotion Mini Hackathon, Mohammad Azhar

Open Educational Resources

Learning Objectives:

Students will be able to:

  • Describe the basics of Robots.

  • Describe basic hardware and software of the LEGO Robot.

  • Write sequential code for LEGO Robot to move.


Online Learning And Planning For Crowd-Aware Service Robot Navigation, Anoop Aroor May 2019

Online Learning And Planning For Crowd-Aware Service Robot Navigation, Anoop Aroor

Dissertations, Theses, and Capstone Projects

Mobile service robots are increasingly used in indoor environments (e.g., shopping malls or museums) among large crowds of people. To efficiently navigate in these environments, such a robot should be able to exhibit a variety of behaviors. It should avoid crowded areas, and not oppose the flow of the crowd. It should be able to identify and avoid specific crowds that result in additional delays (e.g., children in a particular area might slow down the robot). and to seek out a crowd if its task requires it to interact with as many people as possible. These behaviors require the ability …


Deep Learning Enables Robust Assessment And Selection Of Human Blastocysts After In Vitro Fertilization, Pegah Khosravi, Ehsan Kazemi, Qiansheng Zhan, Jonas E. Malmsten, Marco Toschi, Pantelis Zisimopoulos, Alexandros Sigaras, Stuart Lavery, Lee A. D. Cooper, Cristina Hickman, Marcos Meseguer, Zev Rosenwaks, Olivier Elemento, Nikica Zaninovic, Iman Hajirasouliha Apr 2019

Deep Learning Enables Robust Assessment And Selection Of Human Blastocysts After In Vitro Fertilization, Pegah Khosravi, Ehsan Kazemi, Qiansheng Zhan, Jonas E. Malmsten, Marco Toschi, Pantelis Zisimopoulos, Alexandros Sigaras, Stuart Lavery, Lee A. D. Cooper, Cristina Hickman, Marcos Meseguer, Zev Rosenwaks, Olivier Elemento, Nikica Zaninovic, Iman Hajirasouliha

Publications and Research

Visual morphology assessment is routinely used for evaluating of embryo quality and selecting human blastocysts for transfer after in vitro fertilization (IVF). However, the assessment produces different results between embryologists and as a result, the success rate of IVF remains low. To overcome uncertainties in embryo quality, multiple embryos are often implanted resulting in undesired multiple pregnancies and complications. Unlike in other imaging fields, human embryology and IVF have not yet leveraged artificial intelligence (AI) for unbiased, automated embryo assessment. We postulated that an AI approach trained on thousands of embryos can reliably predict embryo quality without human intervention. We …


Cs04all: Machine Learning Module, Hunter R. Johnson Feb 2019

Cs04all: Machine Learning Module, Hunter R. Johnson

Open Educational Resources

These are materials that may be used in a CS0 course as a light introduction to machine learning.

The materials are mostly Jupyter notebooks which contain a combination of labwork and lecture notes. There are notebooks on Classification, An Introduction to Numpy, and An Introduction to Pandas.

There are also two assessments that could be assigned to students. One is an essay assignment in which students are asked to read and respond to an article on machine bias. The other is a lab-like exercise in which students use pandas and numpy to extract useful information about subway ridership in NYC. …


Cs04all: Natural Language Processing Project, Hunter R. Johnson Feb 2019

Cs04all: Natural Language Processing Project, Hunter R. Johnson

Open Educational Resources

In this archive there are two activities/assignments suitable for use in a CS0 or Intro course which uses Python.

In the first activity, students are asked to "fill in the code" in a series of short programs that compute a similarity metric (cosine similarity) for text documents. This involves string tokenization, and frequency counting using Python string methods and datatypes.

https://cocalc.com/share/bde99afd-76c8-493d-9608-db9019bcd346/171/Proj1?viewer=share/

In the second activity (taken directly from Think Python 2e) students use a pronunciation dictionary to solve a riddle involving homophones.

https://cocalc.com/share/bde99afd-76c8-493d-9608-db9019bcd346/171/Dicts2?viewer=share/

This OER material was produced as a result of the CS04ALL CUNY OER project


Culture Clubs: Processing Speech By Deriving And Exploiting Linguistic Subcultures, David Guy Brizan Feb 2019

Culture Clubs: Processing Speech By Deriving And Exploiting Linguistic Subcultures, David Guy Brizan

Dissertations, Theses, and Capstone Projects

Spoken language understanding systems are error-prone for several reasons, including individual speech variability. This is manifested in many ways, among which are differences in pronunciation, lexical inventory, grammar and disfluencies. There is, however, a lot of evidence pointing to stable language usage within subgroups of a language population. We call these subgroups linguistic subcultures.

The two broad problems are defined and a survey of the work in this space is performed. The two broad problems are: linguistic subculture detection, commonly performed via Language Identification, Accent Identification or Dialect Identification approaches; and speech and language processing tasks taken which may see …


Deep Learning Based Medical Image Analysis With Limited Data, Jiaxing Tan Feb 2019

Deep Learning Based Medical Image Analysis With Limited Data, Jiaxing Tan

Dissertations, Theses, and Capstone Projects

Deep Learning Methods have shown its great effort in the area of Computer Vision. However, when solving the problems of medical imaging, deep learning’s power is confined by limited data available. We present a series of novel methodologies for solving medical imaging analysis problems with limited Computed tomography (CT) scans available. Our method, based on deep learning, with different strategies, including using Generative Adversar- ial Networks, two-stage training, infusing the expert knowledge, voting based or converting to other space, solves the data set limitation issue for the cur- rent medical imaging problems, specifically cancer detection and diagnosis, and shows very …


Towards Improving Accuracy And Interpretability Of Deep Learning Based On Satellite Image Classification, Yamile Patino Vargas Jan 2019

Towards Improving Accuracy And Interpretability Of Deep Learning Based On Satellite Image Classification, Yamile Patino Vargas

Dissertations and Theses

ABSTRACT

The study of satellite images provides a way to monitor changes in the surface of the Earth and the atmosphere. Convolutional Neural Networks (CNN) have shown accurate results in solving practical problems in multiple fields. Some of the more recognized fields using CNNs are satellite imagery processing, medicine, communication, transportation, and computer vision. Despite the success of CNNs, there remains a need to explain the network predictions further and understand what the network is determining as valuable information.

There are several frameworks and methodologies developed to explain how CNNs predict outputs and what their internal representations are [1, 4, …