Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 12 of 12

Full-Text Articles in Physical Sciences and Mathematics

Amazon Alexa + Linked Open Data: Theorizing Concerning Relationships Between (Surveillant) Smart-Home Voice Assistants And Linked Open Data, Michelle Nitto Sep 2019

Amazon Alexa + Linked Open Data: Theorizing Concerning Relationships Between (Surveillant) Smart-Home Voice Assistants And Linked Open Data, Michelle Nitto

Publications and Research

No abstract provided.


Semi-Supervised Regression With Generative Adversarial Networks Using Minimal Labeled Data, Greg Olmschenk Sep 2019

Semi-Supervised Regression With Generative Adversarial Networks Using Minimal Labeled Data, Greg Olmschenk

Dissertations, Theses, and Capstone Projects

This work studies the generalization of semi-supervised generative adversarial networks (GANs) to regression tasks. A novel feature layer contrasting optimization function, in conjunction with a feature matching optimization, allows the adversarial network to learn from unannotated data and thereby reduce the number of labels required to train a predictive network. An analysis of simulated training conditions is performed to explore the capabilities and limitations of the method. In concert with the semi-supervised regression GANs, an improved label topology and upsampling technique for multi-target regression tasks are shown to reduce data requirements. Improvements are demonstrated on a wide variety of vision …


Do It Like A Syntactician: Using Binary Gramaticality Judgements To Train Sentence Encoders And Assess Their Sensitivity To Syntactic Structure, Pablo Gonzalez Martinez Sep 2019

Do It Like A Syntactician: Using Binary Gramaticality Judgements To Train Sentence Encoders And Assess Their Sensitivity To Syntactic Structure, Pablo Gonzalez Martinez

Dissertations, Theses, and Capstone Projects

The binary nature of grammaticality judgments and their use to access the structure of syntax are a staple of modern linguistics. However, computational models of natural language rarely make use of grammaticality in their training or application. Furthermore, developments in modern neural NLP have produced a myriad of methods that push the baselines in many complex tasks, but those methods are typically not evaluated from a linguistic perspective. In this dissertation I use grammaticality judgements with artificially generated ungrammatical sentences to assess the performance of several neural encoders and propose them as a suitable training target to make models learn …


Going Big: A Large-Scale Study On What Big Data Developers Ask, Mehdi Bagherzadeh, Raffi T. Khatchadourian Aug 2019

Going Big: A Large-Scale Study On What Big Data Developers Ask, Mehdi Bagherzadeh, Raffi T. Khatchadourian

Publications and Research

Software developers are increasingly required to write big data code. However, they find big data software development challenging. To help these developers it is necessary to understand big data topics that they are interested in and the difficulty of finding answers for questions in these topics. In this work, we conduct a large-scale study on Stackoverflow to understand the interest and difficulties of big data developers. To conduct the study, we develop a set of big data tags to extract big data posts from Stackoverflow; use topic modeling to group these posts into big data topics; group similar topics into …


Module: Robot Senses, Mohammad Azhar May 2019

Module: Robot Senses, Mohammad Azhar

Open Educational Resources

Learning Objectives:

Students will be able to:

  • Describe the basics of Sensors

  • Learn how to program the LEGO Robot to make decision using touch sensors


Module: Robot Locomotion Mini Hackathon, Mohammad Azhar May 2019

Module: Robot Locomotion Mini Hackathon, Mohammad Azhar

Open Educational Resources

Learning Objectives:

Students will be able to:

  • Describe the basics of Robots.

  • Describe basic hardware and software of the LEGO Robot.

  • Write sequential code for LEGO Robot to move.


Online Learning And Planning For Crowd-Aware Service Robot Navigation, Anoop Aroor May 2019

Online Learning And Planning For Crowd-Aware Service Robot Navigation, Anoop Aroor

Dissertations, Theses, and Capstone Projects

Mobile service robots are increasingly used in indoor environments (e.g., shopping malls or museums) among large crowds of people. To efficiently navigate in these environments, such a robot should be able to exhibit a variety of behaviors. It should avoid crowded areas, and not oppose the flow of the crowd. It should be able to identify and avoid specific crowds that result in additional delays (e.g., children in a particular area might slow down the robot). and to seek out a crowd if its task requires it to interact with as many people as possible. These behaviors require the ability …


Cs04all: Machine Learning Module, Hunter R. Johnson Feb 2019

Cs04all: Machine Learning Module, Hunter R. Johnson

Open Educational Resources

These are materials that may be used in a CS0 course as a light introduction to machine learning.

The materials are mostly Jupyter notebooks which contain a combination of labwork and lecture notes. There are notebooks on Classification, An Introduction to Numpy, and An Introduction to Pandas.

There are also two assessments that could be assigned to students. One is an essay assignment in which students are asked to read and respond to an article on machine bias. The other is a lab-like exercise in which students use pandas and numpy to extract useful information about subway ridership in NYC. …


Cs04all: Natural Language Processing Project, Hunter R. Johnson Feb 2019

Cs04all: Natural Language Processing Project, Hunter R. Johnson

Open Educational Resources

In this archive there are two activities/assignments suitable for use in a CS0 or Intro course which uses Python.

In the first activity, students are asked to "fill in the code" in a series of short programs that compute a similarity metric (cosine similarity) for text documents. This involves string tokenization, and frequency counting using Python string methods and datatypes.

https://cocalc.com/share/bde99afd-76c8-493d-9608-db9019bcd346/171/Proj1?viewer=share/

In the second activity (taken directly from Think Python 2e) students use a pronunciation dictionary to solve a riddle involving homophones.

https://cocalc.com/share/bde99afd-76c8-493d-9608-db9019bcd346/171/Dicts2?viewer=share/

This OER material was produced as a result of the CS04ALL CUNY OER project


Culture Clubs: Processing Speech By Deriving And Exploiting Linguistic Subcultures, David Guy Brizan Feb 2019

Culture Clubs: Processing Speech By Deriving And Exploiting Linguistic Subcultures, David Guy Brizan

Dissertations, Theses, and Capstone Projects

Spoken language understanding systems are error-prone for several reasons, including individual speech variability. This is manifested in many ways, among which are differences in pronunciation, lexical inventory, grammar and disfluencies. There is, however, a lot of evidence pointing to stable language usage within subgroups of a language population. We call these subgroups linguistic subcultures.

The two broad problems are defined and a survey of the work in this space is performed. The two broad problems are: linguistic subculture detection, commonly performed via Language Identification, Accent Identification or Dialect Identification approaches; and speech and language processing tasks taken which may see …


Deep Learning Based Medical Image Analysis With Limited Data, Jiaxing Tan Feb 2019

Deep Learning Based Medical Image Analysis With Limited Data, Jiaxing Tan

Dissertations, Theses, and Capstone Projects

Deep Learning Methods have shown its great effort in the area of Computer Vision. However, when solving the problems of medical imaging, deep learning’s power is confined by limited data available. We present a series of novel methodologies for solving medical imaging analysis problems with limited Computed tomography (CT) scans available. Our method, based on deep learning, with different strategies, including using Generative Adversar- ial Networks, two-stage training, infusing the expert knowledge, voting based or converting to other space, solves the data set limitation issue for the cur- rent medical imaging problems, specifically cancer detection and diagnosis, and shows very …


Towards Improving Accuracy And Interpretability Of Deep Learning Based On Satellite Image Classification, Yamile Patino Vargas Jan 2019

Towards Improving Accuracy And Interpretability Of Deep Learning Based On Satellite Image Classification, Yamile Patino Vargas

Dissertations and Theses

ABSTRACT

The study of satellite images provides a way to monitor changes in the surface of the Earth and the atmosphere. Convolutional Neural Networks (CNN) have shown accurate results in solving practical problems in multiple fields. Some of the more recognized fields using CNNs are satellite imagery processing, medicine, communication, transportation, and computer vision. Despite the success of CNNs, there remains a need to explain the network predictions further and understand what the network is determining as valuable information.

There are several frameworks and methodologies developed to explain how CNNs predict outputs and what their internal representations are [1, 4, …