Open Access. Powered by Scholars. Published by Universities.®
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
- Institution
-
- California Polytechnic State University, San Luis Obispo (3)
- San Jose State University (3)
- Southern Methodist University (3)
- City University of New York (CUNY) (2)
- Kennesaw State University (2)
-
- University of Kentucky (2)
- Wayne State University (2)
- Bowdoin College (1)
- CCT College Dublin (1)
- Central Washington University (1)
- Dartmouth College (1)
- DePaul University (1)
- Illinois Math and Science Academy (1)
- Loyola Marymount University and Loyola Law School (1)
- Missouri University of Science and Technology (1)
- Northern Illinois University (1)
- University at Albany, State University of New York (1)
- University of Massachusetts Amherst (1)
- University of Missouri, St. Louis (1)
- University of Montana (1)
- University of Texas at El Paso (1)
- Virginia Commonwealth University (1)
- Washington University in St. Louis (1)
- West Virginia University (1)
- Publication
-
- Master's Projects (3)
- SMU Data Science Review (3)
- Master's Theses (2)
- Theses and Dissertations--Computer Science (2)
- Wayne State University Dissertations (2)
-
- All Master's Theses (1)
- College of Computing and Digital Media Dissertations (1)
- Computer Engineering (1)
- Dissertations (1)
- Dissertations and Theses (1)
- Doctor of Data Science and Analytics Dissertations (1)
- Doctoral Dissertations (1)
- ENGS 88 Honors Thesis (AB Students) (1)
- Graduate Research Theses & Dissertations (1)
- Graduate Student Theses, Dissertations, & Professional Papers (1)
- Graduate Theses, Dissertations, and Problem Reports (1)
- Honors Projects (1)
- Honors Thesis (1)
- ICT (1)
- Legacy Theses & Dissertations (2009 - 2024) (1)
- Master of Science in Computer Science Theses (1)
- Masters Theses (1)
- McKelvey School of Engineering Theses & Dissertations (1)
- Open Access Theses & Dissertations (1)
- Open Educational Resources (1)
- Student Publications & Research (1)
- Theses and Dissertations (1)
- Publication Type
Articles 1 - 30 of 34
Full-Text Articles in Physical Sciences and Mathematics
Information Extraction From Biomedical Text Using Machine Learning, Deepti Garg
Information Extraction From Biomedical Text Using Machine Learning, Deepti Garg
Master's Projects
Inadequate drug experimental data and the use of unlicensed drugs may cause adverse drug reactions, especially in pediatric populations. Every year the U.S. Food and Drug Administration approves human prescription drugs for marketing. The labels associated with these drugs include information about clinical trials and drug response in pediatric population. In order for doctors to make an informed decision about the safety and effectiveness of these drugs for children, there is a need to analyze complex and often unstructured drug labels. In this work, first, an exploratory analysis of drug labels using a Natural Language Processing pipeline is performed. Second, …
Ordinal Hyperplane Loss, Bob Vanderheyden
Ordinal Hyperplane Loss, Bob Vanderheyden
Doctor of Data Science and Analytics Dissertations
This research presents the development of a new framework for analyzing ordered class data, commonly called “ordinal class” data. The focus of the work is the development of classifiers (predictive models) that predict classes from available data. Ratings scales, medical classification scales, socio-economic scales, meaningful groupings of continuous data, facial emotional intensity and facial age estimation are examples of ordinal data for which data scientists may be asked to develop predictive classifiers. It is possible to treat ordinal classification like any other classification problem that has more than two classes. Specifying a model with this strategy does not fully utilize …
Finding A Viable Neural Network Architecture For Use With Upper Limb Prosthetics, Maxwell Lavin
Finding A Viable Neural Network Architecture For Use With Upper Limb Prosthetics, Maxwell Lavin
Master of Science in Computer Science Theses
This paper attempts to answer the question of if it’s possible to produce a simple, quick, and accurate neural network for the use in upper-limb prosthetics. Through the implementation of convolutional and artificial neural networks and feature extraction on electromyographic data different possible architectures are examined with regards to processing time, complexity, and accuracy. It is found that the most accurate architecture is a multi-entry categorical cross entropy convolutional neural network with 100% accuracy. The issue is that it is also the slowest method requiring 9 minutes to run. The next best method found was a single-entry binary cross entropy …
Classifying Fiction And Non-Fiction Works Using Machine Learning, Rachna Gupta '21
Classifying Fiction And Non-Fiction Works Using Machine Learning, Rachna Gupta '21
Student Publications & Research
The objective of this project was to create a program that can determine whether an unknown text is a work of fiction or non-fiction using machine learning. Various datasets of speeches, ebooks, poems, scientific papers, and texts from Project Gutenberg and the Wolfram Example Data were utilized to train and test a Markov Chain machine learning model. A microsite was deployed with the final product that returns a probability of fictionality based on input from the user with 95% accuracy.
Automatic Inference Of Causal Reasoning Chains From Student Essays, Simon Mark Hughes
Automatic Inference Of Causal Reasoning Chains From Student Essays, Simon Mark Hughes
College of Computing and Digital Media Dissertations
While there has been an increasing focus on higher-level thinking skills arising from the Common Core Standards, many high-school and middle-school students struggle to combine and integrate information from multiple sources when writing essays. Writing is an important learning skill, and there is increasing evidence that writing about a topic develops a deeper understanding in the student. However, grading essays is time consuming for teachers, resulting in an increasing focus on shallower forms of assessment that are easier to automate, such as multiple-choice tests. Existing essay grading software has attempted to ease this burden but relies on shallow lexico-syntactic features …
Enhancing Scalability In Genetic Programming With Adaptable Constraints, Type Constraints And Automatically Defined Functions, George Gerules
Enhancing Scalability In Genetic Programming With Adaptable Constraints, Type Constraints And Automatically Defined Functions, George Gerules
Dissertations
Genetic Programming is a type of biological inspired machine learning. It is composed of a population of stochastic individuals. Those individuals can exchange portions of themselves with others in the population through the crossover operation that draws its inspiration from biology. Other biologically inspired operations include mutation and reproduction. The form an individual takes can be many things. It, however, is represented most of the time as a computer program. Constructing correct efficient programs can be notoriously difficult. Various grammar, typing, function constraint, or counting mechanisms can guide creation and evolution of those individuals. These mechanisms can reduce search space …
Identifying Hourly Traffic Patterns With Python Deep Learning, Christopher L. Leavitt
Identifying Hourly Traffic Patterns With Python Deep Learning, Christopher L. Leavitt
Computer Engineering
This project was designed to explore and analyze the potential abilities and usefulness of applying machine learning models to data collected by parking sensors at a major metro shopping mall. By examining patterns in rates at which customer enter and exit parking garages on the campus of the Bellevue Collection shopping mall in Bellevue, Washington, a recurrent neural network will use data points from the previous hours will be trained to forecast future trends.
Classifying Classic Ciphers Using Machine Learning, Nivedhitha Ramarathnam Krishna
Classifying Classic Ciphers Using Machine Learning, Nivedhitha Ramarathnam Krishna
Master's Projects
We consider the problem of identifying the classic cipher that was used to generate a given ciphertext message. We assume that the plaintext is English and we restrict our attention to ciphertext consisting only of alphabetic characters. Among the classic ciphers considered are the simple substitution, Vigenère cipher, playfair cipher, and column transposition cipher. The problem of classification is approached in two ways. The first method uses support vector machines (SVM) trained directly on ciphertext to classify the ciphers. In the second approach, we train hidden Markov models (HMM) on each ciphertext message, then use these trained HMMs as features …
Emulation Vs Instrumentation For Android Malware Detection, Anukriti Sinha
Emulation Vs Instrumentation For Android Malware Detection, Anukriti Sinha
Master's Projects
In resource constrained devices, malware detection is typically based on offline analysis using emulation. In previous work it has been claimed that such emulation fails for a significant percentage of Android malware because well-designed malware detects that the code is being emulated. An alternative to emulation is malware analysis based on code that is executing on an actual Android device. In this research, we collect features from a corpus of Android malware using both emulation and on-phone instrumentation. We train machine learning models based on emulated features and also train models based on features collected via instrumentation, and we compare …
Differential Estimation Of Audiograms Using Gaussian Process Active Model Selection, Trevor Larsen
Differential Estimation Of Audiograms Using Gaussian Process Active Model Selection, Trevor Larsen
McKelvey School of Engineering Theses & Dissertations
Classical methods for psychometric function estimation either require excessive resources to perform, as in the method of constants, or produce only a low resolution approximation of the target psychometric function, as in adaptive staircase or up-down procedures. This thesis makes two primary contributions to the estimation of the audiogram, a clinically relevant psychometric function estimated by querying a patient’s for audibility of a collection of tones. First, it covers the implementation of a Gaussian process model for learning an audiogram using another audiogram as a prior belief to speed up the learning procedure. Second, it implements a use case of …
Machine Learning Pipeline For Exoplanet Classification, George Clayton Sturrock, Brychan Manry, Sohail Rafiqi
Machine Learning Pipeline For Exoplanet Classification, George Clayton Sturrock, Brychan Manry, Sohail Rafiqi
SMU Data Science Review
Planet identification has typically been a tasked performed exclusively by teams of astronomers and astrophysicists using methods and tools accessible only to those with years of academic education and training. NASA’s Exoplanet Exploration program has introduced modern satellites capable of capturing a vast array of data regarding celestial objects of interest to assist with researching these objects. The availability of satellite data has opened up the task of planet identification to individuals capable of writing and interpreting machine learning models. In this study, several classification models and datasets are utilized to assign a probability of an observation being an exoplanet. …
Supervised Machine Learning Models For Fake News Detection, Gofaas Group, Andrea Lopez, Adelo Vieira, Zafar Ahsan, Farooq Saqib, Shirley Marinho
Supervised Machine Learning Models For Fake News Detection, Gofaas Group, Andrea Lopez, Adelo Vieira, Zafar Ahsan, Farooq Saqib, Shirley Marinho
ICT
Fake news or the distribution of disinformation has become one of the most challenging issues in society. News and information are churned out across online websites and platforms in real-time, with little or no way for the viewing public to determine what is real or manufactured. But an awareness of what we are consuming online is becoming apparent and efforts are underway to explore how we separate fake content from genuine and truthful information.
The most challenging part of fake news is determining how to spot it. In technology, there are ways to help us do this. Supervised machine learning …
Teaching Computers To Teach Themselves: Synthesizing Training Data Based On Human-Perceived Elements, James Little
Teaching Computers To Teach Themselves: Synthesizing Training Data Based On Human-Perceived Elements, James Little
Honors Projects
Isolation-Based Scene Generation (IBSG) is a process for creating synthetic datasets made to train machine learning detectors and classifiers. In this project, we formalize the IBSG process and describe the scenarios—object detection and object classification given audio or image input—in which it can be useful. We then look at the Stanford Street View House Number (SVHN) dataset and build several different IBSG training datasets based on existing SVHN data. We try to improve the compositing algorithm used to build the IBSG dataset so that models trained with synthetic data perform as well as models trained with the original SVHN training …
Machine Learning Methods For Personalized Health Monitoring Using Wearable Sensors, Annamalai Natarajan
Machine Learning Methods For Personalized Health Monitoring Using Wearable Sensors, Annamalai Natarajan
Doctoral Dissertations
Mobile health is an emerging field that allows for real-time monitoring of individuals between routine clinical visits. Among others it makes it possible to remotely gather health signals, track disease progression and provide just-in-time interventions. Consumer grade wearable sensors can remotely gather health signals and other time series data. While wearable sensors can be readily deployed on individuals, there are significant challenges in converting raw sensor data into actionable insights. In this dissertation, we develop machine learning methods and models for personalized health monitoring using wearables. Specifically, we address three challenges that arise in these settings. First, data gathered from …
Neural Machine Translation, Quinn M. Lanners, Thomas Laurent
Neural Machine Translation, Quinn M. Lanners, Thomas Laurent
Honors Thesis
Neural Machine Translation is the primary algorithm used in industry to perform machine translation. This state-of-the-art algorithm is an application of deep learning in which massive datasets of translated sentences are used to train a model capable of translating between any two languages. The architecture behind neural machine translation is composed of two recurrent neural networks used together in tandem to create an Encoder Decoder structure. Attention mechanisms have recently been developed to further increase the accuracy of these models. In this senior thesis, the various parts of Neural Machine Translation are explored towards the eventual creation of a tutorial …
A Study Of Face Embedding In Face Recognition, Khanh Duc Le
A Study Of Face Embedding In Face Recognition, Khanh Duc Le
Master's Theses
Face Recognition has been a long-standing topic in computer vision and pattern recognition field because of its wide and important applications in our daily lives such as surveillance system, access control, and so on. The current modern face recognition model, which keeps only a couple of images per person in the database, can now recognize a face with high accuracy. Moreover, the model does not need to be retrained every time a new person is added to the database.
By using the face dataset from Digital Democracy, the thesis will explore the capability of this model by comparing it with …
Cs04all: Machine Learning Module, Hunter R. Johnson
Cs04all: Machine Learning Module, Hunter R. Johnson
Open Educational Resources
These are materials that may be used in a CS0 course as a light introduction to machine learning.
The materials are mostly Jupyter notebooks which contain a combination of labwork and lecture notes. There are notebooks on Classification, An Introduction to Numpy, and An Introduction to Pandas.
There are also two assessments that could be assigned to students. One is an essay assignment in which students are asked to read and respond to an article on machine bias. The other is a lab-like exercise in which students use pandas and numpy to extract useful information about subway ridership in NYC. …
Dish: Democracy In State Houses, Nicholas A. Russo
Dish: Democracy In State Houses, Nicholas A. Russo
Master's Theses
In our current political climate, state level legislators have become increasingly impor- tant. Due to cuts in funding and growing focus at the national level, public oversight for these legislators has drastically decreased. This makes it difficult for citizens and activists to understand the relationships and commonalities between legislators. This thesis provides three contributions to address this issue. First, we created a data set containing over 1200 features focused on a legislator’s activity on bills. Second, we created embeddings that represented a legislator’s level of activity and engagement for a given bill using a custom model called Democracy2Vec. Third, we …
Comparative Study Of Sentiment Analysis With Product Reviews Using Machine Learning And Lexicon-Based Approaches, Heidi Nguyen, Aravind Veluchamy, Mamadou Diop, Rashed Iqbal
Comparative Study Of Sentiment Analysis With Product Reviews Using Machine Learning And Lexicon-Based Approaches, Heidi Nguyen, Aravind Veluchamy, Mamadou Diop, Rashed Iqbal
SMU Data Science Review
In this paper, we present a comparative study of text sentiment classification models using term frequency inverse document frequency vectorization in both supervised machine learning and lexicon-based techniques. There have been multiple promising machine learning and lexicon-based techniques, but the relative goodness of each approach on specific types of problems is not well understood. In order to offer researchers comprehensive insights, we compare a total of six algorithms to each other. The three machine learning algorithms are: Logistic Regression (LR), Support Vector Machine (SVM), and Gradient Boosting. The three lexicon-based algorithms are: Valence Aware Dictionary and Sentiment Reasoner (VADER), Pattern, …
Improving Vix Futures Forecasts Using Machine Learning Methods, James Hosker, Slobodan Djurdjevic, Hieu Nguyen, Robert Slater
Improving Vix Futures Forecasts Using Machine Learning Methods, James Hosker, Slobodan Djurdjevic, Hieu Nguyen, Robert Slater
SMU Data Science Review
The problem of forecasting market volatility is a difficult task for most fund managers. Volatility forecasts are used for risk management, alpha (risk) trading, and the reduction of trading friction. Improving the forecasts of future market volatility assists fund managers in adding or reducing risk in their portfolios as well as in increasing hedges to protect their portfolios in anticipation of a market sell-off event. Our analysis compares three existing financial models that forecast future market volatility using the Chicago Board Options Exchange Volatility Index (VIX) to six machine/deep learning supervised regression methods. This analysis determines which models provide best …
Explainable Neural Networks Based Anomaly Detection For Cyber-Physical Systems, Kasun Amarasinghe
Explainable Neural Networks Based Anomaly Detection For Cyber-Physical Systems, Kasun Amarasinghe
Theses and Dissertations
Cyber-Physical Systems (CPSs) are the core of modern critical infrastructure (e.g. power-grids) and securing them is of paramount importance. Anomaly detection in data is crucial for CPS security. While Artificial Neural Networks (ANNs) are strong candidates for the task, they are seldom deployed in safety-critical domains due to the perception that ANNs are black-boxes. Therefore, to leverage ANNs in CPSs, cracking open the black box through explanation is essential.
The main objective of this dissertation is developing explainable ANN-based Anomaly Detection Systems for Cyber-Physical Systems (CP-ADS). The main objective was broken down into three sub-objectives: 1) Identifying key-requirements that an …
Dedicated Hardware For Machine/Deep Learning: Domain Specific Architectures, Angel Izael Solis
Dedicated Hardware For Machine/Deep Learning: Domain Specific Architectures, Angel Izael Solis
Open Access Theses & Dissertations
Artificial intelligence has come a very long way from being a mere spectacle on the silver screen in the 1920s [Hml18]. As artificial intelligence continues to evolve, and we begin to develop more sophisticated Artificial Neural Networks, the need for specialized and more efficient machines (less computational strain while maintaining the same performance results) becomes increasingly evident. Though these new techniques, such as Multilayer Perceptrons, Convolutional Neural Networks and Recurrent Neural Networks, may seem as if they are on the cutting edge of technology, many of these ideas are over 60 years old! However, many of these earlier models, at …
A Dual State Hierarchical Ensemble Kalman Filter Algorithm, William J. Cook, Jesse Johnson, Marko Maneta, Doug Brinkerhoff
A Dual State Hierarchical Ensemble Kalman Filter Algorithm, William J. Cook, Jesse Johnson, Marko Maneta, Doug Brinkerhoff
Graduate Student Theses, Dissertations, & Professional Papers
Dynamic models that simulate processes across large geographic locations, such as hydrologic models, are often informed by empirical parameters that are distributed across a geographical area and segmented by geological features such as watersheds. These parameters may be referred to as spatially distributed parameters. Spatially distributed parameters are frequently spatially correlated and any techniques utilized in their calibration ideally incorporate existing spatial hierarchical relationships into their structure. In this paper, a parameter estimation method based on the Dual State Ensemble Kalman Filter called the Dual State Hierarchical Ensemble Kalman Filter (DSHEnKF) is presented. This modified filter is innovative in that …
Rule Mining And Sequential Pattern Based Predictive Modeling With Emr Data, Orhan Abar
Rule Mining And Sequential Pattern Based Predictive Modeling With Emr Data, Orhan Abar
Theses and Dissertations--Computer Science
Electronic medical record (EMR) data is collected on a daily basis at hospitals and other healthcare facilities to track patients’ health situations including conditions, treatments (medications, procedures), diagnostics (labs) and associated healthcare operations. Besides being useful for individual patient care and hospital operations (e.g., billing, triaging), EMRs can also be exploited for secondary data analyses to glean discriminative patterns that hold across patient cohorts for different phenotypes. These patterns in turn can yield high level insights into disease progression with interventional potential. In this dissertation, using a large scale realistic EMR dataset of over one million patients visiting University of …
Relation Prediction Over Biomedical Knowledge Bases For Drug Repositioning, Mehmet Bakal
Relation Prediction Over Biomedical Knowledge Bases For Drug Repositioning, Mehmet Bakal
Theses and Dissertations--Computer Science
Identifying new potential treatment options for medical conditions that cause human disease burden is a central task of biomedical research. Since all candidate drugs cannot be tested with animal and clinical trials, in vitro approaches are first attempted to identify promising candidates. Likewise, identifying other essential relations (e.g., causation, prevention) between biomedical entities is also critical to understand biomedical processes. Hence, it is crucial to develop automated relation prediction systems that can yield plausible biomedical relations to expedite the discovery process. In this dissertation, we demonstrate three approaches to predict treatment relations between biomedical entities for the drug repositioning task …
Abso2luteu-Net: Tissue Oxygenation Calculation Using Photoacoustic Imaging And Convolutional Neural Networks, Kevin Hoffer-Hawlik, Geoffrey P. Luke
Abso2luteu-Net: Tissue Oxygenation Calculation Using Photoacoustic Imaging And Convolutional Neural Networks, Kevin Hoffer-Hawlik, Geoffrey P. Luke
ENGS 88 Honors Thesis (AB Students)
Photoacoustic (PA) imaging uses incident light to generate ultrasound signals within tissues. Using PA imaging to accurately measure hemoglobin concentration and calculate oxygenation (sO2) requires prior tissue knowledge and costly computational methods. However, this thesis shows that machine learning algorithms can accurately and quickly estimate sO2. absO2luteU-Net, a convolutional neural network, was trained on Monte Carlo simulated multispectral PA data and predicted sO2 with higher accuracy compared to simple linear unmixing, suggesting machine learning can solve the fluence estimation problem. This project was funded by the Kaminsky Family Fund and the Neukom Institute.
Exploring Cyber-Physical Systems, Misbah Uddin Mohammed
Exploring Cyber-Physical Systems, Misbah Uddin Mohammed
Graduate Research Theses & Dissertations
The advances in IOT, Computer Vision, AI and Machine Learning have made these technologies ubiquitous to our daily lives. From Smart Phones to Connected Vehicles, Cyber Physical systems have been interspersed into everything we interact in today’s world. The aim or this thesis was to explore these advances in Cyber Physical Systems and analyze the different sectors they were affecting. We then hand-picked certain domains and explored further by carrying out practical projects using some of the latest software and hardware resources available. Technologies like Amazon Alexa services, NVIDIA Jetson boards, TensorFlow, OpenCV, NodeJS were heavily employed in our various …
The Use Of Cultural Algorithms To Learn The Impact Of Climate On Local Fishing Behavior In Cerro Azul, Peru, Khalid Kattan
The Use Of Cultural Algorithms To Learn The Impact Of Climate On Local Fishing Behavior In Cerro Azul, Peru, Khalid Kattan
Wayne State University Dissertations
Recently it has been found that the earth’s oceans are warming at a pace that is 40% faster than predicted by a United Nations panel a few years ago. As a result, 2019 has become the warmest year on record for the earth’s oceans. That is because the oceans have acted as a buffer by absorbing 93% of the heat produced by the greenhouse gases [40].
The impact of the oceanic warming has already been felt in terms of the periodic warming of the Pacific Ocean as an effect of the ENSO process. The ENSO process is a cycle of …
Capso: A Multi-Objective Cultural Algorithm System To Predict Locations Of Ancient Sites, Samuel Dustin Stanley
Capso: A Multi-Objective Cultural Algorithm System To Predict Locations Of Ancient Sites, Samuel Dustin Stanley
Wayne State University Dissertations
ABSTRACT
CAPSO: A MULTI-OBJECTIVE CULTURAL ALGORITHM SYSTEM TO PREDICT LOCATIONS OF ANCIENT SITES
by
SAMUEL DUSTIN STANLEY
August 2019
Advisor: Dr. Robert Reynolds
Major: Computer Science
Degree: Doctor of Philosophy
The recent archaeological discovery by Dr. John O’Shea at University of Michigan of prehistoric caribou remains and Paleo-Indian structures underneath the Great Lakes has opened up an opportunity for Computer Scientists to develop dynamic systems modelling these ancient caribou routes and hunter-gatherer settlement systems as well as the prehistoric environments that they existed in. The Wayne State University Cultural Algorithm team has been interested assisting Dr. O’Shea’s archaeological team by …
Towards Improving Accuracy And Interpretability Of Deep Learning Based On Satellite Image Classification, Yamile Patino Vargas
Towards Improving Accuracy And Interpretability Of Deep Learning Based On Satellite Image Classification, Yamile Patino Vargas
Dissertations and Theses
ABSTRACT
The study of satellite images provides a way to monitor changes in the surface of the Earth and the atmosphere. Convolutional Neural Networks (CNN) have shown accurate results in solving practical problems in multiple fields. Some of the more recognized fields using CNNs are satellite imagery processing, medicine, communication, transportation, and computer vision. Despite the success of CNNs, there remains a need to explain the network predictions further and understand what the network is determining as valuable information.
There are several frameworks and methodologies developed to explain how CNNs predict outputs and what their internal representations are [1, 4, …