Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 16 of 16

Full-Text Articles in Entire DC Network

Salience-Aware Adaptive Resonance Theory For Large-Scale Sparse Data Clustering, Lei Meng, Ah-Hwee Tan, Chunyan Miao Dec 2019

Salience-Aware Adaptive Resonance Theory For Large-Scale Sparse Data Clustering, Lei Meng, Ah-Hwee Tan, Chunyan Miao

Research Collection School Of Computing and Information Systems

Sparse data is known to pose challenges to cluster analysis, as the similarity between data tends to be ill-posed in the high-dimensional Hilbert space. Solutions in the literature typically extend either k-means or spectral clustering with additional steps on representation learning and/or feature weighting. However, adding these usually introduces new parameters and increases computational cost, thus inevitably lowering the robustness of these algorithms when handling massive ill-represented data. To alleviate these issues, this paper presents a class of self-organizing neural networks, called the salience-aware adaptive resonance theory (SA-ART) model. SA-ART extends Fuzzy ART with measures for cluster-wise salient feature modeling. …


Pixel-Level Deep Multi-Dimensional Embeddings For Homogeneous Multiple Object Tracking, Mateusz Mittek Dec 2019

Pixel-Level Deep Multi-Dimensional Embeddings For Homogeneous Multiple Object Tracking, Mateusz Mittek

Department of Electrical and Computer Engineering: Dissertations, Theses, and Student Research

The goal of Multiple Object Tracking (MOT) is to locate multiple objects and keep track of their individual identities and trajectories given a sequence of (video) frames. A popular approach to MOT is tracking by detection consisting of two processing components: detection (identification of objects of interest in individual frames) and data association (connecting data from multiple frames). This work addresses the detection component by introducing a method based on semantic instance segmentation, i.e., assigning labels to all visible pixels such that they are unique among different instances. Modern tracking methods often built around Convolutional Neural Networks (CNNs) and additional, …


Topicsummary: A Tool For Analyzing Class Discussion Forums Using Topic Based Summarizations, Swapna Gottipati, Venky Shankararaman, Renjini Ramesh Oct 2019

Topicsummary: A Tool For Analyzing Class Discussion Forums Using Topic Based Summarizations, Swapna Gottipati, Venky Shankararaman, Renjini Ramesh

Research Collection School Of Computing and Information Systems

This Innovative Practice full paper, describes the application of text mining techniques for extracting insights from a course based online discussion forum through generation of topic based summaries. Discussions, either in classroom or online provide opportunity for collaborative learning through exchange of ideas that leads to enhanced learning through active participation. Online discussions offer a number of benefits namely providing additional time to reflect and synthesize information before writing, providing a natural platform for students to voice their ideas without any one student dominating the conversation, and providing a record of the student’s thoughts. An online discussion forum provides a …


Redpc: A Residual Error-Based Density Peak Clustering Algorithm, Milan Parmar, Di Wang, Xiaofeng Zhang, Ah-Hwee Tan, Chunyan Miao, You Zhou Jul 2019

Redpc: A Residual Error-Based Density Peak Clustering Algorithm, Milan Parmar, Di Wang, Xiaofeng Zhang, Ah-Hwee Tan, Chunyan Miao, You Zhou

Research Collection School Of Computing and Information Systems

The density peak clustering (DPC) algorithm was designed to identify arbitrary-shaped clusters by finding density peaks in the underlying dataset. Due to its aptitudes of relatively low computational complexity and a small number of control parameters in use, DPC soon became widely adopted. However, because DPC takes the entire data space into consideration during the computation of local density, which is then used to generate a decision graph for the identification of cluster centroids, DPC may face difficulty in differentiating overlapping clusters and in dealing with low-density data points. In this paper, we propose a residual error-based density peak clustering …


Genotype Combinations Linked To Phenotype Subgroups In Autism Spectrum Disorders, Junya Zhao, Thy Nguyen, Jonathan Kopel, Perry B. Koob, Donald A. Adieroh, Tayo Obafemi-Ajayi Jul 2019

Genotype Combinations Linked To Phenotype Subgroups In Autism Spectrum Disorders, Junya Zhao, Thy Nguyen, Jonathan Kopel, Perry B. Koob, Donald A. Adieroh, Tayo Obafemi-Ajayi

Electrical and Computer Engineering Faculty Research & Creative Works

This paper investigates a computational model that allows for systematic comparison of phenotype data with genotype (Single Nucleotide Polymorphisms (SNPs)) data based on machine learning techniques to identify discriminant genotype markers associated with the phenotypic subgroups. The proposed discriminant SNP identifier model is empirically evaluated using Autism Spectrum Disorder (ASD) simplex sample. Six phenotype markers were selected to cluster the sample in a hexagonal lattice format yielding five multidimensional subgroups based on extremities of the phenotype markers. The SNP selection model includes random subspace selection of SNPs in conjunction with feature selection algorithms to determine which set of SNPs were …


Cure: Flexible Categorical Data Representation By Hierarchical Coupling Learning, Songlei Jian, Guansong Pang, Longbing Cao, Kai Lu, Hang Gao May 2019

Cure: Flexible Categorical Data Representation By Hierarchical Coupling Learning, Songlei Jian, Guansong Pang, Longbing Cao, Kai Lu, Hang Gao

Research Collection School Of Computing and Information Systems

The representation of categorical data with hierarchical value coupling relationships (i.e., various value-to-value cluster interactions) is very critical yet challenging for capturing complex data characteristics in learning tasks. This paper proposes a novel and flexible coupled unsupervised categorical data representation (CURE) framework, which not only captures the hierarchical couplings but is also flexible enough to be instantiated for contrastive learning tasks. CURE first learns the value clusters of different granularities based on multiple value coupling functions and then learns the value representation from the couplings between the obtained value clusters. With two complementary value coupling functions, CURE is instantiated into …


Gsepd: A Bioconductor Package For Rna-Seq Gene Set Enrichment And Projection Display, Karl D. Stamm, Aoy Tomita-Mitchell, Serdar Bozdag Mar 2019

Gsepd: A Bioconductor Package For Rna-Seq Gene Set Enrichment And Projection Display, Karl D. Stamm, Aoy Tomita-Mitchell, Serdar Bozdag

Computer Science Faculty Research and Publications

Background: RNA-seq, wherein RNA transcripts expressed in a sample are sequenced and quantified, has become a widely used technique to study disease and development. With RNA-seq, transcription abundance can be measured, differential expression genes between groups and functional enrichment of those genes can be computed. However, biological insights from RNA-seq are often limited by computational analysis and the enormous volume of resulting data, preventing facile and meaningful review and interpretation of gene expression profiles. Particularly, in cases where the samples under study exhibit uncontrolled variation, deeper analysis of functional enrichment would be necessary to visualize samples’ gene expression activity under …


Evolutionary Trends In The Collaborative Review Process Of A Large Software System, Subhajit Datta, Poulami Sarkar Feb 2019

Evolutionary Trends In The Collaborative Review Process Of A Large Software System, Subhajit Datta, Poulami Sarkar

Research Collection School Of Computing and Information Systems

In this paper, we study the evolutionary trends in the collaborative review process of a large open source software system. As expected, the number of reviews, the number of reviews commented on, as well as the number of reviewers, and the interactions between them show increasing trends over time. But unexpectedly, levels of clustering between developers in their interaction networks show a decreasing trend, even as connections between them increase. In the context of our study, clustering is an indicator of developer collaboration, whereas connection points to how intensely developers work together. Thus the trends we observe can inform how …


Deep Learning Based Analysis Of Histopathological Images Of Breast Cancer, Juanying Xie, Ran Liu, Joseph Luttrell Iv, Chaoyang Zhang Jan 2019

Deep Learning Based Analysis Of Histopathological Images Of Breast Cancer, Juanying Xie, Ran Liu, Joseph Luttrell Iv, Chaoyang Zhang

Faculty Publications

Breast cancer is associated with the highest morbidity rates for cancer diagnoses in the world and has become a major public health issue. Early diagnosis can increase the chance of successful treatment and survival. However, it is a very challenging and time-consuming task that relies on the experience of pathologists. The automatic diagnosis of breast cancer by analyzing histopathological images plays a significant role for patients and their prognosis. However, traditional feature extraction methods can only extract some low-level features of images, and prior knowledge is necessary to select useful features, which can be greatly affected by humans. Deep learning …


Smart Control Of Buck Converters Using A Switching-Based Clustering Algorithm, Brook Abegaz, M. Cmiel Jan 2019

Smart Control Of Buck Converters Using A Switching-Based Clustering Algorithm, Brook Abegaz, M. Cmiel

Engineering Science Faculty Publications

This paper proposes a new approach to the control of switching voltage regulators (buck converters). The method is performed using a switching-based clustering algorithm. The implementations of competing approaches such as a fuzzy-logic controller, proportional integral derivative controller and a neural network based controller are presented in order to compare and evaluate the performance of the switching-based clustering algorithm. The results of the approach show that the proposed method could improve the stability and the performance of the buck converter system by 2.7% in terms of settling time and by 0.6% in terms of the overshoot value as compared to …


Smart Control Of Automatic Voltage Regulators Using K-Means Clustering, Brook Abegaz, J. Kueber Jan 2019

Smart Control Of Automatic Voltage Regulators Using K-Means Clustering, Brook Abegaz, J. Kueber

Engineering Science Faculty Publications

The future cyber physical systems consist of voltage regulators distributed across wide geographical areas. In this paper, a smart control approach of voltage regulators is presented for cyber physical system applications. The approach is implemented using K-means clustering algorithms that use data from voltage and current sensors, compute the correlation of changes across the regulators and generate a proportional feedback. Advanced estimation methods are used in cases where the data from the sensors was not available. The results show that the approach could be used to improve the performance of networked, power dependent systems by 94.5% in terms of overshoot …


Transfer Learning For Detecting Unknown Network Attacks, Juan Zhao, Sachin Shetty, Jan Wei Pan, Charles Kamhoua, Kevin Kwiat Jan 2019

Transfer Learning For Detecting Unknown Network Attacks, Juan Zhao, Sachin Shetty, Jan Wei Pan, Charles Kamhoua, Kevin Kwiat

VMASC Publications

Network attacks are serious concerns in today’s increasingly interconnected society. Recent studies have applied conventional machine learning to network attack detection by learning the patterns of the network behaviors and training a classification model. These models usually require large labeled datasets; however, the rapid pace and unpredictability of cyber attacks make this labeling impossible in real time. To address these problems, we proposed utilizing transfer learning for detecting new and unseen attacks by transferring the knowledge of the known attacks. In our previous work, we have proposed a transfer learning-enabled framework and approach, called HeTL, which can find the common …


Genomic Prediction Using Canopy Coverage Image And Genotypic Information In Soybean Via A Hybrid Model, Reka Howard, Diego Jarquin Jan 2019

Genomic Prediction Using Canopy Coverage Image And Genotypic Information In Soybean Via A Hybrid Model, Reka Howard, Diego Jarquin

Department of Statistics: Faculty Publications

Prediction techniques are important in plant breeding as they provide a tool for selection that is more efficient and economical than traditional phenotypic and pedigree based selection. The conventional genomic prediction models include molecular marker information to predict the phenotype. With the development of new phenomics techniques we have the opportunity to collect image data on the plants, and extend the traditional genomic prediction models where we incorporate diverse set of information collected on the plants. In our research, we developed a hybrid matrix model that incorporates molecular marker and canopy coverage information as a weighted linear combination to predict …


Exploring The Impact Of (Not) Changing Default Settings In Algorithmic Crime Mapping - A Case Study Of Milwaukee, Wisconsin, Md Romael Haque, Katy Weathington, Shion Guha Jan 2019

Exploring The Impact Of (Not) Changing Default Settings In Algorithmic Crime Mapping - A Case Study Of Milwaukee, Wisconsin, Md Romael Haque, Katy Weathington, Shion Guha

Computer Science Faculty Research and Publications

Policing decisions, allocations and outcomes are determined by mapping historical crime data geo-spatially using popular algorithms. In this extended abstract, we present early results from a mixed-methods study of the practices, policies, and perceptions of algorithmic crime mapping in the city of Milwaukee, Wisconsin. We investigate this differential by visualizing potential demographic biases from publicly available crime data over 12 years (2005-2016) and conducting semi-structured interviews of 19 city stakeholders and provide future research directions from this study.


Intelligent Intrusion Detection Using Radial Basis Function Neural Network, Alia Abughazleh, Muder Almiani, Basel Magableh, Abdul Razaque Jan 2019

Intelligent Intrusion Detection Using Radial Basis Function Neural Network, Alia Abughazleh, Muder Almiani, Basel Magableh, Abdul Razaque

Conference papers

Recently we witness a booming and ubiquity evolving of internet connectivity all over the world leading to dramatic amount of network activities and large amount of data and information transfer. Massive data transfer composes a fertile ground to hackers and intruders to launch cyber-attacks and various types of penetrations. As a consequence, researchers around the globe have devoted a large room for researches that can handle different types of attacks efficiently through building various types of intrusion detection systems capable to handle different types of attacks, known and unknown (novel) ones as well as have the capability to deal with …


Towards An Efficient Data Fragmentation, Allocation, And Clustering Approach In A Distributed Environment, Hassan Abdalla, Abdel Monim Artoli Jan 2019

Towards An Efficient Data Fragmentation, Allocation, And Clustering Approach In A Distributed Environment, Hassan Abdalla, Abdel Monim Artoli

All Works

© 2019 by the authors. Data fragmentation and allocation has for long proven to be an efficient technique for improving the performance of distributed database systems' (DDBSs). A crucial feature of any successful DDBS design revolves around placing an intrinsic emphasis on minimizing transmission costs (TC). This work; therefore, focuses on improving distribution performance based on transmission cost minimization. To do so, data fragmentation and allocation techniques are utilized in this work along with investigating several data replication scenarios. Moreover, site clustering is leveraged with the aim of producing a minimum possible number of highly balanced clusters. By doing so, …