Open Access. Powered by Scholars. Published by Universities.®
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
- Institution
-
- China Simulation Federation (315)
- Singapore Management University (110)
- San Jose State University (34)
- Old Dominion University (26)
- MBZUAI (25)
-
- University of Massachusetts Amherst (17)
- University of Arkansas, Fayetteville (16)
- University of Windsor (15)
- Western University (14)
- Technological University Dublin (13)
- City University of New York (CUNY) (11)
- New Jersey Institute of Technology (11)
- University of Kentucky (8)
- California Polytechnic State University, San Luis Obispo (7)
- Dartmouth College (7)
- Air Force Institute of Technology (6)
- The University of Maine (6)
- University of Central Florida (6)
- University of South Florida (6)
- West Virginia University (6)
- American Dental Association (5)
- Florida International University (5)
- University of Nevada, Las Vegas (5)
- University of Wisconsin Milwaukee (5)
- Virginia Commonwealth University (5)
- Washington University in St. Louis (5)
- Michigan Technological University (4)
- Missouri State University (4)
- Missouri University of Science and Technology (4)
- University of Louisville (4)
- Keyword
-
- Machine learning (58)
- Deep learning (50)
- Machine Learning (50)
- Artificial intelligence (37)
- Deep Learning (30)
-
- Artificial Intelligence (28)
- Computer vision (18)
- Simulation (15)
- AI (14)
- Reinforcement learning (13)
- Neural Networks (12)
- Optimization (12)
- Neural networks (11)
- Computer Vision and Pattern Recognition (cs.CV) (9)
- Reinforcement Learning (9)
- COVID-19 (8)
- Classification (8)
- Computer Vision (8)
- Natural Language Processing (8)
- Natural language processing (8)
- Technology (7)
- Computer Science (6)
- Computer generated forces (6)
- Sliding mode control (6)
- Visualization (6)
- Anomaly detection (5)
- Clustering (5)
- Convolutional Neural Network (5)
- Convolutional Neural Networks (5)
- Convolutional neural network (5)
- Publication
-
- Journal of System Simulation (315)
- Research Collection School Of Computing and Information Systems (99)
- Master's Projects (30)
- Electronic Theses and Dissertations (22)
- Doctoral Dissertations (20)
-
- Computer Vision Faculty Publications (19)
- Theses and Dissertations (17)
- Electronic Thesis and Dissertation Repository (12)
- Dissertations (11)
- Master's Theses (9)
- Dissertations, Theses, and Capstone Projects (8)
- Graduate Theses and Dissertations (8)
- Articles (7)
- Conference papers (7)
- Honors Theses (6)
- USF Tampa Graduate Theses and Dissertations (6)
- Computer Science Faculty Publications (5)
- Computer Science and Computer Engineering Undergraduate Honors Theses (5)
- Electronic Theses and Dissertations, 2020- (5)
- FIU Electronic Theses and Dissertations (5)
- Graduate Theses, Dissertations, and Problem Reports (5)
- Machine Learning Faculty Publications (5)
- The Journal of the Michigan Dental Association (5)
- Theses and Dissertations--Computer Science (5)
- UNLV Theses, Dissertations, Professional Papers, and Capstones (5)
- Dartmouth College Undergraduate Theses (4)
- General University of Maine Publications (4)
- MSU Graduate Theses (4)
- Masters Theses (4)
- McKelvey School of Engineering Theses & Dissertations (4)
- Publication Type
Articles 1 - 30 of 832
Full-Text Articles in Physical Sciences and Mathematics
Private And Federated Deep Learning: System, Theory, And Applications For Social Good, Han Hu
Private And Federated Deep Learning: System, Theory, And Applications For Social Good, Han Hu
Dissertations
During the past decade, drug abuse continues to accelerate towards becoming the most severe public health problem in the United States. The ability to detect drugabuse risk behavior at a population scale, such as among the population of Twitter users, can help to monitor the trend of drugabuse incidents. However, traditional methods do not effectively detect drugabuse risk behavior in tweets, mainly due to the sparsity of such tweets and the noisy nature of tweets. In the first part of this dissertation work, the task of classifying tweets as containing drugabuse risk behavior or not, is studied. Millions of public …
A Practical Approach To Automated Software Correctness Enhancement, Aleksandr Zakharchenko
A Practical Approach To Automated Software Correctness Enhancement, Aleksandr Zakharchenko
Dissertations
To repair an incorrect program does not mean to make it correct; it only means to make it more-correct, in some sense, than it is. In the absence of a concept of relative correctness, i.e. the property of a program to be more-correct than another with respect to a specification, the discipline of program repair has resorted to various approximations of absolute (traditional) correctness, with varying degrees of success. This shortcoming is concealed by the fact that most program repair tools are tested on basic cases, whence making them absolutely correct is not clearly distinguishable from making them relatively more-correct. …
Machine Learning And Computer Vision In Solar Physics, Haodi Jiang
Machine Learning And Computer Vision In Solar Physics, Haodi Jiang
Dissertations
In the recent decades, the difficult task of understanding and predicting violent solar eruptions and their terrestrial impacts has become a strategic national priority, as it affects the life of human beings, including communication, transportation, the power grid, national defense, space travel, and more. This dissertation explores new machine learning and computer vision techniques to tackle this difficult task. Specifically, the dissertation addresses four interrelated problems in solar physics: magnetic flux tracking, fibril tracing, Stokes inversion and vector magnetogram generation.
First, the dissertation presents a new deep learning method, named SolarUnet, to identify and track solar magnetic flux elements in …
Energy Planning Model Design For Forecasting The Final Energy Consumption Using Artificial Neural Networks, Haidy Eissa
Energy Planning Model Design For Forecasting The Final Energy Consumption Using Artificial Neural Networks, Haidy Eissa
Theses and Dissertations
“Energy Trilemma” has recently received an increasing concern among policy makers. The trilemma conceptual framework is based on three main dimensions: environmental sustainability, energy equity, and energy security. Energy security reflects a nation’s capability to meet current and future energy demand. Rational energy planning is thus a fundamental aspect to articulate energy policies. The energy system is huge and complex, accordingly in order to guarantee the availability of energy supply, it is necessary to implement strategies on the consumption side. Energy modeling is a tool that helps policy makers and researchers understand the fluctuations in the energy system. Over the …
Explaining Deep Learning Models For Tabular Data Using Layer-Wise Relevance Propagation, Ihsan Ullah, Andre Rios, Vaibhov Gala, Susan Mckeever
Explaining Deep Learning Models For Tabular Data Using Layer-Wise Relevance Propagation, Ihsan Ullah, Andre Rios, Vaibhov Gala, Susan Mckeever
Articles
Trust and credibility in machine learning models are bolstered by the ability of a model to explain its decisions. While explainability of deep learning models is a well-known challenge, a further challenge is clarity of the explanation itself for relevant stakeholders of the model. Layer-wise Relevance Propagation (LRP), an established explainability technique developed for deep models in computer vision, provides intuitive human-readable heat maps of input images. We present the novel application of LRP with tabular datasets containing mixed data (categorical and numerical) using a deep neural network (1D-CNN), for Credit Card Fraud detection and Telecom Customer Churn prediction use …
Comparative Analysis Of Rgb-Based Eye-Tracking For Large-Scale Human-Machine Applications, Brett Thaman, Trung Cao
Comparative Analysis Of Rgb-Based Eye-Tracking For Large-Scale Human-Machine Applications, Brett Thaman, Trung Cao
Posters-at-the-Capitol
Gaze tracking has become an established technology that enables using an individual’s gaze as an input signal to support a variety of applications in the context of Human-Computer Interaction. Gaze tracking primarily relies on sensing devices such as infrared (IR) cameras. Nevertheless, in the recent years, several attempts have been realized at detecting gaze by acquiring and processing images acquired from standard RGB cameras. Nowadays, there are only a few publicly available open-source libraries and they have not been tested extensively. In this paper, we present the result of a comparative analysis that studied a commercial eye-tracking device using IR …
Nitrogenase Iron Protein Classification Using Cnn Neural Network, Amer Rez
Nitrogenase Iron Protein Classification Using Cnn Neural Network, Amer Rez
Master's Projects
The nitrogenase iron protein (NifH) is extensively used to study nitrogen fixation, the ecologically vital process of reducing atmospheric nitrogen to a bioavailable form. The discovery rate of novel NifH sequences is high, and there is an ongoing need for software tools to mine NifH records from the GenBank repository. Since record annotations are unreliable, because they contain errors, classifiers based on sequence alone are required. The ARBitrator classifier is highly successful but must be initialized by extensive manual effort. A Deep Learning approach could substantially reduce manual intervention. However, attempts to build a character-based Deep Learning NifH classifier were …
Privacy Preserving For Multiple Computer Vision Tasks, Amala Varghese Wilson
Privacy Preserving For Multiple Computer Vision Tasks, Amala Varghese Wilson
Master's Projects
Privacy-preserving visual recognition is an important area of research that is gaining momentum in the field of computer vision. In a production environment, it is critical to have neural network models learn continually from user data. However, sharing raw user data with a server is less desirable from a regulatory, security and privacy perspective. Federated learning addresses the problem of privacy- preserving visual recognition. More specifically, we closely examine and dissect a framework known as Dual User Adaptation (DUA) presented by Lange et al. at CVPR 2020, due to its novel idea of bringing about user-adaptation on both the server-side …
Task Classification During Visual Search Using Classic Machine Learning And Deep Learning, Devangi Vilas Chinchankar
Task Classification During Visual Search Using Classic Machine Learning And Deep Learning, Devangi Vilas Chinchankar
Master's Projects
In an average human life, the eyes not only passively scan visual scenes, but most times end up actively performing tasks including, but not limited to, searching, comparing, and counting. As a result of the advances in technology, we are observing a boost in the average screen time. Humans are now looking at an increasing number of screens and in turn images and videos. Understanding what scene a user is looking at and what type of visual task is being performed can be useful in developing intelligent user interfaces, and in virtual reality and augmented reality devices. In this research, …
The Impact Of Programming Language’S Type On Probabilistic Machine Learning Models, Sherif Elsaid
The Impact Of Programming Language’S Type On Probabilistic Machine Learning Models, Sherif Elsaid
Master's Projects
Software development is an expensive and difficult process. Mistakes can be easily made, and without extensive review process, those mistakes can make it to the production code and may have unintended disastrous consequences.
This is why various automated code review services have arisen in the recent years. From AWS’s CodeGuro and Microsoft’s Code Analysis to more integrated code assistants, like IntelliCode and auto completion tools. All of which are designed to help and assist the developers with their work and help catch overlooked bugs.
Thanks to recent advances in machine learning, these services have grown tremen- dously in sophistication to …
Robotic Olfactory-Based Navigation With Mobile Robots, Lingxiao Wang
Robotic Olfactory-Based Navigation With Mobile Robots, Lingxiao Wang
Doctoral Dissertations and Master's Theses
Robotic odor source localization (OSL) is a technology that enables mobile robots or autonomous vehicles to find an odor source in unknown environments. It has been viewed as challenging due to the turbulent nature of airflows and the resulting odor plume characteristics. The key to correctly finding an odor source is designing an effective olfactory-based navigation algorithm, which guides the robot to detect emitted odor plumes as cues in finding the source. This dissertation proposes three kinds of olfactory-based navigation methods to improve search efficiency while maintaining a low computational cost, incorporating different machine learning and artificial intelligence methods.
A. …
Predicting Stocks With Lstm-Based Drnn And Gan, Duy Ngo
Predicting Stocks With Lstm-Based Drnn And Gan, Duy Ngo
Master's Projects
Trading equities can be very lucrative for some and a gamble for others. Professional traders and retail traders are constantly amassing information to be a step ahead of the market to profit off the value of stocks on the market. Some of the tools in their arsenal include different types of calculations based on a variety of data collected on a stock. Technical analysis is a technique for traders to analyze the data of equities presented on charts. Often, the way the price changes over time can be used as an indicator for traders to predict how future prices will …
An Open Source Direct Messaging And Enhanced Recommendation System For Yioop, Aniruddha Dinesh Mallya
An Open Source Direct Messaging And Enhanced Recommendation System For Yioop, Aniruddha Dinesh Mallya
Master's Projects
Recommendation systems and direct messaging systems are two popular components of web portals. A recommendation system is an information filtering system that seeks to predict the "rating" or "preference" a user would give to an item and a direct messaging system allows private communication between users of any platform. Yioop, is an open source, PHP search engine and web portal that can be configured to allow users to create discussion groups, blogs, wikis etc.
In this project, we expanded on Yioop’s group system so that every user now has a personal group. Personal groups were then used to add user …
Deep Convolutional Neural Networks For Accurate Diagnosis Of Covid-19 Patients Using Chest X-Ray Image Databases From Italy, Canada, And The Usa, Amgad A. Salama, Samy H. Darwish, Samir M. Abdel-Mageed, Radwa A. Meshref, Ehab I. Mohamed
Deep Convolutional Neural Networks For Accurate Diagnosis Of Covid-19 Patients Using Chest X-Ray Image Databases From Italy, Canada, And The Usa, Amgad A. Salama, Samy H. Darwish, Samir M. Abdel-Mageed, Radwa A. Meshref, Ehab I. Mohamed
The University of Louisville Journal of Respiratory Infections
Introduction: Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), famously known as COVID-19, has quickly become a global pandemic. Chest X-ray (CXR) imaging has proven reliable, fast, and cost-effective for identifying COVID-19 infections, which proceeds to display atypical unilateral patchy infiltration in the lungs like typical pneumonia. We employed the deep convolutional neural network (DCNN) ResNet-34 to detect and classify CXR images from patients with COVID-19 and Viral Pneumonia and Normal Controls.
Methods: We created a single database containing 781 source CXR images from four different international sub-databases: the Società Italiana di Radiologia Medica e Interventistica (SIRM), the GitHub Database, the …
Analysis Of Camera Trap Footage Through Subject Recognition, Nirnayak Bhardwaj
Analysis Of Camera Trap Footage Through Subject Recognition, Nirnayak Bhardwaj
Master's Projects
Motion-sensitive cameras, otherwise known as camera traps, have become increasingly popular amongst ecologists for studying wildlife. These cameras allow scientists to remotely observe animals through an inexpensive and non-invasive approach. Due to the lenient nature of motion cameras, studies involving them often generate excessive amounts of footage with many photographs not containing any animal subjects. Thus, there is a need for a system that is capable of analyzing camera trap footage to determine if a picture holds value for researchers. While research into automated image recognition is well documented, it has had limited applications in the field of ecology. This …
Identifying Bots On Twitter With Benford’S Law, Sanmesh Bhosale
Identifying Bots On Twitter With Benford’S Law, Sanmesh Bhosale
Master's Projects
Over time Online Social Networks (OSNs) have grown exponentially in terms of active users and have now become an influential factor in the formation of public opinions. Due to this, the use of bots and botnets for spreading misinformation on OSNs has become a widespread concern. The biggest example of this was during the 2016 American Presidential Elections, where Russian bots on Twitter pumped out fake news to influence the election results.
Identifying bots and botnets on Twitter is not just based on visual analysis and can require complex statistical methods to score a profile based on multiple features and …
Employee Churn Prediction Using Logistic Regression And Support Vector Machine, Rajendra Maharjan
Employee Churn Prediction Using Logistic Regression And Support Vector Machine, Rajendra Maharjan
Master's Projects
It is a challenge for Human Resource (HR) team to retain their existing employees than to hire a new one. For any company, losing their valuable employees is a loss in terms of time, money, productivity, and trust, etc. This loss could be possibly minimized if HR could beforehand find out their potential employees who are planning to quit their job hence, we investigated solving the employee churn problem through the machine learning perspective. We have designed machine learning models using supervised and classification-based algorithms like Logistic Regression and Support Vector Machine (SVM). The models are trained with the IBM …
Spatio-Temporal Relation Modeling For Few-Shot Action Recognition, Anirudh Thatipelli, Sanath Narayan, Salman Hameed Khan, Rao Muhammad Anwer, Fahad Shahbaz Khan, Bernard Ghanem
Spatio-Temporal Relation Modeling For Few-Shot Action Recognition, Anirudh Thatipelli, Sanath Narayan, Salman Hameed Khan, Rao Muhammad Anwer, Fahad Shahbaz Khan, Bernard Ghanem
Computer Vision Faculty Publications
We propose a novel few-shot action recognition framework, STRM, which enhances class-specific feature discriminability while simultaneously learning higher-order temporal representations. The focus of our approach is a novel spatio-temporal enrichment module that aggregates spatial and temporal contexts with dedicated local patch-level and global frame-level feature enrichment sub-modules. Local patch-level enrichment captures the appearance-based characteristics of actions. On the other hand, global framelevel enrichment explicitly encodes the broad temporal context, thereby capturing the relevant object features over time. The resulting spatio-temporally enriched representations are then utilized to learn the relational matching between query and support action sub-sequences. We further introduce a …
Analyzing And Detecting Android Malware And Deepfake, Md Shohel Rana
Analyzing And Detecting Android Malware And Deepfake, Md Shohel Rana
Dissertations
Rapid advances in artificial intelligence (AI), machine learning (ML), and deep learning (DL) over the past several decades have produced a variety of technologies and tools that, among numerous cybersecurity issues, have enticed cybercriminals and hackers to design malware for the Android operating systems and/or manipulate multimedia. For example, high-quality and realistic fake videos, images, or audios have been created to spread misinformation and propaganda, foment political discord and hate, or even harass and blackmail people; these manipulated, high-quality and realistic videos became known recently as Deepfake. There has been much work done in recent years on malware analysis and …
Prediction Of Iraqi Stock Exchange Using Optimized Based-Neural Network, Ameer Al-Haq Al-Shamery, Prof. Dr. Eman Salih Al-Shamery
Prediction Of Iraqi Stock Exchange Using Optimized Based-Neural Network, Ameer Al-Haq Al-Shamery, Prof. Dr. Eman Salih Al-Shamery
Karbala International Journal of Modern Science
Stock market prediction is an interesting financial topic that has attracted the attention of researchers for the last years. This paper aims at improving the prediction of the Iraq-Stock-Exchange (ISX) using a developed method of feedforward Neural-Networks based on the Quasi-Newton optimization approach. The proposed method reduces the error factor depending on the Jacobian vector and Lagrange multiplier. This improvement has led to accelerating convergence during the learning process. A sample of companies listed on ISX was selected. This includes twenty-six banks for the years from 2010 to 2020. To evaluate the proposed model, the research findings are compared with …
The Detection Of Sexual Harassment And Chat Predators Using Artificial Neural Network, Noor Amer Hamzah, Ban N. Dhannoon
The Detection Of Sexual Harassment And Chat Predators Using Artificial Neural Network, Noor Amer Hamzah, Ban N. Dhannoon
Karbala International Journal of Modern Science
The vast increase in using social media sites like Twitter and Facebook led to frequent sexual_harassment on the Internet, which is considered a major societal problem. This paper aims to detect sexual_harassment and cyber_predators in early phase. We used deeplearning like Bidirectionally-long-short-term memory. Word representations are carefully reviewed in text specific to mapping to real number vectors. The chat sexual predators Detection_approach with the proposed_model. The best results obtained by the performance measured with F0.5-score were the result is_0.927 with proposed_models. The accuracy measured is_97.27% in the proposed_model. The comments sexual_harassment Detection_approach the result is_0.925 F0.5-score, and accuracy measured is_99.12%.
Ow-Detr: Open-World Detection Transformer, Akshita Gupta, Sanath Narayan, K.J. Joseph, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah
Ow-Detr: Open-World Detection Transformer, Akshita Gupta, Sanath Narayan, K.J. Joseph, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah
Computer Vision Faculty Publications
Open-world object detection (OWOD) is a challenging computer vision problem, where the task is to detect a known set of object categories while simultaneously identifying unknown objects. Additionally, the model must incrementally learn new classes that become known in the next training episodes. Distinct from standard object detection, the OWOD setting poses significant challenges for generating quality candidate proposals on potentially unknown objects, separating the unknown objects from the background and detecting diverse unknown objects. Here, we introduce a novel end-to-end transformer-based framework, OW-DETR, for open-world object detection. The proposed OW-DETR comprises three dedicated components namely, attention-driven pseudo-labeling, novelty classification …
Auto-Curation Of Large Evolving Image Datasets, Sara Mousavicheshmehkaboodi
Auto-Curation Of Large Evolving Image Datasets, Sara Mousavicheshmehkaboodi
Doctoral Dissertations
Large image collections are becoming common in many fields and offer tantalizing opportunities to transform how research, work, and education are conducted if the information and associated insights could be extracted from them. However, major obstacles to this vision exist. First, image datasets with associated metadata contain errors and need to be cleaned and organized to be easily explored and utilized. Second, such collections typically lack the necessary context or may have missing attributes that need to be recovered. Third, such datasets are domain-specific and require human expert involvement to make the right interpretation of the image content. Fourth, the …
Predicting Occurrence Of The Term Sarcopenia With Semi-Supervised Machine Learning, Kevin Flasch
Predicting Occurrence Of The Term Sarcopenia With Semi-Supervised Machine Learning, Kevin Flasch
Theses and Dissertations
Sarcopenia is a medical condition that involves loss of muscle mass. It has been difficult todefine and only recently assigned an official medical code, leading to many medical records lacking a coded diagnosis although the clinical note text may discuss it or symptoms of it. This thesis investigates the application of machine learning and natural language processing to analyze clinical note text to see how well the term ’sarcopenia’ can be predicted in clinical note text from records concerning the condition.
A variety of machine learning models combined with different features and text processingare tested against training data that mentions …
Book Review: Is Law Computable?: Critical Perspectives On Law And Artificial Intelligence, F. Tim Knight
Book Review: Is Law Computable?: Critical Perspectives On Law And Artificial Intelligence, F. Tim Knight
Librarian Publications & Presentations
No abstract provided.
A Human-Embodied Drone For Dexterous Aerial Manipulation, Dongbin Kim
A Human-Embodied Drone For Dexterous Aerial Manipulation, Dongbin Kim
UNLV Theses, Dissertations, Professional Papers, and Capstones
Current drones perform a wide variety of tasks in surveillance, photography, agriculture, package delivery, etc. However, these tasks are performed passively without the use of human interaction. Aerial manipulation shifts this paradigm and implements drones with robotic arms that allow interaction with the environment rather than simply sensing it. For example, in construction, aerial manipulation in conjunction with human interaction could allow operators to perform several tasks, such as hosing decks, drill into surfaces, and sealing cracks via a drone. This integration with drones will henceforth be known as dexterous aerial manipulation.
Our recent work integrated the worker’s experience into …
Analysis Of Residual Neural Networks For Marine Mammal Classification Using Multi-Channel Spectrograms, Daniel T. Murphy
Analysis Of Residual Neural Networks For Marine Mammal Classification Using Multi-Channel Spectrograms, Daniel T. Murphy
University of New Orleans Theses and Dissertations
Surveys of marine mammal populations are an essential part of monitoring the welfare of these animals and their ecosystems. Marine mammal vocalizations provide a reliable method of identifying most species, but passive acoustic monitoring of underwater audio may generate large quantities of data that exceed the capacity of human classifiers. Preprocessing and machine learning techniques provide a method of automating the classification process. In this study, we explore machine learning approaches to vocalization classification using convolutional neural networks with residual learning. Optimal parameters for noise-removal, spectrographic window functions, preprocessing augmentations, and multi-channel spectrogram generation are derived through a series of …
Respiratory Compensated Robot For Liver Cancer Treatment: Design, Fabrication, And Benchtop Characterization, Mishek Jair Musa
Respiratory Compensated Robot For Liver Cancer Treatment: Design, Fabrication, And Benchtop Characterization, Mishek Jair Musa
Graduate Theses and Dissertations
Hepatocellular carcinoma (HCC) is one of the leading causes of cancer-related death in the world. Radiofrequency ablation (RFA) is an effective method for treating tumors less than 5 cm. However, manually placing the RFA needle at the site of the tumor is challenging due to the complicated respiratory induced motion of the liver. This paper presents the design, fabrication, and benchtop characterization of a patient mounted, respiratory compensated robotic needle insertion platform to perform percutaneous needle interventions. The robotic platform consists of a 4-DoF dual-stage cartesian platform used to control the pose of a 1-DoF needle insertion module. The active …
Local Feature Selection For Multiple Instance Learning With Applications., Aliasghar Shahrjooihaghighi
Local Feature Selection For Multiple Instance Learning With Applications., Aliasghar Shahrjooihaghighi
Electronic Theses and Dissertations
Feature selection is a data processing approach that has been successfully and effectively used in developing machine learning algorithms for various applications. It has been proven to effectively reduce the dimensionality of the data and increase the accuracy and interpretability of machine learning algorithms. Conventional feature selection algorithms assume that there is an optimal global subset of features for the whole sample space. Thus, only one global subset of relevant features is learned. An alternative approach is based on the concept of Local Feature Selection (LFS), where each training sample can have its own subset of relevant features. Multiple Instance …
Hsva: Hierarchical Semantic-Visual Adaptation For Zero-Shot Learning, Shiming Chen, Guo Sen Xie, Yang Liu, Qinmu Peng, Baigui Sun, Hao Li, Xinge You, Ling Shao
Hsva: Hierarchical Semantic-Visual Adaptation For Zero-Shot Learning, Shiming Chen, Guo Sen Xie, Yang Liu, Qinmu Peng, Baigui Sun, Hao Li, Xinge You, Ling Shao
Machine Learning Faculty Publications
Zero-shot learning (ZSL) tackles the unseen class recognition problem, transferring semantic knowledge from seen classes to unseen ones. Typically, to guarantee desirable knowledge transfer, a common (latent) space is adopted for associating the visual and semantic domains in ZSL. However, existing common space learning methods align the semantic and visual domains by merely mitigating distribution disagreement through one-step adaptation. This strategy is usually ineffective due to the heterogeneous nature of the feature representations in the two domains, which intrinsically contain both distribution and structure variations. To address this and advance ZSL, we propose a novel hierarchical semantic-visual adaptation (HSVA) framework. …