Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Artificial Intelligence and Robotics

PDF

Series

2020

Institution
Keyword
Publication

Articles 31 - 60 of 171

Full-Text Articles in Physical Sciences and Mathematics

The Future Of Work Now: Ai-Driven Transaction Surveillance At Dbs Bank, Thomas H. Davenport, Steven M. Miller Oct 2020

The Future Of Work Now: Ai-Driven Transaction Surveillance At Dbs Bank, Thomas H. Davenport, Steven M. Miller

Research Collection School Of Computing and Information Systems

One of the most frequently-used phrases at business events these days is “the future of work.” It’s increasingly clear that artificial intelligence and other new technologies will bring substantial changes in work tasks and business processes. But while these changes are predicted for the future, they’re already present in many organizations for many different jobs. The job and incumbents described below are an example of this phenomenon. Steve Miller of Singapore Management University and I co-authored the story.


Reinforcement Learning For Zone Based Multiagent Pathfinding Under Uncertainty, Jiajing Ling, Tarun Gupta, Akshat Kumar Oct 2020

Reinforcement Learning For Zone Based Multiagent Pathfinding Under Uncertainty, Jiajing Ling, Tarun Gupta, Akshat Kumar

Research Collection School Of Computing and Information Systems

We address the problem of multiple agents finding their paths from respective sources to destination nodes in a graph (also called MAPF). Most existing approaches assume that all agents move at fixed speed, and that a single node accommodates only a single agent. Motivated by the emerging applications of autonomous vehicles such as drone traffic management, we present zone-based path finding (or ZBPF) where agents move among zones, and agents' movements require uncertain travel time. Furthermore, each zone can accommodate multiple agents (as per its capacity). We also develop a simulator for ZBPF which provides a clean interface from the …


Dual-Slam: A Framework For Robust Single Camera Navigation, Huajian Huang, Wen-Yan Lin, Siying Liu, Dong Zhang, Sai-Kit Yeung Oct 2020

Dual-Slam: A Framework For Robust Single Camera Navigation, Huajian Huang, Wen-Yan Lin, Siying Liu, Dong Zhang, Sai-Kit Yeung

Research Collection School Of Computing and Information Systems

SLAM (Simultaneous Localization And Mapping) seeks to provide a moving agent with real-time self-localization. To achieve real-time speed, SLAM incrementally propagates position estimates. This makes SLAM fast but also makes it vulnerable to local pose estimation failures. As local pose estimation is ill-conditioned, local pose estimation failures happen regularly, making the overall SLAM system brittle. This paper attempts to correct this problem. We note that while local pose estimation is ill-conditioned, pose estimation over longer sequences is well-conditioned. Thus, local pose estimation errors eventually manifest themselves as mapping inconsistencies. When this occurs, we save the current map and activate two …


Modular Neural Networks For Low-Power Image Classification On Embedded Devices, Abhinav Goel, Sara Aghajanzadeh, Caleb Tung, Shuo-Han Chen, George K. Thiruvathukal, Yung-Hisang Lu Oct 2020

Modular Neural Networks For Low-Power Image Classification On Embedded Devices, Abhinav Goel, Sara Aghajanzadeh, Caleb Tung, Shuo-Han Chen, George K. Thiruvathukal, Yung-Hisang Lu

Computer Science: Faculty Publications and Other Works

Embedded devices are generally small, battery-powered computers with limited hardware resources. It is difficult to run deep neural networks (DNNs) on these devices, because DNNs perform millions of operations and consume significant amounts of energy. Prior research has shown that a considerable number of a DNN’s memory accesses and computation are redundant when performing tasks like image classification. To reduce this redundancy and thereby reduce the energy consumption of DNNs, we introduce the Modular Neural Network Tree architecture. Instead of using one large DNN for the classifier, this architecture uses multiple smaller DNNs (called modules) to progressively classify images …


Co-Design And Evaluation Of An Intelligent Decision Support System For Stroke Rehabilitation Assessment, Min Hun Lee, Daniel P. Siewiorek, Asim Smailagic, Alexandre Bernardino, Sergi Badia Oct 2020

Co-Design And Evaluation Of An Intelligent Decision Support System For Stroke Rehabilitation Assessment, Min Hun Lee, Daniel P. Siewiorek, Asim Smailagic, Alexandre Bernardino, Sergi Badia

Research Collection School Of Computing and Information Systems

Clinical decision support systems have the potential to improve work flows of experts in practice (e.g. therapist's evidence-based rehabilitation assessment). However, the adoption of these systems is challenging, and the gains of these systems have not fully demonstrated yet. In this paper, we identified the needs of therapists to assess patient's functional abilities (e.g. alternative perspectives with quantitative information on patient's exercise motions). As a result, we co-designed and developed an intelligent decision support system that automatically identifies salient features of assessment using reinforcement learning to assess the quality of motion and generate patient-specific analysis. We evaluated this system with …


Knowledge Enhanced Neural Fashion Trend Forecasting, Yunshan Ma, Yujuan Ding, Xun Yang, Lizi Liao, Wai Keung Wong, Tat-Seng Chua Oct 2020

Knowledge Enhanced Neural Fashion Trend Forecasting, Yunshan Ma, Yujuan Ding, Xun Yang, Lizi Liao, Wai Keung Wong, Tat-Seng Chua

Research Collection School Of Computing and Information Systems

Fashion trend forecasting is a crucial task for both academia and industry. Although some efforts have been devoted to tackling this challenging task, they only studied limited fashion elements with highly seasonal or simple patterns, which could hardly reveal the real fashion trends. Towards insightful fashion trend forecasting, this work focuses on investigating fine-grained fashion element trends for specific user groups. We first contribute a large-scale fashion trend dataset (FIT) collected from Instagram with extracted time series fashion element records and user information. Furthermore, to effectively model the time series data of fashion elements with rather complex patterns, we propose …


The Future Of Work Now: Automl At 84.51°And Kroger, Thomas H. Davenport, Steven M. Miller Oct 2020

The Future Of Work Now: Automl At 84.51°And Kroger, Thomas H. Davenport, Steven M. Miller

Research Collection School Of Computing and Information Systems

One of the most frequently-used phrases at business events these days is “the future of work.” It’s increasingly clear that artificial intelligence and other new technologies will bring substantial changes in work tasks and business processes. But while these changes are predicted for the future, they’re already present in many organizations for many different jobs. The job and incumbents described below are an example of this phenomenon.


Foodbot: A Goal-Oriented Just-In-Time Healthy Eating Interventions Chatbot, Philips Kokoh Prasetyo, Palakorn Achananuparp, Ee-Peng Lim Oct 2020

Foodbot: A Goal-Oriented Just-In-Time Healthy Eating Interventions Chatbot, Philips Kokoh Prasetyo, Palakorn Achananuparp, Ee-Peng Lim

Research Collection School Of Computing and Information Systems

Recent research has identified a few design flaws in popular mobile health (mHealth) applications for promoting healthy eating lifestyle, such as mobile food journals. These include tediousness of manual food logging, inadequate food database coverage, and a lack of healthy dietary goal setting. To address these issues, we present Foodbot, a chatbot-based mHealth application for goal-oriented just-in-time (JIT) healthy eating interventions. Powered by a large-scale food knowledge graph, Foodbot utilizes automatic speech recognition and mobile messaging interface to record food intake. Moreover, Foodbot allows users to set goals and guides their behavior toward the goals via JIT notification prompts, interactive …


Topology-Guided Roadmap Construction With Dynamic Region Sampling, Read Sandström, Diane Uwacu, Jory Denny, Nancy M. Amato Oct 2020

Topology-Guided Roadmap Construction With Dynamic Region Sampling, Read Sandström, Diane Uwacu, Jory Denny, Nancy M. Amato

Department of Math & Statistics Faculty Publications

Many types of planning problems require discovery of multiple pathways through the environment, such as multi-robot coordination or protein ligand binding. The Probabilistic Roadmap (PRM) algorithm is a powerful tool for this case, but often cannot efficiently connect the roadmap in the presence of narrow passages. In this letter, we present a guidance mechanism that encourages the rapid construction of well-connected roadmaps with PRM methods. We leverage a topological skeleton of the workspace to track the algorithm's progress in both covering and connecting distinct neighborhoods, and employ this information to focus computation on the uncovered and unconnected regions. We demonstrate …


Asymptotically-Optimal Topological Nearest-Neighbor Filtering, Read Sandström, Jory Denny, Nancy M. Amato Oct 2020

Asymptotically-Optimal Topological Nearest-Neighbor Filtering, Read Sandström, Jory Denny, Nancy M. Amato

Department of Math & Statistics Faculty Publications

Nearest-neighbor finding is a major bottleneck for sampling-based motion planning algorithms. The cost of finding nearest neighbors grows with the size of the roadmap, leading to a significant computational bottleneck for problems which require many configurations to find a solution. In this work, we develop a method of mapping configurations of a jointed robot to neighborhoods in the workspace that supports fast search for configurations in nearby neighborhoods. This expedites nearest-neighbor search by locating a small set of the most likely candidates for connecting to the query with a local plan. We show that this filtering technique can preserve asymptotically-optimal …


Chess As A Testing Grounds For The Oracle Approach To Ai Safety, James D. Miller, Roman Yampolskiy, Olle Häggström, Stuart Armstrong Sep 2020

Chess As A Testing Grounds For The Oracle Approach To Ai Safety, James D. Miller, Roman Yampolskiy, Olle Häggström, Stuart Armstrong

Faculty Scholarship

To reduce the danger of powerful super-intelligent AIs, we might make the first such AIs oracles that can only send and receive messages. This paper proposes a possibly practical means of using machine learning to create two classes of narrow AI oracles that would provide chess advice: those aligned with the player's interest, and those that want the player to lose and give deceptively bad advice. The player would be uncertain which type of oracle it was interacting with. As the oracles would be vastly more intelligent than the player in the domain of chess, experience with these oracles might …


Trainable Structure Tensors For Autonomous Baggage Threat Detection Under Extreme Occlusion, Taimur Hassan, Samet Akçay, Mohammed Bennamoun, Salman Khan, Naoufel Werghi Sep 2020

Trainable Structure Tensors For Autonomous Baggage Threat Detection Under Extreme Occlusion, Taimur Hassan, Samet Akçay, Mohammed Bennamoun, Salman Khan, Naoufel Werghi

Computer Vision Faculty Publications

Detecting baggage threats is one of the most difficult tasks, even for expert officers. Many researchers have developed computer-aided screening systems to recognize these threats from the baggage X-ray scans. However, all of these frameworks are limited in identifying the contraband items under extreme occlusion. This paper presents a novel instance segmentation framework that utilizes trainable structure tensors to highlight the contours of the occluded and cluttered contraband items (by scanning multiple predominant orientations), while simultaneously suppressing the irrelevant baggage content. The proposed framework has been extensively tested on four publicly available X-ray datasets where it outperforms the state-of-the-art frameworks …


F-Measure Optimisation And Label Regularisation For Energy-Based Neural Dialogue State Tracking Models, Anh Duong Trinh, Robert J. Ross, John D. Kelleher Sep 2020

F-Measure Optimisation And Label Regularisation For Energy-Based Neural Dialogue State Tracking Models, Anh Duong Trinh, Robert J. Ross, John D. Kelleher

Conference papers

In recent years many multi-label classification methods have exploited label dependencies to improve performance of classification tasks in various domains, hence casting the tasks to structured prediction problems. We argue that multi-label predictions do not always satisfy domain constraint restrictions. For example when the dialogue state tracking task in task-oriented dialogue domains is solved with multi-label classification approaches, slot-value constraint rules should be enforced following real conversation scenarios.

To address these issues we propose an energy-based neural model to solve the dialogue state tracking task as a structured prediction problem. Furthermore we propose two improvements over previous methods with respect …


Tag: Automated Image Captioning, Nathan Funckes Sep 2020

Tag: Automated Image Captioning, Nathan Funckes

McNair Scholars Manuscripts

Many websites remain non-ADA compliant, containing images which lack accompanying textual descriptions. This leaves sight-impaired individuals unable to fully enjoy the rich wonders of the web. To address this inequity, our research aims to create an autonomous system capable of generating semantically accurate descriptions of images. This problem involves two tasks: recognizing an image and linguistically describing it. Our solution uses state-of-the-art deep learning: employing a convolutional neural network that "learns" to understand images and extracts their salient features, and a recurrent neural network that learns to generate structured, coherent sentences. These two networks are merged to create a single …


Measuring The Perceived Social Intelligence Of Robots, Kimberly A. Barchard, Leiszle Lapping-Carr, R. Shane Westfall, Andrea Fink-Armold, Santosh Balajee Banisetty, David Feil-Seifer Sep 2020

Measuring The Perceived Social Intelligence Of Robots, Kimberly A. Barchard, Leiszle Lapping-Carr, R. Shane Westfall, Andrea Fink-Armold, Santosh Balajee Banisetty, David Feil-Seifer

Psychology Faculty Research

Robotic social intelligence is increasingly important. However, measures of human social intelligence omit basic skills, and robot-specific scales do not focus on social intelligence. We combined human robot interaction concepts of beliefs, desires, and intentions with psychology concepts of behaviors, cognitions, and emotions to create 20 Perceived Social Intelligence (PSI) Scales to comprehensively measure perceptions of robots with a wide range of embodiments and behaviors. Participants rated humanoid and non-humanoid robots interacting with people in five videos. Each scale had one factor and high internal consistency, indicating each measures a coherent construct. Scales capturing perceived social information processing skills (appearing …


The Future Of Work Now: The Multi-Faceted Mall Security Guard At A Multi-Faceted Jewel, Thomas H. Davenport, Steven M. Miller Sep 2020

The Future Of Work Now: The Multi-Faceted Mall Security Guard At A Multi-Faceted Jewel, Thomas H. Davenport, Steven M. Miller

Research Collection School Of Computing and Information Systems

One of the most frequently-used phrases at business events these days is “the future of work.” It’s increasingly clear that artificial intelligence and other new technologies will bring substantial changes in work tasks and business processes. But while these changes are predicted for the future, they’re already present in many organizations for many different jobs. The job and incumbents described below are an example of this phenomenon. Steve Miller of Singapore Management University and I co-authored the story.


Zone Path Construction (Zac) Based Approaches For Effective Real-Time Ridesharing, Meghna Lowalekar, Pradeep Varakantham, Patrick Jaillet Sep 2020

Zone Path Construction (Zac) Based Approaches For Effective Real-Time Ridesharing, Meghna Lowalekar, Pradeep Varakantham, Patrick Jaillet

Research Collection School Of Computing and Information Systems

Real-time ridesharing systems such as UberPool, Lyft Line, GrabShare have become hugely popular as they reduce the costs for customers, improve per trip revenue for drivers and reduce traffic on the roads by grouping customers with similar itineraries. The key challenge in these systems is to group the "right" requests to travel together in the "right" available vehicles in real-time, so that the objective (e.g., requests served, revenue or delay) is optimized. This challenge has been addressed in existing work by: (i) generating as many relevant feasible (with respect to the available delay for customers) combinations of requests as possible …


Rethinking Mistake In The Age Of Algorithms: Quoine Pte Ltd V B2c2 Ltd, Vincent Ooi, Kian Peng Soh Sep 2020

Rethinking Mistake In The Age Of Algorithms: Quoine Pte Ltd V B2c2 Ltd, Vincent Ooi, Kian Peng Soh

Research Collection Yong Pung How School Of Law

Good traders remove emotion from the decision-making process. Automated trading algorithms have enabled this, allowing one to trade round the clock, and without the constant need to monitor one’s investments. But software has gremlins. Given the vast amounts of money involved in such trades, it was only a matter of time before disputes involving automated trading software came before the courts. The decision in Quoine v B2C2 (“Quoine”) represents the first time an apex court in the Commonwealth has ruled on the applicability of contractual principles to situations involving automated trading software.


Deep Learning Techniques For Structural Response Prediction During Strong Ground Motions, Ahmed A. Torky, Susumu Ohno Prof., Toshide Kashima Prof. Sep 2020

Deep Learning Techniques For Structural Response Prediction During Strong Ground Motions, Ahmed A. Torky, Susumu Ohno Prof., Toshide Kashima Prof.

Civil Engineering

In this paper, deep learning techniques are applied to predict a building’s structural response to strong ground motions. Data from sensors near and inside structures measure accelerations during strong ground motions. The Building Research Institute (BRI) ANX building, an eight-story structure, has experienced major earthquakes since 1998. Sensors in the building provide and accumulate big data of historic events. Changes of the natural frequency of the ANX building from the big data is initially quantified. The time-series data of the historic events can be used to predict future response to future events using deep learning models rapidly. Although previous literature …


Research 4.0: Research In The Age Of Automation, Rob Procter, Ben Glover, Elliot Jones Sep 2020

Research 4.0: Research In The Age Of Automation, Rob Procter, Ben Glover, Elliot Jones

Copyright, Fair Use, Scholarly Communication, etc.

Executive Summary

There is a growing consensus that we are at the start of a fourth industrial revolution, driven by developments in Artificial Intelligence, machine learning, robotics, the Internet of Things, 3-D printing, nanotechnology, biotechnology, 5G, new forms of energy storage and quantum computing. This wave of technical innovations is already having a significant impact on how research is conducted, with dramatic change across research methods in recent years within some disciplines, as this project’s interim report set out.

Whilst there are a wide range of technologies associated with the fourth industrial revolution, this report primarily seeks to understand what …


Bus Frequency Optimization: When Waiting Time Matters In User Satisfaction, Songsong Mo, Zhifeng Bao, Baihua Zheng, Zhiyong Peng Sep 2020

Bus Frequency Optimization: When Waiting Time Matters In User Satisfaction, Songsong Mo, Zhifeng Bao, Baihua Zheng, Zhiyong Peng

Research Collection School Of Computing and Information Systems

Reorganizing bus frequency to cater for the actual travel demand can save the cost of the public transport system significantly. Many, if not all, existing studies formulate this as a bus frequency optimization problem which tries to minimize passengers’ average waiting time. However, many investigations have confirmed that the user satisfaction drops faster as the waiting time increases. Consequently, this paper studies the bus frequency optimization problem considering the user satisfaction. Specifically, for the first time to our best knowledge, we study how to schedule the buses such that the total number of passengers who could receive their bus services …


Pricing And Equilibrium In On-Demand Ride-Pooling Markets, Jintao Ke, Hai Yang, Xinwei Li, Hai Wang, Jieping Ye Sep 2020

Pricing And Equilibrium In On-Demand Ride-Pooling Markets, Jintao Ke, Hai Yang, Xinwei Li, Hai Wang, Jieping Ye

Research Collection School Of Computing and Information Systems

With the recent rapid growth of technology-enabled mobility services, ride-sourcing platforms, such as Uber and DiDi, have launched commercial on-demand ride-pooling programs that allow drivers to serve more than one passenger request in each ride. Without requiring the prearrangement of trip schedules, these programs match on-demand passenger requests with vehicles that have vacant seats. Ride-pooling programs are expected to offer benefits for both individual passengers in the form of cost savings and for society in the form of traffic alleviation and emission reduction. In addition to some exogenous variables and environments for ride-sourcing market, such as city size and population …


An Ecosystem Approach To Ethical Ai And Data Use: Experimental Reflections, Mark Findlay, Josephine Seah Sep 2020

An Ecosystem Approach To Ethical Ai And Data Use: Experimental Reflections, Mark Findlay, Josephine Seah

Research Collection Yong Pung How School Of Law

While we have witnessed a rapid growth of ethics documents meant to guide artificial intelligence (AI) development, the promotion of AI ethics has nonetheless proceeded with little input from AI practitioners themselves. Given the proliferation of AI for Social Good initiatives, this is an emerging gap that needs to be addressed in order to develop more meaningful ethical approaches to AI use and development. This paper offers a methodology-a 'shared fairness' approach-aimed at identifying AI practitioners' needs when it comes to confronting and resolving ethical challenges and to find a third space where their operational language can be married with …


Off-Policy Reinforcement Learning For Efficient And Effective Gan Architecture Search, Tian Yuan, Wang Qin, Zhiwu Huang, Wen Li, Dengxin Dai, Minghao Yang, Jun Wang, Olga Fink Aug 2020

Off-Policy Reinforcement Learning For Efficient And Effective Gan Architecture Search, Tian Yuan, Wang Qin, Zhiwu Huang, Wen Li, Dengxin Dai, Minghao Yang, Jun Wang, Olga Fink

Research Collection School Of Computing and Information Systems

In this paper, we introduce a new reinforcement learning (RL) based neural architecture search (NAS) methodology for effective and efficient generative adversarial network (GAN) architecture search. The key idea is to formulate the GAN architecture search problem as a Markov decision process (MDP) for smoother architecture sampling, which enables a more effective RL-based search algorithm by targeting the potential global optimal architecture. To improve efficiency, we exploit an off-policy GAN architecture search algorithm that makes efficient use of the samples generated by previous policies. Evaluation on two standard benchmark datasets (i.e., CIFAR-10 and STL-10) demonstrates that the proposed method is …


Feature Pyramid Transformer, Dong Zhang, Hanwang Zhang, Jinhui Tang, Meng Wang, Xian-Sheng Hua, Qianru Sun Aug 2020

Feature Pyramid Transformer, Dong Zhang, Hanwang Zhang, Jinhui Tang, Meng Wang, Xian-Sheng Hua, Qianru Sun

Research Collection School Of Computing and Information Systems

Feature interactions across space and scales underpin modern visual recognition systems because they introduce beneficial visual contexts. Conventionally, spatial contexts are passively hidden in the CNN’s increasing receptive fields or actively encoded by non-local convolution. Yet, the non-local spatial interactions are not across scales, and thus they fail to capture the non-local contexts of objects (or parts) residing in different scales. To this end, we propose a fully active feature interaction across both space and scales, called Feature Pyramid Transformer (FPT). It transforms any feature pyramid into another feature pyramid of the same size but with richer contexts, by using …


Intersentiment: Combining Deep Neural Models On Interaction And Sentiment For Review Rating Prediction, Shi Feng, Kaisong Song, Daling Wang, Wei Gao, Yifei Zhang Aug 2020

Intersentiment: Combining Deep Neural Models On Interaction And Sentiment For Review Rating Prediction, Shi Feng, Kaisong Song, Daling Wang, Wei Gao, Yifei Zhang

Research Collection School Of Computing and Information Systems

Review rating prediction is commonly approached from the perspective of either Collaborative Filtering (CF) or Sentiment Classification (SC). CF-based approach usually resorts to matrix factorization based on user–item interaction, and does not fully utilize the valuable review text features. In contrast, SC-based approach is focused on mining review content, but can just incorporate some user- and product-level features, and fails to capture sufficient interactions between them represented typically in a sparse matrix as CF can do. In this paper, we propose a novel, extensible review rating prediction model called InterSentiment by bridging the user-product interaction model and the sentiment model …


Commanding And Re-Dictation: Developing Eyes-Free Voice-Based Interaction For Editing Dictated Text, Debjyoti Ghosh, Can Liu, Shengdong Zhao, Kotaro Hara Aug 2020

Commanding And Re-Dictation: Developing Eyes-Free Voice-Based Interaction For Editing Dictated Text, Debjyoti Ghosh, Can Liu, Shengdong Zhao, Kotaro Hara

Research Collection School Of Computing and Information Systems

Existing voice-based interfaces have limited support for text editing, especially when seeing the text is difficult, e.g., while walking or cooking. This research develops voice interaction techniques for eyes-free text editing. First, with a Wizard-of-Oz study, we identified two primary user strategies: using commands, e.g., “replace go with goes” and re-dictating over an erroneous portion, e.g., correcting “he go there” by saying “he goes there.” To support these user strategies with an actual system implementation, we developed two eyes-free voice interaction techniques, Commanding and Re-dictation, and evaluated them with a controlled experiment. Results showed that while Re-dictation performs significantly better …


A Multicut Outer-Approximation Approach For Competitive Facility Location Under Random Utilities, Tien Mai, Andrea Lodi Aug 2020

A Multicut Outer-Approximation Approach For Competitive Facility Location Under Random Utilities, Tien Mai, Andrea Lodi

Research Collection School Of Computing and Information Systems

This work concerns the maximum capture facility location problem with random utilities, i.e., the problem of seeking to locate new facilities in a competitive market such that the captured demand of users is maximized, assuming that each individual chooses among all available facilities according to a random utility maximization model. The main challenge lies in the nonlinearity of the objective function. Motivated by the convexity and separable structure of such an objective function, we propose an enhanced implementation of the outer approximation scheme. Our algorithm works in a cutting plane fashion and allows to separate the objective function into a …


An Analysis Of Sketched Irls For Accelerated Sparse Residual Regression, Daichi Iwata, Michael Waechter, Wen-Yan Lin, Yasuyuki Matsushita Aug 2020

An Analysis Of Sketched Irls For Accelerated Sparse Residual Regression, Daichi Iwata, Michael Waechter, Wen-Yan Lin, Yasuyuki Matsushita

Research Collection School Of Computing and Information Systems

This paper studies the problem of sparse residual regression, i.e., learning a linear model using a norm that favors solutions in which the residuals are sparsely distributed. This is a common problem in a wide range of computer vision applications where a linear system has a lot more equations than unknowns and we wish to find the maximum feasible set of equations by discarding unreliable ones. We show that one of the most popular solution methods, iteratively reweighted least squares (IRLS), can be significantly accelerated by the use of matrix sketching. We analyze the convergence behavior of the proposed method …


An Ensemble Of Epoch-Wise Empirical Bayes For Few-Shot Learning, Yaoyao Liu, Bernt Schiele, Qianru Sun Aug 2020

An Ensemble Of Epoch-Wise Empirical Bayes For Few-Shot Learning, Yaoyao Liu, Bernt Schiele, Qianru Sun

Research Collection School Of Computing and Information Systems

Few-shot learning aims to train efficient predictive models with a few examples. The lack of training data leads to poor models that perform high-variance or low-confidence predictions. In this paper, we propose to meta-learn the ensemble of epoch-wise empirical Bayes models (E3BM) to achieve robust predictions. “Epoch-wise'' means that each training epoch has a Bayes model whose parameters are specifically learned and deployed. ”Empirical'' means that the hyperparameters, e.g., used for learning and ensembling the epoch-wise models, are generated by hyperprior learners conditional on task-specific data. We introduce four kinds of hyperprior learners by considering inductive vs. transductive, and epoch-dependent …