Open Access. Powered by Scholars. Published by Universities.®

Machine Learning Faculty Publications

Learning strategy

Publication Year

Articles 1 - 2 of 2

Full-Text Articles in Artificial Intelligence and Robotics

Learning To Generalize Dispatching Rules On The Job Shop Scheduling, Zangir Iklassov, Dmitrii Medvedev, Ruben Solozabal, Martin Takac Jun 2022

Learning To Generalize Dispatching Rules On The Job Shop Scheduling, Zangir Iklassov, Dmitrii Medvedev, Ruben Solozabal, Martin Takac

Machine Learning Faculty Publications

This paper introduces a Reinforcement Learning approach to better generalize heuristic dispatching rules on the Job-shop Scheduling Problem (JSP). Current models on the JSP do not focus on generalization, although, as we show in this work, this is key to learning better heuristics on the problem. A well-known technique to improve generalization is to learn on increasingly complex instances using Curriculum Learning (CL). However, as many works in the literature indicate, this technique might suffer from catastrophic forgetting when transferring the learned skills between different problem sizes. To address this issue, we introduce a novel Adversarial Curriculum Learning (ACL) strategy, …


Learning From Mistakes - A Framework For Neural Architecture Search, Bhanu Garg, Li Zhang, Pradyumna Sridhara, Ramtin Hosseini, Eric P. Xing, Pengtao Xie Nov 2021

Learning From Mistakes - A Framework For Neural Architecture Search, Bhanu Garg, Li Zhang, Pradyumna Sridhara, Ramtin Hosseini, Eric P. Xing, Pengtao Xie

Machine Learning Faculty Publications

Learning from one's mistakes is an effective human learning technique where the learners focus more on the topics where mistakes were made, so as to deepen their understanding. In this paper, we investigate if this human learning strategy can be applied in machine learning. We propose a novel machine learning method called Learning From Mistakes (LFM), wherein the learner improves its ability to learn by focusing more on the mistakes during revision. We formulate LFM as a three-stage optimization problem: 1) learner learns; 2) learner re-learns focusing on the mistakes, and; 3) learner validates its learning. We develop an efficient …