Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 7 of 7

Full-Text Articles in Physical Sciences and Mathematics

Lightlearn: Occupant Centered Lighting Controller Using Reinforcement Learning To Adapt Systems To Humans, June Young Park, Thomas Dougherty, Hagen Fritz, Zoltan Nagy Sep 2018

Lightlearn: Occupant Centered Lighting Controller Using Reinforcement Learning To Adapt Systems To Humans, June Young Park, Thomas Dougherty, Hagen Fritz, Zoltan Nagy

International Building Physics Conference 2018

Humans spend up to 90% of their time indoors, thus building systems should maintain the indoor environment within the comfort range. In this paper, we present LightLearn, a reinforcement learning based occupant centered lighting controller. The control agent interacts with the occupant non-intrusively, learns her/his preferences, and determines actions for achieving both human comfort and energy saving. We present system hardware, control algorithm, and experimental results of LightLearn for an office space. Compared to the full (9am-5pm) and occupancy based control, LightLearn reduced 83% and 63% of operation time, respectively, by adapting to the occupant.


Adopt: Combining Parameter Tuning And Adaptive Operator Ordering For Solving A Class Of Orienteering Problems, Aldy Gunawan, Hoong Chuin Lau, Kun Lu Jul 2018

Adopt: Combining Parameter Tuning And Adaptive Operator Ordering For Solving A Class Of Orienteering Problems, Aldy Gunawan, Hoong Chuin Lau, Kun Lu

Research Collection School Of Computing and Information Systems

Two fundamental challenges in local search based metaheuristics are how to determine parameter configurations and design the underlying Local Search (LS) procedure. In this paper, we propose a framework in order to handle both challenges, called ADaptive OPeraTor Ordering (ADOPT). In this paper, The ADOPT framework is applied to two metaheuristics, namely Iterated Local Search (ILS) and a hybridization of Simulated Annealing and ILS (SAILS) for solving two variants of the Orienteering Problem: the Team Dependent Orienteering Problem (TDOP) and the Team Orienteering Problem with Time Windows (TOPTW). This framework consists of two main processes. The Design of Experiment (DOE) …


Probabilistic Guided Exploration For Reinforcement Learning In Self-Organizing Neural Networks, Peng Wang, Weigui Jair Zhou, Di Wang, Ah-Hwee Tan Jul 2018

Probabilistic Guided Exploration For Reinforcement Learning In Self-Organizing Neural Networks, Peng Wang, Weigui Jair Zhou, Di Wang, Ah-Hwee Tan

Research Collection School Of Computing and Information Systems

Exploration is essential in reinforcement learning, which expands the search space of potential solutions to a given problem for performance evaluations. Specifically, carefully designed exploration strategy may help the agent learn faster by taking the advantage of what it has learned previously. However, many reinforcement learning mechanisms still adopt simple exploration strategies, which select actions in a pure random manner among all the feasible actions. In this paper, we propose novel mechanisms to improve the existing knowledgebased exploration strategy based on a probabilistic guided approach to select actions. We conduct extensive experiments in a Minefield navigation simulator and the results …


Bounding Box Improvement With Reinforcement Learning, Andrew Lewis Cleland Jun 2018

Bounding Box Improvement With Reinforcement Learning, Andrew Lewis Cleland

Dissertations and Theses

In this thesis, I explore a reinforcement learning technique for improving bounding box localizations of objects in images. The model takes as input a bounding box already known to overlap an object and aims to improve the fit of the box through a series of transformations that shift the location of the box by translation, or change its size or aspect ratio. Over the course of these actions, the model adapts to new information extracted from the image. This active localization approach contrasts with existing bounding-box regression methods, which extract information from the image only once. I implement, train, and …


Pseudorehearsal In Actor-Critic Agents With Neural Network Function Approximation, Vladimir Marochko, Leonard Johard, Manuel Mazzara, Luca Longo Jan 2018

Pseudorehearsal In Actor-Critic Agents With Neural Network Function Approximation, Vladimir Marochko, Leonard Johard, Manuel Mazzara, Luca Longo

Articles

Catastrophic forgetting has a significant negative impact in reinforcement learning. The purpose of this study is to investigate how pseudorehearsal can change performance of an actor-critic agent with neural-network function approximation. We tested agent in a pole balancing task and compared different pseudorehearsal approaches. We have found that pseudorehearsal can assist learning and decrease forgetting.


Towards A Physio-Cognitive Model Of The Exploration Exploitation Trade-Off., David M. Schwartz, Christopher L. Dancy Jan 2018

Towards A Physio-Cognitive Model Of The Exploration Exploitation Trade-Off., David M. Schwartz, Christopher L. Dancy

Faculty Conference Papers and Presentations

Managing the exploration vs exploitation trade-off is an important part of our everyday lives. It occurs in minor decisions such as choosing what music to listen to as well as major decisions, such as picking a research direction to pursue. The dilemma is the same despite the context: does one exploit the environment, using current knowledge to acquire a satisfactory solution, or explore other options and potentially find a better answer. An accurate cognitive model must be able to handle this trade-off because of the importance it plays in our lives. We are developing physio-cognitive models to better understand how …


Intelligent And Secure Underwater Acoustic Communication Networks, Chaofeng Wang Jan 2018

Intelligent And Secure Underwater Acoustic Communication Networks, Chaofeng Wang

Dissertations, Master's Theses and Master's Reports

Underwater acoustic (UWA) communication networks are promising techniques for medium- to long-range wireless information transfer in aquatic applications. The harsh and dynamic water environment poses grand challenges to the design of UWA networks. This dissertation leverages the advances in machine learning and signal processing to develop intelligent and secure UWA communication networks. Three research topics are studied: 1) reinforcement learning (RL)-based adaptive transmission in UWA channels; 2) reinforcement learning-based adaptive trajectory planning for autonomous underwater vehicles (AUVs) in under-ice environments; 3) signal alignment to secure underwater coordinated multipoint (CoMP) transmissions.

First, a RL-based algorithm is developed for adaptive transmission in …