Engineering | Open Access Articles | Digital Commons Network™

Task Scheduling For Internet Of Vehicles Based On Deep Reinforcement Learning In Edge Computing, Xiang Ju, Shengchao Su, Chaojie Xu, Beibei He

Journal of System Simulation

Abstract: Aiming at the offloading and execution of delay-constrained computing tasks for internet of vehicles in edge computing, a task scheduling method based on deep reinforcement learning is proposed. In multi-edge server scenario, a software-defined network-aided internet of vehicles task offloading system is built. On this basis, the task scheduling model of vehicle computation offloading is given. According to the characteristics of task scheduling, a scheduling method based on an improved pointer network is designed. Considering the complexity of task scheduling and computing resource allocation, the deep reinforcement learning algorithm is used to train the pointer network. The vehicle offloading …

Go to article

Intelligent Air Defense Task Assignment Based On Assignment Strategy Optimization Algorithm, Jiayi Liu, Gang Wang, Qiang Fu, Xiangke Guo, Siyuan Wang

Journal of System Simulation

Abstract: Aiming at the insufficient solving speed of assignment strategy optimization algorithm in largescale scenarios, deep reinforcement learning is combined with Markov decision process to carry out the intelligent large-scale air defense task assignment. According to the characteristics of large-scale air defense operations, Markov decision process is used to model the agent and a digital battlefield simulation environment is built. Air defense task assignment agent is designed and trained in digital battlefield simulation environment through proximal policy optimization algorithm. The feasibility and advantage of the method are verified by taking a large-scale ground-to-air countermeasure mission as an example.

Go to article

Intelligent Path Planning For Mobile Robots Based On Sac Algorithm, Laiyi Yang, Jing Bi, Haitao Yuan

Journal of System Simulation

Abstract: Aiming at the high dimension, slow convergence and complex modelling of traditional path planning algorithms for mobile robots, a new intelligent path planning algorithm is proposed, which is based on deep reinforcement learning soft actor-critic (SAC) algorithm to save the poor performance of robot in complicated environments with static and dynamic obstacles. An improved reward function is designed to enable mobile robots to quickly avoid obstacles and reach targets by using state dynamic normalization and priority experience pool techniques. To evaluate the performance, a pygame-based simulation environment is constructed. Compared with proximal policy optimization(PPO) algorithm, experimental …

Go to article

Obstacle Avoidance Path Planning And Simulation Of Mobile Picking Robot Based On Dppo, Junqiang Lin, Hongjun Wang, Xiangjun Zou, Po Zhang, Chengen Li, Yipeng Zhou, Shujie Yao

Journal of System Simulation

Abstract: Aiming at the autonomous decision-making difficulty of mobile picking robots in random and changeable complicated path environment during field operations, an autonomous obstacle avoidance path planning method based on deep reinforcement learning is propose. By setting the state space and action space and using the artificial potential field method to design the reward function, an obstacle penalty coefficient setting method based on collision cone collision avoidance detection is proposed to improve the autonomous collision avoidance ability. A virtual simulation system is constructed, in which the learning and training of the mobile picking robot is carried out and verified by …

Go to article

Generalization Through Diversity: Improving Unsupervised Environment Design, Wenjun Li, Pradeep Varakantham, Dexun Li

Research Collection School Of Computing and Information Systems

Agent decision making using Reinforcement Learning (RL) heavily relies on either a model or simulator of the environment (e.g., moving in an 8x8 maze with three rooms, playing Chess on an 8x8 board). Due to this dependence, small changes in the environment (e.g., positions of obstacles in the maze, size of the board) can severely affect the effectiveness of the policy learned by the agent. To that end, existing work has proposed training RL agents on an adaptive curriculum of environments (generated automatically) to improve performance on out-of-distribution (OOD) test scenarios. Specifically, existing research has employed the potential for the …

Go to article

Joint Optimization Strategy Of Computing Offloading And Edge Caching For Intelligent Connected Vehicles, Fei Ding, Yuchen Sha, Ying Hong, Xiao Kuai, Dengyin Zhang

Journal of System Simulation

To guarantee the low-delay communication of intelligent connected vehicles, the V2X channel model and the multi-access edge computing (MEC) technology, are used to carry out the research of the joint optimization strategy of computing offloading and edge caching.An intelligent connected vehicle with task offloading and edge caching model least-deep deterministic policy gradient(L-DDPG) is developed.By integrating the vehicular local and edge computing resources, the classification processing of different computing tasks in V2X scenarios is supported.The vehicular computing request is prejudged by edge platform to ensure the rapid response of continuous homogeneous computing tasks. Combining with the least recently …

Go to article

Mixed-Criticality Scheduling Using Reinforcement Learning, Omar Elseadawy

Theses and Dissertations

Mixed-criticality (MC) scheduling is necessary for many safety-critical real-time embedded systems, as a failure of high-criticality jobs could lead to fatal accidents. With the emergence of software technologies in software-defined vehicles in the automotive and avionics industries, studying Mixed-Critically (MC) systems is essential to their safety standards, similar to ISO26262. The real-time operation of MC systems makes it an inherently online problem, such that the scheduler is only aware of the jobs that are currently released at any point in time and has no knowledge of future jobs. Due to the overhead cost of preemption, this study focuses on enforcing …

Go to article

Deep Reinforcement Learning-Based Control Strategy For Boost Converter, Yuxuan Dai, Chenggang Cui

Journal of System Simulation

Abstract: In view of the problems of model uncertainty and nonlinearity in bus voltage stability control of Boost converter, an intelligent control strategy based on model-free deep reinforcement learning(RL) is proposed. RL double DQN(DDQN) algorithm and deep deterministic policy gradient(DDPG) algorithm are used, and the Boost converter controller is designed. The state, action space, reward function, and neural network are also designed to improve the dynamic performance of the controller. The joint simulation of the Boost converter model and RL agent is realized by RL modelica(RLM …

Go to article

Research Progress Of Opponent Modeling Based On Deep Reinforcement Learning, Haotian Xu, Long Qin, Junjie Zeng, Yue Hu, Qi Zhang

Journal of System Simulation

Abstract: Deep reinforcement learning is an agent modeling method with both deep learning feature extraction ability and reinforcement learning sequence decision-making ability, which can make up for the depleted non-stationary adaptation, complex feature selection and insufficient state-space representation ability of traditional opponent modeling. The deep reinforcement learning-based opponent modeling methods are divided into two categories, explicit modeling and implicit modeling, and the corresponding theories, models, algorithms and applicable scenarios are sorted out according to the categories. The applications of deep reinforcement learning-based opponent modeling techniques on different fields are introduced. The key problems and future development are summarized to provide …

Go to article

A Review On Learning To Solve Combinatorial Optimisation Problems In Manufacturing, Cong Zhang, Yaoxin Wu, Yining Ma, Wen Song, Zhang Le, Zhiguang Cao, Jie Zhang

Research Collection School Of Computing and Information Systems

An efficient manufacturing system is key to maintaining a healthy economy today. With the rapid development of science and technology and the progress of human society, the modern manufacturing system is becoming increasingly complex, posing new challenges to both academia and industry. Ever since the beginning of industrialisation, leaps in manufacturing technology have always accompanied technological breakthroughs from other fields, for example, mechanics, physics, and computational science. Recently, machine learning (ML) technology, one of the crucial subjects of artificial intelligence, has made remarkable progress in many areas. This study thoroughly reviews how ML, specifically deep (reinforcement) learning, motivates new ideas …

Go to article

Estimation Of Optimal Flight Trajectory In A Total Power Loss Scenario Using Proximal Policy Optimization, Eidahn Eliash

Theses and Dissertations

Recent research has been conducted in using deep reinforcement learning algorithms to generate closed form optimal control laws for helicopters in power loss scenarios. The control laws could allow for design of various cockpit visual pilot aids if proven successful. Such pilot aids are predicted to reduce pilot workload during execution of an emergency powerless landing, and to overall increase the probability of landing safely in such an event. The proposed topic discusses the use of the PPO algorithm in order to find an optimal control policy for a helicopter in a total power loss emergency landing scenario. The task …

Go to article

Virtual Synchronous Generator Control Using Twin Delayed Deep Deterministic Policy Gradient Method, Oroghene Oboreh-Snapps, Buxin She, Shah Fahad, Haotian Chen, Jonathan W. Kimball, Fangxing Li, Hantao Cui, Rui Bo

Electrical and Computer Engineering Faculty Research & Creative Works

This paper presents a data-driven approach that adaptively tunes the parameters of a virtual synchronous generator to achieve optimal frequency response against disturbances. In the proposed approach, the control variables, namely, the virtual moment of inertia and damping factor, are transformed into actions of a reinforcement learning agent. Different from the state-of-the-art methods, the proposed study introduces the settling time parameter as one of the observations in addition to the frequency and rate of change of frequency (RoCoF). In the reward function, preset indices are considered to simultaneously ensure bounded frequency deviation, low RoCoF, fast response, and quick settling time. …

Go to article

Deep Reinforcement Learning For Approximate Policy Iteration: Convergence Analysis And A Post-Earthquake Disaster Response Case Study, Abhijit Gosavi, L. (Lesley) H. Sneed, L. A. Spearing

Engineering Management and Systems Engineering Faculty Research & Creative Works

Approximate Policy Iteration (API) is a Class of Reinforcement Learning (RL) Algorithms that Seek to Solve the Long-Run Discounted Reward Markov Decision Process (MDP), Via the Policy Iteration Paradigm, Without Learning the Transition Model in the Underlying Bellman Equation. Unfortunately, These Algorithms Suffer from a Defect Known as Chattering in Which the Solution (Policy) Delivered in Each Iteration of the Algorithm Oscillates between Improved and Worsened Policies, Leading to Sub-Optimal Behavior. Two Causes for This that Have Been Traced to the Crucial Policy Improvement Step Are: (I) the Inaccuracies in the Policy Improvement Function and (Ii) the Exploration/exploitation Tradeoff Integral …

Go to article

A Data-Driven Scheduling Approach For Integrated Electricity-Hydrogen System Based On Improved Ddpg, Yaping Zhao, Jingsi Huang, Endong Xu, Jianxiao Wang, Xiaoyun Xu

Graduate School of Business Publications

The involvement of hydrogen energy systems has been recognised as a promising way to mitigate climate problems. As a kind of efficient multi-energy complementary system, the hydropower-photovoltaic-hydrogen (HPH) system could be an ideal approach to combining hydrogen with an installed renewable energy system to improve the flexibility of energy management and reduce power curtailment. However, the intra-day scheduling of HPH system brings challenges due to the time-related nonlinear hydropower generation process, the complex energy conversion process and the uncertain natural resource supply. Faced with these challenges, an improved deep deterministic policy gradient (DDPG)-based data-driven scheduling algorithm is proposed. In contrast …

Go to article

Engineering Commons^™

Full-Text Articles in Engineering

Task Scheduling For Internet Of Vehicles Based On Deep Reinforcement Learning In Edge Computing, Xiang Ju, Shengchao Su, Chaojie Xu, Beibei He

Journal of System Simulation

Intelligent Air Defense Task Assignment Based On Assignment Strategy Optimization Algorithm, Jiayi Liu, Gang Wang, Qiang Fu, Xiangke Guo, Siyuan Wang

Journal of System Simulation

Intelligent Path Planning For Mobile Robots Based On Sac Algorithm, Laiyi Yang, Jing Bi, Haitao Yuan

Journal of System Simulation

Obstacle Avoidance Path Planning And Simulation Of Mobile Picking Robot Based On Dppo, Junqiang Lin, Hongjun Wang, Xiangjun Zou, Po Zhang, Chengen Li, Yipeng Zhou, Shujie Yao

Journal of System Simulation

Generalization Through Diversity: Improving Unsupervised Environment Design, Wenjun Li, Pradeep Varakantham, Dexun Li

Research Collection School Of Computing and Information Systems

Joint Optimization Strategy Of Computing Offloading And Edge Caching For Intelligent Connected Vehicles, Fei Ding, Yuchen Sha, Ying Hong, Xiao Kuai, Dengyin Zhang

Journal of System Simulation

Mixed-Criticality Scheduling Using Reinforcement Learning, Omar Elseadawy

Theses and Dissertations

Deep Reinforcement Learning-Based Control Strategy For Boost Converter, Yuxuan Dai, Chenggang Cui

Journal of System Simulation

Research Progress Of Opponent Modeling Based On Deep Reinforcement Learning, Haotian Xu, Long Qin, Junjie Zeng, Yue Hu, Qi Zhang

Journal of System Simulation

A Review On Learning To Solve Combinatorial Optimisation Problems In Manufacturing, Cong Zhang, Yaoxin Wu, Yining Ma, Wen Song, Zhang Le, Zhiguang Cao, Jie Zhang

Research Collection School Of Computing and Information Systems

Estimation Of Optimal Flight Trajectory In A Total Power Loss Scenario Using Proximal Policy Optimization, Eidahn Eliash

Theses and Dissertations

Virtual Synchronous Generator Control Using Twin Delayed Deep Deterministic Policy Gradient Method, Oroghene Oboreh-Snapps, Buxin She, Shah Fahad, Haotian Chen, Jonathan W. Kimball, Fangxing Li, Hantao Cui, Rui Bo

Electrical and Computer Engineering Faculty Research & Creative Works

Deep Reinforcement Learning For Approximate Policy Iteration: Convergence Analysis And A Post-Earthquake Disaster Response Case Study, Abhijit Gosavi, L. (Lesley) H. Sneed, L. A. Spearing

Engineering Management and Systems Engineering Faculty Research & Creative Works

A Data-Driven Scheduling Approach For Integrated Electricity-Hydrogen System Based On Improved Ddpg, Yaping Zhao, Jingsi Huang, Endong Xu, Jianxiao Wang, Xiaoyun Xu

Graduate School of Business Publications