Open Access. Powered by Scholars. Published by Universities.®

Operations Research, Systems Engineering and Industrial Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 17 of 17

Full-Text Articles in Operations Research, Systems Engineering and Industrial Engineering

Intelligent Optimization Of Coal Terminal Unloading Scheduling Based On Improved D3qn Algorithm, Baoxin Qin, Yuxiao Zhang, Sirui Wu, Weichong Cao, Zhan Li Mar 2024

Intelligent Optimization Of Coal Terminal Unloading Scheduling Based On Improved D3qn Algorithm, Baoxin Qin, Yuxiao Zhang, Sirui Wu, Weichong Cao, Zhan Li

Journal of System Simulation

Abstract: Intelligent decision scheduling can improve the operation efficiency of large ports, which is one of the important research directions for the implementation of artificial intelligence technology in the smart port scenario. This article studies the intelligent unloading scheduling tasks of coal terminals and abstracts them as a Markov sequence decision problem. A deep reinforcement learning model for this problem is established, and an improved D3QN algorithm is proposed to realize intelligent optimization of unloading scheduling decisions by considering the characteristics of high action space dimension and sparse feasible action in the model. The simulation results show that for the …


Task Scheduling For Internet Of Vehicles Based On Deep Reinforcement Learning In Edge Computing, Xiang Ju, Shengchao Su, Chaojie Xu, Beibei He Dec 2023

Task Scheduling For Internet Of Vehicles Based On Deep Reinforcement Learning In Edge Computing, Xiang Ju, Shengchao Su, Chaojie Xu, Beibei He

Journal of System Simulation

Abstract: Aiming at the offloading and execution of delay-constrained computing tasks for internet of vehicles in edge computing, a task scheduling method based on deep reinforcement learning is proposed. In multi-edge server scenario, a software-defined network-aided internet of vehicles task offloading system is built. On this basis, the task scheduling model of vehicle computation offloading is given. According to the characteristics of task scheduling, a scheduling method based on an improved pointer network is designed. Considering the complexity of task scheduling and computing resource allocation, the deep reinforcement learning algorithm is used to train the pointer network. The vehicle offloading …


Intelligent Air Defense Task Assignment Based On Assignment Strategy Optimization Algorithm, Jiayi Liu, Gang Wang, Qiang Fu, Xiangke Guo, Siyuan Wang Aug 2023

Intelligent Air Defense Task Assignment Based On Assignment Strategy Optimization Algorithm, Jiayi Liu, Gang Wang, Qiang Fu, Xiangke Guo, Siyuan Wang

Journal of System Simulation

Abstract: Aiming at the insufficient solving speed of assignment strategy optimization algorithm in largescale scenarios, deep reinforcement learning is combined with Markov decision process to carry out the intelligent large-scale air defense task assignment. According to the characteristics of large-scale air defense operations, Markov decision process is used to model the agent and a digital battlefield simulation environment is built. Air defense task assignment agent is designed and trained in digital battlefield simulation environment through proximal policy optimization algorithm. The feasibility and advantage of the method are verified by taking a large-scale ground-to-air countermeasure mission as an example.


Obstacle Avoidance Path Planning And Simulation Of Mobile Picking Robot Based On Dppo, Junqiang Lin, Hongjun Wang, Xiangjun Zou, Po Zhang, Chengen Li, Yipeng Zhou, Shujie Yao Aug 2023

Obstacle Avoidance Path Planning And Simulation Of Mobile Picking Robot Based On Dppo, Junqiang Lin, Hongjun Wang, Xiangjun Zou, Po Zhang, Chengen Li, Yipeng Zhou, Shujie Yao

Journal of System Simulation

Abstract: Aiming at the autonomous decision-making difficulty of mobile picking robots in random and changeable complicated path environment during field operations, an autonomous obstacle avoidance path planning method based on deep reinforcement learning is propose. By setting the state space and action space and using the artificial potential field method to design the reward function, an obstacle penalty coefficient setting method based on collision cone collision avoidance detection is proposed to improve the autonomous collision avoidance ability. A virtual simulation system is constructed, in which the learning and training of the mobile picking robot is carried out and verified by …


Intelligent Path Planning For Mobile Robots Based On Sac Algorithm, Laiyi Yang, Jing Bi, Haitao Yuan Aug 2023

Intelligent Path Planning For Mobile Robots Based On Sac Algorithm, Laiyi Yang, Jing Bi, Haitao Yuan

Journal of System Simulation

Abstract: Aiming at the high dimension, slow convergence and complex modelling of traditional path planning algorithms for mobile robots, a new intelligent path planning algorithm is proposed, which is based on deep reinforcement learning soft actor-critic (SAC) algorithm to save the poor performance of robot in complicated environments with static and dynamic obstacles. An improved reward function is designed to enable mobile robots to quickly avoid obstacles and reach targets by using state dynamic normalization and priority experience pool techniques. To evaluate the performance, a pygame-based simulation environment is constructed. Compared with proximal policy optimization(PPO) algorithm, experimental …


Generalization Through Diversity: Improving Unsupervised Environment Design, Wenjun Li, Pradeep Varakantham, Dexun Li Aug 2023

Generalization Through Diversity: Improving Unsupervised Environment Design, Wenjun Li, Pradeep Varakantham, Dexun Li

Research Collection School Of Computing and Information Systems

Agent decision making using Reinforcement Learning (RL) heavily relies on either a model or simulator of the environment (e.g., moving in an 8x8 maze with three rooms, playing Chess on an 8x8 board). Due to this dependence, small changes in the environment (e.g., positions of obstacles in the maze, size of the board) can severely affect the effectiveness of the policy learned by the agent. To that end, existing work has proposed training RL agents on an adaptive curriculum of environments (generated automatically) to improve performance on out-of-distribution (OOD) test scenarios. Specifically, existing research has employed the potential for the …


Joint Optimization Strategy Of Computing Offloading And Edge Caching For Intelligent Connected Vehicles, Fei Ding, Yuchen Sha, Ying Hong, Xiao Kuai, Dengyin Zhang Jun 2023

Joint Optimization Strategy Of Computing Offloading And Edge Caching For Intelligent Connected Vehicles, Fei Ding, Yuchen Sha, Ying Hong, Xiao Kuai, Dengyin Zhang

Journal of System Simulation

To guarantee the low-delay communication of intelligent connected vehicles, the V2X channel model and the multi-access edge computing (MEC) technology, are used to carry out the research of the joint optimization strategy of computing offloading and edge caching.An intelligent connected vehicle with task offloading and edge caching model least-deep deterministic policy gradient(L-DDPG) is developed.By integrating the vehicular local and edge computing resources, the classification processing of different computing tasks in V2X scenarios is supported.The vehicular computing request is prejudged by edge platform to ensure the rapid response of continuous homogeneous computing tasks. Combining with the least recently …


Deep Reinforcement Learning-Based Control Strategy For Boost Converter, Yuxuan Dai, Chenggang Cui May 2023

Deep Reinforcement Learning-Based Control Strategy For Boost Converter, Yuxuan Dai, Chenggang Cui

Journal of System Simulation

Abstract: In view of the problems of model uncertainty and nonlinearity in bus voltage stability control of Boost converter, an intelligent control strategy based on model-free deep reinforcement learning(RL) is proposed. RL double DQN(DDQN) algorithm and deep deterministic policy gradient(DDPG) algorithm are used, and the Boost converter controller is designed. The state, action space, reward function, and neural network are also designed to improve the dynamic performance of the controller. The joint simulation of the Boost converter model and RL agent is realized by RL modelica(RLM …


Research Progress Of Opponent Modeling Based On Deep Reinforcement Learning, Haotian Xu, Long Qin, Junjie Zeng, Yue Hu, Qi Zhang Apr 2023

Research Progress Of Opponent Modeling Based On Deep Reinforcement Learning, Haotian Xu, Long Qin, Junjie Zeng, Yue Hu, Qi Zhang

Journal of System Simulation

Abstract: Deep reinforcement learning is an agent modeling method with both deep learning feature extraction ability and reinforcement learning sequence decision-making ability, which can make up for the depleted non-stationary adaptation, complex feature selection and insufficient state-space representation ability of traditional opponent modeling. The deep reinforcement learning-based opponent modeling methods are divided into two categories, explicit modeling and implicit modeling, and the corresponding theories, models, algorithms and applicable scenarios are sorted out according to the categories. The applications of deep reinforcement learning-based opponent modeling techniques on different fields are introduced. The key problems and future development are summarized to provide …


A Review On Learning To Solve Combinatorial Optimisation Problems In Manufacturing, Cong Zhang, Yaoxin Wu, Yining Ma, Wen Song, Zhang Le, Zhiguang Cao, Jie Zhang Mar 2023

A Review On Learning To Solve Combinatorial Optimisation Problems In Manufacturing, Cong Zhang, Yaoxin Wu, Yining Ma, Wen Song, Zhang Le, Zhiguang Cao, Jie Zhang

Research Collection School Of Computing and Information Systems

An efficient manufacturing system is key to maintaining a healthy economy today. With the rapid development of science and technology and the progress of human society, the modern manufacturing system is becoming increasingly complex, posing new challenges to both academia and industry. Ever since the beginning of industrialisation, leaps in manufacturing technology have always accompanied technological breakthroughs from other fields, for example, mechanics, physics, and computational science. Recently, machine learning (ML) technology, one of the crucial subjects of artificial intelligence, has made remarkable progress in many areas. This study thoroughly reviews how ML, specifically deep (reinforcement) learning, motivates new ideas …


Research On Fire Emergency Evacuation Simulation Based On Cooperative Deep Reinforcement Learning, Lingjia Ni, Xiaoxia Huang, Hongga Li, Zibo Zhang Jun 2022

Research On Fire Emergency Evacuation Simulation Based On Cooperative Deep Reinforcement Learning, Lingjia Ni, Xiaoxia Huang, Hongga Li, Zibo Zhang

Journal of System Simulation

Abstract: The fire accident is a major threat to the public safety, in which the high temperature, toxic and harmful gases seriously interfer the selection of the evacuation routes. Deep reinforcement learning is introduced into the research of emergency evacuation simulation, and a cooperative double deep Q network algorithm is proposed for the multi-agent environment. A fire scene model that changes dynamically over time is established to provide the real-time information on the distribution of the dangerous areas for the evacuation. The independent agent neural networks are integrated and the multi-agent unified deep neural network is established to realize the …


A Devs-Based Formal Description Method For Complex Product Behavior Models, Qingquan Lin, Jiaran Yang, Heming Zhang Apr 2022

A Devs-Based Formal Description Method For Complex Product Behavior Models, Qingquan Lin, Jiaran Yang, Heming Zhang

Journal of System Simulation

Abstract: For the online optimization of pedestrian flow control in subway station, an algorithm frame for pedestrian flow control in subway station based on machine learning is designed. The pedestrian flow control process of a subway station during morning rush hour is selected,and the agent-based model is built to simulate the control process. The training data is collected through the multiple runs of the model, which is used as the input of deep reinforcement learning network, and the mature net is obtained through adequate training to provide the optimizing scheduling policy. Linking the actual data with the mature net …


Research On Optimization Of Airport Cargo Business Based On Deep Reinforcement Learning, Hongwei Wang, Peng Yang Mar 2022

Research On Optimization Of Airport Cargo Business Based On Deep Reinforcement Learning, Hongwei Wang, Peng Yang

Journal of System Simulation

Abstract: An intelligent agent technology architecture is adopted to the simulation model development of airport cargo business. Aiming at the optimization of airport cargo resources, a decision support system framework combining deep reinforcement learning (DRL) and airport cargo business simulation model is proposed. The simulated results are applied as the training data of the DRL network, and the DRL is used to optimize operation parameter of the simulation model. The mature system can be run online, which can provide optimized operation order in real time. In order to verify the effectiveness of the architecture, model development and experiments are conducted …


Job Scheduling And Simulation In Cloud Based On Deep Reinforcement Learning, Qirui Li, Xinyi Peng Feb 2022

Job Scheduling And Simulation In Cloud Based On Deep Reinforcement Learning, Qirui Li, Xinyi Peng

Journal of System Simulation

Abstract: To solve the difficulty in job scheduling in the complex and transient multi-user, multi-queue, and multi-data-center cloud computing environment, this paper proposed a job scheduling method based on deep reinforcement learning. A system model of cloud job scheduling and its mathematical model were built, and an optimization goal consisting of transmission time, waiting time, and execution time was obtained. A job scheduling algorithm based on deep reinforcement learning was designed, and its state space, action space, and reward function were given. A simulated cloud job scheduler was designed and developed, and simulated scheduling experiments were conducted on it. The …


A Data-Driven Modeling Method For Game Adversity Agent, Zeng Bi, Fang Xiao, Deshuai Kong, Xiangxiang Song, Zhengxuan Jia, Tingyu Lin Jan 2022

A Data-Driven Modeling Method For Game Adversity Agent, Zeng Bi, Fang Xiao, Deshuai Kong, Xiangxiang Song, Zhengxuan Jia, Tingyu Lin

Journal of System Simulation

Abstract: Aiming at the problems of collaborative modeling of formation behavior and intelligent generation of decision-making in complex confrontation scenarios, based on the serious game to simulate the confrontation scenarios of complex maritime equipment against the air, this paper proposes a data-driven modeling method for game agent and uses a distributed modeling technology of parallel adversarial scenarios and opportunistic decision making technology of smart targets to achieve agent modeling. It provides support for the further exploration of multi-objective collaborative modeling in complex confrontation scenarios. The simulation results show that deep reinforcement learning algorithms can provide a basis for the modeling …


Step-Wise Deep Learning Models For Solving Routing Problems, Liang Xin, Wen Song, Zhiguang Cao, Jie Zhang Jul 2021

Step-Wise Deep Learning Models For Solving Routing Problems, Liang Xin, Wen Song, Zhiguang Cao, Jie Zhang

Research Collection School Of Computing and Information Systems

Routing problems are very important in intelligent transportation systems. Recently, a number of deep learning-based methods are proposed to automatically learn construction heuristics for solving routing problems. However, these methods do not completely follow Bellman's Principle of Optimality since the visited nodes during construction are still included in the following subtasks, resulting in suboptimal policies. In this article, we propose a novel step-wise scheme which explicitly removes the visited nodes in each node selection step. We apply this scheme to two representative deep models for routing problems, pointer network and transformer attention model (TAM), and significantly improve the performance of …


Research On Generation Technology Of Computer Generated Force In Lvc Training System, Gao Ang, Zhiming Dong, Guohui Zhang, Liang Tao, Qisheng Guo Mar 2021

Research On Generation Technology Of Computer Generated Force In Lvc Training System, Gao Ang, Zhiming Dong, Guohui Zhang, Liang Tao, Qisheng Guo

Journal of System Simulation

Abstract: LVC training system of the combat equipment under the condition of confrontation is an effective means of training, aiming at the problem that in LVC training system, computer generated forces are difficult to meet the demand of training problems. The concept of LVC training and LVC training system is clarified, according to the relationship between model and system structure, the corresponding modeling technology requirements of three different hierarchical models, namely logical range entity configuration, command entity and combat entity, are expounded. According to the specific requirements, four computer-generated force generation methods are proposed, namely, logical target range virtual and …