Open Access. Powered by Scholars. Published by Universities.®
Operations Research, Systems Engineering and Industrial Engineering Commons™
Open Access. Powered by Scholars. Published by Universities.®
- Publication
- Publication Type
Articles 1 - 2 of 2
Full-Text Articles in Operations Research, Systems Engineering and Industrial Engineering
Simulation Of Neighbor Discovery Algorithm In Directional Antenna Ad Hoc Networks Based On Opnet, Li Mo, Zhao Liang
Simulation Of Neighbor Discovery Algorithm In Directional Antenna Ad Hoc Networks Based On Opnet, Li Mo, Zhao Liang
Journal of System Simulation
Abstract: Neighbor discovery problem in directional antenna Ad Hoc networks is studied. A Q-Learning neighbor discovery algorithm is proposed, and a simulation model for directional antenna Ad Hoc networks based on OPNET software is setup. This algorithm makes a decision of transmitting and receiving mode for every neighbor scanning without the information of neighbor position through the Q-Learning mechanism. The reward value is determined according to the current scanning result, and the experiences are learned in order to improve the neighbor discovery efficiency. The simulation model based on OPNET software is constructed using three-layer modeling mechanism. Then the directional antenna …
A New Reinforcement Learning Algorithm With Fixed Exploration For Semi-Markov Decision Processes, Angelo Michael Encapera
A New Reinforcement Learning Algorithm With Fixed Exploration For Semi-Markov Decision Processes, Angelo Michael Encapera
Masters Theses
"Artificial intelligence or machine learning techniques are currently being widely applied for solving problems within the field of data analytics. This work presents and demonstrates the use of a new machine learning algorithm for solving semi-Markov decision processes (SMDPs). SMDPs are encountered in the domain of Reinforcement Learning to solve control problems in discrete-event systems. The new algorithm developed here is called iSMART, an acronym for imaging Semi-Markov Average Reward Technique. The algorithm uses a constant exploration rate, unlike its precursor R-SMART, which required exploration decay. The major difference between R-SMART and iSMART is that the latter uses, in addition …