Open Access. Powered by Scholars. Published by Universities.®

Computer Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Electronic Theses and Dissertations

South Dakota State University

Accuracy Ensured Cmpression

Articles 1 - 1 of 1

Full-Text Articles in Computer Engineering

Kernel-Controlled Dqn Based Cnn Pruning For Model Compression And Acceleration, Romancha Khatri Jan 2020

Kernel-Controlled Dqn Based Cnn Pruning For Model Compression And Acceleration, Romancha Khatri

Electronic Theses and Dissertations

Apart from the accuracy, the size of convolutional neural networks (CNN) models is another principal factor for facilitating the deployment of models on memory, power and budget constrained devices. However, conventional model compression techniques require human experts to setup parameters to explore the design space which is suboptimal and time consuming. Various pruning techniques are implemented to gain compression, trading off speed and accuracy. Given a CNN model [11], we propose an automated deep reinforcement learning [9] based model compression technique that can effectively turned off kernels on each layer by observing its significance on decision making. By observing accuracy, …