FPGA accelerators for deep learning

Lowering Dynamic Power of a Stream-based CNN Hardware Accelerator

Read more about Lowering Dynamic Power of a Stream-based CNN Hardware Accelerator
Log in to post comments

Custom hardware accelerators of Convolutional Neural Networks (CNN) provide a promising solution to meet real-time constraints for a wide range of applications on low-cost embedded devices. In this work, we aim to lower the dynamic power of a stream-based CNN hardware accelerator by reducing the computational redundancies in the CNN layers. In particular, we investigate the redundancies due to the downsampling effect of max pooling layers which are prevalent in state-of-the-art CNNs, and propose an approximation method to reduce the overall computations.

poster_mmsp19.pdf

poster_mmsp19.pdf (531)

Categories:: Algorithm and architecture co-optimization

75 Views