Abstract
In this paper we investigate a method to reduce the number of computations and associated activations in Convolutional Neural Networks (CNN) by using bitmaps. The bitmaps are used to mask the input images to the network that fall within a rectangular window but do not fall within the boundaries of the objects the network is being trained upon. The mask has the effect of rendering the operations on these portions of the training images trivial. The thesis is that applying this approach to CNNs will not degrade accuracy while at the same time reducing the computational workload and reducing memory footprint. We found that we can remove up to 60% of the input images and see no decrease in accuracy. This leads to activation sparsity that can be exploited using a hardware accelerator to speedup training and inference, and decrease energy consumed.
Original language | English |
---|---|
DOIs | |
Publication status | Published - 1 Jan 2019 |
Externally published | Yes |
Event | IMVIP 2019: Irish Machine Vision & Image Processing - Technological University Dublin, Dublin, Ireland Duration: 28 Aug 2019 → 30 Aug 2019 |
Conference
Conference | IMVIP 2019: Irish Machine Vision & Image Processing |
---|---|
Country/Territory | Ireland |
City | Dublin |
Period | 28/08/19 → 30/08/19 |
Keywords
- bitmap masks
- pedestrian detection
- Convolutional Neural Networks
- CNN
- computational workload
- memory footprint
- activation sparsity
- hardware accelerator
- training
- inference
- energy consumption