Acceleration of Convolutional Neural Network Training using Field Programmable Gate Arrays

dc.contributor.advisorFitsum, Assamnew (PhD)
dc.contributor.authorGuta, Tesema
dc.date.accessioned2022-02-11T04:22:17Z
dc.date.accessioned2023-11-04T15:14:44Z
dc.date.available2022-02-11T04:22:17Z
dc.date.available2023-11-04T15:14:44Z
dc.date.issued2022-01
dc.description.abstractConvolutional neural networks (CNN) training often necessitates a considerable amount of computational resources. In recent years, several studies have proposed CNN inference and training accelerators, which the FPGAs have previously demonstrated good performance and energy efficiency. To speed processing, the CNN requires additional computational resources such as memory bandwidth, a FPGA plantform resource usage, time, and power consumption. As well as training the CNN needs large datasets and computational power, and they are constrained by the requirement for improved hardware acceleration to support scalability beyond existing data and model sizes. In this study, we propose a procedure for energy efficient CNN training in collaboration with an FPGA-based accelerator. We employed optimizations such as quantization, which is a common model compression technique, to speed up the CNN training process. Additionally, a gradient accumulation buffer is used to ensure maximum operating efficiency while maintaining gradient descent of the learning algorithm. Subsequently, to validate our design, we implemented the AlexNet and VGG16 models on an FPGA board and a laptop CPU and GPU. Consequently, our designs achieve 203.75 GOPS on Terasic DE1-SoC with the AlexNet model and 196.50 GOPS with the VGG16 model on Terasic DE-SoC. This, as far as we know, outperforms existing FPGA-based accelerators. Compared to the CPU and GPU, our design is 22.613X and 3.709X more energy efficient respectively.en_US
dc.identifier.urihttp://etd.aau.edu.et/handle/123456789/30005
dc.language.isoen_USen_US
dc.publisherAddis Ababa Universityen_US
dc.subjectGate Arraysen_US
dc.subjectConvolutional Neural Networken_US
dc.titleAcceleration of Convolutional Neural Network Training using Field Programmable Gate Arraysen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
GutaTesema.pdf
Size:
1.15 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: