Repository logo
 

Learning rate optimisation of an image processing deep convolutional neural network

Thumbnail Image

Date

2021

Authors

Buthelezi, Sibusiso Blessing

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

The major contribution of this dissertation is the proposal of the use of mathematical models to identify an optimal learning rate for an image processing deep convolutional neural network (DCNN). This model is derived from a nonlinear regression relationship between the learning rate and the accuracy of a test DCNN model. This relationship is meant to (A) resolve the problem of arbitrarily selecting the initial learning rate (B) reduce computational resource requirement and (C) reduce training instabilities. An algorithm is developed to analyse an inputted DCNN model and subsequently render output parameters that may be used to aid in the selection of an OLR. The benefit of an OLR includes improved training stability and reduced computational resources. The results rendered by the OLR algorithm proposes that an optimal learning rate improves model performance; this is described by the test model average accuracy of 91%. Furthermore, a model validation graph is also extrapolated. which will illustrate the mathematical model accuracy and the region of interest (ROI). The ROI defines the region in the learning rate spectrum with a positive effect on model performance.

Description

A dissertation submitted in fulfilment of the requirements for the degree of Master of Engineering Department of Electronics and Computer Engineering, Faculty of Engineering and the Built Environment, Durban University of Technology, 2021.

Keywords

Citation

DOI

https://doi.org/10.51415/10321/3750

Endorsement

Review

Supplemented By

Referenced By