This is part 2 of my previous post on LeNet-5, where we implemented and trained it on the MNIST dataset. This post will walk through the AlexNet architecture and its different implementations in other frameworks. After this, we will use transfer learning on AlexNet so it can learn the Caltech-256 dataset