Introduction Baseline solution Tensil RTL and Vivado implementation ResNet compiled for Tensil Tensil for Vitis embedded applications Dual clock solution Ultra RAM solution Solutions with large local memory Introduction Sometimes the application requires pushing the performance to its limits. In this tutorial we will show how to optimize Tensil running ResNet20 trained on CIFAR for maximum performance. To do this, we will use the powerful ZCU104 board and implement an embedded application to ...