Instructions to execute ONNX Runtime applications with CUDA| onnxruntime
Automatic Mixed Precision for Deep Learning | NVIDIA Developer
A guide to torch.cuda, a PyTorch module to run CUDA operations| pytorch.org