Stable Diffusion can run in ONNX in the browser in under FIXME compressed, FIXME uncompressed, without significant quality loss. This small size comes from quantizing weights to 6-bit and 8-bit precision, performing operations in full 16-bit precision, and implementing the quantization in a way that can run in a serialized ONNX model.