Topic: FlashAttention-3 for Inference: INT8 Quantization and Query Head Packing for MQA/GQA (External)