Topic: Three-Tier Storage Architecture for Fast LLM Inference in the Cloud