Cortex leverages sharding techniques to horizontally scale both single and multi-tenant clusters beyond the capacity of a single node. Background The default sharding strategy employed by Cortex distributes the workload across the entire pool of instances running a given service (eg. ingesters). For example, on the write path, each tenant’s series are sharded across all ingesters, regardless of how many active series the tenant has or how many different tenants are in the cluster. The defau...