Topic: How much LLM training data is there, in the limit?