Topic: [2310.08754] Tokenizer Choice For LLM Training: Negligible or Crucial?