Topic: [2106.12672] Charformer: Fast Character Transformers via Gradient-based Subword Tokenization