Login
From:
mlops.systems
(Uncensored)
subscribe
Alex Strick van Linschoten - Tokenizing Balochi with HuggingFace’s Tokenizer and FastAI/Spacy
https://mlops.systems/posts/2023-06-03-training-custom-balochi-tokenizer.html
links
backlinks
Tagged with:
nlp
tokenisation
balochi
balochi-language-model
I explore language tokenization using FastAI, Spacy, and Huggingface Tokenizers, with a special focus on the less-represented Balochi language.
Roast topics
Find topics
Find it!