Lately, I've been thinking a lot about visualizing datasets, and good old-fashioned t-SNE embeddings came to mind. In this blog post, indulge me as I examine a "data map" of our Tagalog NER dataset.| Lj Miranda
Large language models showed promise on structured prediction tasks like named entity recognition and text categorization. But how well do they perform when ...| Lj Miranda
A development log on the calamanCy project and the Tagalog NLP pipeline. The tl;dr: we just finished re-annotating the dataset. I also want to share my learn...| Lj Miranda