This Buwan ng Wika (National Language Month), I'm proud to introduce FilBench, a big step forward in Filipino NLP evaluation. Read to learn more!| Lj Miranda
The rise of LLMs is forcing us to rethink Filipino NLP. But there's still a ton of work to do—just not the stuff you might think. Here's my take on what's worth doing, what's a waste of time, and where Filipino NLP research should be heading.| Lj Miranda
Lately, I've been thinking a lot about visualizing datasets, and good old-fashioned t-SNE embeddings came to mind. In this blog post, indulge me as I examine a "data map" of our Tagalog NER dataset.| Lj Miranda
A collection of notes, projects, and essays.| Lj Miranda
Large language models showed promise on structured prediction tasks like named entity recognition and text categorization. But how well do they perform when ...| Lj Miranda
A development log on the calamanCy project and the Tagalog NLP pipeline. The tl;dr: we just finished re-annotating the dataset. I also want to share my learn...| Lj Miranda