Clustering of unlabeled data can be performed with the module sklearn.cluster. Each clustering algorithm comes in two variants: a class, that implements the fit method to learn the clusters on trai...| scikit-learn
Posted on Tuesday 7 Feb 2023. 1,208 words, 14 links. By Matt Webb.| Interconnected, a blog by Matt Webb
Open-source vector similarity search for Postgres. Contribute to pgvector/pgvector development by creating an account on GitHub.| GitHub
Embeddings are a really neat trick that often come wrapped in a pile of intimidating jargon. If you can make it through that jargon, they unlock powerful and exciting techniques …| Simon Willison’s Weblog
We’re on a journey to advance and democratize artificial intelligence through open source and open science.| huggingface.co
Let’s overcome decision fatigue by building a decision tree app from thousands of images of bathroom fixtures, an off-the-shelf image embedding model, and a few command-line tools.| Drew Breunig
LLM is my Python library and command-line tool for working with language models. I just released LLM 0.9 with a new set of features that extend LLM to provide tools …| simonwillison.net