Last year, I published a simple demo app that visualizes embedding vectors of a bunch of cat, dog and airplane photos as heatmaps. It’s a simple concept and does not have much practical use. I built it mostly for fun, but also for some “learning by doing” too. Today I’m releasing a completely new version of my CatBench Vector Search Playground app! https://github.com/tanelpoder/catbench With this new app, you can interactively navigate through multiple Postgres-based vector similarity...| tanelpoder.com
Last year I wrote an article about visualizing embedding vectors of a variety of different pictures as heatmaps. I used TorchVision with ImageNet1K_V1 model for encoding a bunch of different cat, dog and plane images into their embeddings, 1000 floating-point values in each resulting vector. I used the generate_embeddings.py script for that. This whole exercise is just for my own learning (and fun), so it’s ok if there’s no other practical value coming from this work. - Linux, Oracle, SQL...| tanelpoder.com
Disclaimer: I’m not an ML expert and not even a serious ML specialist (yet?), so feel free to let me know if I’m wrong! It seems to me that we have hit a bit of an “on-premises” vs. “on-premise” situation in the ML/AI and vector search terminology space. The majority of product announcements, blog articles and even some papers I’ve read use the term vector embeddings to describe embeddings, but embeddings already are vectors themselves! - Linux, Oracle, SQL performance tuning an...| tanelpoder.com