I have a hobby project called HN Distilled, where I needed to add a "Similar Topics" section at the bottom of each post summary. I implemented it by using a bit of Python code, which calculated cosine similarity between vectorized representations of each post. It might sound complex, but in essence, it was just a few methods from the scikit-learn. This is how it worked: