Exploring 6 noteworthy approaches for incorporating longer-term context in transformer models.| machine learning musings