IBM Research and partners have released Bamba-9B-v2, an open-source hybrid Transformer-SSM model trained on 3T tokens, claiming faster inference than comparable LLMs.| WinBuzzer
The open-source LLM combines the sequence-modeling skill of a transformer with the inferencing speed of an SSM. IBM Granite will soon adopt key Bamba features.| IBM Research