Topic: [2410.00037] Moshi: a speech-text foundation model for real-time dialogue