We present our work on music generation with Perceiver AR, an autoregressive architecture that is able to generate high-quality samples as long as 65k tokens—the equivalent of minutes of music, or entire pieces! 🎵Music Samples📝ICML PaperGitHub CodeDeepMind Blog The playlist above contains samples generated by a Perceiver AR model trained on 10,000 hours of symbolic piano music (and synthesized with Fluidsynth). Introduction Transformer-based architectures have been recently used to ge...