Qwen2.5-Omni is a multimodal generative AI model capable of accepting text, image, audio, and video as input while outputting text and audio.| DebuggerCafe