Phi-3.5-MoE is a cutting-edge, lightweight open model developed from the Phi-3 datasets, which include synthetic data and curated publicly available documents, emphasizing high-quality and reasoning-intensive information. It supports multiple languages and features a 128K token context length. The model has undergone extensive refinement through supervised fine-tuning, proximal policy optimization, and direct preference optimization to guarantee […] The post Phi-3.5 on HuggingFace appeared ...