Topic: [2311.06242] Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks