Large Language Models (LLMs) and their multi-modal variants offer significant benefits in automating complex processes, with Document Understanding (DU) being a particularly promising application. In DU, the challenge often lies in integrating text, layout, and graphical elements to accurately extract necessary information. In a new paper Arctic-TILT. Business Document Understanding at Sub-Billion Scale, a research