Login
From:
machine learning musings
(Uncensored)
subscribe
A Survey of Document Understanding Models
https://www.pragmatic.ml/a-survey-of-document-understanding-models/
links
backlinks
Tagged with:
multimodal
attention
transformers
finetuning
The past three years have seen significant interest in applying language models to the task of visual document understanding – integrating spatial, textual, and visual signals to make sense of PDFs and scanned documents.
Roast topics
Find topics
Find it!