Login
From:
machine learning musings
(Uncensored)
subscribe
A Survey of Document Understanding Models
https://www.pragmatic.ml/a-survey-of-document-understanding-models/
links
backlinks
Tagged with:
multimodal
attention
transformers
finetuning
The past three years have seen significant interest in applying language models to the task of visual document understanding – integrating spatial, textual, and visual signals to make sense of PDFs and scanned documents.
Roast topics
Find topics
Roast it!
Roast topics
Find topics
Find it!
Roast topics
Find topics
Find it!