Information Extraction from Business Documents
Autoři | |
---|---|
Rok publikování | 2022 |
Druh | Článek ve sborníku |
Konference | Recent Advances in Slavonic Natural Language Processing (RASLAN 2022) |
Fakulta / Pracoviště MU | |
Citace | |
www | fulltext PDF |
Klíčová slova | OCR; Multi-modal learning; Information extraction; Transformers; Structured Documents |
Popis | Document AI is a relatively new research topic that refers to techniques for automatically reading, understanding, and analyzing business documents. Nowadays, many companies extract data from business documents through manual efforts that are time-consuming and expensive, requiring manual customization or configuration. This paper describes techniques to address these problems, apply them to real-world data, and implement them to an end-to-end solution for automatic information extraction from business documents. |
Související projekty: |