Document Classification and Key Information Extraction Using Multimodal Transformers

Mehmet Selman Baysan, Furkan Kizilay, Ayşe Irem Özmen, Gökhan Ince

Araştırma sonucu: Kitap/Rapor/Konferans Bildirisinde BölümKonferans katkısıbilirkişi

Özet

Companies manage and track their expenses either physically or through software applications. However, manual expense entry steps are prone to errors. Manual expense entry errors losses in terms of money, time and productivity. Therefore, this study presents a novel system on the automation of document information entry with a special focus on financial documents through machine I earning techniques. The methodology involves training LayoutLM models for sequence and token classification to categorize and extract detailed information from various financial documents such a s receipts and invoices. The proposed system integrates state-of-the-art models such as LayoutLMv2, LayoutLMv3, and fastText to achieve accurate document classification a nd information extraction. The designed system was implemented and tested on various types of receipts and invoices containing financial values, using evaluation metrics such as accuracy, precision, recall, and F1-score. The capability of the proposed system to achieve high accuracy, precision and F1 scores above 90 % across various document types and in automated document processing tasks reaffirms its suitability for document processing applications.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığıUBMK 2024 - Proceedings
Ana bilgisayar yayını alt yazısı9th International Conference on Computer Science and Engineering
EditörlerEsref Adali
YayınlayanInstitute of Electrical and Electronics Engineers Inc.
Sayfalar276-281
Sayfa sayısı6
ISBN (Elektronik)9798350365887
DOI'lar
Yayın durumuYayınlandı - 2024
Etkinlik9th International Conference on Computer Science and Engineering, UBMK 2024 - Antalya, Turkey
Süre: 26 Eki 202428 Eki 2024

Yayın serisi

AdıUBMK 2024 - Proceedings: 9th International Conference on Computer Science and Engineering

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???9th International Conference on Computer Science and Engineering, UBMK 2024
Ülke/BölgeTurkey
ŞehirAntalya
Periyot26/10/2428/10/24

Bibliyografik not

Publisher Copyright:
© 2024 IEEE.

Parmak izi

Document Classification and Key Information Extraction Using Multimodal Transformers' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Alıntı Yap