Abstract
Companies manage and track their expenses either physically or through software applications. However, manual expense entry steps are prone to errors. Manual expense entry errors losses in terms of money, time and productivity. Therefore, this study presents a novel system on the automation of document information entry with a special focus on financial documents through machine I earning techniques. The methodology involves training LayoutLM models for sequence and token classification to categorize and extract detailed information from various financial documents such a s receipts and invoices. The proposed system integrates state-of-the-art models such as LayoutLMv2, LayoutLMv3, and fastText to achieve accurate document classification a nd information extraction. The designed system was implemented and tested on various types of receipts and invoices containing financial values, using evaluation metrics such as accuracy, precision, recall, and F1-score. The capability of the proposed system to achieve high accuracy, precision and F1 scores above 90 % across various document types and in automated document processing tasks reaffirms its suitability for document processing applications.
Original language | English |
---|---|
Title of host publication | UBMK 2024 - Proceedings |
Subtitle of host publication | 9th International Conference on Computer Science and Engineering |
Editors | Esref Adali |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 276-281 |
Number of pages | 6 |
ISBN (Electronic) | 9798350365887 |
DOIs | |
Publication status | Published - 2024 |
Event | 9th International Conference on Computer Science and Engineering, UBMK 2024 - Antalya, Turkey Duration: 26 Oct 2024 → 28 Oct 2024 |
Publication series
Name | UBMK 2024 - Proceedings: 9th International Conference on Computer Science and Engineering |
---|
Conference
Conference | 9th International Conference on Computer Science and Engineering, UBMK 2024 |
---|---|
Country/Territory | Turkey |
City | Antalya |
Period | 26/10/24 → 28/10/24 |
Bibliographical note
Publisher Copyright:© 2024 IEEE.
Keywords
- Document Automation
- Financial Document Processing
- Key Information Extraction (KIE)
- LayoutLM
- Token Classification