End to End Invoice Processing Application Based on Key Fields Extraction

ARSLAN, HALİL

doi:10.1109/access.2022.3192828

End to End Invoice Processing Application Based on Key Fields Extraction

ARSLAN H.

IEEE ACCESS, cilt.10, ss.78398-78413, 2022 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 10
Basım Tarihi: 2022
Doi Numarası: 10.1109/access.2022.3192828
Dergi Adı: IEEE ACCESS
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Compendex, INSPEC, Directory of Open Access Journals
Sayfa Sayıları: ss.78398-78413
Anahtar Kelimeler: Optical character recognition software, Data mining, Business, Image processing, Deep learning, Companies, Character recognition, Invoice processing, key fields extraction, text detection, deep learning, table extraction, optical character recognition
Açık Arşiv Koleksiyonu: AVESİS Açık Erişim Koleksiyonu
Sivas Cumhuriyet Üniversitesi Adresli: Evet

Özet

In this paper, an automatic invoice processing system, which is in great demand among private and public companies, was proposed. The proposed system supports all invoice file types that can be submitted by companies. Companies can easily submit invoices to the system via the web interface or email, and all invoices submitted to the system are queued and processed sequentially. If the invoice is a text file, the invoice information is extracted from the text by using template matching. If the invoice is an image, the text and table areas are detected and extracted. For table detection, we used both image processing based and YOLOv5-based deep learning method. Cell extraction was then performed from the extracted table images. As a result of these processes, all text and table cells were obtained as images and these images were converted into machine-readable text using the open-source software Tesseract OCR. Tesseract already provides trained models for English and Turkish. However, these models do not provide successful results for invoices submitted by companies in Turkish. Therefore, the new fine-tuned model trained with invoices in Turkish was used for OCR. The experimental results showed that the trained Turkish model was more accurate than the Turkish and English models provided by Tesseract. In addition, the YOLOv5-based table detection model was more accurate than the image-processing-based table detection method.