Unfolder: fast localization and image rectification of a document with a crease from folding in half
Presentation of folded documents is not an uncommon case in modern society. Digitizing such documents by capturing them with a smartphone camera can be tricky since a crease can divide the document contents into separate planes. To unfold the document, one could hold the edges potentially obscuring...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Samara National Research University
2024-08-01
|
Series: | Компьютерная оптика |
Subjects: | |
Online Access: | https://www.computeroptics.ru/eng/KO/Annot/KO48-4/480409e.html |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1823863794318704640 |
---|---|
author | A.M. Ershov D.V. Tropin E.E. Limonova D.P. Nikolaev V.V. Arlazarov |
author_facet | A.M. Ershov D.V. Tropin E.E. Limonova D.P. Nikolaev V.V. Arlazarov |
author_sort | A.M. Ershov |
collection | DOAJ |
description | Presentation of folded documents is not an uncommon case in modern society. Digitizing such documents by capturing them with a smartphone camera can be tricky since a crease can divide the document contents into separate planes. To unfold the document, one could hold the edges potentially obscuring it in a captured image. While there are many geometrical rectification methods, they were usually developed for arbitrary bends and folds. We consider such algorithms and propose a novel approach Unfolder developed specifically for images of documents with a crease from folding in half. Unfolder is robust to projective distortions of the document image and does not fragment the image in the vicinity of a crease after rectification. A new Folded Document Images dataset was created to investigate the rectification accuracy of folded (2, 3, 4, and 8 folds) documents. The dataset includes 1600 images captured when document placed on a table and when held in hand. The Unfolder algorithm allowed for a recognition error rate of 0.33, which is better than the advanced neural network methods DocTr (0.44) and DewarpNet (0.57). The average runtime for Unfolder was only 0.25 s / image on an iPhone XR. |
format | Article |
id | doaj-art-c691e0dcec1e48b58a31cd0cf0df44eb |
institution | Kabale University |
issn | 0134-2452 2412-6179 |
language | English |
publishDate | 2024-08-01 |
publisher | Samara National Research University |
record_format | Article |
series | Компьютерная оптика |
spelling | doaj-art-c691e0dcec1e48b58a31cd0cf0df44eb2025-02-09T09:45:23ZengSamara National Research UniversityКомпьютерная оптика0134-24522412-61792024-08-0148454255310.18287/2412-6179-CO-1406Unfolder: fast localization and image rectification of a document with a crease from folding in halfA.M. Ershov0D.V. Tropin1E.E. Limonova2D.P. Nikolaev3V.V. Arlazarov4Smart Engines Service LLC; Institute for Information Transmission Problems of RAS (Kharkevich Institute)Smart Engines Service LLC; Federal Research Center "Computer Science and Control" of the Russian Academy of SciencesSmart Engines Service LLC; Federal Research Center "Computer Science and Control" of the Russian Academy of SciencesSmart Engines Service LLC; Institute for Information Transmission Problems of RAS (Kharkevich Institute)Smart Engines Service LLC; Federal Research Center "Computer Science and Control" of the Russian Academy of SciencesPresentation of folded documents is not an uncommon case in modern society. Digitizing such documents by capturing them with a smartphone camera can be tricky since a crease can divide the document contents into separate planes. To unfold the document, one could hold the edges potentially obscuring it in a captured image. While there are many geometrical rectification methods, they were usually developed for arbitrary bends and folds. We consider such algorithms and propose a novel approach Unfolder developed specifically for images of documents with a crease from folding in half. Unfolder is robust to projective distortions of the document image and does not fragment the image in the vicinity of a crease after rectification. A new Folded Document Images dataset was created to investigate the rectification accuracy of folded (2, 3, 4, and 8 folds) documents. The dataset includes 1600 images captured when document placed on a table and when held in hand. The Unfolder algorithm allowed for a recognition error rate of 0.33, which is better than the advanced neural network methods DocTr (0.44) and DewarpNet (0.57). The average runtime for Unfolder was only 0.25 s / image on an iPhone XR.https://www.computeroptics.ru/eng/KO/Annot/KO48-4/480409e.htmlfolded documentsimage rectificationdewarpingon-device acquisitionopen dataset |
spellingShingle | A.M. Ershov D.V. Tropin E.E. Limonova D.P. Nikolaev V.V. Arlazarov Unfolder: fast localization and image rectification of a document with a crease from folding in half Компьютерная оптика folded documents image rectification dewarping on-device acquisition open dataset |
title | Unfolder: fast localization and image rectification of a document with a crease from folding in half |
title_full | Unfolder: fast localization and image rectification of a document with a crease from folding in half |
title_fullStr | Unfolder: fast localization and image rectification of a document with a crease from folding in half |
title_full_unstemmed | Unfolder: fast localization and image rectification of a document with a crease from folding in half |
title_short | Unfolder: fast localization and image rectification of a document with a crease from folding in half |
title_sort | unfolder fast localization and image rectification of a document with a crease from folding in half |
topic | folded documents image rectification dewarping on-device acquisition open dataset |
url | https://www.computeroptics.ru/eng/KO/Annot/KO48-4/480409e.html |
work_keys_str_mv | AT amershov unfolderfastlocalizationandimagerectificationofadocumentwithacreasefromfoldinginhalf AT dvtropin unfolderfastlocalizationandimagerectificationofadocumentwithacreasefromfoldinginhalf AT eelimonova unfolderfastlocalizationandimagerectificationofadocumentwithacreasefromfoldinginhalf AT dpnikolaev unfolderfastlocalizationandimagerectificationofadocumentwithacreasefromfoldinginhalf AT vvarlazarov unfolderfastlocalizationandimagerectificationofadocumentwithacreasefromfoldinginhalf |