DSpace Repository

GAN-based text line segmentation method for challenging handwritten documents

Show simple item record

dc.creator ÖZKAYA, Ufuk
dc.creator Özşeker, İbrahim
dc.creator DEMİR, Ali Alper
dc.date 2024-01-01T00:00:00Z
dc.date.accessioned 2024-08-26T12:05:20Z
dc.date.available 2024-08-26T12:05:20Z
dc.identifier 08076aa9-798b-417f-8001-7e17428ad2fd
dc.identifier 10.1007/s10032-024-00488-5
dc.identifier https://avesis.sdu.edu.tr/publication/details/08076aa9-798b-417f-8001-7e17428ad2fd/oai
dc.identifier.uri http://acikerisim.sdu.edu.tr/xmlui/handle/123456789/98659
dc.description Text line segmentation (TLS) is an essential step of the end-to-end document analysis systems. The main purpose of this step is to extract the individual text lines of any handwritten documents with high accuracy. Handwritten and historical documents mostly contain touching and overlapping characters, heavy diacritics, footnotes and side notes added over the years. In this work, we present a new TLS method based on generative adversarial networks (GAN). TLS problem is tackled as an image-to-image translation problem and the GAN model was trained to learn the spatial information between the individual text lines and their corresponding masks including the text lines. To evaluate the segmentation performance of the proposed GAN model, two challenging datasets, VML-AHTE and VML-MOC, were used. According to the qualitative and quantitative results, the proposed GAN model achieved the best segmentation accuracy on the VML-MOC dataset and showed competitive performance on the VML-AHTE dataset.
dc.language eng
dc.rights info:eu-repo/semantics/closedAccess
dc.title GAN-based text line segmentation method for challenging handwritten documents
dc.type info:eu-repo/semantics/article


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account