论文标题

希腊纸莎草纸上的作家检索和作家身份

Writer Retrieval and Writer Identification in Greek Papyri

论文作者

Christlein, Vincent, Marthot-Santaniello, Isabelle, Mayr, Martin, Nicolaou, Anguelos, Seuret, Mathias

论文摘要

数字化历史手稿的分析通常由古学专家解决。作者身份证是指已知作者的分类,而作家的检索则试图通过图像数据集中的图像相似性找到作者。尽管自动作家识别/检索方法已经为许多历史文档类型提供了有希望的结果,但由于纤维结构和严重的人工制品,纸莎草数据非常具有挑战性。因此,改进作者识别的重要步骤是预处理和特征抽样过程。我们研究了几种方法,并表明良好的二进制化是纸莎草著作中改进的作者认同的关键。我们主要关注基于传统或基于自我监督的方法的无监督特征方法的作者检索。但是,在作者分类/重新识别的情况下,它也可以与基于深度学习的最深层学习方法相媲美。

The analysis of digitized historical manuscripts is typically addressed by paleographic experts. Writer identification refers to the classification of known writers while writer retrieval seeks to find the writer by means of image similarity in a dataset of images. While automatic writer identification/retrieval methods already provide promising results for many historical document types, papyri data is very challenging due to the fiber structures and severe artifacts. Thus, an important step for an improved writer identification is the preprocessing and feature sampling process. We investigate several methods and show that a good binarization is key to an improved writer identification in papyri writings. We focus mainly on writer retrieval using unsupervised feature methods based on traditional or self-supervised-based methods. It is, however, also comparable to the state of the art supervised deep learning-based method in the case of writer classification/re-identification.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源