注释粒度在深度学习模型中对组织病理学图像的影响

论文标题

注释粒度在深度学习模型中对组织病理学图像的影响

Effects of annotation granularity in deep learning models for histopathological images

论文作者

Shi, Jiangbo, Gao, Zeyu, Zhang, Haichuan, Puttapirat, Pargorn, Wang, Chunbao, Zhang, Xiangrong, Li, Chen

论文摘要

病理对癌症诊断至关重要。通常，病理学家根据组织学幻灯片上观察到的细胞和组织结构得出结论。机器学习，尤其是深度学习的快速发展已经建立了强大而准确的分类器。它们被用于分析组织病理学幻灯片并协助病理学家诊断。大多数机器学习系统在很大程度上依赖于注释的数据集来获得经验和知识，以正确，准确地执行各种任务，例如分类和细分。这项工作研究了组织病理学数据集中注释的不同粒度，包括图像，边界框，椭圆方面和像素方面的粒度，以验证注释在病理幻灯片中对深度学习模型的影响。我们设计了相应的实验，以根据具有不同注释粒度的注释来测试深度学习模型的分类和分割性能。在分类中，当通过像素注释数据集培训时，基于最新的深度学习分类器的性能更好。平均而言，精度，召回和F1得分分别提高7.87％，8.83％和7.85％。因此，建议在分类任务中深入学习算法可以更好地利用较细的粒度注释。同样，在通过像素的注释训练时，语义分割算法可以实现8.33％的分割精度。我们的研究不仅表明，较细粒的注释可以改善深度学习模型的性能，还可以帮助从组织病理学幻灯片中提取更准确的表型信息。接受颗粒状注释培训的智能系统可以帮助病理学家检查某些区域以更好地诊断。类似于这项工作的分室预测方法可能有助于表型和基因型关联研究。

Pathological is crucial to cancer diagnosis. Usually, Pathologists draw their conclusion based on observed cell and tissue structure on histology slides. Rapid development in machine learning, especially deep learning have established robust and accurate classifiers. They are being used to analyze histopathological slides and assist pathologists in diagnosis. Most machine learning systems rely heavily on annotated data sets to gain experiences and knowledge to correctly and accurately perform various tasks such as classification and segmentation. This work investigates different granularity of annotations in histopathological data set including image-wise, bounding box, ellipse-wise, and pixel-wise to verify the influence of annotation in pathological slide on deep learning models. We design corresponding experiments to test classification and segmentation performance of deep learning models based on annotations with different annotation granularity. In classification, state-of-the-art deep learning-based classifiers perform better when trained by pixel-wise annotation dataset. On average, precision, recall and F1-score improves by 7.87%, 8.83% and 7.85% respectively. Thus, it is suggested that finer granularity annotations are better utilized by deep learning algorithms in classification tasks. Similarly, semantic segmentation algorithms can achieve 8.33% better segmentation accuracy when trained by pixel-wise annotations. Our study shows not only that finer-grained annotation can improve the performance of deep learning models, but also help extracts more accurate phenotypic information from histopathological slides. Intelligence systems trained on granular annotations may help pathologists inspecting certain regions for better diagnosis. The compartmentalized prediction approach similar to this work may contribute to phenotype and genotype association studies.

下载PDF全文

下载文献需遵守相关版权规定

论文标题