卵巢癌病理图像的单边噪声多实例学习分类方法
DOI:
CSTR:
作者:
作者单位:

1.昆明理工大学 信息工程与自动化学院;2.中国北京大学肿瘤医院云南医院,云南省肿瘤医院,昆明医科大学第三附属医院

作者简介:

通讯作者:

中图分类号:

TP 391.41

基金项目:

国家自然科学基金(00000000);国家高技术研究发展计划(863计划)(2008AA000000)


One-sided label noise multi-instance learning classification method for ovarian cancer pathological images
Author:
Affiliation:

1.School of Information Engineering and Automation,Kunming University of Science and Technology,Kunming;2.Peking University Cancer Hospital Yunnan Hospital (Yunnan Cancer Hospital, The Third Affiliated Hospital of Kunming Medical University)

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对数字病理学中训练样本标注成本高、易引入单边标签噪声(即良性样本被错标为恶性)的挑战,提出一种能够在该噪声环境下进行稳健学习的多实例学习(Multi-Instance Learning,MIL)分类方法,即一种基于双重加权机制的单边噪声多实例学习(MIL-OSLN)的方法,以提高卵巢癌病理图像的自动诊断准确率。该方法在实例(instance)和包(bag)两个层级同时进行动态加权,协同地从噪声数据中筛选出关键诊断区域,并抑制假阳性包的干扰。利用半监督聚类生成的弱标签数据集训练模型,并在独立的金标准测试集上与多种基线模型进行了全面的性能对比。实验结果显示,即使在训练集噪声率高达50%的情况下,模型AUC仍能稳定在0.9左右,综合性能显著优于所有对比基线模型。研究表明,该方法能够有效应对MIL中的单边标签噪声问题,适用于大规模弱标注病理数据的肿瘤智能诊断。

    Abstract:

    High annotation costs and one-sided label noise (benign samples mislabeled as malignant) pose challenges in digital pathology. To address these issues, a One-Sided Label Noise Multi-Instance Learning (MIL-OSLN) framework based on a dual-weighting mechanism was proposed to improve the automatic diagnosis accuracy of ovarian cancer pathological images. The framework dynamically weighted samples at both instance and bag levels simultaneously. It synergistically screened key diagnostic regions from noisy data and suppressed interference from false-positive bags. The model was trained using a weak label dataset generated via semi-supervised clustering, and performance was compared against various baseline models on an independent gold-standard test set. Experimental results indicated that the model"s Area Under the Curve (AUC) remained stable at approximately 0.9 even when the training set noise rate reached 50%. The comprehensive performance significantly outperformed all comparative baseline models. The study demonstrates that this method effectively addresses one-sided label noise in MIL and is suitable for intelligent tumor diagnosis using large-scale weakly labeled pathological data.

    参考文献
    相似文献
    引证文献
引用本文
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-12-12
  • 最后修改日期:2026-01-12
  • 录用日期:2026-04-08
  • 在线发布日期:
  • 出版日期:
文章二维码