Lightweight Medical Image Segmentation using Randomized Diverse Scribble Prompting
DOI:
CSTR:
Author:
Affiliation:

1.Chongqing Institute of Green and Intelligent Technology,Chinese Academy of Sciences;2.Chongqing School,University of Chinese Academy of Sciences;3.First Affiliated Hospital,Army Medical University;4.Department of Radiology,The First Affiliated Hospital of Chongqing Medical University;5.Chongqing Shukang Technology Service Co,Ltd

Clc Number:

TP399

Fund Project:

National Natural Science Foundation of China (62106247) and Natural Science Foundation of Chongqing (CSTB2024NSCQMSX0932, CSTB2024NSCQ-MSX0932)

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    Medical image segmentation, as a crucial component of computer-aided diagnosis, has witnessed remarkable progress in recent years under the "prompt-segmentation" paradigm based on large models. MedSAM has demonstrated excellent performance in medical scenarios but requires substantial computational resources. LiteMedSAM, a lightweight version, is suitable for resource-constrained environments, yet it does not fully utilize diverse mask information during the prompt encoding stage, making it difficult to achieve ideal segmentation results under conditions of sparse annotations. To address this issue, a lightweight medical image segmentation algorithm based on random diverse scribble prompts is proposed. This algorithm maintains the lightweight nature of LiteMedSAM while incorporating three modules: random diverse scribble generation, adaptive prompt weight based on Gumbel-Softmax, and multi-level gate fusion, which are well-suited to the prompt encoder structure. Specifically, it first uses the global representation of sparse prompts to pre-select the most discriminative scribble patterns at the logical level, then randomly generates multiple binary masks with diverse geometric shapes based on adaptive weights, and finally fuses the mask prompts with maskless prior information through a spatial-channel-level gating mechanism with dynamic weighting. Experimental results show that, without significantly increasing computational costs, the proposed method achieves higher Dice similarity coefficients (DSC) and normalized surface distances (NSD) on multiple medical image segmentation datasets compared to LiteMedSAM. Currently, this method has been successfully applied in the scenario of medical image radiation dose assessment, confirming its clinical application value.

    Reference
    Related
    Cited by
Get Citation
Related Videos

Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:September 04,2025
  • Revised:November 21,2025
  • Adopted:November 24,2025
  • Online:
  • Published:
Article QR Code