Lightweight Medical Image Segmentation using Randomized Diverse Scribble Prompting

Lightweight Medical Image Segmentation using Randomized Diverse Scribble Prompting
DOI:
                        
CSTR:
                        
Author:
                        
Affiliation:1.Chongqing Institute of Green and Intelligent Technology,Chinese Academy of Sciences;2.Chongqing School,University of Chinese Academy of Sciences;3.First Affiliated Hospital,Army Medical University;4.Department of Radiology,The First Affiliated Hospital of Chongqing Medical University;5.Chongqing Shukang Technology Service Co,Ltd
Clc Number:TP399
Fund Project:National Natural Science Foundation of China (62106247) and Natural Science Foundation of Chongqing (CSTB2024NSCQMSX0932, CSTB2024NSCQ-MSX0932)

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Medical image segmentation, as a crucial component of computer-aided diagnosis, has witnessed remarkable progress in recent years under the "prompt-segmentation" paradigm based on large models. MedSAM has demonstrated excellent performance in medical scenarios but requires substantial computational resources. LiteMedSAM, a lightweight version, is suitable for resource-constrained environments, yet it does not fully utilize diverse mask information during the prompt encoding stage, making it difficult to achieve ideal segmentation results under conditions of sparse annotations. To address this issue, a lightweight medical image segmentation algorithm based on random diverse scribble prompts is proposed. This algorithm maintains the lightweight nature of LiteMedSAM while incorporating three modules: random diverse scribble generation, adaptive prompt weight based on Gumbel-Softmax, and multi-level gate fusion, which are well-suited to the prompt encoder structure. Specifically, it first uses the global representation of sparse prompts to pre-select the most discriminative scribble patterns at the logical level, then randomly generates multiple binary masks with diverse geometric shapes based on adaptive weights, and finally fuses the mask prompts with maskless prior information through a spatial-channel-level gating mechanism with dynamic weighting. Experimental results show that, without significantly increasing computational costs, the proposed method achieves higher Dice similarity coefficients (DSC) and normalized surface distances (NSD) on multiple medical image segmentation datasets compared to LiteMedSAM. Currently, this method has been successfully applied in the scenario of medical image radiation dose assessment, confirming its clinical application value.

Reference

Cited by

Get Citation

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:September 04,2025
Revised:November 21,2025
Adopted:November 24,2025
Online:
Published:

Home

Get Citation

Related Videos

Share

Article Metrics

History

Article QR Code