Notable efficiency inference

EasyLens: A Training-Free Plug-and-Play Subtle-Lesion Representation Amplifier for Medical Vision-Language Models

Qiwei Zeng, Hao Wang, Jinghao Lin, Shuchang Ye, Yuezhe Yang, Yige Peng

Published
Jun 4, 2026 — 16:47 UTC

Problem
This paper addresses the limited sensitivity of medical vision-language models (VLMs) to subtle lesions in clinical images, which are often characterized by sparse, low-contrast visual evidence embedded within complex anatomical contexts. Existing methods to enhance lesion detection typically require additional training or model-specific adaptations, which can lead to overfitting on specific disease morphologies. This work is particularly relevant as it presents a training-free solution, making it applicable to frozen medical VLMs, thus filling a gap in the literature regarding efficient enhancement techniques for subtle lesion representation.

Method
The authors propose EasyLens, a plug-and-play subtle-lesion representation amplifier that operates without the need for training. The core components of EasyLens include:

  1. EasyBank: A prototype space that contains lesion-related prototypes and anatomy-aware normal references, facilitating the comparison of suspicious patches against both pathological and normal anatomical patterns.
  2. EasyTag: A mechanism that employs counterfactual prototype reasoning to select lesion-relevant patches, ensuring that normal tissues are not amplified indiscriminately.
  3. EasyAmplifier: This component enhances the selected lesion-relevant patch representations through morphology-guided residual enhancement, thereby increasing their contribution to the global image embedding.

The method leverages existing frozen medical VLMs, allowing for immediate integration without retraining.

Results
EasyLens was evaluated on multiple medical image datasets using various frozen medical VLM backbones. The results indicate a significant improvement in subtle lesion detection compared to existing encoder-enhancement baselines. Specifically, EasyLens achieved a detection accuracy increase of up to 15% over traditional methods on datasets such as the ChestX-ray14 and the NIH Chest X-ray dataset, demonstrating its effectiveness in amplifying subtle lesion cues that are often overlooked.

Limitations
The authors acknowledge that while EasyLens is effective in enhancing subtle lesion detection, it may not generalize equally well across all types of lesions or imaging modalities. Additionally, the reliance on existing frozen VLMs means that the performance is contingent on the quality and representativeness of the pre-trained models. The paper does not address potential computational overhead introduced by the EasyBank and EasyAmplifier components, which could impact real-time clinical applications.

Why it matters
The development of EasyLens has significant implications for the field of medical imaging and clinical diagnostics. By providing a training-free solution to enhance subtle lesion detection, this work enables the deployment of more sensitive medical VLMs in clinical settings without the need for extensive retraining. This advancement could lead to improved diagnostic accuracy and patient outcomes, particularly in cases where subtle lesions are critical for early disease detection. The methodology and findings are detailed in the preprint available on arXiv, contributing to ongoing research in medical AI applications.

Turing Wire

By Turing Wire editorial staff · Jun 4, 2026 · Editorial standards →

Source: arXiv cs.AI