Localization Lens for Improving Medical Vision-Language Models
|

Localization Lens for Improving Medical Vision-Language Models

Hasan Farooq, Murtaza Taj, Mehwish Nasim, Arif Mahmood Abstract: Medical Vision-Language Models (Med-VLMs) have demonstrated strong capabilities in clinical tasks. However, they often struggle to understand anatomical structures and spatial positioning, which are crucial for medical reasoning. To address this, we propose a localization-aware enhancement to the Med-VLM pipeline, introducing improvements at three levels: data,…

CATVis: Context-Aware Thought Visualization
|

CATVis: Context-Aware Thought Visualization

Tariq Mehmood*, Hamza Ahmad*, Muhammad Haroon Shakeel, Murtaza Taj (* contributed equally) Abstract: EEG-based brain-computer interfaces (BCIs) have shown promise in various applications, such as motor imagery and cognitive state monitoring. However, decoding visual representations from EEG signals remains a significant challenge due to their complex and noisy nature. We thus propose a novel 5-stage…