Abstract: The application of vision-language (VL) models in Internet of Drones (IoD) frameworks is a groundbreaking solution for semantic comprehension of aerial tasks. In contrast to conventional ...