Using Vision Language Models t... Note