Running on Zero VLM Object Understanding π¦ Explore object detection, visual grounding, keypoint Detecti