Abstract: Foundation models such as ChatGPT have made significant strides in robotic tasks due to their universal representation of real-world domains. In this paper, we leverage foundation models to ...
Feb. 22nd, 2024: We released our paper on Arxiv. Further details can be found in code and our updated arXiv. Weakly supervised visual recognition using inexact supervision is a critical yet ...