6 papers are accepted by ECCV 2024. See their details: Grounding DINO LLaVA-Plus TAPTR: Tracking Any Point T-Rex2: Text-Visual Prompted Detector LLaVA-Grounding Semantic-SAM