View in Telegram
Cross-Modal Bidirectional Interaction Model for Referring Remote Sensing Image Segmentation Publication date: 11 Oct 2024 Topic: Semantic Segmentation Paper: https://arxiv.org/pdf/2410.08613v1.pdf GitHub: https://github.com/hit-sirs/crobim Description: In contrast to natural scenarios, expressions in RRSIS often involve complex geospatial relationships, with target objects of interest that vary significantly in scale and lack visual saliency, thereby increasing the difficulty of achieving precise segmentation. To address the aforementioned challenges, a novel RRSIS framework is proposed, termed the cross-modal bidirectional interaction model (CroBIM). Specifically, a context-aware prompt modulation (CAPM) module is designed to integrate spatial positional relationships and task-specific knowledge into the linguistic features, thereby enhancing the ability to capture the target object.
Love Center - Dating, Friends & Matches, NY, LA, Dubai, Global
Love Center - Dating, Friends & Matches, NY, LA, Dubai, Global
Find friends or serious relationships easily