Abstract: Audio-visual segmentation (AVS) aims to achieve precise object segmentation by leveraging multimodal cues. However, effective alignment and fusion of audio and visual features are often ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results