Abstract: Audio-visual zero-shot learning (ZSL) leverages both video and audio information for model training, aiming to classify new video categories that were not seen during the training. However, ...
Abstract: In recent years, visual-based sign language recognition (SLR) has become an active research area with the advancement of deep learning. However, it is difficult to collect sign language data ...