Visual transformer with depthwise separable convolution projections for video-based human action recognitionYu Cao, Fang Wang and Qiusheng ZhengMATEC Web Conf., 413 (2025) 06003DOI: https://doi.org/10.1051/matecconf/202541306003