Continuous Sign Language Recognition with Correlation Network

Hu, Lianyu; Gao, Liqing; Liu, Zekang; Feng, Wei

Computer Science > Computer Vision and Pattern Recognition

arXiv:2303.03202 (cs)

[Submitted on 6 Mar 2023 (v1), last revised 18 Mar 2023 (this version, v3)]

Title:Continuous Sign Language Recognition with Correlation Network

Authors:Lianyu Hu, Liqing Gao, Zekang Liu, Wei Feng

View PDF

Abstract:Human body trajectories are a salient cue to identify actions in the video. Such body trajectories are mainly conveyed by hands and face across consecutive frames in sign language. However, current methods in continuous sign language recognition (CSLR) usually process frames independently, thus failing to capture cross-frame trajectories to effectively identify a sign. To handle this limitation, we propose correlation network (CorrNet) to explicitly capture and leverage body trajectories across frames to identify signs. In specific, a correlation module is first proposed to dynamically compute correlation maps between the current frame and adjacent frames to identify trajectories of all spatial patches. An identification module is then presented to dynamically emphasize the body trajectories within these correlation maps. As a result, the generated features are able to gain an overview of local temporal movements to identify a sign. Thanks to its special attention on body trajectories, CorrNet achieves new state-of-the-art accuracy on four large-scale datasets, i.e., PHOENIX14, PHOENIX14-T, CSL-Daily, and CSL. A comprehensive comparison with previous spatial-temporal reasoning methods verifies the effectiveness of CorrNet. Visualizations demonstrate the effects of CorrNet on emphasizing human body trajectories across adjacent frames.

Comments:	CVPR2023, Camera ready version. code: this https URL. Made few modifications on explanations. arXiv admin note: text overlap with arXiv:2211.17081
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2303.03202 [cs.CV]
	(or arXiv:2303.03202v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2303.03202

Submission history

From: Lianyu Hu [view email]
[v1] Mon, 6 Mar 2023 15:02:12 UTC (1,201 KB)
[v2] Wed, 8 Mar 2023 14:21:22 UTC (1,204 KB)
[v3] Sat, 18 Mar 2023 12:31:42 UTC (1,204 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Continuous Sign Language Recognition with Correlation Network

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Computer Science > Computer Vision and Pattern Recognition

Title:Continuous Sign Language Recognition with Correlation Network

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.