Skip to main content
Log in

Gesture recognition of traffic police based on static and dynamic descriptor fusion

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

We present a method to recognize gestures made by Chinese traffic police based on the static and dynamic descriptor fusion for driver assistance systems and intelligent vehicles. Gesture recognition is made possible by combining the extracted static and dynamic features. First, the point cloud data of human upper body in each frame of input video is obtained to estimate the static descriptor with 2.5D gesture model. Then, the dynamic descriptor is estimated by computing the motion history image of the input RGB video sequence. Finally, the above two descriptors are fused and the mean structural similarity index is used to recognize the gestures made by Chinese traffic police. A comparative study and qualitative evaluation are proposed with other gesture recognition methods, which demonstrate that better recognition results can be obtained using the proposed method on a number of video sequences.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

Similar content being viewed by others

References

  1. Bradski G, Davis J (2000) Motion segmentation and pose recognition with motion history gradients. In: Proceedings of IEEE Workshop on Applications of Computer Vision, pp 174–184

  2. Cai ZX, Guo F (2015) Max-covering scheme for gesture recognition of Chinese traffic police. Pattern Anal Applic 18(2):403–418

    Article  MathSciNet  Google Scholar 

  3. Eichner M, Ferrari V (2009) Better appearance models for pictorial structures. In: Proceeding of British Machine Vision Conference, London, UK, pp 1–11

  4. Eichner M, Ferrari V (2012) Human pose co-estimation and applications. IEEE Trans Pattern Anal Mach Intell (PAMI) 34(11):2282–2288

    Article  Google Scholar 

  5. Eichner M, Marin-Jimenez M, Zisserman A, Ferrari V (2012) 2D articulated human pose estimation and retrieval in (almost) unconstrained still images. Int J Comput Vis (IJCV) 99(2):190–214

    Article  MathSciNet  Google Scholar 

  6. Ferrari V, Marin-Jimenez M, Zisserman A (2008) Progressive search space reduction for human pose estimation. In: Proceeding of IEEE Conference on Computer Vision & Pattern Recognition, Anchorage, AK, pp 1–8

  7. Guo F, Cai ZX, Tang J (2011) Chinese traffic police gesture recognition in complex scene. In: Proceeding of the 2011 International Joint Conference of IEEE FCST-11, Los Alamitos, USA, pp 1505–1511

  8. Guo F, Tang J, Cai ZX (2013) Automatic recognition of Chinese traffic police gesture based on max-covering scheme. Adv Inf Sci Serv Sci 5(1):428–436

    Google Scholar 

  9. Huang YM, Zhang GB, Li X, Da FP (2011) Improved emotion recognition with novel global utterance-level features. Appl Math Inf Sci 5(2):147–153

    Google Scholar 

  10. Johnson S, Everigham M (2011) Learning effective human pose estimation from inaccurate annotation. In: Proceeding of IEEE Conference on Computer Vision & Pattern Recognition, Colorado Springs, USA, pp 1465–1472

  11. Kang H, Lee CW, Jung K (2004) Recognition-based gesture spotting in video games. Pattern Recogn Lett 25(15):1701–1714

    Article  Google Scholar 

  12. Le QK, Pham CH, Le TH (2012) Road traffic control gesture recognition using depth images. IEEK Trans Smart Process Comput 1(1):1–7

    Google Scholar 

  13. Liu JG, Luo JB, Shan M (2009) Recognizing realistic actions from videos ‘in the wild’. In: Proceeding of IEEE Conference on Computer Vision & Pattern Recognition, Miami, FL, pp 1996–2003

  14. Sapp B, Jordan C, Taskar B (2010) Adaptive pose prior for pictorial structures. In: Proceeding of IEEE Conference on Computer Vision & Pattern Recognition, San Francisco, USA pp 422–429

  15. Singh M, Mandal M, Basu A (2005) Visual gesture recognition for ground air traffic control using the Radon transform. In: Proceeding of IEEE/RSJ International Conference on Intelligent Robots & Systems, Edmonton, Canada, pp 2586–2591

  16. Smisek J, Jancosek M, Pajdla T (2011) 3D with Kinect. In: Proceedings of the 2011 I.E. International Conference on Computer Vision Workshops, Barcelona, Spain, pp 1154–1160

  17. Song Y, Demirdjian D, Davis R (2011) Tracking body and hands for gesture recognition: NATOPS aircraft handling signal database. In: Proceeding of IEEE International Conference on Automatic Face & Gesture Recognition and Workshops, Santa Barbara, CA, pp 500–506

  18. Suau X, Casas JR, Ruiz-Hidalgo J (2011) Real-time head and hand tracking based on 2.5D data. In: Proceedings of the 2011 I.E. International Conference on Multimedia and Expo, Barcelona, Spain, pp 1–6

  19. Tang J, Luo J, Tjahjadi T, Gao Y (2014) 2.5D multi-view gait recognition based on point cloud registration. Sensors 14:6124–6143

    Article  Google Scholar 

  20. Visual Geometry Group (2015) 2D articulated human pose estimation software v1.22, http://groups.inf.ed.ac.uk/calvin/articulated_human_pose_estimation_code/. Accessed 15 May 2015

  21. Yuan T, Wang B (2010) Accelerometer-based Chinese traffic police gesture recognition system. Chin J Electron 19(2):270–274

    MathSciNet  Google Scholar 

  22. Zhou W, Bovik AC, Sheikh HR, Simoncelli EP (2004) Image quality assessment: from error visibility to structural similarity. IEEE Trans Image Process 13(4):600–612

    Article  Google Scholar 

  23. Zhou Z, Li ST, Sun B (2014) Extreme learning machine based hand posture recognition in color-depth image. In: Proceedings of Chinese Conference on Pattern Recognition, pp 1–10

  24. Zhu Y, Fujimura K (2010) A Bayesian framework for human body pose tracking from depth image sequences. Sensors 10:5280–5293

    Article  Google Scholar 

  25. Zou BJ, Chen S, Shi C et al (2009) Automatic reconstruction of 3D human motion pose from uncalibrated monocular video sequences based on markerless human motion tracking. Pattern Recogn 42:1559–1571

    Article  MATH  Google Scholar 

Download references

Acknowledgments

This work was supported in part by the National Natural Science Foundation of China (No. 61502537, 91220301), China Postdoctoral Science Foundation (No. 2014 M552154), Hunan Planned Projects for Key Scientific Research Funds (No. 2015WK3006), Postdoctoral Science Foundation of Central South University (No. 126648).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jin Tang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Guo, F., Tang, J. & Wang, X. Gesture recognition of traffic police based on static and dynamic descriptor fusion. Multimed Tools Appl 76, 8915–8936 (2017). https://doi.org/10.1007/s11042-016-3497-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-016-3497-9

Keywords

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy