Surgical Gesture Segmentation and Recognition

Tao, Lingling; Zappella, Luca; Hager, Gregory D.; Vidal, René

doi:10.1007/978-3-642-40760-4_43

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8151))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

6274 Accesses

Abstract

Automatic surgical gesture segmentation and recognition can provide useful feedback for surgical training in robotic surgery. Most prior work in this field relies on the robot’s kinematic data. Although recent work [1,2] shows that the robot’s video data can be equally effective for surgical gesture recognition, the segmentation of the video into gestures is assumed to be known. In this paper, we propose a framework for joint segmentation and recognition of surgical gestures from kinematic and video data. Unlike prior work that relies on either frame-level kinematic cues, or segment-level kinematic or video cues, our approach exploits both cues by using a combined Markov/semi-Markov conditional random field (MsM-CRF) model. Our experiments show that the proposed model improves over a Markov or semi-Markov CRF when using video data alone, gives results that are comparable to state-of-the-art methods on kinematic data alone, and improves over state-of-the-art methods when combining kinematic and video data.

Download to read the full chapter text

Chapter PDF

Using hand pose estimation to automate open surgery training feedback

Article 30 May 2023

Transition State Clustering: Unsupervised Surgical Trajectory Segmentation for Robot Learning

Using open surgery simulation kinematic data for tool and gesture recognition

Article 13 April 2022

Keywords

References

Béjar Haro, B., Zappella, L., Vidal, R.: Surgical gesture classification from video data. In: Ayache, N., Delingette, H., Golland, P., Mori, K. (eds.) MICCAI 2012, Part I. LNCS, vol. 7510, pp. 34–41. Springer, Heidelberg (2012)
Chapter Google Scholar
Zappella, L., Béjar, B., Hager, G., Vidal, R.: Surgical gesture classification from video and kinematic data. Medical Image Analysis (2013)
Google Scholar
Barden, C., Specht, M., McCarter, M., Daly, J., Fahey, T.: Effects of limited work hours on surgical training. Obstetrical & Gynecological Survey 58(4), 244–245 (2003)
Article Google Scholar
Lenihan, J., Kovanda, C., Seshadri-Kreaden, U.: What is the learning curve for robotic assisted gynecologic surgery? J. of Minimally Invasive Gynecology 15(5), 589–594 (2008)
Article Google Scholar
Padoy, N., Hager, G.D.: Human-machine collaborative surgery using learned models. In: IEEE Conference on Robotics and Automation, pp. 5285–5292 (2011)
Google Scholar
Rosen, J., Hannaford, B., Richards, C., Sinanan, M.: Markov modeling of minimally invasive surgery based on tool/tissue interaction and force/torque signatures for evaluating surgical skills. IEEE Trans. Biomedical Eng. 48(5), 579–591 (2001)
Article Google Scholar
Reiley, C.E., Hager, G.D.: Task versus subtask surgical skill evaluation of robotic minimally invasive surgery. In: Yang, G.-Z., Hawkes, D., Rueckert, D., Noble, A., Taylor, C. (eds.) MICCAI 2009, Part I. LNCS, vol. 5761, pp. 435–442. Springer, Heidelberg (2009)
Chapter Google Scholar
Loukas, C., Georgiou, E.: Surgical workflow analysis with Gaussian mixture multivariate autoregressive (GMMAR) models: a simulation study. Computer Aided Surgery (2013)
Google Scholar
Rosen, J., Solazzo, M., Hannaford, B., Sinanan, M.: Task decomposition of laparoscopic surgery for objective evaluation of surgical residents’ learning curve using hidden Markov model. Computer Aided Surgery 7(1), 49–61 (2002)
Article Google Scholar
Leong, J.J.H., Nicolaou, M., Atallah, L., Mylonas, G.P., Darzi, A., Yang, G.Z.: HMM assessment of quality of movement trajectory in laparoscopic surgery. In: Larsen, R., Nielsen, M., Sporring, J. (eds.) MICCAI 2006. LNCS, vol. 4190, pp. 752–759. Springer, Heidelberg (2006)
Chapter Google Scholar
Varadarajan, B.: Learning and inference algorithms for dynamical system models of dextrous motion. PhD thesis, Johns Hopkins University (2011)
Google Scholar
Tao, L., Elhamifar, E., Khudanpur, S., Hager, G.D., Vidal, R.: Sparse hidden Markov models for surgical gesture classification and skill evaluation. In: Abolmaesumi, P., Joskowicz, L., Navab, N., Jannin, P. (eds.) IPCAI 2012. LNCS, vol. 7330, pp. 167–177. Springer, Heidelberg (2012)
Chapter Google Scholar
Wolf, R., Duchateau, J., Cinquin, P., Voros, S.: 3D tracking of laparoscopic instruments using statistical and geometric modeling. In: Fichtinger, G., Martel, A., Peters, T. (eds.) MICCAI 2011, Part I. LNCS, vol. 6891, pp. 203–210. Springer, Heidelberg (2011)
Chapter Google Scholar
Mung, J., Vignon, F., Jain, A.: A non-disruptive technology for robust 3D tool tracking for ultrasound-guided interventions. In: Fichtinger, G., Martel, A., Peters, T. (eds.) MICCAI 2011, Part I. LNCS, vol. 6891, pp. 153–160. Springer, Heidelberg (2011)
Chapter Google Scholar
Richa, R., Bó, A.P.L., Poignet, P.: Robust 3D visual tracking for robotic-assisted cardiac interventions. In: Jiang, T., Navab, N., Pluim, J.P.W., Viergever, M.A. (eds.) MICCAI 2010, Part I. LNCS, vol. 6361, pp. 267–274. Springer, Heidelberg (2010)
Chapter Google Scholar
Blum, T., Feußner, H., Navab, N.: Modeling and segmentation of surgical workflow from laparoscopic video. In: Jiang, T., Navab, N., Pluim, J.P.W., Viergever, M.A. (eds.) MICCAI 2010, Part III. LNCS, vol. 6363, pp. 400–407. Springer, Heidelberg (2010)
Chapter Google Scholar
Lalys, F., Riffaud, L., Morandi, X., Jannin, P.: Automatic phases recognition in pituitary surgeries by microscope images classification. In: Navab, N., Jannin, P. (eds.) IPCAI 2010. LNCS, vol. 6135, pp. 34–44. Springer, Heidelberg (2010)
Chapter Google Scholar
Lalys, F., Riffaud, L., Morandi, X., Jannin, P.: Surgical phases detection from microscope videos by combining SVM and HMM. In: Menze, B., Langs, G., Tu, Z., Criminisi, A. (eds.) MICCAI 2010. LNCS, vol. 6533, pp. 54–62. Springer, Heidelberg (2011)
Chapter Google Scholar
Lalys, F., Riffaud, L., Bouget, D., Jannin, P.: An application-dependent framework for the recognition of high-level surgical tasks in the OR. In: Fichtinger, G., Martel, A., Peters, T. (eds.) MICCAI 2011, Part I. LNCS, vol. 6891, pp. 331–338. Springer, Heidelberg (2011)
Chapter Google Scholar
Lalys, F., Riffaud, L., Bouget, D., Jannin, P.: A framework for the recognition of high-level surgical tasks from video images for cataract surgeries. IEEE Transactions on Biomedical Engineering 59(4), 966–976 (2012)
Article Google Scholar
Fathi, A., Farhadi, A., Rehg, J.M.: Understanding egocentric activities. In: IEEE International Conference on Computer Vision, pp. 407–414 (2011)
Google Scholar
Shi, Q., Cheng, L., Wang, L., Smola, A.J.: Human action segmentation and recognition using discriminative semi-markov models. Int. Journal of Computer Vision 93(1), 22–32 (2011)
Article MATH Google Scholar
Andrew, G.: A hybrid markov/semi-markov conditional random field for sequence segmentation. In: Conf. on Empirical Methods in Natural Language Processing, pp. 465–472 (2006)
Google Scholar
Laptev, I.: On space-time interest points. International Journal of Computer Vision 64(2-3), 107–123 (2005)
Article Google Scholar
Wang, Y., Tran, D., Liao, Z.: Learning hierarchical poselets for human parsing. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1705–1712 (2011)
Google Scholar
Tsochantaridis, I., Joachims, T., Hofmann, T., Altun, Y.: Large margin methods for structured and interdependent output variables. J. of Machine Learning Research 6, 1453–1484 (2005)
MathSciNet MATH Google Scholar
Joachims, T., Finley, T., Yu, C.N.J.: Cutting-plane training of structural svms. Machine Learning 77(1), 27–59 (2009)
Article MATH Google Scholar
Reiley, C.E., Lin, H.C., Varadarajan, B., Vagolgyi, B., Khudanpur, S., Yuh, D.D., Hager, G.D.: Automatic recognition of surgical motions using statistical modeling for capturing variability. In: Medicine Meets Virtual Reality, pp. 396–401 (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA
Lingling Tao, Luca Zappella, Gregory D. Hager & René Vidal

Authors

Lingling Tao
View author publications
You can also search for this author in PubMed Google Scholar
Luca Zappella
View author publications
You can also search for this author in PubMed Google Scholar
Gregory D. Hager
View author publications
You can also search for this author in PubMed Google Scholar
René Vidal
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Information and Communications Headquarters, Nagoya University, 464-8603, Nagoya, Japan
Kensaku Mori
Graduate School of Engineering, University of Tokyo, 113-8656, Tokyo, Japan
Ichiro Sakuma
Graduate School of Medicine, Osaka University, 565-0871, Osaka, Japan
Yoshinobu Sato
IRISA, Campus Universitaire de Beaulieu, 35042, Rennes, France
Christian Barillot
Computer Aided Medical Procedures, Technical University of Munich, 85748, Garching, Germany
Nassir Navab

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tao, L., Zappella, L., Hager, G.D., Vidal, R. (2013). Surgical Gesture Segmentation and Recognition. In: Mori, K., Sakuma, I., Sato, Y., Barillot, C., Navab, N. (eds) Medical Image Computing and Computer-Assisted Intervention – MICCAI 2013. MICCAI 2013. Lecture Notes in Computer Science, vol 8151. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40760-4_43

Download citation

DOI: https://doi.org/10.1007/978-3-642-40760-4_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40759-8
Online ISBN: 978-3-642-40760-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Surgical Gesture Segmentation and Recognition

Abstract

Chapter PDF

Similar content being viewed by others

Using hand pose estimation to automate open surgery training feedback

Transition State Clustering: Unsupervised Surgical Trajectory Segmentation for Robot Learning

Using open surgery simulation kinematic data for tool and gesture recognition

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Surgical Gesture Segmentation and Recognition

Abstract

Chapter PDF

Similar content being viewed by others

Using hand pose estimation to automate open surgery training feedback

Transition State Clustering: Unsupervised Surgical Trajectory Segmentation for Robot Learning

Using open surgery simulation kinematic data for tool and gesture recognition

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.