Abstract
Augmented Reality (AR)–based video telephony service can allow mobile users a better user experience (UX) since it allows participants to place and transmit augmented objects on video frames to a peer. However, there are quite a few AR-based mobile video communication models today, yet the existing models are limited and insufficient in supporting technical service such as real-time object detection, dynamic data selection, and discrimination between local data augmentation and remote data augmentation. This paper presents an enhanced AR–based mobile video telephony scheme, in which the object of interest can be dynamically combined with a video frame through real-time object detection, and users can immediately share their experience with their friend during a video call. In order to evaluate the effectiveness and feasibility of the proposed scheme, an application has been implemented on the mobile system and the computational time has been measured. Experimental results show that the proposed system can give customers better UX with small increase of computational time.
Similar content being viewed by others
References
Chillet D, Eiche A, Pillement S, Sentieys O (2011) Real-time scheduling on heterogeneous system-on-chip architectures using an optimized artificial neural network. J Syst Archit 57(4):340–353
Chuen-Horng L, Yu-Jhuang S (2010) Fast segmentation of porcelain images based on texture features original research article. J Visual Commun Image Recogn 21(7):707–721
Daping W, Michael B (2012) Augmented reality: service construction via a 4D communication model. IEEE Commun Mag 50(3):26–31
Duanggate C, Uyyanonvara B, Makhanov S, Barman S, Williamson T (2011) Object detection with feature stability over scale space. J Visual Commun Image Recogn 22(4):345–352
Dunko GA (2009) Enhanced video telephony through augmented reality, US Patent 20090231413A1
Freund Y, Schapire RE (1997) A Decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst 55:119–139
Fukayama A, Takamiya S, Nakagawa J, Arakawa N, Kanamaru N, Uchida N (2011) Architecture and prototype of augmented reality videophone service. In Proc. Of the 15th International Conference on Intelligence in Next Generation Networks 80–85
ITU (1996) Rec. H.263: video coding for low bit rate communication, telecommunication standardization sector of ITU
Jain AK (1989) Fundamentals of digital image processing. Prentice-Hall
Jana S, Baik E, Pande A, Mohapatra P (2014) Improving mobile video telephony. In Proc. Of the 2014 Eleventh Annual IEEE International Conference on Sensing, Communication, and Networking (SECON), pp.495–503
Kim DW, Gil GT, Kim DH (2011) A handover decision strategy with a novel modified load-based adaptive hysteresis adjustment in 3GPP LTE system. IEICE Trans Inf Syst E94-D(6):1130–1136
Lanitis A, Taylor CJ, Cootes TF (1995) An autoatic face identification system using flexible appearance models. Image Vis Comput 13(5):393–401
Lee D-H, Park J (2007) An accessible and collaborative tourist guide based on augmented reality and mobile devices. In Proc. of the Second Workshop on Digital Media and its Application in Museum & Heritages 379–382
Manpreet K, Jasdeep K, Jappreet K (2011) Survey of contrast enhancement techniques based on histogram equalization. Int J Adv Comput Sci Appl 2(7):137–141
McKenna SJ, Gong S, Raja Y (1998) Modeling facial colour and identity with gaussian mixtures. Pattern Recogn 31(12):1883–1892
Mirzaei MR, Ghorshi S, Mortazavi M (2013) Audio-visual speech recognition techniques in augmented reality environments. J Visual Comput 30(3):245–257
Osuna E, Freund R, Girosi F (1997) Training support vector machines: an application to face detection. Proc IEEE Conf Comput Vis Pattern Recogn 130–136
Popov A, Dimitrova D (2008) A new approach for finding face features in color images. In Proc. Of the 4th International Conference on Intelligent Systems 1233–1237
Rajagopalan A, Kumar K, Karlekar J, Manivasakan R, Patil M, Desai U, Poonacha P, Chaudhuri S (1998) Finding faces in photographs. Proc Sixth IEEE Int Conf Comput Vis 640–645
Rosenberg J, Schulzrinne H, Camarillo G, Johnston A, Peterson J, Sparks R, Handley H, Schooler E (2002) SIP: Session Initiation Protocol (RFC 3261), Internet Engineering Task Force Network Working Group
Samanchuen T, Kiattisin S (2014) implementation and quality evaluation of video telephony using session initiation protocol. In Proc. of the 2014 Annual Summit and Conference on Asia-Pacific Signal and Information Processing Association (APSIPA), pp. 1–4
Schneiderman H, Kanade T (1998) Probabilistic modeling of local appearance and spatial relationships for object recognition. Proc IEEE Conf Comput Vis Pattern Recogn 45–51
Sirawongphatsara P, Wuttidittachotti P, Daengsi T (2015) Comparison of video telephony: a case study of LINE and Tango over 3G in Bangkok. In Proc. Of the 2015 International Conference on Information Networking (ICOIN) 205–209
Smith JR, Jabri MA (2004) The 3G-324 M protocols for conversational video telephony. IEEE Multimedia 11(3):102–105
Su GM, Lai YC, Kwasinski A, Wang H (2011) 3D video communications: challenges and opportunities. Int J Commun Syst 24(10):1261–1281
Sung KK, Poggio T (1998) Example-based learning for view-based human face detection. IEEE Trans Pattern Anal Mach Intell 20(1):39–51
Taragay O, Mikhail S, Vlad B, Supun S, Rakesh K (2015) Augmented reality binoculars. IEEE TVCG 21(5):611–623
Turk M, Pentland A (1991) Eigenfaces for recognition. J Cogn Neurosci 3(1):71–86
Viola P, Jone MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154
Wu-Chih H, Chao-Ho C, Deng-Yuan H, Yan-Ting Y (2012) Video object segmentation in rainy situations based on difference scheme with object structure and color analysis. J Visual Commun Image Recogn 23(2):303–312
Yu Y-C (2013) Design of a mobile telephony system for social interaction. In Proc. of the 2013 I.E. and Internet of Things (iThings/CPSCom). IEEE Int Conf IEEE Cyber Phys Soc Comput 1006–1012
Acknowledgments
This paper was supported by Research Fund, Kumoh National Institute of Technology.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Jang, S.B., Kim, Y.G. & Ko, YW. Mobile video communication based on augmented reality. Multimed Tools Appl 76, 16893–16909 (2017). https://doi.org/10.1007/s11042-016-3627-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-016-3627-4