Abstract
In contrast to other services on the Internet, streaming media service needs to fetch data from local disks more frequently, since it always lasts long and the bit rate is quite high. In addition, because of the much slower reading/writing speed of disk than random access memory (RAM), adopting advisable RAM caching policy can efficiently reduce disk I/O. In this paper, we study the problem of reducing disk I/O by using a novel approach. We first provide a new popularity estimate algorithm. Then a formal optimization problem about average disk I/O is presented, and a suboptimal caching algorithm for a special case of the problem is given. Furthermore, a partially observable Markov decision process (POMDP) model is constructed for the caching problem. Based on the model, popularity is taken advantage of to predict clients’ randomized behaviors, data replacing decisions are made when the defined observations occur, and the impact of caching actions on disk performance for future infinite steps is assessed. The method of event-based optimization is applied in search of the optimal stochastic policy. Disk I/O, as the long-run average performance measure, is optimized by applying the policy-gradient algorithm. The simulation results illustrate that data required by clients could be better predicted and lower disk I/O could be achieved by using the model proposed in this paper.









Similar content being viewed by others
References
Acharya S, Smith BC, Parnes P (1999) Characterizing user access to videos on the world wide web Electronic Imaging, International Society for Optics and Photonics, pp 130–141
Aggarwal G, Kumar G, Sethia D (2012) Dynamic prefix caching of videos with lazy update Proceedings of the second international conference on Computational Science, Engineering and Information Technology. ACM, pp 689–693
Baxter J, Bartlett PL (2001) Infinite-horizon policy-gradient estimation. J Artif Intell Res 15:319–350
Bhat UN (2015) An introduction to queueing theory: modeling and analysis in applications, Birkhäuser
Borst S, Gupta V, Walid A (2009) Self-organizing algorithms for cache cooperation in content distribution networks. Bell Labs Tech J 14(3):113–125
Cao XR, Zhang J (2008) Event-based optimization of markov systems. IEEE Trans Autom Control 53(4):1076–1082
Chen S, Shen B, Wee S, Zhang X (2006) Segment-based streaming media proxy: modeling and optimization. IEEE Trans Multimedia 8(2):243–256
Chesire M, Wolman A, Voelker GM, Levy HM (2001) Measurement and analysis of a streaming media workload USITS, vol 1, pp 1–1
Cormen T, Leiserson C, Rivest R (1990) Introduction to algorithms. MIT Press, Cambridge
Dan A, Sitaram D (1994) Buffer management policy for an on-demand video server. Citeseer, Yorktown Heights
Dan A, Sitaram D (1996) Generalized interval caching policy for mixed interactive and long video workloads Electronic Imaging: Science & Technology, International Society for Optics and Photonics, pp 344–351
Eberhard M, Szkaliczki T, Hellwagner H, Szobonya L, Timmerer C (2010) Knapsack problem-based piece-picking algorithms for layered content in peer-to-peer networks. ACM
Gomaa H, Messier GG, Williamson C, Davies R (2013) Estimating instantaneous cache hit ratio using markov chain analysis. IEEE/ACM Trans Networking 21(5):1472–1483
Gramatikov S, Jaureguizar F (2016) Modelling and analysis of non-cooperative peer-assisted vod streaming in managed networks. Multimedia Tools and Applications 75(8):4321–4348
He Y, Shen G, Xiong Y, Guan L (2009) Optimal prefetching scheme in p2p vod applications with guided seeks. IEEE Trans Multimedia 11(1):138–151
Hsu SP, Chuang DM, Arapostathis A (2006) On the existence of stationary optimal policies for partially observed mdps under the long-run average cost criterion. Syst Control Lett 55(2):165–173
Hwang KS, Jiang WC, Chen YJ (2015) Model learning and knowledge sharing for a multiagent system with dyna-q learning. IEEE transactions on cybernetics 45(5):978–990
Lee D, Choi J, Kim JH, Noh SH, Min SL, Cho Y, Kim CS (2001) Lrfu: a spectrum of policies that subsumes the least recently used and least frequently used policies. IEEE Trans Comput 50(12):1352–1361
Li Y, Yin B, Xi H (2008) Partially observable markov decision processes and performance sensitivity analysis. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 38(6):1645–1651
Lim H, Du DH (2001) Protocol considerations for video prefix-caching proxy in wide area networks. Electron Lett 37(6):1
Ling Q, Xu L, Yan J, Zhang Y (2015) An adaptive caching algorithm suitable for time-varying user accesses in vod systems. Multimedia Tools and Applications 74 (24):11,117–11,137
Lu X, Yin B, Zhang X, Cao J, Kang Y (2016) Event-based optimization for admission control in distributed service system. Telecommun Syst 62(3):553–567
Luo JG, Zhang Q, Tang Y, Yang SQ (2009) A trace-driven approach to evaluate the scalability of p2p-based video-on-demand service. IEEE Trans Parallel Distrib Syst 20(1):59–70
Megiddo N, Modha DS (2004) Outperforming lru with an adaptive replacement cache algorithm. Computer 37(4):58–65
Oh HR, Song H (2007) Metafile-based scalable caching and dynamic replacing algorithms for multiple videos over quality-of-service networks. IEEE Trans Multimedia 9(7):1535–1542
Psounis K, Prabhakar B (2002) Efficient randomized web-cache replacement schemes using samples from past eviction times. IEEE/ACM Trans Networking (TON) 10(4):441–455
Puterman ML (2014) Markov decision processes: discrete stochastic dynamic programming, Wiley
Rizzo L, Vicisano L (2000) Replacement policies for a proxy cache. IEEE/ACM Trans Networking (ToN) 8(2):158–170
Sen S, Rexford J, Towsley D (1999) Proxy prefix caching for multimedia streams IEEE Proceedings of the INFOCOM’99. 18th Annual joint conference of the IEEE computer and communications societies, vol 3. IEEE, pp 1310–1319
Senda K, Hattori S, Hishinuma T, Kohda T (2014) Acceleration of reinforcement learning by policy evaluation using nonstationary iterative method. IEEE trans cybernetics 44(12):2696–2705
Sprangers O, Babuṡka R, Nageshrao SP, Lopes GA (2015) Reinforcement learning for port-hamiltonian systems. IEEE trans cybernetics 45(5):1017–1027
Tewari R, Vin HM, Dan A, Sitaram D (1997) Resource-based caching for web servers Photonics West’98 Electronic Imaging, International Society for Optics and Photonics, pp 191–204
Wilson L, Zipf GK (1949) Human behavior and the principle of least effort
Wu KL, Yu PS, Wolf JL (2004) Segmentation of multimedia streams for proxy caching. IEEE Trans Multimedia 6(5):770–780
Xie H, Gao L, Zhang L, Zhang Z, Yang M (2009) An efficient caching mechanism for video-on-demand service over peer-to-peer network International Conference on Scalable computing and communications; 8th international conference on embedded computing, 2009. SCALCOM-EMBEDDEDCOM’09. IEEE, pp 251–256
Yin B, Lu S, Guo D (2011) Analysis of admission control in p2p-based media delivery network based on pomdp. International Journal of Innovative Computing Information and Control 7(7B):4411–4422
Zhou Y, Chiu DM, Lui JC (2011) A simple model for chunk-scheduling strategies in p2p streaming. IEEE/ACM Trans Networking 19(1):42–54
Zhou Y, Fu TZ, Chiu DM (2015) A unifying model and analysis of p2p vod replication and scheduling. IEEE/ACM Trans Networking (TON) 23(4):1163–1175
Zink M, Suh K, Gu Y, Kurose J (2009) Characteristics of youtube network traffic at a campus network–measurements, models, and implications. Comput Netw 53 (4):501–514
Acknowledgments
This work is supported in part by the National Natural Science Foundation of China under grant Nos. 61174124, 61233003, 61503358, 61673361.
Author information
Authors and Affiliations
Corresponding author
Additional information
This work is supported in part by the National Natural Science Foundation of China under grant Nos. 61174124, 61233003,61304048 in part by Research Fund for the Doctoral Program of Higher Education of China under grant No. 20123402110029 and in part by Natural Science Research Program of the Anhui High Education Bureau of China under grant No. KJ2012A286.
Rights and permissions
About this article
Cite this article
Yin, B., Cao, J., Kang, Y. et al. A novel POMDP-based server RAM caching algorithm for VoD systems. Multimed Tools Appl 77, 13023–13045 (2018). https://doi.org/10.1007/s11042-017-4930-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-017-4930-4