Abstract
Real-time and accurate prediction of traffic flow plays an important role in intelligent transportation systems. However, short-term traffic flow forecasting is extremely challenging due to the highly nonlinear nature of the traffic system and the dynamic spatial and temporal correlation. Although various methods, including deep learning based ones, have been proposed, most of them still suffer from problems such as spatial nonstationarity and thus cannot achieve good prediction performance. Inspired by the recent superior performance of attention mechanism, we introduce it into the model for traffic flow prediction with regular grided input. To be specific, we propose a novel deep learning framework, Spatial-Temporal Attention Based Convolutional Networks (STAtt-Net), for accurate forecasting of citywide traffic flow. First, we model the traffic data as a two-dimensional matrix with two channels. Each cell in the matrix represents the traffic in the corresponding region. Taking into account the temporal correlation and dependence of traffic system, the periodic patterns contained in traffic data are modeled by three major components for weekly trend, daily periodicity, and hourly closeness respectively. Then, STAtt-Net employs a STBlock as the basis unit to learn temporal dependence and spatial dependence of traffic flow, taking advantage of attention mechanism. We conduct extensive experiments to evaluate the performance of our model on three real-world datasets (TaxiBJ, BikeNYC, TaxiSZ), with the results revealing better prediction accuracy and efficiency of the proposed model against existing ones.



Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Atrish A, Singh N, Kumar K, Kumar V (2017) An automated hierarchical framework for player recognition in sports image. In: Proceedings of the international conference on video and image processing, pp 103–108
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Chang H, Lee Y, Yoon B, Baek S (2012) Dynamic near-term traffic flow prediction: system-oriented approach based on past experiences. IET Intell Transp Syst 6(3):292–305
Cheng S, Lu F, Peng P (2020) Short-term traffic forecasting by mining the non-stationarity of spatiotemporal patterns. IEEE Trans Intell Transp Syst 22(10):6365–6383
Choi H, Cho K, Bengio Y (2018) Fine-grained attention mechanism for neural machine translation. Neurocomputing 284:171–176
Deng D, Shahabi C, Demiryurek U, Zhu L, Yu R, Liu Y (2016) Latent space model for road networks to predict time-varying traffic. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 1525–1534
Ding Q Y, Wang X F, Zhang X Y, Sun Z Q (2011) Forecasting traffic volume with space-time arima model. In: Advanced materials research, vol 156. Trans Tech Publications, pp 979–983
Fu R, Zhang Z, Li L (2016) Using lstm and gru neural network methods for traffic flow prediction. In: 2016 31st Youth academic annual conference of Chinese association of automation (YAC). IEEE, pp 324–328
Guo S, Lin Y, Li S, Chen Z, Wan H (2019) Deep spatial–temporal 3d convolutional neural networks for traffic data forecasting. IEEE Trans Intell Transp Syst 20(10):3913–3926
Harvey A C (1990) Forecasting structural time series models and the kalman filter
He Z, Chow C -Y, Zhang J -D (2019) Stcnn: a spatio-temporal convolutional neural network for long-term traffic prediction. In: 2019 20th IEEE international conference on mobile data management (MDM). IEEE, pp 226–233
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141
Huang W, Song G, Hong H, Xie K (2014) Deep architecture for traffic flow prediction: deep belief networks with multitask learning. IEEE Trans Intell Transp Syst 15(5):2191–2201
Hyndman R J, Koehler A B (2006) Another look at measures of forecast accuracy. Int J Forecast 22(4):679–688
Ke R, Li W, Cui Z, Wang Y (2020) Two-stream multi-channel convolutional neural network for multi-lane traffic speed prediction considering traffic volume impact. Transp Res Rec 2674(4):459–470
Kumar K (2021) Text query based summarized event searching interface system using deep learning over cloud. Multimed Tools Appl 80(7):11079–11094
Kumar K, Kumar A, Bahuguna A (2017) D-cad: deep and crowded anomaly detection. In: Proceedings of the 7th international conference on computer and communication technology, pp 100–105
Kumar K, Shrimankar D D, Singh N (2018) Somes: an efficient som technique for event summarization in multi-view surveillance videos 383–389
Kumar A, Purohit K, Kumar K (2019) Stock price prediction using recurrent neural network and long short-term memory. In: International conference on deep learning, artificial intelligence and robotics. Springer, pp 153–160
Lim B, Arık SÖ, Loeff N, Pfister T (2021) Temporal fusion transformers for interpretable multi-horizon time series forecasting. Int J Forecast 37 (4):1748–1764
Liu K, Gao S, Qiu P, Liu X, Yan B, Lu F (2017) Road2vec: measuring traffic interactions in urban road system from massive travel routes. ISPRS Int J Geo-Inf 6(11):321
Lu F, Liu K, Duan Y, Cheng S, Du F (2018) Modeling the heterogeneous traffic correlations in urban road systems using traffic-enhanced community detection approach. Physica A: Stat Mech Appl 501:227–237
Luo Q, Zhou Y (2021) Spatial-temporal structures of deep learning models for traffic flow forecasting: a survey. In: 2021 4th International conference on intelligent autonomous systems (ICoIAS). IEEE, pp 187–193
Luong M -T, Pham H, Manning C D (2015) Effective approaches to attention-based neural machine translation. arXiv:1508.04025
Ma X, Tao Z, Wang Y, Yu H, Wang Y (2015) Long short-term memory neural network for traffic speed prediction using remote microwave sensor data. Transp Res Part C: Emerg Technol 54:187–197
Martínez LM, Viegas JM, Silva EA (2009) A traffic analysis zone definition: a new methodology and algorithm. Transportation 36(5):581–599
Mnih V, Heess N, Graves A (2014) Recurrent models of visual attention. In: Advances in neural information processing systems, pp 2204–2212
Moorthy C, Ratcliffe B (1988) Short term traffic forecasting using time series methods. Transp Plan Technol 12(1):45–56
Negi A, Kumar K (2021) Face mask detection in real-time video stream using deep learning. In: Computational intelligence and healthcare informatics, pp 255–268
Negi A, Kumar K (2022) Chapter 1—ai-based implementation of decisive technology for prevention and fight with covid-19 1–14
Negi A, Kumar K, Chaudhari N S, Singh N, Chauhan P (2021) Predictive analytics for recognizing human activities using residual network and fine-tuning. In: International conference on big data analytics. Springer, pp 296–310
Okutani I, Stephanedes Y J (1984) Dynamic prediction of traffic volume through kalman filtering theory. Transp Res Part B: Methodol 18(1):1–11
Openshaw S (1984) The modifiable areal unit problem. Geo Books, Norwick
Sharma S, Kumar K (2021) Asl-3dcnn: American sign language recognition technique using 3-d convolutional neural networks. Multimed Tools Appl 80(17):26319–26331
Shi X, Chen Z, Wang H, Yeung D -Y, Wong W -K, Woo W-C (2015) Convolutional lstm network: a machine learning approach for precipitation nowcasting. arXiv:1506.04214
Smola A J, Schölkopf B (2004) A tutorial on support vector regression. Stat Comput 14(3):199–222
Solanki A, Bamrara R, Kumar K, Singh N (2020) VEDL: a novel video event searching technique using deep learning. In: Soft computing: theories and applications, pp 905–914
Srinivasu P N, Balas V E (2021) Self-learning network-based segmentation for real-time brain mr images through haris. PeerJ Comput Sci 7:654
Stathopoulos A, Karlaftis M (2001) Temporal and spatial variations of real-time traffic data in urban areas. Transp Res Rec 1768(1):135–140
Tang C, Zhu X, Liu X, Wang L, Zomaya A (2019) Defusionnet: defocus blur detection via recurrently fusing and refining multi-scale deep features. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 2700–2709
Tang C, Liu X, An S, Wang P (2021) Br2net: defocus blur detection via a bidirectional channel attention residual refining network. IEEE Trans Multimed 23:624–635. https://doi.org/10.1109/TMM.2020.2985541
Tobler W R (1970) A computer movie simulating urban growth in the detroit region. Econ Geogr 46(2)
Wang X, Girshick R, Gupta A, He K (2018) Non-local neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7794–7803
Williams B M, Hoel L A (2003) Modeling and forecasting vehicular traffic flow as a seasonal arima process: theoretical basis and empirical results. J Transp Eng 129(6):664–672
Williams B M, Durvasula P K, Brown D E (1998) Urban freeway traffic flow prediction: application of seasonal autoregressive integrated moving average and exponential smoothing models. Transp Res Rec 1644(1):132–141
Xingjian S, Chen Z, Wang H, Yeung D -Y, Wong W -K, Woo W-C (2015) Convolutional lstm network: a machine learning approach for precipitation nowcasting. In: Advances in neural information processing systems, pp 802–810
Yan Q, Wang B, Zhang W, Luo C, Xu W, Xu Z, Zhang Y, Shi Q, Zhang L, You Z (2020) Attention-guided deep neural network with multi-scale feature fusion for liver vessel segmentation. IEEE J Biomed Health Inform 25(7):2629–2642
Yan Q, Wang B, Zhang W, Luo C, Xu W, Xu Z, Zhang Y, Shi Q, Zhang L, You Z (2021) Attention-guided deep neural network with multi-scale feature fusion for liver vessel segmentation. IEEE J Biomed Health Inform 25(7):2629–2642. https://doi.org/10.1109/JBHI.2020.3042069
Yao Z-S, Shao C-F, Gao Y-L (2006) Research on methods of short-term traffic forecasting based on support vector regression [j]. J Beijing Jiaotong Univ 30(3):19–22
Yao H, Tang X, Wei H, Zheng G, Li Z (2019) Revisiting spatial-temporal similarity: a deep learning framework for traffic prediction. In: Proceedings of the AAAI conference on artificial intelligence, vol 33, pp 5668–5675
Yue Y (2006) Spatial-temporal dependency of traffic flow and its implications for short-term traffic forecasting. HKU Theses Online (HKUTO)
Zeng W, Lin C, Lin J, Jiang J, Xia J, Turkay C, Chen W (2020) Revisiting the modifiable areal unit problem in deep traffic prediction with visual analytics. IEEE Trans Vis Comput Graph 27(2):839–848
Zeng W, Lin C, Liu K, Lin J, Tung A K (2021) Modeling spatial nonstationarity via deformable convolutions for deep traffic flow prediction. IEEE Trans Knowl Data Eng
Zeng W, Lin C, Lin J, Jiang J, Xia J, Turkay C, Chen W (2021) Revisiting the modifiable areal unit problem in deep traffic prediction with visual analytics. IEEE Trans Visual Comput Graph 27(2):839–848. https://doi.org/10.1109/TVCG.2020.3030410
Zhao L, Song Y, Zhang C, Liu Y, Wang P, Lin T, Deng M, Li H (2019) T-gcn: a temporal graph convolutional network for traffic prediction. IEEE Trans Intell Transp Syst 21(9):3848–3858
Zhang X-L, He G-G, Lu H-P (2009) Short-term traffic flow forecasting based on k-nearest neighbors non-parametric regression. J Syst Eng 24(2):178–183
Zhang J, Zheng Y, Qi D (2017) Deep spatio-temporal residual networks for citywide crowd flows prediction. In: Proceedings of the AAAI conference on artificial intelligence, vol 31
Zheng C, Fan X, Wang C, Qi J (2020) Gman: a graph multi-attention network for traffic prediction. In: Proceedings of the AAAI conference on artificial intelligence, vol 34, pp 1234–1241
Funding
This work is supported by National Natural Science Foundation of China (62077039) and Research Project (PZ2020016).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
There are no other competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Lin, J., Lin, C. & Ye, Q. Attention based convolutional networks for traffic flow prediction. Multimed Tools Appl 83, 7379–7394 (2024). https://doi.org/10.1007/s11042-023-15395-w
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-15395-w