Abstract
High Utility Item-set Mining (HUIM) is the futuristic remodel version of Frequent Item-set Mining (FIM). It discovers customer purchase trends in the retail market. This knowledge is useful to retailers to incorporate various innovative schemes in their businesses to attract the customers such as discounts, cross-marketing, seasonal sale offers…etc. Even though many HUIM algorithms are available to detect profitable patterns, most of them cannot apply to all kinds of retail market data sets due to certain assumptions. The first assumption is that the items always produce a positive profit. Even though purchased items’ overall profit could be positive, few items may have negative profit. Another assumption is they are built for static transactional data. The data is gathered up to the point of time and is used for analysis. It is helpful to make decisions at some intervals like quarterly, half-yearly, yearly. But, to take decisions at any time by analyzing the present sales trend, it is required to process the data stream. This paper presents an innovative idea named Extended Global Utility Item-sets Tree(EGUI-tree) to extract High utility item-sets in the retail market data stream with positive and negative profit items. The sliding window-based technique is applied to the data stream to pick up the very recent data to process. An experimental study on real-world datasets shows that the proposed EGUI-tree algorithm is faster and scalable.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Agrawal R, Imielinski T, Swami A (1993) Database mining: a performance perspective. IEEE Trans Knowl Data Eng 5(6):914–925
Bansal R, Dawar S, Goyal V (2015) An efficient algorithm for mining high-utility itemsets with discount notion. Springer, Berlin, pp 84–98
Borah A, Nath B (2019) Rare pattern mining: challenges and future perspectives. Complex Intell Syst 5:1–23
Fournier V, Philippe L, Chun-Wei R, Uday K, Yun S, Thomas R (2017) A survey of sequential pattern mining. Data Sci Pattern Recognit 1:54–77
Fournier-Viger P, Chun-Wei Lin J, Truong-Chi T, Nkambou R (2019) A survey of high utility itemset mining. In: Fournier-Viger P, Lin JW, Nkambou R, Vo B, Tseng V (eds) High-utility pattern mining. Studies in big data. Springer, Berlin
Gan W, Lin JCW, Fournier-Viger P, Chao HC, Tseng VS, Yu PS (2021) A survey of utility-oriented pattern mining. IEEE Trans Knowl Data Eng 33(4):1306–1327. https://doi.org/10.1109/TKDE.2019.2942594
Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. SIGMOD Rec 29(2):1–12
Krishnamoorthy S (2015) Pruning strategies for mining high utility itemsets. Expert Syst Appl 42:2371–2381
Lee V, Jin R, Agrawal G (2014) Frequent pattern mining in data streams. In: Aggarwal C, Han J (eds) Frequent pattern mining. Springer, Cham
Li H, Huang H, Lee S (2011) Fast and memory efficient mining of high-utilityitemsets from data streams: with and without negative item profits. Knowl Inf Syst 28:495–522
Lin JC-W, Fournier-Viger P, Gan W (2016a) FHN: an efficient algorithm for mining high-utility itemsets with negative unit profits. Knowl Based Syst 30:109–126
Lin C-W, Gan W, Viger F, Philippe H, Tzung-Pei H, Tsengs V (2016b) Fast algorithms for mining high-utilityitemsets with various discount strategies. Adv Eng Inform 30:109–126
Rakesh A, Ramakrishnan S (1994) Fast algorithms for mining association rules in large databases. In: Proceedings of the 20th International Conference on Very Large Data Bases (VLDB ’94), pp 487–499
Singh K, Shakya HK, Singh A (2018) Mining of high-utility item sets with negative utility. Expert Syst 35(8):e12296
Singh K, Singh SS, Kumar A, Biswas B (2019) High utility itemsets mining with negative utility value: a survey. J Intell Fuzzy Syst 35(6):6551–6562
Truong-Chi T, Fournier-Viger P (2019) A survey of high utility sequential pattern mining. In: Fournier-Viger P, Lin JW, Nkambou R, Vo B, Tseng V (eds) High-utility pattern mining. Studies in big data. Springer, Berlin
Tseng V, Wu C-W, Viger F, Philippe Y (2015) Efficient algorithms for mining top-K High utility item sets. IEEE Trans Knowl Data Eng 28:1–1
Yun U, Lee G, Yoon E (2017) Efficient high utility pattern mining for establishing manufacturing plans with sliding window control. IEEE Trans Industr Electron 64(9):7239–7249
Zhang C, Almpanidis G, Wang W, Liu C (2018) An Empirical Evaluation of High Utility Itemset Mining Algorithms. Expert Syst Appl 101:91–115
Zhang C, Han M, Sun R, Du S, Shen M (2020) A Survey of key technologies for high utility patterns mining. IEEE Access 8:55798–55814
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Publisher's Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Amaranatha Reddy, P., Hazarath Murali Krishna Prasad, M. High Utility Item-set Mining from retail market data stream with various discount strategies using EGUI-tree. J Ambient Intell Human Comput 14, 871–882 (2023). https://doi.org/10.1007/s12652-021-03341-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-021-03341-3