Abstract
The multi-label classification problem involves finding a multi-valued decision function that predicts an instance to a vector of binary classes. Two methods are widely used to build multi-label classifiers: the binary relevance method and the chain classifier. Both can induce a polynomial multi-valued decision function by using Bayesian network-augmented naive Bayes classifiers as base models. In this paper, we propose a feature weighting approach to improve the classification accuracy of the decision function. This method, called probability feature weighting, estimates the conditional probability of the positive class through deep computation of the frequency ratio of features from the training data. Moreover, we identify irrelevant variables in terms of probability to simplify the decision function. Experiments showed that the decision function with a probability feature weighting rarely degrades the quality of the model and drastically improves it in many cases.






Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Bielza C, Li G, Larranga P (2011) Multi-dimensional classification with Bayesian network. Int J Approx Reason 52:705–727
Zhang ML, Zhou ZH (2014) A review on multi-label learning algorithms. IEEE Trans Knowl Data Eng 26(8):1819–1837
Agrawal S, Agrawal J, Kaur S, Sharma S (2016) A comparative study of fuzzy PSO and fuzzy SVD-based RBF neural network for multi-label classification. Neural Comput Appl. https://doi.org/10.1007/s00521-016-2446-x
Vens C, Struyf J, Schietgat L (2008) Decision trees for hierarchical multi-label classification. Mach Lean 73:185–214. https://doi.org/10.1007/s10994-008-5077-3
Blockeel H, Schietgat L, Struyf J, Dzeroki S et al (2006) Decision tree for hierarchical multilabel classification: a case study in functional genomics. Springer, Berlin, pp 18–29
Boutell MR, Luo J, Shen X, Brown CM (2004) Learning multi-label scene classification. Pattern Recognit 37:1757–1771
Godbole S, Sarawagi S (2004) Discriminative methods for multi-labeled classification. Springer, Berlin, pp 22–30
Hüllermeier E, Fürnkranz J, Cheng W, Brinker K (2008) Label ranking by learning pairwise preferences. Artif Intell 172:1897–1916
Tsoumakas G, Vlahavas I (2007) Random \(k\)-labelsets: an ensemble method for multilabel classification. Machine learning ECML 2007. Lecture notes in computer science, vol 4701. Springer, Berlin, Heidelberg
Schapire RE, Singer Y (2000) Boos Texter: a boosting-based system for text categorization. Mach Learn 39:135–168. https://doi.org/10.1023/A:1007649029923
Zhang M-L, Zhou Z-H (2007) ML-KNN: a lazy learning approach to multi-label learning. Pattern Recognit 40(7):2038–2048
Sucar LE, Bielza C, Morales EF et al (2014) Multi-label classification with Bayesian network-based chain classifiers. Pattern Recognit Lett 41:14–22
Read J, Pfahringer B, Holmes G et al (2011) Classifier chains for multi-label classification. Mach Learn 85:333–359. https://doi.org/10.1007/s10994-011-5256-5
Varando G, Bielza C, Larrañaga P (2014) Expressive power of binary relevance and chain classifiers based on Bayesian networks for multi-label classification. Springer, Berlin, pp 519–534
Varando G, Bielza C, Larraãnaga P (2016) Decision function for chain classifiers based on Bayesian network for multi-label classification. Int J Approx Reason 68:164–178
Jiang L, Li C, Wang S et al (2016) Deep feature weighting for naive Bayes and its application to text classification. Eng Appl Artif Intell 52:26–39
Friedman N, Geiger D, Goldszmidt M (1997) Bayesian network classifiers. Mach Learn 29(2–3):131–163
Ouali A, Cherif AR, Krebs M-O (2006) Data mining based Bayesian networks for best classification. Comput Stat Data Anal 51(2):1278–1292
Varando G, Bielza C, Larrañaga P (2015) Decision boundary for disctete Bayesian network classifiers. J Mach Learn Res 16:2725–2749
O’Donnell R, Servedio RA (2010) New degree bounds for polynomial threshold functions. Combinatorica 30(3):327–358. https://doi.org/10.1007/s00493-010-2173-3
Tan J, Zhang Z, Zhen L et al (2012) Adaptive feature selection via a new version of support vector machine. Neural Comput Appl. https://doi.org/10.1007/s00521-012-1018-y
Hall MA (2000) Correlation-based feature selection for discrete and numeric class. Machine learning. In: Proceedings of the seventeenth international conference on machine learning, vol 1. Morgan Kaufmann Publishers Inc, pp 359–366
Wang S, Jiang L, Li C (2014) A CFS-based feature weighting approach to native Bayes text classifiers. Springer, Berlin, pp 555–562
Li Z, Lu W, Sun Z, Xing W (2016) A parallel feature selection method study for text classification. Neural Comput Appl. https://doi.org/10.1007/s00521-016-2351-3
Hall M (2007) A decision tree-based attribute weighting filter for native Bayes. Knowl Based Syst 20:120–126
Jiang L, Cai Z, Wang D et al (2012) Improving Tree augmented Naive Bayes for class probability estimation. Knowl Based Syst 26:239–245
Tsai C-J, Lee C-L, Yang W-P (2008) A discretization algorithm based on class-attribute contingency coefficient. Inf Sci 178:714–731
Tsoumakas G, Katakis L (2007) Multi-label classification: an overview. Int J Data Wareh Min 3(3):1–13
Read J, Bielza C, Larrañaga P (2014) Multi-dimensional classification with super-classes. IEEE Trans Knowl Data Eng 26(7):1720–1733
de Waal PR, van der Gaag LC (2007) Inference and Learning in multi-dimensional Bayesian network classifiers. Springer, Berlin, pp 501–511
Acknowledgements
The authors thank the editor and the anonymous reviewers for helpful comments and suggestions. This work was supported by the National Natural Science Foundation of China (Grant No. 61573266).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
We wish to confirm that there are no known conflicts of interest associated with this publication. We also confirm that the manuscript has been read and approved by all named authors and that there are no other persons who satisfied the criteria for authorship but are not listed.
Rights and permissions
About this article
Cite this article
Yang, Y., Ding, M. Decision function with probability feature weighting based on Bayesian network for multi-label classification. Neural Comput & Applic 31, 4819–4828 (2019). https://doi.org/10.1007/s00521-017-3323-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-017-3323-y