research-article

Hierarchical Ensemble Reduction and Learning for Resource-constrained Computing

Authors:

Kun HeAuthors Info & Claims

ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 25, Issue 1

Article No.: 12, Pages 1 - 21

https://doi.org/10.1145/3365224

Published: 04 December 2019 Publication History

Abstract

Generic tree ensembles (such as Random Forest, RF) rely on a substantial amount of individual models to attain desirable performance. The cost of maintaining a large ensemble could become prohibitive in applications where computing resources are stringent. In this work, a hierarchical ensemble reduction and learning framework is proposed. Experiments show our method consistently outperforms RF in terms of both accuracy and retained ensemble size. In other words, ensemble reduction is achieved with enhancement in accuracy rather than degradation. The method can be executed efficiently, up to >590× time reduction than a recent ensemble reduction work. We also developed Boolean logic encoding techniques to directly tackle multiclass problems. Moreover, our framework bridges the gap between software-based ensemble methods and hardware computing in the IoT era. We developed a novel conversion paradigm that supports the automatic deployment of >500 trees on a chip. Our proposed method reduces power consumption and overall area utilization by >21.5% and >62%, respectively, comparing with RF. The hierarchical approach provides rich opportunities to balance between the computation (training and response time), the hardware resource (memory and energy), and accuracy.

References

[1]

Ankur Agrawal, Chia-Yu Chen, Jungwook Choi, Kailash Gopalakrishnan, Jinwook Oh, Sunil Shukla, Viji Srinivasan, Swagath Venkataramani, and Wei Zhang. 2017. Invited: Accelerator design for deep learning training. In Proceedings of the 54th ACM/EDAC/IEEE Design Automation Conference (DAC’17). 1--2.

[2]

David Arthur and Sergei Vassilvitskii. 2007. K-means++: The advantages of careful seeding. In Proceedings of the 18th ACM-SIAM Symposium on Discrete Algorithms (SODA’07). 1027--1035.

[3]

Ron Bekkerman, Mikhail Bilenko, and John Langford. 2012. Scaling Up Machine Learning: Parallel and Distributed Approaches. Cambridge University Press, Cambridge, UK.

[4]

R. K. Brayton, G. D. Hachtel, C. T. McMullen, and A. L. Sangiovanni-Vincentelli. 1984. Logic Minimization Algorithms for VLSI Synthesis. Kluwer Academic Publishers, Hingham, MA.

[5]

Leo Breiman. 2001. Random forests. Mach. Learn. 45, 1 (2001), 5--32.

Digital Library

[6]

L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone. 1984. Classification and Regression Trees. Chapman 8 Hall, New York, NY.

[7]

Tianqi Chen and Carlos Guestrin. 2016. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’16). 785--794.

Digital Library

[8]

Chuan Cheng and C. S. Bouganis. 2013. Accelerating random forest training process using FPGA. In Proceedings of the International Conference on Field Programmable Logic and Applications (FPL’13). 1--7.

[9]

Marc-Andre Daigneault and Jean Pierre David. 2018. Automated synthesis of streaming transfer level hardware designs. ACM Trans. Reconfig. Technol. Syst. 11, 2, Article 13 (Nov. 2018), 22 pages.

Digital Library

[10]

V. De, D. Sylvester, J. Myers, J. Deguchi, S. Shiratake, and I. Verbauwhede. 2018. F1: Intelligent energy-efficient systems at the edge of IoT. In Proceedings of the IEEE International Solid-State Circuits Conference (ISSCC’18). 502--504.

[11]

Thomas G. Dietterich and Ghulum Bakiri. 1995. Solving multiclass learning problems via error-correcting output codes. J. Artific. Intell. Res. 2, 1 (1995), 263--286.

[12]

Nicolás García-Pedrajas and Domingo Ortiz-Boyer. 2011. An empirical study of binary classifier fusion methods for multiclass classification. Inf. Fusion 12, 2 (Apr. 2011), 111--130.

Digital Library

[13]

Itay Hubara, Matthieu Courbariaux, Daniel Soudry, Ran El-Yaniv, and Yoshua Bengio. 2016. Binarized neural networks. In Proceedings of the Conference on Advances in Neural Information Processing Systems (NIPS’16). Curran Associates, Inc., 4107--4115.

[14]

Gareth James and Trevor Hastie. 1998. The error coding method and PICTs. J. Comput. Graph. Stat. 7, 3 (1998), 377--387.

[15]

H. Kargupta and B. Park. 2004. A Fourier spectrum-based approach to represent decision trees for mining data streams in mobile environments. IEEE Trans. Knowl. Data Eng. 16, 2 (2004), 216--229.

Digital Library

[16]

Xiang Lin, R. D. (Shawn) Blanton, and Donald E. Thomas. 2017. Random forest architectures on FPGA for multiple applications. In Proceedings of the Great Lakes Symposium on VLSI (GLSVLSI’17). ACM, New York, 415--418.

[17]

Zhenyu Lu, Xindong Wu, Xingquan Zhu, and Josh Bongard. 2010. Ensemble pruning via individual contribution ordering. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’10). 871--880.

Digital Library

[18]

Gonzalo Martínez-Muñoz and Alberto Suárez. 2006. Pruning in ordered bagging ensembles. In Proceedings of the 23rd International Conference on Machine Learning (ICML’06). 609--616.

Digital Library

[19]

Giovanni De Micheli. 1994. Synthesis and Optimization of Digital Circuits. McGraw-Hill, New York, NY.

[20]

Ramanathan Narayanan, Daniel Honbo, Gokhan Memik, Alok Choudhary, and Joseph Zambreno. 2007. Interactive presentation: An FPGA implementation of decision tree classification. In Proceedings of the Conference on Design, Automation and Test in Europe (DATE’07). 189--194.

Digital Library

[21]

Mohammad Rastegari, Vicente Ordonez, Joseph Redmon, and Ali Farhadi. 2016. XNOR-Net: ImageNet classification using binary convolutional neural networks. In Proceedings of the European Conference on Computer Vision (ECCV’16). Springer International Publishing, Cham, 525--542.

[22]

X. Ren, M. Martin, and R. D. Blanton. 2015. Improving accuracy of on-chip diagnosis via incremental learning. In Proceedings of the IEEE 33rd VLSI Test Symposium (VTS’15). 1--6.

[23]

Xuanle Ren, Vitor Grade Tavares, and R. D. (Shawn) Blanton. 2015. Detection of illegitimate access to JTAG via statistical learning in chip. In Proceedings of the Design, Automation Test in Europe Conference Exhibition (DATE’15). 109--114.

[24]

Lior Rokach. 2010. Ensemble-based classifiers. Artific. Intell. Rev. 33, 1 (1 Feb 2010), 1--39.

[25]

Lior Rokach. 2019. Ensemble Learning: Pattern Classification Using Ensemble Methods, 2nd ed. World Scientific Publishing Company, New Jersey.

[26]

Seyed Ali Rokni and Hassan Ghasemzadeh. 2019. Share-n-learn: A framework for sharing activity recognition models in wearable systems with context-varying sensors. ACM Trans. Des. Auto. Electron. Syst. 24, 4 (2019), 39:1--39:27.

[27]

B. D. Rouhani, A. Mirhoseini, and F. Koushanfar. 2017. Deep 3: Leveraging three levels of parallelism for efficient deep learning. In Proceedings of the 54th ACM/EDAC/IEEE Design Automation Conference (DAC’17). 1--6.

[28]

Maryam Sabzevari, Gonzalo Martínez-Muñoz, and Alberto Suárez. 2018. Vote-boosting ensembles. Pattern Recog. 83 (Nov. 2018), 119--133.

[29]

Fareena Saqib, Aindrik Dutta, Jim Plusquellic, Philip Ortiz, and M. S. Pattichis. 2015. Pipelined decision tree classification accelerator implementation in FPGA. IEEE Trans. Comput. 64, 1 (2015), 280--285.

[30]

R. Struharik. 2015. Decision tree ensemble hardware accelerators for embedded applications. In Proceedings of the IEEE 13th International Symposium on Intelligent Systems and Informatics (SISY’15). 101--106.

[31]

H. Wang and R. D. Blanton. 2010. CANOPY: A Boolean Optimization Approach for Minimizing Ensemble Size. Technical Report. Advanced Chip Test Laboratory, Carnegie Mellon University, Pittsburgh, PA.

[32]

Hongfei Wang and R. D. Blanton. 2016. Ensemble reduction via logic minimization. ACM Trans. Des. Auto. Electron. Syst. 21, 4 (2016), 1--17.

Digital Library

[33]

Hongfei Wang and Kun He. 2019. Improving test and diagnosis efficiency through ensemble reduction and learning. ACM Trans. Des. Auto. Electron. Syst. 24, 5 (2019), 1--26.

Digital Library

[34]

H. Wang, J. Li, K. He, and W. Cai. 2018. Work-in-progress: Hierarchical ensemble learning for resource-aware FPGA computing. In Proceedings of the International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS’18). 1--2.

[35]

Xuechao Wei, Cody Hao Yu, Peng Zhang, Youxiang Chen, Yuxin Wang, Han Hu, Yun Liang, and Jason Cong. 2017. Automated systolic array architecture synthesis for high throughput CNN inference on FPGAs. In Proceedings of the 54th Design Automation Conference (DAC’17). ACM, New York, 29:1--29:6.

Digital Library

[36]

Y. Zhang, S. Burer, and W. N. Street. 2006. Ensemble pruning via semi-definite programming. J. Mach. Learn. Res. 7, 3 (2006), 1315--1338.

Digital Library

[37]

Z. Zhao, K. M. Barijough, and A. Gerstlauer. 2018. DeepThings: Distributed adaptive deep learning inference on resource-constrained IoT edge clusters. IEEE Trans. Comput.-Aided Des. Integ. Circ. Syst. 37, 11 (2018), 2348--2359.

[38]

Zhi-Hua Zhou and Ji Feng. 2017. Deep forest: Towards an alternative to deep neural networks. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI’17). 3553--3559.

Cited By

Lezhnev EZunin VAmerikanov ARomanov A(2024)Electronic Computer-Aided Design for Low-Level Modeling of Networks-on-ChipIEEE Access10.1109/ACCESS.2024.338271012(48750-48763)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3382710
Tahir AChen FAlmazroi AJanbi N(2023)SWEP-RFJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2023.10167235:8Online publication date: 1-Sep-2023
https://dl.acm.org/doi/10.1016/j.jksuci.2023.101672

Index Terms

Hierarchical Ensemble Reduction and Learning for Resource-constrained Computing
1. Computing methodologies
  1. Machine learning
    1. Machine learning algorithms
      1. Ensemble methods
2. Hardware
  1. Electronic design automation
    1. High-level and register-transfer level synthesis
      1. Hardware-software codesign

Recommendations

Improving Test and Diagnosis Efficiency through Ensemble Reduction and Learning

Machine learning is a powerful lever for developing, improving, and optimizing test methodologies to cope with the demand from the advanced nodes. Ensemble methods are a particular learning paradigm that uses multiple models to boost performance. In ...
Ensemble Reduction via Logic Minimization

An ensemble of machine learning classifiers usually improves generalization performance and is useful for many applications. However, the extra memory storage and computational cost incurred from the combined models often limits their potential ...
Classifier ensemble reduction using a modified firefly algorithm

We propose a FA variant for classifier ensemble reduction.It incorporates both accelerated attractiveness and evading strategies.The attractiveness search operation is directed by local and global best solutions.The evading mechanism leads the search to ...

Comments

comments powered by Disqus.

Information & Contributors

Information

Published In

ACM Transactions on Design Automation of Electronic Systems Volume 25, Issue 1

January 2020

299 pages

ISSN:1084-4309

EISSN:1557-7309

DOI:10.1145/3370083

Editor:
Naehyuck Chang
Korea Advanced Institute of Science and Technology, Korea

Issue’s Table of Contents

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Journal Family

ACM Journals for the Design of Smart and Connected Systems

Publication History

Published: 04 December 2019

Accepted: 01 September 2019

Revised: 01 August 2019

Received: 01 May 2019

Published in TODAES Volume 25, Issue 1

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Funding Sources

National Natural Science Foundation of China (NSFC)

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
200
Total Downloads

Downloads (Last 12 months)27
Downloads (Last 6 weeks)2

Reflects downloads up to 25 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Lezhnev EZunin VAmerikanov ARomanov A(2024)Electronic Computer-Aided Design for Low-Level Modeling of Networks-on-ChipIEEE Access10.1109/ACCESS.2024.338271012(48750-48763)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3382710
Tahir AChen FAlmazroi AJanbi N(2023)SWEP-RFJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2023.10167235:8Online publication date: 1-Sep-2023
https://dl.acm.org/doi/10.1016/j.jksuci.2023.101672

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Issue’s Table of Contents

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Alternative Proxies:

Alternative Proxy