0% found this document useful (0 votes)

29 views6 pages

Auto Keras

The document summarizes AutoKeras, an AutoML library that automates the process of model selection and hyperparameter tuning for deep learning. It has three levels of APIs - task, IO, and functional - that range from simplest to most configurable. AutoKeras uses Keras and TensorFlow to build models, with KerasTuner providing the infrastructure to implement search spaces and algorithms. The core workflow analyzes data, constructs a suitable search space, and finds high-performing hyperparameters. The search algorithm leverages prior knowledge by starting with predefined good configurations and mutating the current best configuration on each evaluation.

Uploaded by

sanaulhaq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views6 pages

Auto Keras

Uploaded by

sanaulhaq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Journal of Machine Learning Research 24 (2023) 1-6 Submitted 11/20; Revised 12/22; Published 1/23

AutoKeras: An AutoML Library for Deep Learning

Haifeng Jin 1, 2 haifengj@google.com

François Chollet 1 fchollet@google.com
Qingquan Song 2 ustcsqq@gmail.com
Xia Hu 3 xia.hu@rice.edu
1
Google LLC, Mountain View, CA 94043, USA
2
Texas A&M University, College Station, TX 77843, USA
3
Rice University, Houston, TX 77005, USA

Editor: Andreas Mueller

Abstract
To use deep learning, one needs to be familiar with various software tools like TensorFlow or
Keras, as well as various model architecture and optimization best practices. Despite recent
progress in software usability, deep learning remains a highly specialized occupation. To
enable people with limited machine learning and programming experience to adopt deep
learning, we developed AutoKeras, an Automated Machine Learning (AutoML) library
that automates the process of model selection and hyperparameter tuning. AutoKeras
encapsulates the complex process of building and training deep neural networks into a
very simple and accessible interface, which enables novice users to solve standard machine
learning problems with a few lines of code. Designed with practical applications in mind,
AutoKeras is built on top of Keras and TensorFlow, and all AutoKeras-created models can
be easily exported and deployed with the help of the TensorFlow ecosystem tooling.
Keywords: AutoML, Machine Learning, Deep Learning, Python

1. Introduction
Deep learning has been widely adopted for its success in many real-world applications like
computer vision (He et al., 2016) and natural language processing (Devlin et al., 2019). To
adopt deep learning, people often need to go through a non-trivial learning curve (Bargava,
2018; Song et al., 2022). A strong foundation of machine learning theory and being proficient
in deep learning libraries like TensorFlow (Abadi et al., 2016) or Keras (Chollet et al., 2015)
are both prerequisites for building a deep learning solution (Yao et al., 2018).
To remove the barriers to adopting deep learning, we developed AutoKeras, an AutoML
library for deep learning. It automates the process of model selection and hyperparame-
ter tuning and encapsulates the end-to-end process from raw datasets to trained machine
learning models into an extremely simple and flexible interface. Novice users can imple-
ment deep learning models with a few lines of code, while advanced users can also easily
customize different parts of the model to their needs. AutoKeras specializes in raw data
types like images and texts in addition to structured data, which is supported by existing
AutoML libraries (Thornton et al., 2013; Feurer et al., 2015; Olson et al., 2016; Kotthoff
et al., 2017; Feurer et al., 2020; Erickson et al., 2020; Zimmer et al., 2021). It is also flexible
enough to cover multi-modal data and multi-task use cases. AutoKeras is built base on

©2023 Haifeng Jin, François Chollet, Qingquan Song and Xia Hu.
License: CC-BY 4.0, see https://creativecommons.org/licenses/by/4.0/. Attribution requirements are provided
at http://jmlr.org/papers/v24/20-1355.html.
Jin, Chollet, Song and Hu

Figure 1: Three Levels of APIs

KerasTuner (O’Malley et al., 2019), Keras (Chollet et al., 2015), and TensorFlow (Abadi
et al., 2016). The models created by AutoKeras can be easily exported as Keras models,
which can be deployed in various production environments with the help of the TensorFlow
ecosystem.

2. API Design
The API design of AutoKeras follows the style of Keras, which is well-received by the deep
learning community. It has three levels of APIs, namely, task API, IO API, and functional
API, ranging from the simplest to the most configurable. The code for using these APIs is
shown in Figure 1 with diagrams showing the corresponding neural network models. The
parts with question marks are tuned automatically.
The task API requires the least amount of configurations from the user. As shown in
Figure 1 from line 3 to 5, an example of the image classification task is implemented within
three lines of code. Six different tasks are supported in task APIs, including classification
and regression for image, text, and structured data.
The IO API (input/output API) supports multi-modal data and multi-task use cases.
In Figure 1 from line 7 to 10, the dataset is a set of images with attributes, for example, an
image of a house with attributes describing the total area and location of the house. Each
data sample is associated with two prediction targets, a label for classification, and a real
value for regression. The user needs to specify the inputs and outputs format of the model
as shown in line 8 and 9. The training data are passed in lists in the same order in line 10.
The functional API enables advanced users to tailor the search spaces according to
their needs. It resembles the Keras functional API to let the user build the computational
graph of the deep learning model with the building blocks. The example from line 12 to
line 19 connects both preprocessing steps and neural network blocks, which apply data
normalization and data augmentation to the data before passing it to a neural network
with ResNet (He et al., 2016) and XceptionNet (Chollet, 2017). Notably, on line 15, the

2
AutoKeras: An AutoML Library for Deep Learning

version of the ResNet is specified as v2, which further reduces the size of the search space.
There are many such configurable hyperparameters for other blocks as well. They are tuned
automatically if left unspecified. Moreover, the users can also create custom neural network
blocks to use with the functional API.
Compared with other AutoML libraries, like AutoGluon (Erickson et al., 2020), which
covers tree-based models in the search space, and Auto-PyTorch (Zimmer et al., 2021),
which focus on structured data tasks, AutoKeras is optimized for raw data types and focuses
on deep neural network models only, which makes it fully compatible with the TensorFlow
and Keras ecosystem. The fit function in AutoKeras supports all the arguments supported
by the Keras fit function. The model found by AutoKeras can be easily exported as a Keras
model. With the help of the TensorFlow ecosystem, it is ready for deployment in various
production environments.

3. System Architecture
AutoKeras uses Keras and TensorFlow to build machine learning models. KerasTuner, a
hyperparameter tuning framework for Keras, provides the infrastructure for implementing
the search space and the search algorithm. Built on top of KerasTuner, AutoKeras im-
plements a series of carefully designed search spaces, task-specific search algorithms, and
easy-to-use APIs.
The core AutoKeras workflow consists of the following steps. First, AutoKeras analyzes
the training data to determine e.g. whether a given tabular data feature is categorical or nu-
merical, whether the image data includes a channel dimension, or whether the classification
labels need to be encoded. Second, it uses this information to construct a suitable search
space that encompasses both neural architecture patterns and common hyperparameters.
Finally, the search algorithm finds high-performing hyperparameter values.
The search space of AutoKeras includes state-of-the-art deep learning models for the
supported tasks. For models like EfficientNet (Tan and Le, 2019) and BERT (Devlin et al.,
2019), pretrained weights can be leveraged. Besides optimizing the model architecture,
it also tunes the hyperparameters from the preprocessing steps and the training process,
for example, image data augmentation, text vectorization, categorical feature encoding,
optimizer, learning rate, and weight decay.

4. Search Algorithm
Instead of treating hyperparameter tuning as a black-box optimization problem, AutoKeras
implements a novel search algorithm that leverages the prior knowledge of the search space.
The main idea is to warm-start the search with good configurations (a configuration is a
complete set of hyperparameter values that builds and trains a model) and to keep exploiting
the neighborhood of good configurations.
Under the task API of AutoKeras, the search space is predefined. Instead of starting
from random configurations, it starts by evaluating a list of predefined configurations, which
are known to perform well generally. Then, the search algorithm will always mutate the
current best configuration to create the next configuration to evaluate. Such design is

3
Jin, Chollet, Song and Hu

inspired by the hill-climbing algorithm (Elsken et al., 2018). The pseudo-code of the search
algorithm is shown in Algorithm 1.

Algorithm 1 The search algorithm of AutoKeras

for i ← 1 to t do . t is the total number of evaluations in the search
if i <= m then . m is the number of predefined configs
eval(ith pre-defined hp) . Evaluate pre-defined configurations
else
eval(mutate(get best hp())) . Mutate the current best for evaluation

In the mutation process, prior knowledge of the search space is used again. The hy-
perparameters are hierarchically grouped into sub-modules according to their locations in
the model. A sub-module can be a single hyperparameter, a layer, or the entire model. To
make the mutated configuration similar to the current best, in every mutation, only one of
the sub-modules is selected and all of its hyperparameter values are resampled. To make
sub-modules with more hyperparameters less likely to be selected, we assign probabilities
for the sub-modules to be selected as follows. A raw probability vector p̂p is defined as:
1 1 1
p̂p = ( , ,..., ) ∈ RK , (1)
n1 + 1 n2 + 1 nK + 1
where ni is the number of hyperparameters in the ith sub-module, K is the total number
of sub-modules, the +1 offset is to smooth the small values. To normalize p̂p to sum to one,
the logit(·) function and the softmax function σ(·) are applied:
x
p = σ(logit(p̂p)) = σ(− ln n ) for n = (n1 , n2 , . . . , nK ) ∈ RK , logit(x) = ln . (2)
1−x
The normalized vector p contains the final probabilities for the sub-modules to be selected.
The experimental results are published on the AutoKeras official website (autokeras.com).

5. Conclusions and Future Work

We developed AutoKeras, an AutoML library for deep learning with simple APIs to effi-
ciently provide end-to-end deep learning solutions to the users. It can handle multi-task
learning and multi-modal data. The search space is fully customizable by the user. The
model found by AutoKeras can be easily exported and deployed in the production environ-
ment with the help of the TensorFlow ecosystem.
In the future, we plan to support more tasks, better support distributed search and
training to scale up the framework for larger datasets, and build infrastructure to automat-
ically benchmark AutoKeras performance for new releases following the established best
practices for neural architecture search (Lindauer and Hutter, 2020).

Acknowledgments

We thank the reviewers for their helpful comments, and we thank all the contributors from
our open-source community for their work. This work is, in part, supported by DARPA
(#FA8750-17- 2-0116) and NSF (#IIS-1718840 and #IIS-1750074).

4
AutoKeras: An AutoML Library for Deep Learning

References
Martı́n Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean,
Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, et al. TensorFlow:
A system for large-scale machine learning. In OSDI, 2016.

Bargava. How to learn deep learning in 6 months. https://towardsdatascience.com/

how-to-learn-deep-learning-in-6-months-e45e40ef7d48, 2018.

François Chollet et al. Keras. https://keras.io, 2015.

François Chollet. Xception: Deep learning with depthwise separable convolutions. In CVPR,
2017.

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. BERT: Pre-training
of deep bidirectional transformers for language understanding. In NAACL, 2019.

Thomas Elsken, Jan-Hendrik Metzen, and Frank Hutter. Simple and efficient architecture
search for convolutional neural networks. In ICLR Workshop Track, 2018.

Nick Erickson, Jonas Mueller, Alexander Shirkov, Hang Zhang, Pedro Larroy, Mu Li, and
Alexander Smola. AutoGluon-Tabular: Robust and accurate automl for structured data.
arXiv:2003.06505 [stat.ML], 2020.

Matthias Feurer, Aaron Klein, Katharina Eggensperger, Jost Springenberg, Manuel Blum,
and Frank Hutter. Efficient and robust automated machine learning. In NeurIPS, 2015.

Matthias Feurer, Katharina Eggensperger, Stefan Falkner, Marius Lindauer, and Frank
Hutter. Auto-sklearn 2.0: Hands-free automl via meta-learning. arXiv:2007.04074
[cs.LG], 2020.

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image
recognition. In CVPR, 2016.

Lars Kotthoff, Chris Thornton, Holger Hoos, Frank Hutter, and Kevin Leyton-Brown. Auto-
WEKA 2.0: Automatic model selection and hyperparameter optimization in WEKA.
JMLR, 2017.

Marius Lindauer and Frank Hutter. Best practices for scientific research on neural archi-
tecture search. JMLR, 2020.

Randal S. Olson, Nathan Bartley, Ryan J. Urbanowicz, and Jason H. Moore. Evaluation of
a tree-based pipeline optimization tool for automating data science. In GECCO, 2016.

Tom O’Malley, Elie Bursztein, James Long, François Chollet, Haifeng Jin, Luca Invernizzi,
et al. Keras Tuner. https://github.com/keras-team/keras-tuner, 2019.

Qingquan Song, Haifeng Jin, and Xia Hu. Automated Machine Learning in Action. Manning
Publications, 2022.

5
Jin, Chollet, Song and Hu

Mingxing Tan and Quoc V Le. Efficientnet: Rethinking model scaling for convolutional
neural networks. In ICLR, 2019.

Chris Thornton, Frank Hutter, Holger H Hoos, and Kevin Leyton-Brown. Auto-WEKA:
Combined selection and hyperparameter optimization of classification algorithms. In
KDD, 2013.

Quanming Yao, Mengshuo Wang, Yuqiang Chen, Wenyuan Dai, Hu Yi-Qi, Li Yu-Feng,
Tu Wei-Wei, Yang Qiang, and Yu Yang. Taking human out of learning applications: A
survey on automated machine learning. arXiv:1810.13306 [cs.AI], 2018.

Lucas Zimmer, Marius Lindauer, and Frank Hutter. Auto-pytorch: Multi-fidelity met-
alearning for efficient and robust autodl. TPAMI, 2021.

Solved Simplex Problems PDF
No ratings yet
Solved Simplex Problems PDF
5 pages
Day 1 S2
No ratings yet
Day 1 S2
24 pages
14 DL Frameworks
No ratings yet
14 DL Frameworks
30 pages
Unit3 DLT Material Important Notes
No ratings yet
Unit3 DLT Material Important Notes
33 pages
Deep Learning With Keras
No ratings yet
Deep Learning With Keras
3 pages
IntroKeras Español
No ratings yet
IntroKeras Español
46 pages
Unit Ii
No ratings yet
Unit Ii
83 pages
Chap 1 - Artificial Intelligence
No ratings yet
Chap 1 - Artificial Intelligence
28 pages
AutoML - A Survey of The State-Of-The-Art
No ratings yet
AutoML - A Survey of The State-Of-The-Art
60 pages
PFMEA Template Example AIAG VDA T3
No ratings yet
PFMEA Template Example AIAG VDA T3
8 pages
1 s2.0 S2949715923000604 Main
No ratings yet
1 s2.0 S2949715923000604 Main
30 pages
Robust Control System
No ratings yet
Robust Control System
40 pages
2024 AutoML Past, Present and Future
No ratings yet
2024 AutoML Past, Present and Future
82 pages
AutoML A Survey of State-Of-The-Art
No ratings yet
AutoML A Survey of State-Of-The-Art
33 pages
Penerbit 0053
No ratings yet
Penerbit 0053
14 pages
LAB SHEET 1 Basics
No ratings yet
LAB SHEET 1 Basics
5 pages
DL Unit 4 Notes
No ratings yet
DL Unit 4 Notes
21 pages
Deep Learning Record
No ratings yet
Deep Learning Record
70 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
82 pages
Deeplearning Ai
No ratings yet
Deeplearning Ai
64 pages
Introduction To Deep Neural Networks - DataCamp
No ratings yet
Introduction To Deep Neural Networks - DataCamp
10 pages
REF-10-Automated Machine Learning The New Wave of Machine Learning
No ratings yet
REF-10-Automated Machine Learning The New Wave of Machine Learning
8 pages
Hiperparametre
No ratings yet
Hiperparametre
10 pages
Explore The Implementation of CNNs in Python
No ratings yet
Explore The Implementation of CNNs in Python
10 pages
Keras
No ratings yet
Keras
4 pages
106106213
No ratings yet
106106213
637 pages
Practical Guide To Keras
No ratings yet
Practical Guide To Keras
28 pages
Keras-tensorflow-IT Haarlem 2023
No ratings yet
Keras-tensorflow-IT Haarlem 2023
35 pages
Keras v.2.1.6
No ratings yet
Keras v.2.1.6
244 pages
Python Deep Learning With Keras
No ratings yet
Python Deep Learning With Keras
21 pages
Course Title: Fundamentals of Deep Learning Lab: BTECH Programme: AI&DS
No ratings yet
Course Title: Fundamentals of Deep Learning Lab: BTECH Programme: AI&DS
81 pages
Introduction To Keras
No ratings yet
Introduction To Keras
14 pages
Course Title: Fundamentals of Deep Learning Lab: BTECH Programme: AI&DS
No ratings yet
Course Title: Fundamentals of Deep Learning Lab: BTECH Programme: AI&DS
81 pages
Keras1 - 1.4 Advanced Model Architectures
No ratings yet
Keras1 - 1.4 Advanced Model Architectures
11 pages
Assignment 4
No ratings yet
Assignment 4
5 pages
5 Total Quality Management and Iso-9000
100% (1)
5 Total Quality Management and Iso-9000
38 pages
DL-Experiments-1 To 5
No ratings yet
DL-Experiments-1 To 5
43 pages
Lec 07 8
No ratings yet
Lec 07 8
40 pages
Deep Learning - Libraries
No ratings yet
Deep Learning - Libraries
5 pages
Deep Learning Manual
No ratings yet
Deep Learning Manual
57 pages
Supervised and Unsupervised
100% (1)
Supervised and Unsupervised
191 pages
TensorFlow With R
No ratings yet
TensorFlow With R
46 pages
UNIT III - KR, Approaches
No ratings yet
UNIT III - KR, Approaches
8 pages
15 NIPS Auto Sklearn Poster
No ratings yet
15 NIPS Auto Sklearn Poster
1 page
Ker As Tutorial
No ratings yet
Ker As Tutorial
33 pages
09 Tensorflow101 Slide
No ratings yet
09 Tensorflow101 Slide
78 pages
Cenelec Standards For Irste
100% (1)
Cenelec Standards For Irste
24 pages
Deep Learning Tools
No ratings yet
Deep Learning Tools
23 pages
Cours 3 - Custom Models and Training With TensorFlow
No ratings yet
Cours 3 - Custom Models and Training With TensorFlow
36 pages
DL Unit 3
No ratings yet
DL Unit 3
21 pages
Big Data Machine Learning Lab 4
No ratings yet
Big Data Machine Learning Lab 4
7 pages
Deep Learning With Keras - Quick Guide
No ratings yet
Deep Learning With Keras - Quick Guide
22 pages
Deep Learning TensorFlow and Keras
No ratings yet
Deep Learning TensorFlow and Keras
454 pages
Unit 2
No ratings yet
Unit 2
10 pages
8 Deep Learning CNN
No ratings yet
8 Deep Learning CNN
63 pages
Manufacturing Layout Analysis - Comparing Flexsim With Excel Spreadsheets
No ratings yet
Manufacturing Layout Analysis - Comparing Flexsim With Excel Spreadsheets
2 pages
Unit-3 Aiml
No ratings yet
Unit-3 Aiml
10 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
Deep Learning and Neural Networks
No ratings yet
Deep Learning and Neural Networks
8 pages
Dla
No ratings yet
Dla
23 pages
Language & Speech - The Speech Chain. Phonetics & Phonology
No ratings yet
Language & Speech - The Speech Chain. Phonetics & Phonology
2 pages
DL 1 2 3
No ratings yet
DL 1 2 3
24 pages
Deep Learning With Python Mini Course
No ratings yet
Deep Learning With Python Mini Course
26 pages
WINSEM2024-25 CSE4006 ETH AP2024254000693 2024-12-19 Reference-Material-I
No ratings yet
WINSEM2024-25 CSE4006 ETH AP2024254000693 2024-12-19 Reference-Material-I
10 pages
Project Life Cycle
No ratings yet
Project Life Cycle
14 pages
Named Entity Recognition With Bidirectional Lstm-Cnns
No ratings yet
Named Entity Recognition With Bidirectional Lstm-Cnns
14 pages
Map of Human Computer Interaction: What Does The Discipline of HCI Cover? Why Study HCI?
No ratings yet
Map of Human Computer Interaction: What Does The Discipline of HCI Cover? Why Study HCI?
11 pages
Conceptsforauto I
No ratings yet
Conceptsforauto I
23 pages
M.abdullah Tariq CV
No ratings yet
M.abdullah Tariq CV
2 pages
The Benefits of Predictive Maintenance in Manufact
No ratings yet
The Benefits of Predictive Maintenance in Manufact
9 pages
Notes On Data Modeling
No ratings yet
Notes On Data Modeling
16 pages
Operations Management Session 11 Chap 8 KdupNLlgP6
No ratings yet
Operations Management Session 11 Chap 8 KdupNLlgP6
25 pages
Accident Alert: Technical Report
No ratings yet
Accident Alert: Technical Report
9 pages
DSE 3141 Deep Learning Lab Manual 2024 Week4
No ratings yet
DSE 3141 Deep Learning Lab Manual 2024 Week4
14 pages
Factors Affecting System Complexity: Coupling
No ratings yet
Factors Affecting System Complexity: Coupling
4 pages
Benefits of The Object-Oriented Development
No ratings yet
Benefits of The Object-Oriented Development
3 pages
SQL Introduction
No ratings yet
SQL Introduction
34 pages
Se Unit Ii PDF
No ratings yet
Se Unit Ii PDF
37 pages
Lung Disease Report Final
No ratings yet
Lung Disease Report Final
51 pages
SVM & CNN
No ratings yet
SVM & CNN
62 pages
A Need To Modify The Method of Failure M PDF
No ratings yet
A Need To Modify The Method of Failure M PDF
16 pages
Day5 FDP IoT Part1
No ratings yet
Day5 FDP IoT Part1
89 pages
ICT (Information and Communication Technology)
No ratings yet
ICT (Information and Communication Technology)
2 pages
Building Powerful Image Classification Models Using Very Little Data
No ratings yet
Building Powerful Image Classification Models Using Very Little Data
20 pages
Logistics Mgt.-Lesson 1
100% (1)
Logistics Mgt.-Lesson 1
7 pages
Matlab Matlab Toolbox Deep Learning Toolbox Neural Network Toolbox Libraries Functions How To Use
No ratings yet
Matlab Matlab Toolbox Deep Learning Toolbox Neural Network Toolbox Libraries Functions How To Use
5 pages
Unit - 3
No ratings yet
Unit - 3
42 pages
Activity 6 Infographic
No ratings yet
Activity 6 Infographic
1 page
Keras Deep Learning Essentials: Definitive Reference for Developers and Engineers
From Everand
Keras Deep Learning Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Auto Keras

Uploaded by

Auto Keras

Uploaded by

Journal of Machine Learning Research 24 (2023) 1-6 Submitted 11/20; Revised 12/22; Published 1/23

AutoKeras: An AutoML Library for Deep Learning

Haifeng Jin 1, 2 haifengj@google.com

Editor: Andreas Mueller

Figure 1: Three Levels of APIs

Algorithm 1 The search algorithm of AutoKeras

5. Conclusions and Future Work

Bargava. How to learn deep learning in 6 months. https://towardsdatascience.com/

François Chollet et al. Keras. https://keras.io, 2015.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.