10 From Zero To ML
10 From Zero To ML
Make AI easy,
fast and useful
for enterprises
and developers
Serverless ML on Google Cloud Platform
Cloud Cloud ML
AutoML Engine
Data
Application Scientists
Developers & ML
Practitioners
Machine BQML
Learning APIs
AI building blocks for
all developers
Fully trained ML models from Google Cloud Platform
Inception-v4, Inception-ResNet and the Progressive Neural Architecture Search Neural Architecture Search with Reinforcement
Impact of Residual Connections on Learning Chenxi Liu, Barret Zoph, Jonathon Shlens, Wei Learning
Christian Szegedy, Sergey Ioffe, Vincent Hua, Li-Jia Li, Li Fei-Fei, Alan Yuille, Jonathan Barret Zoph, Quoc V. Le. ICLR 2017.
Learning Transferable Architectures for Scalable Vanhoucke, and Alex Alemi. AAAI, 2017. Huang, Kevin Murphy, Arxiv, 2017
Image Recognition, Barret Zoph, Vijay Vasudevan,
Jonathon Shlens, and Quoc V. Le. Arxiv, 2017.
Large-Scale Evolution of Image Classifiers Bayesian Optimization for a Better Dessert
Esteban Real, Sherry Moore, Andrew Selle, Saurabh Saxena, Yutaka Leon Suematsu, Quoc Le, Alex Kurakin. Benjamin Solnik, Daniel Golovin, Greg Kochanski, John Elliot Karr
International Conference on Machine Learning, 2017.
Neural Architecture Search
Idea: model-generating model trained via reinforcement learning
Neural Architecture Search with Reinforcement Learning, Zoph & Le, ICLR 2016 arxiv.org/abs/1611.01578
Neural Architecture Search to find a model
20K
times
Iterate to
find the
most
accurate
model
AutoML outperforms handcrafted models
Inception-ResNet-v2
computational cost
Learning Transferable Architectures for Scalable Image Recognition, Zoph et al. 2017, https://arxiv.org/abs/1707.07012
computational cost
Learning Transferable Architectures for Scalable Image Recognition, Zoph et al. 2017, https://arxiv.org/abs/1707.07012
computational cost
Learning Transferable Architectures for Scalable Image Recognition, Zoph et al. 2017, https://arxiv.org/abs/1707.07012
computational cost
Learning Transferable Architectures for Scalable Image Recognition, Zoph et al. 2017, https://arxiv.org/abs/1707.07012
Efforts Low for solution designing High for end to end model development
Inception-ResNet-v2
computational cost
Learning Transferable Architectures for Scalable Image Recognition, Zoph et al. 2017, https://arxiv.org/abs/1707.07012
Simple: SQL
BigQuery ML
2
specifying model type
SELECT * FROM
ML.PREDICT(MODEL `sales.predict_2019_spend`, (
SELECT
4 Occupation_ID,
10014 Zip_code,
"Married" Marital_status,
5 Years_current_city,
"Toys" Primary_purhcase_category
FROM `sara-bigquery.sales.test_consumer_behavior` ))
4
Cloud ML
Engine
Build, train, and serve custom
models with your own data
Cloud Machine Learning Engine
cloud.google.com/ml-engine
Same Deploy
Source: Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis non erat sem
Training and Certification 36
Proprietary + Confidential
Same Deploy
Source: Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis non erat sem
Training and Certification 37
Proprietary + Confidential
Source: Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis non erat sem
Training and Certification
Create task.py to parse command-line parameters and Proprietary + Confidential
Source: Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis non erat sem
Training and Certification
The model.py contains the ML model in Proprietary + Confidential
taxifare/
taxifare/PKG-INFO
taxifare/setup.cfg
Python packages need to
taxifare/setup.py
taxifare/trainer/ contain an __init__.py
taxifare/trainer/__init__.py in every folder
taxifare/trainer/task.py
taxifare/trainer/model.py
Source: Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis non erat sem
Training and Certification
Proprietary + Confidential
export PYTHONPATH=${PYTHONPATH}:/somedir/taxifare
python -m trainer.task \
--train_data_paths="/somedir/datasets/*train*" \
--eval_data_paths=/somedir/datasets/*valid* \
--output_dir=/somedir/output \
--train_steps=100 --job-dir=/tmp
Source: Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis non erat sem
Training and Certification
Hyperparameter tuning with HyperTune
Want to find this
Not these
Automatic hyperparameter tuning
Objective
Runs multiple trials in a training with
specified HP metrics
m #1
rPa ra
Hype
Hyperparameter tuning with HyperTune
hyperparameters:
goal: MAXIMIZE
maxTrials: 30
Hyperparameter types:
maxParallelTrials: 1 DOUBLE, INTEGER,
params: CATEGORICAL, DISCRETE
- parameterName: hidden1
type: INTEGER
Hyperparameter scaling:
minValue: 40 UNIT_LINEAR_SCALE,
maxValue: 400 UNIT_LOG_SCALE,
scaleType: UNIT_LINEAR_SCALE UNIT_REVERSE_LOG_SCALE
Then use the gcloud command to submit
Proprietary + Confidential
Source: Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis non erat sem
Training and Certification
Proprietary + Confidential
Model
Deploy
Source: Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis non erat sem
Training and Certification
We cannot reuse the training input function for serving
Proprietary + Confidential
Features Features
Data format Data format
Model Model
might be CSV might be JSON
Training Serving
input_fn input_fn
Labels Labels
Source: Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis non erat sem
Training and Certification
Proprietary + Confidential
def serving_input_fn():
feature_placeholders = {
'pickuplon' : tf.placeholder(tf.float32, [None]),
'pickuplat' : tf.placeholder(tf.float32, [None]),
'dropofflat' : tf.placeholder(tf.float32, [None]),
'dropofflon' : tf.placeholder(tf.float32, [None]),
'passengers' : tf.placeholder(tf.float32, [None]),
}
features = {
key: tf.expand_dims(tensor, -1)
for key, tensor in feature_placeholders.items()
}
return tf.estimator.export.ServingInputReceiver(features,
feature_placeholders)
Source: Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis non erat sem
Training and Certification
Proprietary + Confidential
Source: Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis non erat sem
Training and Certification 50
AutoML vs CMLE (Cloud ML Engine)
Attribute AutoML CMLE
Data requirements Specific format (based on solution) Any kind of data (can be transformed)
Custom
No Yes, fully
requirements
Status Few solutions are available (in Beta) Full available with auto scale feature
queue warehouse
Raw logs, Stream Is this the Machine
files, assets, Visualization
Google processing same? Learning and BI
Analytics
data, etc.
Storage NoSQL
database
Co-workers
Batch
As this?
processing
Batch BAC
Applications
and Reports
BAC
Applications
and Reports