Merge branch 'master' into add-pomerium

GoogleCloudPlatform · Aug 18, 2020 · 760b709 · 760b709
2 parents 47f5f63 + 5c7239f
commit 760b709
Show file tree

Hide file tree

Showing 5 changed files with 443 additions and 1 deletion.
diff --git a/tutorials/automl-tables-model-export/Dockerfile.template b/tutorials/automl-tables-model-export/Dockerfile.template
@@ -0,0 +1,17 @@
+# Copyright 2019 Google Inc. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+FROM gcr.io/cloud-automl-tables-public/model_server
+
+ADD model-export/tbl/YOUR_RENAMED_DIRECTORY /models/default/0000001
diff --git a/tutorials/automl-tables-model-export/convert_oss.py b/tutorials/automl-tables-model-export/convert_oss.py
@@ -0,0 +1,60 @@
+# Copyright 2019 Google Inc. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+# tested with TF1.14
+import sys
+import tensorflow as tf
+
+from absl import app
+from absl import flags
+from tensorflow.core.protobuf import saved_model_pb2
+from tensorflow.python.summary import summary
+
+FLAGS = flags.FLAGS
+
+flags.DEFINE_string('saved_model', '', 'The location of the saved_model.pb to visualize.')
+flags.DEFINE_string('output_dir', '', 'The location for the Tensorboard log to begin visualization from.')
+
+def import_to_tensorboard(saved_model, output_dir):
+  """View an imported saved_model.pb as a graph in Tensorboard.
+
+  Args:
+    saved_model: The location of the saved_model.pb to visualize.
+    output_dir: The location for the Tensorboard log to begin visualization from.
+
+  Usage:
+    Call this function with your model location and desired log directory.
+    Launch Tensorboard by pointing it to the log directory.
+    View your imported `.pb` model as a graph.
+  """
+  with open(saved_model, "rb") as f:
+    sm = saved_model_pb2.SavedModel()
+    sm.ParseFromString(f.read())
+    if 1 != len(sm.meta_graphs):
+      print('More than one graph found. Not sure which to write')
+      sys.exit(1)
+    graph_def = sm.meta_graphs[0].graph_def
+
+    pb_visual_writer = summary.FileWriter(output_dir)
+    pb_visual_writer.add_graph(None, graph_def=graph_def)
+    print("Model Imported. Visualize by running: "
+          "tensorboard --logdir={}".format(output_dir))
+
+
+def main(argv):
+  import_to_tensorboard(FLAGS.saved_model, FLAGS.output_dir)
+
+
+if __name__ == '__main__':
+  app.run(main)
diff --git a/tutorials/automl-tables-model-export/index.md b/tutorials/automl-tables-model-export/index.md
@@ -0,0 +1,304 @@
+---
+title: Export a custom AutoML Tables model and serve it with Cloud Run
+description: Learn how to export a custom AutoML Tables model and serve it with Cloud Run or any other environment where you can run a container.
+author: amygdala
+tags: ML, machine learning, TensorBoard
+date_published: 2020-08-18
+---
+
+With [AutoML Tables](https://cloud.google.com/automl-tables/docs/), you can automatically build and deploy state-of-the-art machine-learning models using your
+own structured data. See [this notebook](https://cloud.google.com/blog/products/ai-machine-learning/use-automl-tables-from-a-jupyter-notebook) for a walkthrough.
+
+AutoML Tables includes a feature with which you can [export](https://cloud.google.com/automl-tables/docs/model-export) your full custom model, packaged such that
+you can serve it with a Docker container. This lets you serve your models anywhere that you can run a container.
+
+This tutorial shows you how to package an exported AutoML Tables model to serve on [Cloud Run](https://cloud.google.com/run/docs/). With Cloud Run, your model 
+serving automatically scales up with traffic and scales down to 0 when it’s not being used. This tutorial also shows how you can examine your trained custom 
+model in [TensorBoard](https://www.tensorflow.org/tensorboard).
+
+This tutorial uses the [Cloud Console](https://console.cloud.google.com/automl-tables/datasets), but you could also accomplish the same steps through the 
+command-line interface or using the [AutoML Tables client libraries](https://googleapis.dev/python/automl/latest/gapic/v1beta1/tables.html).
+
+## About the dataset and scenario
+
+The [Cloud Public Datasets Program](https://cloud.google.com/bigquery/public-data/) makes available public datasets that are useful for experimenting with 
+machine learning. Just as in
+[Explaining model predictions on structured data](https://cloud.google.com/blog/products/ai-machine-learning/explaining-model-predictions-structured-data),
+this tutorial uses data that is essentially a join of two public datasets stored in
+[BigQuery](https://cloud.google.com/bigquery/)—[London bike rentals](https://console.cloud.google.com/bigquery?p=bigquery-public-data&d=london_bicycles&page=dataset) and
+[NOAA weather data](https://console.cloud.google.com/bigquery?p=bigquery-public-data&d=noaa_gsod&page=dataset)—with some additional processing to clean up 
+outliers and derive additional GIS and day-of-week fields. 
+
+You use this dataset to build a _regression model_ to predict the duration of a bike rental based on information about the start and end stations, the day of 
+the week, the weather on that day, and other data. If you were running a bike rental company, for example, these predictions and their explanations could help 
+you to anticipate demand and plan how to stock each location.
+
+You can use AutoML Tables for tasks as varied as asset valuations, fraud detection, credit risk analysis, customer retention prediction, and analyzing item 
+layouts in stores.
+
+## Create a dataset
+
+The first step in training a Tables model is to create a *dataset* using your data. This tutorial uses the bike rentals and weather dataset described above. You
+can also follow along with your own tabular dataset, but in that case you need to construct your own prediction instances, too. 
+
+1.  Go to the [**Tables** page](https://console.cloud.google.com/automl-tables/datasets) in the Cloud Console, and enable the API.
+
+    ![Enable the AutoML Tables API](https://storage.googleapis.com/gcp-community/tutorials/automl-tables-model-export/enable_api.png)
+
+1.  Create a new Tables dataset.
+
+    ![Create a new Tables dataset](https://storage.googleapis.com/gcp-community/tutorials/automl-tables-model-export/create_dataset.png)
+
+1.  Import your data into the dataset:
+
+    1.  On the **Import** tab, select **Import data from BigQuery**.
+    1.  Enter `aju-dev-demos` as the BigQuery project ID, `london_bikes_weather` as the dataset ID, and `bikes_weather` as the table name.
+    1.  Click **Import**.
+
+    ![Import the data](https://storage.googleapis.com/gcp-community/tutorials/automl-tables-model-export/import_data.png)
+
+## Edit the dataset’s schema
+
+After the import is complete, you edit the dataset schema to change a few of the inferred types.
+
+On the **Train** tab, make sure that your schema matches the screenshot below:
+
+1.  Change `bike_id`, `end_station_id`, `start_station_id`, and `loc_cross` to be of type **Categorical**.
+1.  Select `duration` in the **Target column** section. 
+
+![Adjust the dataset schema](https://storage.googleapis.com/gcp-community/tutorials/automl-tables-model-export/schema.png)
+
+Useful statistics are generated for the columns, including correlation statistics with the target column, which can help you determine which columns you want
+to use as model inputs.
+
+## Train the Tables model
+
+Now you're ready to train a model on the dataset.  
+
+For this example, you train a model to predict ride duration given all the other dataset inputs, so you train a
+[regression](https://cloud.google.com/automl-tables/docs/problem-types) model. 
+
+Enter a training budget of 1 hour, and include all available feature columns, as shown in the following screenshot.
+
+![Train a model to predict ride](https://storage.googleapis.com/gcp-community/tutorials/automl-tables-model-export/train.png)
+
+## Export the trained model
+
+After the model is trained, you export the result, so that it can be served from any environment in which you can run a container. Alternatively, you could
+[deploy](https://cloud.google.com/automl-tables/docs/predict) your model to AI Platform for online prediction.
+
+For details about the export process, see [Exporting models](https://cloud.google.com/automl-tables/docs/model-export).
+
+Steps in this procedure use `gsutil`. To run these commands, you need [`gcloud`](https://cloud.google.com/sdk/install) installed. You can run these commands from
+the [Cloud Shell](https://cloud.google.com/shell/) instead of your local machine if you don't want to install the SDK locally.
+
+1.  On the **Test & Use** tab, under the **Use your model** heading, click the **Container** card to export your trained model to be run from a Docker container. 
+
+    ![Export trained model to be run from Docker container](https://storage.googleapis.com/gcp-community/tutorials/automl-tables-model-export/export1.png)
+
+1.  Browse to select the Google Cloud Storage folder into which you want to export your model, and click the **Export** button.
+
+    You need to use a *regional* Cloud Storage bucket, in the same region as your model. 
+
+    Consider creating a sub-folder for the model export in the Cloud Storage bucket, so that if you have multiple exports, you can keep track of them.
+
+    ![Browse to Cloud Storage folder to export model](https://storage.googleapis.com/gcp-community/tutorials/automl-tables-model-export/export2.png)
+
+1.  When the export is finished, create a local directory (for example, `bikes_weather`) to hold your model.
+
+1.  Copy the download command provided in the Cloud Console, which will look something like the following:
+
+    `gsutil cp -r gs://[YOUR_STORAGE_BUCKET]/model_export_1//* ./download_dir`
+
+1.  Edit this command as follows:
+
+    1.  Add quotation marks around the `gs` URI.
+    1.  Remove one of the end slashes.
+    1.  Edit `download_dir` to point to the directory that you created.
+
+    The result should look something like the following:
+
+        gsutil cp -r 'gs://[YOUR_STORAGE_BUCKET/model_export_1/*' ./bikes_weather
+
+1.  Run the command from the parent directory of your `bikes_weather` directory.
+
+    The exported model is copied to `./bikes_weather`.
+
+## Test your exported model locally
+
+After you've downloaded your model, you can run and test it locally. This provides a good check before deploying to Cloud Run.
+The process is described in detail in the [AutoML Tables documentation](https://cloud.google.com/automl-tables/docs/model-export).
+
+1.  Change to the `bikes_weather` directory.
+
+    You should see a `model_export` subdirectory, which is the result of your download.
+
+1.  Rename the `model_export/tbl/tf_saved_model*` subdirectory to remove the timestamp suffix.
+
+1.  Create and run a container to serve your new trained model:
+
+        docker run -v `pwd`/model-export/tbl/[YOUR_RENAMED_DIRECTORY]:/models/default/0000001 -p 8080:8080 -it gcr.io/cloud-automl-tables-public/model_server
+
+    This starts up a model server to which you can send requests. This command uses the `gcr.io/cloud-automl-tables-public/model_server` container image and
+    mounts your local directory.
+
+1.  Download or navigate to the [`instances.json`](https://github.com/GoogleCloudPlatform/community/tree/master/tutorials/automl-tables-model-export/instances.json)
+    file, which holds data for three prediction instances for the bikes and weather model.
+
+1.  From the directory where you placed `instances.json`, run the following command:
+
+        curl -X POST --data @instances.json http://localhost:8080/predict
+
+    You’ll get back predictions for all of the instances in the JSON file.
+
+    The actual duration for the third instance is 1200.
+
+## View information about your exported model in TensorBoard
+
+In this section, you view your exported custom model in [TensorBoard](https://www.tensorflow.org/tensorboard). 
+
+Viewing your exported model in TensorBoard requires a conversion step. You need to have TensorFlow 1.14 or 1.15
+[installed](https://www.tensorflow.org/install/pip#2.-create-a-virtual-environment-recommended) to run the the conversion script.
+
+1.  Download or navigate to the
+    [`convert_oss.py`](https://github.com/GoogleCloudPlatform/community/tree/master/tutorials/automl-tables-model-export/convert_oss.py) script, and copy it to 
+    the parent directory of `model_export`.
+
+1.  Create a directory for the output (for example, `converted_export`):
+
+        mkdir converted_export
+
+1.  Run the script:
+
+        python ./convert_oss.py --saved_model ./model-export/tbl/<your_renamed_directory>/saved_model.pb --output_dir converted_export
+
+1.  Point TensorBoard to the converted model:
+
+        tensorboard --logdir=converted_export
+
+1.  View the exported custom Tables model in Tensorboard.
+
+    You will see a rendering of the model graph, and you can pan and zoom to view model sub-graphs in more detail.
+
+    ![View exported custom Tables model in Tensorboard](https://storage.googleapis.com/gcp-community/tutorials/automl-tables-model-export/tb1.png)
+
+    ![](https://storage.googleapis.com/gcp-community/tutorials/automl-tables-model-export/tb2.png)
+
+    Zoom in to see part of the model graph in more detail.
+
+    ![Zooming in to see part of the model graph in more detail](https://storage.googleapis.com/gcp-community/tutorials/automl-tables-model-export/tb3.png)
+
+## Create a Cloud Run service based on your exported model
+
+At this point, you have a trained model that you've exported and tested locally. You are almost ready to deploy it to
+[Cloud Run](https://cloud.google.com/run/docs/). As the last step of preparation, you create a container image that uses 
+`gcr.io/cloud-automl-tables-public/model_server` as a base image and adds the model directory, and you push that image to the
+[Google Container Registry](https://cloud.google.com/container-registry/), so that Cloud Run can access it.
+
+### Build a container to use for Cloud Run
+
+1.  In the same `bikes_weather` directory that holds the `model_export` subdirectory, create a file called `Dockerfile` that contains the following two lines,
+    replacing `[YOUR_RENAMED_DIRECTORY]` with the path to your exported model, the same path that you used in a previous step when running locally:
+
+        FROM gcr.io/cloud-automl-tables-public/model_server
+
+        ADD model-export/tbl/[YOUR_RENAMED_DIRECTORY]/models/default/0000001
+
+    The template is in
+    [`Dockerfile.template`](https://github.com/GoogleCloudPlatform/community/tree/master/tutorials/automl-tables-model-export/Dockerfile.template), too.
+
+1.  Build a container from the `Dockerfile` (in this example called `bw-serve`):
+
+        docker build . -t gcr.io/[YOUR_PROJECT_ID]/bw-serve
+
+1.  Push the container to the Google Container Registry:
+
+        docker push gcr.io/[YOUR_PROJECT_ID]/bw-serve
+
+If you get an error, you may need to configure Docker to use `gcloud` to
+[authenticate requests to Container Registry](https://cloud.google.com/container-registry/docs/quickstart#add_the_image_to).
+
+Alternately, you can use [Cloud Build](https://cloud.google.com/cloud-build/docs/quickstart-docker) to build the container instead, as follows:
+
+    gcloud builds submit --tag gcr.io/[YOUR_PROJECT_ID]/bw-serve .
+
+### Create your Cloud Run service
+
+Now you're ready to deploy the container to Cloud Run, where you can scalably serve it for predictions.
+
+1.  Go to the [Cloud Run page in the Cloud Console](https://console.cloud.google.com/marketplace/details/google-cloud-platform/cloud-run).
+
+1.  Click **Start using** if necessary.
+
+1.  Click **Create service**.
+
+    ![Creating a Cloud Run Service](https://storage.googleapis.com/gcp-community/tutorials/automl-tables-model-export/cloud_run1.png)
+
+1.  For the container URL, enter the name of the container that you just built above.
+
+1.  Select **Cloud Run (fully managed)**.
+
+1.  Enter a service name, which can be anything you like.
+
+1.  Select **Require Authentication**. 
+
+1.  Click **Show optional revision settings**.
+
+1.  Change **Memory allocated** to **2GiB**.
+
+    ![Set service instances to use 2GiB of memory](https://storage.googleapis.com/gcp-community/tutorials/automl-tables-model-export/cloud_run2.png)
+
+1.  Leave the rest of the settings at their default values, and click **Create**.
+
+### Send prediction requests to the Cloud Run service
+
+After your Cloud Run service is deployed, you can send prediction requests to it. Your new service has a URL that starts with your service name and ends
+with `run.app`. You can send JSON predictions to the Cloud Run service just as with the local server you tested earlier; but with Cloud Run, the service will
+scale up and down based on demand. 
+
+Assuming that you selected the **Require Authentication** option, you can make prediction requests like this:
+
+```bash
+curl -X POST -H \
+"Authorization: Bearer $(gcloud auth print-identity-token)" --data @./instances.json \
+https:/[YOUR_SERVICE_URL]/predict
+```
+
+It may take a second or two for the first request to return, but subsequent requests will be faster.
+
+If you set up your Cloud Run service endpoint so that it does not require authentication, you don’t need to include the authorization header in your `curl` 
+request.
+
+## What’s next?
+
+In this tutorial, you saw how to export a custom AutoML Tables trained model, view model information in TensorBoard, and build a container image that lets you serve the model from any environment. Then you saw how you can deploy that image to Cloud Run for scalable serving. See the
+[Cloud Run documentation](https://cloud.google.com/run/docs/authenticating/overview) for more information on how to configure your prediction endpoint for
+end-user or service-to-service authentication.
+
+Once you’ve built a model-serving container image, you can deploy it to other environments as well. For example, if you have installed
+[Knative serving](https://github.com/knative/serving) on a [Kubernetes](https://kubernetes.io/) cluster, you can create a Knative *service* like this, using the 
+same container image (replacing `[YOUR_PROJECT_ID]` with your project ID):
+
+```yaml
+apiVersion: serving.knative.dev/v1
+kind: Service
+metadata:
+  name: bikes-weather
+spec:
+  template:
+    spec:
+      containers:
+        - image: gcr.io/[YOUR_PROJECT_ID]/bw-serve
+```
+
+Though the example model for this tutorial fits on a 2-GiB Cloud Run instance, you might have models that are too large for the managed Cloud Run service, and 
+serving it with Kubernetes/GKE is a good alternative.
+
+If you’re curious about the details of your custom model, you can use Cloud Logging to
+[view information about your AutoML Tables model](https://cloud.google.com/automl-tables/docs/logging). Using Cloud Logging, you can see the final model 
+hyperparameters and the hyperparameters and object values used during model training and tuning.
+
+You may also be interested in exploring the updated [AutoML Tables client libraries](https://googleapis.dev/python/automl/latest/gapic/v1beta1/tables.html), 
+which make it easy for you to
+[train and use Tables programmatically](https://github.com/GoogleCloudPlatform/python-docs-samples/tree/master/tables/automl/notebooks), or reading about how to create a _contextual bandit_ model pipeline
+[using AutoML Tables, without needing a specialist for tuning or feature engineering](https://cloud.google.com/blog/products/ai-machine-learning/how-to-build-better-contextual-bandits-machine-learning-models).