Seldon Deployment Canary Promotion¶
Iris Model¶
Iris is the genus of flower which contains 3 species: setosa, versicolor, and virginica. This demo is based on iris classification model based on flower properties like sepal length, sepal width, petal length, and petal width. The species are also the classes that will be used for the classification. Here we will:
Deploy a pretrained sklearn iris model
Load test the model
Observe requests and metrics
Deploy a canary XGBoost model
Load test canary model
Observe requests and metrics for both models
Promote the canary model
Launch a Seldon Deployment¶
From the
Overview
page, select theCreate new deployment
button in the top right cornerIn the
Deployment Creation Wizard
, enter the deployment details as follows:Name: iris-classifier
Namespace: seldon
Type: Seldon Deployment
Protocol: Seldon
Configure the default predictor as follows:
Runtime: Scikit Learn
Model URI:
gs://seldon-models/v1.18.2/sklearn/iris
Model Project: default
Storage Secret: (leave blank/none)
Skip
Next
for the remaining steps, then clickLaunch
.If your deployment is launched successfully, it will have an
Available
status in theOverview
page.
Start Load Test¶
Once the deployment is in an
Available
status, navigate to itsDashboard
page by clicking on it.In the
Requests Monitor
section, click on theStart a load test
button to start a load test with the following details:Connections(total): 1
Load Parameter: Duration(seconds)
Value: 120
Json payload:
{ "data": { "names": ["Sepal length", "Sepal width", "Petal length", "Petal Width"], "ndarray": [ [6.8, 2.8, 4.8, 1.4], [6.1, 3.4, 4.5, 1.6] ] } }
This will spawn a Kubernetes Job that will send continuous prediction requests for the specified seconds to the SKLearn model in the deployment.
Since we have 2 sets of prediction data in the Json payload, we should expect to see double the number of requests in the metrics and the Requests
page.
Observe requests and metrics¶
Once the load test has started, you can monitor the upcoming requests, their responses and metrics in the Requests
page the deployment.
If this doesn’t work, consult the request logging docs section for debugging.
You can also see core metrics in the Dashboard
page.
Deploy a Canary model¶
The next step is to create an XGBoost canary model that will be sharing a percentage of the traffic with the main model.
Navigate to the
Dashboard
of the deployment and click on theAdd Canary
button.In the
Update Deployment Wizard
, configure the default predictor as follows:Runtime: XGBoost
Model URI:
gs://seldon-models/xgboost/iris
Model Project: default
Storage Secret: (leave blank/none)
Canary Traffic Percentage: 10
Skip
Next
for the remaining steps, then clickLaunch
.While the canary model is being launched, the deployment status will move to an
Updating
state.If the canary model is launched successfully, the deployment status will become
Available
.
This will create a new canary deployment with the XGBoost model and roughly 10% of the traffic will be sent to it.
Note
The deployment status represents the overal status of the deployment, including the main and canary models.
Load test the canary model¶
This time, we will create a new load test with the canary model running and observe the requests and metrics for both models. You can use either the same Json payload from the previous load test or construct a new one with different values or number of predictions.
Warning
Remember that roughly 10% of the traffic will be sent to the canary model. If, however, the canary model is not available, all the traffic will be sent to the main model.
Observe requests and metrics for both models¶
Once the load test has started, you can monitor the upcoming requests, their responses and metrics in the Requests
page the deployment.
If this doesn’t work, consult the request logging docs section for debugging.
In order to see the requests for the canary model, you need to select the canary
predictor in the Node Selector
filter in the Requests
page, as shown on the screenshot.
Take a note that the number of requests for the canary model will be 10% of the total number of requests, as was specified in the canary deployment.
You can also see core metrics for both models in the Dashboard
page.
Promote the Canary model¶
Great! Now we have observed the requests and metrics for both models. If we are happy with how the canary model is performing, we can promote it to become the main model.
Navigate to the
Dashboard
of the deployment and click on thePromote Canary
button.In the
Promote Canary
dialog, clickConfirm
to promote the canary model to the main model.If the canary model is promoted successfully, the deployment status will become
Available
.