Seldon Deployment Canary Promotion¶
Iris Model¶

Iris is the genus of flower which contains 3 species: setosa, versicolor, and virginica. This demo is based on iris classification model based on flower properties like sepal length, sepal width, petal length, and petal width. The species are also the classes that will be used for the classification. Here we will:
- Deploy a pretrained sklearn iris model 
- Load test the model 
- Observe requests and metrics 
- Deploy a canary XGBoost model 
- Load test canary model 
- Observe requests and metrics for both models 
- Promote the canary model 
Launch a Seldon Deployment¶
- From the - Overviewpage, select the- Create new deploymentbutton in the top right corner
- In the - Deployment Creation Wizard, enter the deployment details as follows:- Name: iris-classifier 
- Namespace: seldon 
- Type: Seldon Deployment 
- Protocol: Seldon 
 
- Configure the default predictor as follows: - Runtime: Scikit Learn 
- Model URI: - gs://seldon-models/v1.18.2/sklearn/iris
- Model Project: default 
- Storage Secret: (leave blank/none) 
 
- Skip - Nextfor the remaining steps, then click- Launch.
- If your deployment is launched successfully, it will have an - Availablestatus in the- Overviewpage.
Start Load Test¶
- Once the deployment is in an - Availablestatus, navigate to its- Dashboardpage by clicking on it.
- In the - Requests Monitorsection, click on the- Start a load testbutton to start a load test with the following details:- Connections(total): 1 
- Load Parameter: Duration(seconds) 
- Value: 120 
- Json payload: - { "data": { "names": ["Sepal length", "Sepal width", "Petal length", "Petal Width"], "ndarray": [ [6.8, 2.8, 4.8, 1.4], [6.1, 3.4, 4.5, 1.6] ] } } 
 
This will spawn a Kubernetes Job that will send continuous prediction requests for the specified seconds to the SKLearn model in the deployment.
Since we have 2 sets of prediction data in the Json payload, we should expect to see double the number of requests in the metrics and the Requests page.
Observe requests and metrics¶
Once the load test has started, you can monitor the upcoming requests, their responses and metrics in the Requests page the deployment.
If this doesn’t work, consult the request logging docs section for debugging.
You can also see core metrics in the Dashboard page.
Deploy a Canary model¶
The next step is to create an XGBoost canary model that will be sharing a percentage of the traffic with the main model.
- Navigate to the - Dashboardof the deployment and click on the- Add Canarybutton.
- In the - Update Deployment Wizard, configure the default predictor as follows:- Runtime: XGBoost 
- Model URI: - gs://seldon-models/xgboost/iris
- Model Project: default 
- Storage Secret: (leave blank/none) 
- Canary Traffic Percentage: 10 
 
- Skip - Nextfor the remaining steps, then click- Launch.
- While the canary model is being launched, the deployment status will move to an - Updatingstate.
- If the canary model is launched successfully, the deployment status will become - Available.
This will create a new canary deployment with the XGBoost model and roughly 10% of the traffic will be sent to it.
Note
The deployment status represents the overal status of the deployment, including the main and canary models.
Load test the canary model¶
This time, we will create a new load test with the canary model running and observe the requests and metrics for both models. You can use either the same Json payload from the previous load test or construct a new one with different values or number of predictions.
Warning
Remember that roughly 10% of the traffic will be sent to the canary model. If, however, the canary model is not available, all the traffic will be sent to the main model.
Observe requests and metrics for both models¶
Once the load test has started, you can monitor the upcoming requests, their responses and metrics in the Requests page the deployment.
If this doesn’t work, consult the request logging docs section for debugging.
In order to see the requests for the canary model, you need to select the canary predictor in the Node Selector filter in the Requests page, as shown on the screenshot.
Take a note that the number of requests for the canary model will be 10% of the total number of requests, as was specified in the canary deployment.
You can also see core metrics for both models in the Dashboard page.
Promote the Canary model¶
Great! Now we have observed the requests and metrics for both models. If we are happy with how the canary model is performing, we can promote it to become the main model.
- Navigate to the - Dashboardof the deployment and click on the- Promote Canarybutton.
- In the - Promote Canarydialog, click- Confirmto promote the canary model to the main model.
- If the canary model is promoted successfully, the deployment status will become - Available.








