
Overview
**In this walkthrough, we will use the Arize platform to monitor Click-through Rate (CTR) model performance.**This use case will walk you through how to manage your model’s performance for an online advertising platform. While you have spent a great deal of your time collecting online data and training models for best performance, it’s common that your models in production have no tools available to monitor the performance of your models, identify any issues or get insights into how to improve your models. In this walkthrough, we will look at a few scenarios common to an advertisement use-case and more specifically look at CTR predictions versus actuals for a given ad or ad group.You will learn to:- Get training, validation, and production data into the Arize platform
- Setup a baseline and performance dashboards
- Create threshold alerts
- Monitor for Log-Loss
- Understand where the model is underperforming
- Discover the root cause of issues
Set up a Baseline
Within Arize, you are able to set a baseline for the model which compares your model’s behavior in production to a training set, validation set, or an initial model launch period. This allows us to determine when/where our model’s behavior has changed drastically.
-
Datasets:
Training Version 1.0 -
Default Metric:
Accuracy, Trigger Alert When:Accuracy is below .7, Positive Class:click - Turn On Monitoring: Drift ✅, Data Quality ✅, Performance ✅
Monitoring
Arize sets up monitors across all features, predictions, and actual values. For click-through rate, it’s important to monitor the model’s Accuracy.
Drift Detection
Drift is a change in distribution over time, measured for model inputs, outputs, and actuals of a model. Measure drift to identify if your models have grown stale, you have data quality issues, or if there are adversarial inputs in your model. Detecting drift in your models will help protect your models from performance degradation and allow you to better understand how to begin resolution.
Prediction Drift Impact can surface when drift has impacted your model. Drift (PSI) is a measurement of how much your distribution has drifted. Lastly, Feature Importance helps your explain why even small Drift (PSI) can have significant Drift Impact.
Data Quality Checks
It’s important to immediately surface data quality issues to identify how your data quality maps to your model’s performance. Utilize data quality monitoring to analyze hard failures in your data quality pipeline, such as missing data or cardinality shifts.- Missing / Null values could be an indicator of issues from an upstream data source.
- Cardinality is checked to ensure there are no spikes / drops in feature values.

Performance Analysis
Model performance metrics measure how well your model performs in production. Once a performance monitor is triggered, navigate to the Performance tab to start troubleshooting your model issues and gain an understanding of what caused the degradation.

Actionable Insight
Now that we understand what is affecting our model we can now:- Retrain the model in these brand new cohorts inside the device and domain features
- Handle the empty values in our data pipelines, affecting our data quality
Custom Dashboard
In only a few clicks, you can add widgets to provide a single glance view of your model’s import metrics and KPI. To visualize these metrics over time you can also create a custom time series widget which overlays three plots to showcase the fluctuation of these metrics over time. Below we’ll set up a templatized dashboard and then adjust the template to match our use case.
- Pred Shopping: Use Aggregation Function : Average with Average of set to Prediction Score with filter (feature category = [shopping]). Also add a filter (feature domain != [new_site.com])

Explainability
Log feature importances to the Arize platform to explain your model’s predictions. By logging these values, you gain the ability to view the global feature importances of your predictions as well as the ability to perform global and cohort prediction-based analysis to compare feature importances for your model’s features.