- What type of data types do you support?
- What model types does Arize support?
- How can Arize surface outliers/anomalies?
- What performance metrics does Arize support?
- What happens if a new categorical feature is seen in production?
- What happens if a new numerical feature is seen in production?
- How does Arize calculate drift?
- What metrics can be applied to individual features?
- How do you evaluate features?
- How does Arize handle concept drift?
- Does Arize have any security certifications?
- How are automatic thresholds set?
What data types does Arize support?
Arize natively supports tabular/structured data types (strings, floats, booleans, etc), as well as embedding support for NLP, Image, and other unstructured data types.What model types does Arize support?
Arize natively supports binary classification, multi-class classification, regression, ranking, NLP, and CV model types. Your model type informs the data ingestion format and the performance metrics that can be utilized in the platform.How can Arize surface outliers/anomalies?
Arize can surface outlier/anomalous data through:Data Quality checks
- Numeric Features: Arize will monitor outliers in numeric inputs ranges for your input data.
- Categorical Features: Arize will monitor outlier categories and the overall cardinality of categorical features.
Drift checks
If there are features slices that vary significantly from the set baseline distribution, Arize will alert you through drift detection monitors.Feature Performance Heatmap
If there are outlier slices that are poorly performing, Arize’s feature performance heatmap will automatically surface up the worst performing segments. These slices can also be monitored explicitly for proactively performance degradation detection.What performance metrics does Arize support?
Arize supports a comprehensive list of model performance metrics for both numeric and categorical model types. These metrics are available on dashboards as well as monitors. In addition to the out-of-the-gate metrics listed below, Arize also supports model data metrics, custom evaluation metrics, and user defined business impact metrics. Learn more about statistical widgets here and user-defined business impact formulas here.| Accuracy | MAE | Precision | Sensitivity |
| AUC | MAPE | R-Squared | Specificity |
| F1 | MSE | Recall | TP/TN/FP/FN |
| Log Loss | PR-AUC | RMSE | Custom Metrics |
How can I monitor the impact of a particular feature?
You can monitor the model’s performance for that particular feature, feature-value combination —also known as a slice. This feature performance heatmap helps visualize the performance of each slice and indicates what slices are the most problematic/performance degrading.What happens if a new categorical feature is seen in production?
Arize drift detection can flag when categorical features see a % of unseen categories. For example, if the baseline had 10 categories, but the production/serving distribution differed significantly in number, Arize will trigger an alert. Additionally, Arize captures the percentage of values that fall into these new feature categories not previously seen in the baseline distribution.What happens if a new numerical feature is seen in production?
Arize drift detection can show the % of values outside of the baseline range. Arize uses the quantiles of the data to calculate the bins of the distribution. If the baseline range has a larger range than the production/serving environment, the user can see the % of volume where the baseline distribution was outside of the production/serving distribution. If the production/serving distribution was outside the range of the baseline distribution, similarly Arize surfaces the % of volume for values outside the baseline range.How does Arize calculate drift?
Arize calculates drift metrics including Population Stability Index, KL Divergence, KS Statistic and JS Distance. Arize computes drift by measuring distribution changes between the model’s production values and a baseline (reference dataset). Users can configure a baseline to be any time window of a:- Pre-production dataset (training, test, validation) or
- Fixed or moving time period from production (e.g. last 30 days, last 60 days).
What metrics can be applied to individual features?
Arize supports automated schema detection of models and immediately computes statistics for all features of the model, including:| Cardinality | Average | Feature Type |
|---|---|---|
| Minimum Value | Standard Deviation | Missing Value |
| Maximum Value | Percentiles | Custom Metrics |