ShareChat’s Machine Learning Team Grows Engagement, Inclusivity

Key Facts

COMPANY
ShareChat

INDUSTRY
Social Media

ABOUT
Social media giant with over 400 million monthly active users across ShareChat, Moj, and MX TakaTak

MODELS IN PRODUCTION
>200

MACHINE LEARNING TEAM SIZE
>100

PRIMARY USE CASES
Advertising optimization, click-through rate, content intelligence, computer vision (CV), natural language processing (NLP), recommender systems, more

Challenges

Delays in detecting and diagnosing model performance issues lurked as a potential problem given the high importance of machine learning (ML) to ShareChat’s advertising optimization and user engagement. Before implementing ML observability, the team faced:

~ 24 hour delays to detect performance issues due to the limitations of existing internal dashboards and alerts
A time-consuming ML troubleshooting workflow involving querying, calculating metrics, and writing ad-hoc scripts to slice data across hundreds of models
Product and ad sales teams sometimes catching model performance issues before the ML team due to blindspots
Lack of tooling to proactively monitor unstructured data in production
An estimated 3-4 full time employees (FTEs) likely needed to build out and maintain a more robust ML observability solution in-house to meet future needs

Solution

In order to better detect model drift and to speed up time-to-resolution, ShareChat implemented Arize for ML observability earlier this year. Arize enables:

Pre-launch model validation
Automated monitors based on predefined thresholds
Monitoring of both structured and unstructured data
ML performance tracing to quickly pinpoint the source of model performance problems and map back to underlying data issues
Concept and feature drift monitoring and troubleshooting to compare across training, validation, and production environments
Data integrity checks to ensure the quality of model data inputs and outputs with automated checks for missing, unexpected, or extreme values
Bias and fairness tracing to ensure models are not generating potentially biased or unfair outcomes for protected segments of interest

Results

ShareChat’s monetization AI team is seeing improved model performance and significant time savings since deploying Arize. That translates to:

Hundreds of extra hours freed up per year across the team
A payback period of under a year; >100% ROI
Improved model performance from proactively surfacing feature drift and performance impact score at a cohort-level
Robust drift monitoring for structured data, with the plans to implement embedding drift monitoring for NLP models
Immediate visibility when issues arise based on predefined and automated thresholds, maximizing internal visibility and speeding up mean time-to-resolution
Positive impacts on AI fairness goals

CUSTOMER STORY