Predicted Classes
The model categorizes leads into three classes based on historical conversion patterns:Class Definitions
Closed Won
Leads predicted to convert successfully. These are high-priority prospects with strong conversion signals.
Closed Lost
Leads predicted not to convert. These may require different engagement strategies or deprioritization.
Other
Leads with uncertain outcomes. These may still be in progress or require additional nurturing.
Color Coding
The dashboard uses a consistent color scheme to visually distinguish predicted classes:- Green (#20C69E): Closed Won - High priority
- Red (#F86C7D): Closed Lost - Low priority
- Yellow (#F2BB67): Other - Medium priority
Probability Scores
Each prediction includes a probability score indicating the model’s confidence in a “Closed Won” outcome.Probability Binning
Probabilities are grouped into four quartiles for distribution analysis:0-25% Probability
0-25% Probability
Very Low Confidence - Leads in this range are unlikely to convert. Consider deprioritizing or using alternative engagement strategies.
25-50% Probability
25-50% Probability
Low-Medium Confidence - These leads show some potential but may need additional nurturing or qualification.
50-75% Probability
50-75% Probability
Medium-High Confidence - Solid prospects that warrant active engagement and follow-up.
75-100% Probability
75-100% Probability
High Confidence - Top-priority leads with strong conversion signals. Allocate maximum sales resources to these prospects.
Using Predictions for Lead Prioritization
Rank by Probability
Within each class, prioritize leads with higher probability scores (closer to 100%).
Analyze Contributing Factors
Review the lead features (Use Case, Source, Discount code, etc.) to understand why certain leads scored higher.
The model calculates probabilities using
predict_proba() from the Gradient Boosting classifier, which provides calibrated probability estimates based on the training data.Prediction Output Structure
The complete prediction output includes:- predictions_df: Individual lead predictions with features and classes
- probability_distribution: Aggregated count of leads in each probability quartile
- accuracy_score: Overall model performance metric
Review the accuracy score to understand the model’s reliability. Higher accuracy (closer to 1.0) indicates more trustworthy predictions.