Predicting Rider Conversion in Sparse Data Environments with Bayesian Trees

2026-03-30

10 min read

by Zammit Alban

Tags:

machine-learning

data-science

ridesharing

transportation

statistics

Receive daily AI-curated summaries of engineering articles from top tech companies worldwide.

This post describes how Lyft built a Bayesian hierarchical tree model to predict rider conversion in real-time under sparse data conditions.

•Predicting whether a rider will request a ride after viewing price and ETA is central to Lyft's marketplace, but high-cardinality categorical features cause severe data sparsity for specific contexts
•Standard Gradient Boosted Trees (LGBM, XGBoost) overfit badly when only a handful of examples exist for a specific context intersection (e.g., a business traveler leaving Detroit suburbs at 4 AM)
•A hierarchical tree partitions sessions by context keys such as city region, time of day, and supply-demand balance, with nodes becoming increasingly sparse deeper in the tree
•Bayesian smoothing applies a Gaussian prior via L2 regularization (||Θ_parent - Θ_child||²), pulling sparse child node parameters toward the parent's stable estimates to prevent overfitting

Related Articles