TIL: Variance reduction for A/B tests

During our predictive modeling office hours at work, someone mentioned CUPED (cue-ped) in response to a question about high variance in A/B testing with small group size (side note: sharing knowledge at work is highly encouraged). This is a fairly simple method for reducing variance. From the original authors:

“[CUPED] utilizes data from the pre-experiment period to reduce metric variability and hence achieve better sensitivity.”
Deng, Xu, Kohavi, and Walker

Marton (Bytepawn) breaks down the CUPED formula quite nicely:

Assume an A/B testing setup where we’re measuring a metric M, eg. $ spend per user. We have N users, randomly split into A and B. A is control, B is treatment. We have metric M for each user for the “before” time period, when treatment and control was the same, and the “after” period, when treatment had some treatment applied, which we hope increased their spend.
Let Y_i be the ith user’s spend in the “after” period, and X_i be their spend in the “before” period, both for A and B combined. We compute an adjusted “after” spend Y′_i.
The CUPED recipe:
1. Compute the covariance cov(X,Y) of X and Y.
2. Compute the variance var(X) of X.
3. Compute the mean μ_X of X.
4. Compute the adjusted Y′_i=Y_i−(X_i−μ_X)*(cov(X,Y)/var(X)) for each user.
5. Evaluate the A/B test using Y′ instead of Y.
Marton Trencseni, Bytepawn

This is a great way to increase statistical power with smaller groups and/or timeframes.

TIL: Variance reduction for A/B tests

Like this:

Leave a Reply Cancel reply

Share this:

Like this:

Leave a Reply Cancel reply