Question 1

how to ab test cold emails

Accepted Answer

To A/B test cold emails, isolate one variable at a time (subject line, opener, CTA, or offer), split your list into statistically equal segments of at least 100–200 recipients per variant, send simultaneously to eliminate time bias, and measure against a single primary metric—typically reply rate or positive reply rate, not open rate. Run tests for at least 5–7 business days before declaring a winner, then fold the winning variant into your control and iterate.

Question 2

How many emails do I need to send for a statistically valid cold email A/B test?

Accepted Answer

At a minimum, 100 per variant for directional signal; 200–500 per variant for 95% statistical confidence, depending on your baseline reply rate. If your current reply rate is 4% and you want to detect a 15% relative improvement (i.e., 4.6% vs. 4%), a sample size calculator will tell you you need roughly 250–300 per variant. Below 100 per variant, you're reading noise, not signal. Use a free tool like Evan Miller's sample size calculator to get the exact number based on your baseline.

Question 3

Should I test subject lines or email body copy first?

Accepted Answer

Test subject lines first if your open rates are low (under 30–40% and you're confident MPP inflation isn't a factor). If opens are healthy but reply rates are low (under 3–5%), move to testing first-line openers and CTAs—these have the most direct impact on conversions. In most cold email programs, opener and CTA tests generate more actionable lift than subject line tests once baseline deliverability is solid.

Question 4

Can I A/B test cold email follow-ups, not just the first email?

Accepted Answer

Yes, and it's often underused. Follow-up sequences are high-leverage test surfaces because they touch prospects who didn't reply to your opener. You can test: follow-up timing (day 2 vs. day 4), bump sequences ('Did this get buried?') vs. value-add follow-ups (sharing a resource), and total sequence length (3 touches vs. 6 touches). Run the same one-variable, equal-split rules. In Smartlead, you can assign entire sequence variants per contact cohort, which makes follow-up testing cleaner than step-level variants.

Question 5

How long should I run a cold email A/B test?

Accepted Answer

At minimum, 5 full business days after the last send, to account for delayed replies. Most prospects don't reply the same day—reply windows can stretch 3–7 days, especially for senior buyers. If you're testing a sequence with multiple touches, wait until the full sequence has run for both variants before measuring. Cutting the test early is the most common source of false positive results in cold email testing.

Question 6

Does Apple Mail Privacy Protection (MPP) make open-rate testing useless?

Accepted Answer

Largely, yes. Since Apple MPP launched in iOS 15, Apple pre-loads email content to protect user privacy, which registers as an 'open' even when the user never viewed the email. If your list is more than 30–40% Apple Mail users (common in B2B, especially with Mac-heavy companies), your open rates are severely inflated and unreliable as a test metric. Use reply rate as your primary metric. Open rates can still be used as a directional signal for subject line tests if you're aware of the inflation, but don't use them for definitive winner declarations.

Question 7

Can I use AI to generate A/B test variants at scale?

Accepted Answer

Yes—this is one of the highest-leverage use cases for AI in cold outreach. Tools like Clay can use GPT-4 or Claude to generate multiple first-line variants based on enrichment data (LinkedIn bio, recent news, job title), and you can route different personalization approaches into separate sequences as controlled variants. You can also use AI to generate 5–10 subject line variants and manually select the two most differentiated for testing. The key constraint remains the same: test one variable at a time and ensure your AI-generated variants are genuinely different in mechanism, not just superficially reworded.

Question 8

What's the difference between A/B testing and multivariate testing for cold email?

Accepted Answer

A/B testing compares two variants of a single variable (e.g., CTA Option A vs. CTA Option B). Multivariate testing simultaneously tests combinations of multiple variables (e.g., Subject A + CTA A, Subject A + CTA B, Subject B + CTA A, Subject B + CTA B). Multivariate testing requires exponentially larger sample sizes—at minimum 500–1,000+ per combination—and is only practical for high-volume senders (10,000+ emails/month). For most SDR teams and outbound programs, pure A/B testing on one variable at a time is the right approach. Multivariate is better suited to email marketing programs with large lists than to cold outbound.

how to ab test cold emails

Quick Answer

Why Most Cold Email A/B Tests Are Garbage (And How to Fix That)

What to Actually Test (Prioritized by Impact)

Step-by-Step: Running a Statistically Valid Test

Tooling: Native A/B Features vs. Manual Splits

Reading Results: What Winning Actually Looks Like

Frequently Asked Questions

Sources

Get Expert GTM Answers with Maestro

how to ab test cold emails

Quick Answer

Why Most Cold Email A/B Tests Are Garbage (And How to Fix That)

What to Actually Test (Prioritized by Impact)

Step-by-Step: Running a Statistically Valid Test

Tooling: Native A/B Features vs. Manual Splits

Reading Results: What Winning Actually Looks Like

Related Questions

Frequently Asked Questions

Sources

Get Expert GTM Answers with Maestro

Related Pages

How to Use LinkedIn Sales Navigator: Complete Guide (2025)

Best Time to Send Cold Emails: Data-Backed Guide (2025)

What Is Cold Calling? Definition, Legality & Does It Work

Tim Ferriss Cold Email Template: Framework & Examples