Question 1

What does statistical significance mean in an A/B test?

Accepted Answer

Statistical significance means the difference between your variants is unlikely to be explained by random chance alone. At a 95% confidence level, a significant result means that if there were truly no difference, you would see data this extreme less than 5% of the time. It does not guarantee the effect is large or important — only that it is probably real.

Question 2

What is a p-value?

Accepted Answer

The p-value is the probability of observing a difference at least as extreme as the one in your data, assuming the two variants actually convert at the same rate. A p-value of 0.03 means there is a 3% chance random noise alone would produce a gap this large. If the p-value is below your significance threshold, the result is called significant.

Question 3

Should I use a one-tailed or two-tailed test?

Accepted Answer

A two-tailed test checks whether B is different from A in either direction, while a one-tailed test only checks one direction. Two-tailed is the safer default because B can plausibly perform worse, and a one-tailed test halves the p-value, making it easier to declare a winner prematurely. Use one-tailed only when you genuinely don't care about detecting a negative effect.

Question 4

How much traffic do I need for a reliable A/B test?

Accepted Answer

It depends on your baseline conversion rate and the smallest uplift you want to detect — smaller effects need much more traffic. As a rough rule, detecting a 20% relative improvement on a 5% conversion rate requires around 8,000-10,000 visitors per variant at 95% confidence and 80% power. Each variant should also have at least 5 expected conversions and non-conversions.

Question 5

Why shouldn't I stop my A/B test as soon as it shows significance?

Accepted Answer

Checking results repeatedly and stopping the moment p dips below 0.05 — known as peeking — dramatically inflates your false-positive rate, because random fluctuations will cross the threshold temporarily even when there is no real effect. Decide your sample size or test duration in advance and only evaluate significance at the end.

A/B Test Significance Calculator

How it works

Free A/B Test Significance Calculator

How the A/B Test Calculator Works

One-Tailed vs Two-Tailed Tests

Choosing a Confidence Level

Tips for Trustworthy A/B Test Results

Add this tool to your site

You might also like

Frequently Asked Questions