Don't Fall Into the Trap of A/B Testing Minutiae

Comments 31

Please keep your comments TAGFEE by following the community etiquette.

E-mail me when new comments are posted

Sort by:

Comments are closed on posts more than 30 days old. Got a burning question? Head to our Q&A section to start a new conversation.

Claudiu Murariu

2010-07-08T02:19:54-07:00

I get this all the time... Let's test the button color or the size of the text or the font type... To be honest I believe that people who take the approach like this think of the visitors as some little monkeys that react on different colors or sizes.

People who visit our website are human beings, therefore complex. Yes, the change of the color of a button might change on the way that a page communicates, but the truth is that how and what it communicates has the impact on the users, not the button colors.

Therefore, I totally agree with your approach to focus on big ideas. From the way people react to tests versions you also get to learn so much about them.

4 0

I get this all the time... Let's test the button color or the size of the text or the font type... To be honest I believe that people who take the approach like this think of the visitors as some little monkeys that react on different colors or sizes. People who visit our website are human beings, therefore complex. Yes, the change of the color of a button might change on the way that a page communicates, but the truth is that how and what it communicates has the impact on the users, not the button colors. Therefore, I totally agree with your approach to focus on big ideas. From the way people react to tests versions you also get to learn so much about them. 
Cancel
- firstconversion
 
 2010-07-09T07:58:22-07:00
 
 "how and what it communicates has the impact on the users, not the button colors."
 
 the only way you would know that is by testing the colours i.e. testing the communication media.
 
 The whole point of testing is to simplify people's complex reactions down to the level of monkeys or robots.
 
 Thats why its a quantitative and not qualitative endeavour and thats why we measure to have significance
 
 1 0
 
 "how and what it communicates has the impact on the users, not the button colors." the only way you would know that is by testing the colours i.e. testing the communication media. The whole point of testing is to simplify people's complex reactions down to the level of monkeys or robots. Thats why its a quantitative and not qualitative endeavour and thats why we measure to have significance 
 Cancel
Gianluca Fiorelli

2010-07-08T02:25:53-07:00

Any posts about CRO is very welcome... and this one is somehow a "best CRO advices repository" because of all the links you provide (I ask you to feel 'guilty' for my loss of productivity in the next few hours).

A post to be bookmarked surely.

About if it's better prioritize small scale vs. huge A/B tests, I believe it is to be considered as when a shop makes changes to its products expositions. With the premise that is always good to maintain a constant look & feeling in order to not "shock" your returning visitor, it would be good to refresh the image & visual architecture of a web site every amount of months (12/18) in order to better the CRO and prioritize the big changes.

I'm curious to read the comment by our CRO mozzers experts (Dr. Pete, this is especially for you).

gfiorelli1 edited 2010-07-08T02:27:50-07:00
4 0

Any posts about CRO is very welcome... and this one is somehow a "best CRO advices repository" because of all the links you provide (I ask you to feel 'guilty' for my loss of productivity in the next few hours). A post to be bookmarked surely. About if it's better prioritize small scale vs. huge A/B tests, I believe it is to be considered as when a shop makes changes to its products expositions. With the premise that is always good to maintain a constant look & feeling in order to not "shock" your returning visitor, it would be good to refresh the image & visual architecture of a web site every amount of months (12/18) in order to better the CRO and prioritize the big changes. I'm curious to read the comment by our CRO mozzers experts (Dr. Pete, this is especially for you). 
Cancel
- goodnewscowboy
 
 2010-07-08T08:43:10-07:00
 
 Any posts about CRO is very welcome
 
 Hear, hear! I echo Gianluca's sentiment. I can't get enough of good CRO posts.
 
 2 0
 
 Any posts about CRO is very welcome Hear, hear! I echo Gianluca's sentiment. I can't get enough of good CRO posts.
 Cancel
 - Matt Shoffner
 
 2010-07-09T06:52:40-07:00
 
 CRO is my favorite part of internet marketing.
 
 2 0
 
 CRO is my favorite part of internet marketing.
 Cancel
Mark Jackson

2010-07-08T02:00:07-07:00

Good advice - I think the most pertinent phrase was "they've got a higher base conversion rate".

Changing the little things is always good if you have the time and resources available but these should always be done once you've nailed down the bigger areas.

Common sense, but so often overlooked.

3 0

Good advice - I think the most pertinent phrase was "they've got a higher base conversion rate". Changing the little things is always good if you have the time and resources available but these should always be done once you've nailed down the bigger areas. Common sense, but so often overlooked.
Cancel
freshfishdesign

2010-07-08T03:30:09-07:00

As well as keeping out of the weeds, I guess there's a certain 'grass is always greener' mentality. Of course, it's natural to want to improve your conversion rates but when there are often just minimal improvements available, I wonder if it can counter-productive.

For instance, if you spend a week testing a completely new landing page design and get a 4% reduction in your conversion rates, then you try a new design again and get a 6% increase, presumably you're going to be tempted to go third time lucky and try yet another design. And another and another...

At some point, your regular visitors are going to get frustrated and confused with all the changes to your site. Every time they hit your front page they see something different. Eventually they don't bother to come back to your site. So maybe your conversion rate is improving but your actual visitor volume is dropping. Just a thought.

3 0

As well as keeping out of the weeds, I guess there's a certain 'grass is always greener' mentality. Of course, it's natural to want to improve your conversion rates but when there are often just minimal improvements available, I wonder if it can counter-productive. For instance, if you spend a week testing a completely new landing page design and get a 4% reduction in your conversion rates, then you try a new design again and get a 6% increase, presumably you're going to be tempted to go third time lucky and try yet another design. And another and another... At some point, your regular visitors are going to get frustrated and confused with all the changes to your site. Every time they hit your front page they see something different. Eventually they don't bother to come back to your site. So maybe your conversion rate is improving but your actual visitor volume is dropping. Just a thought.
Cancel
- Sorano
 
 2010-07-08T04:38:40-07:00
 
 I agree with you. Anyway, the good designers are not they supposed to know what is more attractive to the consumer? ( like the green button versus the red one )
 
 A better design can convert more and it seems logical to me but as you say, if done too often it is not better.
 
 1 0
 
 I agree with you. Anyway, the good designers are not they supposed to know what is more attractive to the consumer? ( like the green button versus the red one ) A better design can convert more and it seems logical to me but as you say, if done too often it is not better.
 Cancel
stephen-15402

2010-07-08T09:30:34-07:00

The Basecamp example is interesting:

"They recorded a 14% improvement from new vs. old"

The thing is - this improvement is in *click-throughs* from the homepage to the plans & pricing page, not *conversions* to the "thank you" page.

Basecamp should really run a follow-up test to see if this improvement is reflected in the conversion rate as well.

After all, there's no point increasing click-throughs if it doesn't also increase conversion. And it's even possible that a page that increases the CTR actually lowers the CR.

3 0

The Basecamp example is interesting: "They recorded a 14% improvement from new vs. old" The thing is - this improvement is in *click-throughs* from the homepage to the plans & pricing page, not *conversions* to the "thank you" page. Basecamp should really run a follow-up test to see if this improvement is reflected in the conversion rate as well. After all, there's no point increasing click-throughs if it doesn't also increase conversion. And it's even possible that a page that increases the CTR actually lowers the CR.
Cancel
Matt Brooks

2010-07-08T12:48:23-07:00

CRO is one of my favorite topics right now. That's one of the reasons we started doing web design and development - to get more control of the marketing and conversions. The best producer of ROI for an SEO campaign is increased leads and sales, and it is hard to get (what people consider) good SEO results without increasing ROI. SEO and conversion rates are getting increasingly connected and it's hard to offer one without the other.

Thanks for the visuals on where to expend energy. I love the basecamp example, and I have read similar stories in Website Magazine. This reminds me of the post (https://www.seomoz.org/blog/design-trends-the-single-purpose-homepage) about the single focus home pages. There is definately a trend in that direction, and basecamp is another example of knowing exactly what action you want people to take, and creating the entire page around that single focus.

I think there is a lot of branding that a company has to do before a single focus homepage will be effective, because if you don't have brand recognition, you have to use the valuable home page real estate to quickly and effectively educate your visitors as to who you are and what you do. That's slightly off the subject, but I thought it was worth mentioning :)

2 0

CRO is one of my favorite topics right now. That's one of the reasons we started doing web design and development - to get more control of the marketing and conversions. The best producer of ROI for an SEO campaign is increased leads and sales, and it is hard to get (what people consider) good SEO results without increasing ROI. SEO and conversion rates are getting increasingly connected and it's hard to offer one without the other. Thanks for the visuals on where to expend energy. I love the basecamp example, and I have read similar stories in Website Magazine. This reminds me of the post (<a href="design-trends-the-single-purpose-homepage">https://www.seomoz.org/blog/design-trends-the-single-purpose-homepage</a>) about the single focus home pages. There is definately a trend in that direction, and basecamp is another example of knowing exactly what action you want people to take, and creating the entire page around that single focus. I think there is a lot of branding that a company has to do before a single focus homepage will be effective, because if you don't have brand recognition, you have to use the valuable home page real estate to quickly and effectively educate your visitors as to who you are and what you do. That's slightly off the subject, but I thought it was worth mentioning :)
Cancel
JasonCohen

2010-07-08T09:39:41-07:00

Thanks for the shout-out. Completely agree with your methodology, and agreed that everyone needs better knowledge about what "testing" means.

2 0

Thanks for the shout-out. Completely agree with your methodology, and agreed that everyone needs better knowledge about what "testing" means.
Cancel
Jonathan Quinton

2010-07-08T02:13:19-07:00

Interesting to see the Basecamp example. The new version just looks so much clearer and "uncluttered".

I always think less is more and try to ask myself "How much can I get away with NOT having on the page".

Cheers!

Jon

2 0

Interesting to see the Basecamp example. The new version just looks so much clearer and "uncluttered". I always think less is more and try to ask myself "How much can I get away with NOT having on the page". Cheers! Jon 
Cancel
- Matt Shoffner
 
 2010-07-08T06:17:45-07:00
 
 I think it depends. I tend to like less is more too. But, if people aren't converting because they want MORE information, then maybe less is less. Though the Basecamp change is more aesthetically appealing (to me) it also appears to better illustrate their value. I think your point is well deserved though, if clutter hides your value, then clean it up.
 
 3 0
 
 I think it depends. I tend to like less is more too. But, if people aren't converting because they want MORE information, then maybe less is less. Though the Basecamp change is more aesthetically appealing (to me) it also appears to better illustrate their value. I think your point is well deserved though, if clutter hides your value, then clean it up. 
 Cancel
 - Jonathan Quinton
 
 2010-07-08T08:22:06-07:00
 
 Exactly, always a fine line between over powering people with too much, and not giving them enough...
 
 1 0
 
 Exactly, always a fine line between over powering people with too much, and not giving them enough... 
 Cancel
SEMWarrior

2010-07-09T07:10:46-07:00

Logic is the driving force behind changes. It's good to see the concept of a broad change that can have a greater overall impact. Then, once the big change is made, little tweaks can be done. Thanks...

1 0

Logic is the driving force behind changes. It's good to see the concept of a broad change that can have a greater overall impact. Then, once the big change is made, little tweaks can be done. Thanks...
Cancel
José Abuchaem

2012-02-01T04:38:25-08:00

Great article!

We've recently A/B tested a totally new landing page at my startup and got a 50% increase in registrations.

Te new landing basically added a signup form on the right side of the page vs the old landing which had the classic "register now" button that took the user to the pricing page and THEN to the form.

We still have to check if the new registered users are as likely to become paying users as before, so it's still a work in progress.

Thanks!

1 0

Great article! We've recently A/B tested a totally new landing page at my startup and got a 50% increase in registrations. Te new landing basically added a signup form on the right side of the page vs the old landing which had the classic "register now" button that took the user to the pricing page and THEN to the form. We still have to check if the new registered users are as likely to become paying users as before, so it's still a work in progress. Thanks!
Cancel
NatalieMyers

2010-07-20T11:20:24-07:00

Ya, I totally agree with the premise of this article. When I'm reporting on A/B testing case studies for WhichTestWon.com, I'm often drawn in by the hunt for those instances where a small change made a significant (10% or more) impact on conversions ONLY because our mission is to evangelize testing... so if we can show people a small change made a big imapact, mission accomplished. But, you're right in that most of the time it's the big overhaul that gets you the highest increase in conversions. I've seen it time and again.

Natalie Myers

Senior Reporter

WhichTestWon.com

1 0

Ya, I totally agree with the premise of this article. When I'm reporting on A/B testing case studies for WhichTestWon.com, I'm often drawn in by the hunt for those instances where a small change made a significant (10% or more) impact on conversions ONLY because our mission is to evangelize testing... so if we can show people a small change made a big imapact, mission accomplished. But, you're right in that most of the time it's the big overhaul that gets you the highest increase in conversions. I've seen it time and again. Natalie Myers Senior Reporter WhichTestWon.com 
Cancel
Rajat Garg

2013-04-29T23:29:39-07:00

Nice one - we have spent quite a while on this as well with our upcoming mobile app. Also, realized that mobile a/b testing is a pain, so we are going to launch new product around that!

1 0

Nice one - we have spent quite a while on this as well with our upcoming mobile app. Also, realized that <a href="https://www.optimimo.com" rel="nofollow">mobile a/b testing</a> is a pain, so we are going to launch new product around that! 
Cancel
surioseo

2016-12-14T19:52:44-08:00

This post couldn't have come at a better time. I'm actually right in the middle of creating our test page for our upcoming A/B test. tefal This helped a lot to produce ideas. Thanks!

1 0

This post couldn't have come at a better time. I'm actually right in the middle of creating our test page for our upcoming A/B test. tefal This helped a lot to produce ideas. Thanks!
Cancel
Georgi_Georgiev

2014-05-23T08:08:56-07:00

Hm, I have to agree with "furstconversion" there - the cost/benefit calculation is what one should be looking at and immediately "statistical power" and "significance level" determination comes to mind. Not mentioned even once above.

Furthermore, the stats backing the graphs are dubious and uninformative, if not misleading. "Reaching statistical significance" is NOT a valid way to determine when to stop a test (statistically significant confidence makes no sense statistically, so I presume this is what you mean).

Having such a stopping rule exposes you to errors much greater than the ones your achieved significance level tells you. Furthermore, you speak nowhere of the effect size confidence intervals which is what would be the data one should look at when determining the success of a test. I bet those would look quite horribly if you stopped your tests the first time you reached significance (specifiying your level would have been nice).

For more on this -

https://blog.analytics-toolkit.com/2014/why-every-internet-marketer-should-be-a-statistician/

KeriMorgret edited 2014-05-23T08:18:31-07:00
1 0

Hm, I have to agree with "furstconversion" there - the cost/benefit calculation is what one should be looking at and immediately "statistical power" and "significance level" determination comes to mind. Not mentioned even once above. Furthermore, the stats backing the graphs are dubious and uninformative, if not misleading. "Reaching statistical significance" is NOT a valid way to determine when to stop a test (statistically significant confidence makes no sense statistically, so I presume this is what you mean). Having such a stopping rule exposes you to errors much greater than the ones your achieved significance level tells you. Furthermore, you speak nowhere of the effect size confidence intervals which is what would be the data one should look at when determining the success of a test. I bet those would look quite horribly if you stopped your tests the first time you reached significance (specifiying your level would have been nice). For more on this - https://blog.analytics-toolkit.com/2014/why-every-internet-marketer-should-be-a-statistician/
Cancel
algogmbh_petra

2010-07-08T22:51:54-07:00

Hi Rand,another great useful post.In the upper area you wrote:Changing Button Color from Red to Green = 72% Improvement from Dan McGrady--> actually it must be "Changing Button from Green to Red" ;-)Thank you Petra

1 0

Hi Rand,another great useful post.In the upper area you wrote:<a href="https://dmix.ca/2010/05/how-we-increased-our-conversion-rate-by-72/" rel="nofollow">Changing Button Color from Red to Green = 72% Improvement</a> from Dan McGrady--> actually it must be "Changing Button from Green to Red" ;-)Thank you Petra
Cancel
JoshFraser48

2010-07-08T10:38:02-07:00

Great post. This is an area where I've made a lot of mistakes and learned these lessons the hard way. I recently shared some of my experiences in a 5 minute talk at the Boulder New Tech Meetup:

https://www.onlineaspect.com/2010/05/09/ab-testing/

1 0

Great post. This is an area where I've made a lot of mistakes and learned these lessons the hard way. I recently shared some of my experiences in a 5 minute talk at the Boulder New Tech Meetup: https://www.onlineaspect.com/2010/05/09/ab-testing/
Cancel
manan

2010-07-08T07:56:44-07:00

Great post..!!

1 0

Great post..!!
Cancel
wmw71190

2010-07-08T06:27:23-07:00

This post couldn't have come at a better time. I'm actually right in the middle of creating our test page for our upcoming A/B test. This helped a lot to produce ideas. Thanks!

1 0

This post couldn't have come at a better time. I'm actually right in the middle of creating our test page for our upcoming A/B test. This helped a lot to produce ideas. Thanks!
Cancel
goodnewscowboy

2010-07-08T08:41:35-07:00

Really liked your "Opportunity Cost" graphic Rand. It said it all in a nutshell.

My biggest takeaway form this is to really "count the cost" of doing A/B testing to ensure it will have a large enough payback for the dollars spent by the client.

It becomes even more critical when dealing with low volume websites that have to wait double, triple and even quadruple the time to be able to obtain statistically valid volumes.

1 0

Really liked your "Opportunity Cost" graphic Rand. It said it all in a nutshell. My biggest takeaway form this is to really "count the cost" of doing A/B testing to ensure it will have a large enough payback for the dollars spent by the client. It becomes even more critical when dealing with low volume websites that have to wait double, triple and even quadruple the time to be able to obtain statistically valid volumes.
Cancel
- freshfishdesign
 
 2010-07-08T12:31:28-07:00
 
 For low volume websites it doesn't seem worthwhile to do this kind of testing. You could have a site up for two months collecting enough data only to find out you'd taken a 20% drop in conversions.
 
 2 0
 
 For low volume websites it doesn't seem worthwhile to do this kind of testing. You could have a site up for two months collecting enough data only to find out you'd taken a 20% drop in conversions.
 Cancel
 - goodnewscowboy
 
 2010-07-08T14:09:55-07:00
 
 To be sure freshfish, you need to be extremely circumspect before embarking upon an A/B for a small site. You're right about having to wait a while for results.
 
 Yet, at the same time, the designer can't know what the users would want in a site nearly as well as the users can speak for themselves.
 
 1 0
 
 To be sure freshfish, you need to be extremely circumspect before embarking upon an A/B for a small site. You're right about having to wait a while for results. Yet, at the same time, the designer can't know what the users would want in a site nearly as well as the users can speak for themselves. 
 Cancel
- firstconversion
 
 2010-07-09T08:07:52-07:00
 
 That graph oversimplifies it.
 
 For some sites the opportunity cost to create a 1% increase is minimal and that 1% is a massive gain (think what Amazon would do with a 1% increase)
 
 The way we think about design today is wrong.
 - We design 1 thing and then test its slowly over time
 - A more sensible approach would be: you have your designers design 2 or 3 versions of the same thing as part of your design process.
 Accept that everything you design needs to be tested and you can save a lot of time and headaches by doing it upfront in the initial design stage.
 
 If I was going to an Agency for a website today, I dont want to know what 1 site they build me, but what sites they build me.
 
 Having a designer who is a front end developer saves you time and money because they design in a way thats easy to build. This will soon soon be the same for designers who design so its easy to test
 
 1 0
 That graph oversimplifies it. For some sites the opportunity cost to create a 1% increase is minimal and that 1% is a massive gain (think what Amazon would do with a 1% increase) The way we think about design today is wrong. <ul><li> We design 1 thing and then test its slowly over time </li></ul> <ul><li>A more sensible approach would be: you have your designers design 2 or 3 versions of the same thing as part of your design process. </li></ul> Accept that everything you design needs to be tested and you can save a lot of time and headaches by doing it upfront in the initial design stage. If I was going to an Agency for a website today, I dont want to know what 1 site they build me, but what sites they build me. Having a designer who is a front end developer saves you time and money because they design in a way thats easy to build. This will soon soon be the same for designers who design so its easy to test
 Cancel
colewebdev

2010-07-08T12:47:26-07:00

By your logic 37 Signals' big redesign was also the exception to the rule. If ordniary people embark on a major redesign most likely they won't see large positive changes.

Simply saying success stories shouldn't be followed because they are the exception fails to consider the data and results from these stories actually contain.

Even though it's 'hard' .. math and visitor behavior always answers these questions for us.

1 0

By your logic 37 Signals' big redesign was also the exception to the rule. If ordniary people embark on a major redesign most likely they won't see large positive changes. Simply saying success stories shouldn't be followed because they are the exception fails to consider the data and results from these stories actually contain. Even though it's 'hard' .. math and visitor behavior always answers these questions for us. 
Cancel
Steven Alig

2010-07-08T14:02:37-07:00

Great post.

We are always told to TEST TEST TEST. And the reality is as you have stated above.

This is especially true for an organization that deals with many small clients (vs. just several large ones) such as ours.Our efforts, time and limited resources are better placed in other areas.

I am not saying not to test, but don’t split hairs over the minute details. Unless those minute details can actually correlate to BIG $$$.

1 0

Great post. We are always told to TEST TEST TEST. And the reality is as you have stated above. This is especially true for an organization that deals with many small clients (vs. just several large ones) such as ours.Our efforts, time and limited resources are better placed in other areas. I am not saying not to test, but don’t split hairs over the minute details. Unless those minute details can actually correlate to BIG $$$. 
Cancel
mtgcs2000

2010-07-08T20:58:33-07:00

Totally agree with you if you're using some manual testing tool (like GWO) where you have to keep manually changing code and updating your website by hand (that time could be put to much better use).

But with automated tools out there (such as the Conversion Chicken) that run and improve the site by themselves and don't require any maintenance there is little point in not testing, as they don't require any time investment and may lead to much better results.

Of course its great to do massive sweeping design changes (like 37 signals), but after that big design change is done why not test all the little things to squeeze out as many sales as possible?

mtgcs2000 edited 2010-07-08T20:59:22-07:00
1 0

Totally agree with you if you're using some manual testing tool (like GWO) where you have to keep manually changing code and updating your website by hand (that time could be put to much better use). But with automated tools out there (such as the Conversion Chicken) that run and improve the site by themselves and don't require any maintenance there is little point in not testing, as they don't require any time investment and may lead to much better results. Of course its great to do massive sweeping design changes (like 37 signals), but after that big design change is done why not test all the little things to squeeze out as many sales as possible?
Cancel

Post Analytics

Don't Fall Into the Trap of A/B Testing Minutiae

Visualizing the "Local Minimum" Issue

The Tantalizing Tease of Testing Minutiae

Some Simple, Compelling Math to Keep You Out of the Weeds

What You Should Be Testing

Comments 31

Visualizing the "Local Minimum" Issue

The Tantalizing Tease of Testing Minutiae

Some Simple, Compelling Math to Keep You Out of the Weeds

What You Should Be Testing

Comments 31

Log in to Moz

Don't have an account?