How, When, and Why to Benchmark Your UX Research Findings

Posted on January 5, 2015
5 min read


A fellow UX professional recently came to me with a fun story: Her boss came to her desk and told her, “I want us to add benchmarking to our testing strategy.” When she asked him why, he told her that he’d recently heard that a competitor was doing it, and he wanted to keep up with the times.

Benchmarking is a hot topic in the UX industry, and if it’s done properly, it can provide a lot of spectacular information about how your site or app is performing over time!

Unfortunately, it’s often perceived as “just another” way to test the usability of a product. The truth is, benchmarking is a commitment---and it requires all the thought and reflection that any good commitment entails. Stick to the guidelines below, and you’ll gain a rich collection of data that showcases your team’s achievements and pitfalls across the life of your site or app.

So what is benchmarking, anyway?

Benchmarking is the process of testing a site or app’s progress over time.

This can be progress through different iterations of a prototype.

This can be progress across different versions of an application.

This can even be progress across different sites: yours and your competitors'.

What’s the benefit of benchmarking?

A lot of companies like to run benchmarking studies because they show actual data as it changes over time. It adds a quantitative dimension to the qualitative research you’re already doing, which can drive home your findings and add weight to your suggestions. Plus, it’s always nice to be able to share a pretty chart with the manager!

benchmarking chart

It’s intensely gratifying to be able to track a product’s successes and failures over time. In the above example, we can see that the app in question has gotten worse at helping users get started. Whatever the first round’s tutorial was doing, it was doing it well; it might be time to scrap the new tutorial flow and return to what worked.

benchmarking chart 2

Benchmarking can also clue you in to the ways that later iterations of your product are solving problems, like in the graph above. As you can see, navigation has steadily improved in each iteration---it even reached a perfect score for tablet users in the latest round! Without watching a single video, you can rest easier knowing that users are getting through your app without difficulty.

How is benchmarking done?

Set the plan

It starts with a conversation among your team. You need to agree on exactly what you're hoping to learn, because once you have decided on script for a benchmarking study, you shouldn’t make changes to it. (That could compromise the integrity of the research!) So ask yourselves:

  • What are we actually trying to measure? The tutorial for new users? What keeps current users coming back for more, or what makes them leave? How trustworthy we appear? How difficult or easy it is to complete the basic tasks on our site?
  • Can we commit to this on a regular basis? Do we have the budget? How often are we producing new versions? How often do we want to see how we stack up against our competitors?

Write the script

Once you know the big questions that your team wants to answer, then you need a script---a very specific kind of script.

1. The tasks have to be goal-oriented and as basic as possible.

  • For ThinkGeek, “Find a product you’d like to buy.”
  • For Evernote, “Create a new note.”
  • For Tax Slayer, “Sign up for an account.”

2. The tasks also have to avoid using signposts or descriptive instructions because those elements might not be there in the next round of testing.

  • “Click the blue + icon to subscribe to this post.” = BAD
  • “Subscribe to this post.” = GOOD

3. Follow each task with evaluative questions, and keep a consistent pattern of tasks and questions. Using measurable questions like rating scale or multiple choice will give you the data you need to make those pretty charts!

  • Task: “Create a new playlist with songs from 3 different artists.”
  • Rating scale: “How difficult (1) or easy (5) was it to create a playlist?”
  • Multiple choice: “Compared to other music apps, was it easier, harder, or about the same to create a playlist?”
    • Easier
    • Harder
    • About the Same
    • Other/None of the Above

Once you’ve got a script, it’s important to keep it as consistent as possible. Any shifts in the tasks or questions can compromise data and make it more difficult to compare results across different rounds of testing, so try not to tinker with it!

(Pro tip: The UserTesting dashboard makes it unbelievably easy to set up your next round of benchmarking: just click the “Create Similar Test” button!)

Create similar test

The Create Similar Test button copies the details from your previous test, so you can set up a second round of benchmarking with a click.

Pick your user demographics

Finally, it’s important to capture the same number and type of users.

If your budget allows, include more users than you would in a typical study. For ordinary usability tests, 5-7 users will normally give you plenty of qualitative insight. But because we’re looking more at numeric trends in a benchmarking study, you can bump up your total number of users to 10, 20, or even 50.

You don't need to test with the exact same people each time; it’s okay to get fresh eyes on each version of your product.

However, the total number of users and any distinguishing characteristics should remain the same. For example, if you tested with 20 college students the first time around, you'll need to test with 20 college students for the next round too.

How often should I be running benchmarking studies?

The frequency and spacing of new studies depends on a variety of things, including how quickly new designs are being produced and how much of a budget you have.

  • If your product is an established one, a quarterly benchmarking study may be all you need to keep tabs on any performance issues.
  • If you’re pumping out new iterations every week, you’ll want to be a bit more aggressive with your scheduling... and probably lighter on the total number of users.
  • And in some cases, your product may have sporadic updates, in which case the benchmarking may also be sporadic.

The important thing about benchmarking is seeing the progress over time, so whatever you do, don’t run an initial benchmarking study and then never run another one. You’ll never know what benefit a benchmarking study can really be!

In this Article