UI/UX Usability Testing Methods

Every designer believes their design makes sense. The problem is, designers are not their users. What feels obvious to a designer who built something can feel completely confusing to a first-time user. Usability testing is how you replace guesses and opinions with real observations and evidence. This page teaches you how to plan, run, and learn from usability tests — even with a small budget and no dedicated research team.

What Is Usability Testing?

Usability testing means watching real users try to use your product to complete real tasks, and observing where they succeed, where they struggle, and why. You are not testing the user — you are testing the design.

USABILITY TESTING CORE CONCEPT:

The Session:
  You show your product to one user at a time.
  You give them a task to complete.
  You watch them try.
  You do NOT help or explain anything.
  You take notes on what works and what breaks.

What You Discover:
  ✓ Where users get confused or lost
  ✓ What language makes sense to them
  ✓ Which features they cannot find
  ✓ What they expect to happen (vs what actually happens)
  ✓ Which parts feel smooth and satisfying

IMPORTANT MINDSET:
  "If the user can't figure it out, the design is wrong.
   The user is never wrong — the design is wrong."

A famous insight from usability researcher Jakob Nielsen: testing with just 5 users reveals approximately 85% of all usability problems in a product. You do not need hundreds of participants to find critical issues.

Types of Usability Tests

Not all usability tests work the same way. Different types of tests answer different questions at different stages of the design process.

Moderated vs Unmoderated

MODERATED TESTING:
  A facilitator (you or a researcher) is present during the session.
  They can ask follow-up questions and probe for deeper insights.
  
  Best for: Complex flows, early-stage designs, exploratory research
  Downside: Time-intensive, requires scheduling

UNMODERATED TESTING:
  The user completes tasks alone with recorded screen and voice.
  No facilitator present. Tools like Maze or UserTesting.com do this.
  
  Best for: Validating specific flows quickly, large sample sizes
  Downside: Cannot ask follow-up questions in real time

Remote vs In-Person

IN-PERSON TESTING:
  User and facilitator are in the same room.
  You see body language, facial expressions, hesitations.
  You can observe where their eyes land on the screen.
  
  Best for: Rich qualitative data, complex workflows
  Downside: Expensive, geographically limited

REMOTE TESTING:
  Conducted over video call (Zoom, Meet) or async tools.
  User shares their screen. You observe and take notes.
  
  Best for: Recruiting diverse users anywhere in the world
  Downside: Tech issues can disrupt sessions

Evaluative vs Exploratory Testing

EVALUATIVE TESTING:
  You have a finished or near-finished design.
  You test whether users can successfully complete tasks.
  You measure: completion rate, time on task, errors.
  
  Example question: "Can users successfully check out?"

EXPLORATORY TESTING:
  You have a rough prototype or a competitor's product.
  You explore how users think about a problem or task.
  You discover mental models and expectations.
  
  Example question: "How do users currently manage their expenses?"

Planning a Usability Test

Step 1: Define Your Research Questions

Before recruiting anyone or building tasks, know exactly what you want to learn. Research questions are the questions the test should answer for the team — not questions you ask users directly.

RESEARCH QUESTIONS EXAMPLE:

Product: A banking app's loan application feature

Research Questions:
  1. Can users find the loan application section?
  2. Do users understand what "debt-to-income ratio" means
     in the context of our form?
  3. Do users trust the process when entering financial information?
  4. Where do users drop off during the application?

These questions determine what tasks you give users
and what you observe during sessions.

Step 2: Define Tasks

A task is a realistic scenario that you give to the participant. Good tasks are realistic and specific. They describe a situation, not a set of instructions. They do not reveal which part of the interface the user should click.

TASK WRITING COMPARISON:

BAD TASK (Tells user what to click):
  "Click on the 'Apply for Loan' button in the top menu,
   then fill out the form on the next page."
  ← This is a tutorial, not a test.

GOOD TASK (Scenario-based, no hints):
  "You need to borrow ₹50,000 for a home renovation.
   You have heard this app offers personal loans.
   Please go ahead and start the application process."
  ← User must find and navigate the flow themselves.

TASK PRINCIPLES:
  ✓ Set the scene with a realistic scenario
  ✓ Give a goal, not instructions
  ✓ Avoid words that match button labels in the UI
  ✓ Make it feel like something the user would actually do

Step 3: Recruit Participants

The users you test with must match your actual target audience. Testing a senior citizen's health app with 25-year-old designers gives you useless data. Participants should match your user personas in terms of age, technical skill level, occupation, and familiarity with similar products.

PARTICIPANT RECRUITMENT GUIDE:

How Many to Test:
  5 users → Finds ~85% of usability problems (ideal for 1 round)
  8-10 users → Appropriate for diverse, multiple user types
  1-2 users → Not enough — results are not representative

Where to Recruit:
  → Existing customers (ask via email or in-app prompt)
  → Social media or community groups matching your audience
  → Recruitment platforms: UserTesting.com, Respondent.io
  → University campuses (for student-focused products)
  → Incentive: Gift card, discount, or cash payment for their time

Screener Questions:
  Before confirming a participant, ask screening questions:
  "Do you currently use any budgeting apps?" (Yes/No)
  "How often do you make purchases online?" (Scale)
  "What is your age range?" (Brackets)
  → Only confirm participants who match your user profile

Step 4: Prepare Your Test Protocol

A test protocol is a written script that the facilitator follows during every session. It ensures consistency across sessions — every participant gets the same introduction, tasks, and closing questions.

USABILITY TEST PROTOCOL STRUCTURE:

1. INTRODUCTION (5 minutes)
   "Thank you for joining us today. We are testing our
   design, not your abilities. There are no right or wrong
   answers. Please think out loud as you go — tell us
   what you are looking at, what you are thinking,
   and what you are trying to do."

2. WARM-UP QUESTIONS (3 minutes)
   "Before we start, tell me a bit about yourself.
   What apps do you use most often?"

3. TASKS (20-30 minutes)
   Present each task card one at a time.
   Observe and take notes silently.
   Only ask: "What are you thinking right now?"
   NEVER say: "Click here" or "You need to go to..."

4. POST-TASK QUESTIONS (5 minutes)
   After each task: "On a scale of 1-10, how easy was that?"
   "Was there any moment where you were unsure what to do?"

5. CLOSING QUESTIONS (5 minutes)
   "What was the most confusing part overall?"
   "What worked well for you?"
   "Is there anything else you would want to share?"

Running a Usability Test Session

The Think-Aloud Method

Ask participants to narrate their thoughts out loud as they use the product. This gives you a direct window into their mental model — how they are interpreting the interface.

THINK-ALOUD EXAMPLE:

User is trying to find the "Settings" page:

"Okay, I'm looking at the top... I see Home and Products.
Hmm, I don't see Settings anywhere obvious. Maybe it's
under my profile icon? Let me click that... okay I see
Account, Billing, and Help. No Settings specifically.
Maybe Help has settings inside? I'll click Help... no,
that's a FAQ page. I'm confused. Is Settings on mobile
somewhere different? Maybe I need to look in the menu."

What This Reveals:
  ✕ "Settings" is hidden and not where users expect it
  ✕ Users look under profile icon first (common mental model)
  ✕ "Help" and "Settings" categories feel ambiguous
  ✓ User's vocabulary: they say "Settings" not "Preferences"

How to Facilitate Without Leading

The hardest part of facilitating is staying silent while a user clearly struggles. Every instinct says to help them. But if you help, you lose the data — you never learn that the design had a problem there.

FACILITATOR RESPONSES:

When user is stuck, say:
  ✓ "What are you thinking right now?"
  ✓ "What would you expect to happen here?"
  ✓ "What would you do next if this were your own device?"

When user asks for help:
  User: "Am I supposed to click this button?"
  You:  "Whatever you would do if you were using this at home."

Never say:
  ✗ "Actually, the button is over here."
  ✗ "You're close, keep going!"
  ✗ "That's the right idea."

Your job is to observe, not to teach.

Measuring Usability: What to Track

Usability testing produces two types of data: qualitative (what users say and do) and quantitative (numbers you can measure and compare).

USABILITY METRICS TABLE:

METRIC              WHAT IT MEASURES              HOW TO COLLECT
────────────────────────────────────────────────────────────────
Task Completion     Did the user finish the task?  Facilitator notes
Rate                (Completed/Not completed/       Target: 80-100%
                    Completed with help)

Time on Task        How long did it take?          Screen recording
                    Faster = more efficient         Track in seconds

Error Rate          How many wrong clicks/paths?   Facilitator notes
                    Fewer errors = more intuitive   Count per task

Satisfaction        How easy did it feel?          Post-task rating
Score               1 (Very hard) to 10 (Very easy) Track per task

SUS Score           Overall system usability       10-question survey
(System Usability   Standardized 0-100 score       given after test
Scale)              Above 68 = acceptable          Standard benchmark

The System Usability Scale (SUS)

SUS QUESTIONNAIRE (Given after the session):

Rate each from 1 (Strongly Disagree) to 5 (Strongly Agree):

1. I think I would like to use this system frequently.
2. I found the system unnecessarily complex.
3. I thought the system was easy to use.
4. I think that I would need technical support to use this system.
5. I found the various functions in the system well integrated.
6. I thought there was too much inconsistency in this system.
7. I would imagine that most people would learn to use
   this system very quickly.
8. I found the system very cumbersome to use.
9. I felt very confident using the system.
10. I needed to learn a lot of things before I could get
    going with this system.

SUS SCORE INTERPRETATION:
  Above 85: Excellent — "Wow" level usability
  68-85:    Good — Acceptable for most products
  51-68:    OK — Noticeable problems, improvement needed
  Below 51: Poor — Significant redesign required

Analyzing and Reporting Findings

After testing, you have hours of notes and recordings. The goal is to extract patterns, prioritize problems, and present clear recommendations to the team.

Affinity Mapping

AFFINITY MAPPING PROCESS:

Step 1: Write each observation on a separate sticky note:
  "User missed the 'Proceed' button completely"
  "User expected a confirmation email immediately"
  "User confused 'Account' with 'Profile'"

Step 2: Group related observations together:
  NAVIGATION ISSUES GROUP:
    - Missed Proceed button
    - Could not find Settings
    - Back button behavior unexpected

  TRUST ISSUES GROUP:
    - Worried about entering credit card
    - Expected confirmation message
    - Did not know if payment went through

Step 3: Name each group. These become your theme findings.

Step 4: Prioritize by frequency + severity:
  High priority: 4 of 5 users experienced this issue
  Medium: 2-3 of 5 users
  Low: 1 of 5 users (watch for in next round)

Finding Severity Ratings

USABILITY PROBLEM SEVERITY SCALE:

4 — CATASTROPHIC:
  Users cannot complete the task at all.
  Must fix before launch.
  Example: "Submit" button does not respond on mobile.

3 — MAJOR:
  Users complete the task but with significant difficulty.
  Fix before launch if possible.
  Example: Users take 3+ minutes to find the Settings page.

2 — MINOR:
  Small friction point. Users get through but are annoyed.
  Fix in next design iteration.
  Example: Error message text is confusing but user figures it out.

1 — COSMETIC:
  Very minor issue. Does not prevent task completion.
  Fix when time permits.
  Example: A button is slightly misaligned on one screen.

When to Test: The Research Timeline

USABILITY TESTING IN THE DESIGN PROCESS:

PHASE 1 — DISCOVERY:
  Test: Competitor products or low-fi sketches
  Goal: Understand how users think about the problem
  Method: Exploratory, moderated, think-aloud

PHASE 2 — DESIGN:
  Test: Wireframes and low-fidelity prototypes
  Goal: Validate information architecture and flows
  Method: Moderated prototype testing (Figma prototype)

PHASE 3 — PRE-LAUNCH:
  Test: High-fidelity prototype or staging environment
  Goal: Confirm design is ready to ship
  Method: Evaluative, task completion rate + SUS score

PHASE 4 — POST-LAUNCH:
  Test: Live product with real users
  Goal: Find problems in production that testing missed
  Method: Unmoderated, remote, analytics-informed tasks

Low-Cost Usability Testing Methods

TESTING ON A TIGHT BUDGET:

Guerrilla Testing:
  Walk into a coffee shop.
  Ask strangers to spend 10 minutes with your prototype.
  Offer to buy their coffee.
  Works for: Quick gut-checks on basic usability

Hallway Testing:
  Test with colleagues from other departments.
  NOT design team or product team — they know too much.
  Works for: Fast sanity checks before design reviews

Remote Unmoderated (Tools):
  Maze.design → Upload Figma prototype, get completion rates
  UserTesting.com → Real users, video recordings, faster turnaround
  Lookback.io → Moderated and unmoderated sessions with recording

5-Second Test:
  Show a screen for 5 seconds, then hide it.
  Ask: "What is this page for?" and "What stood out?"
  Works for: Testing clarity of landing pages and hero sections

Key Points

Usability testing means watching real users attempt real tasks — you are testing the design, not the user.
Testing with just 5 users reveals approximately 85% of all usability problems.
Tasks must be scenario-based with a goal, never step-by-step instructions that guide users where to click.
Use the think-aloud method to hear users' reasoning in real time.
Never help a user who is struggling — their struggle is the most valuable data you collect.
Track task completion rate, time on task, error rate, and satisfaction score as quantitative metrics.
The System Usability Scale (SUS) gives you a standardized 0–100 usability score; above 68 is acceptable.
Use affinity mapping to group observations into themes after the sessions.
Rate problems by severity (1–4) and prioritize catastrophic and major problems for immediate fixes.
Test at every phase: exploratory sketches, wireframes, prototypes, and the live product.

Previous lessons

Back to courses

Next lessons