Thinking Aloud Test Report

Christopher Oser

Usability Testing d3-hypertree Hierarchy Browser

19th January 2019

1 Executive Summary

A thinking aloud test is made up of tasks performed by selected test users who are filmed and recorded during the test, while verbally expressing their thoughts. In this test, each user was asked to solve each of 5 tasks. Each user participated in two sessions, one on a PC and one on a tablet. Each of these sessions was done with two different datasets and configurations in order to obtain broader results:

After half of the users completed the procedure the sequence of devices used was reversed in order to prevent a bias to one device. Altogether, 10 test users participated in this test with one being a pilot test. This results in a total of 20 sessions recorded.

Conceptually, the d3-hypertree hierarchy browser was appreciated greatly by most users. Nine out ten users liked the idea and claimed they could definitely see it being used commercially. Only one user was strongly opposed and claimed he saw no utility in the application. Ironically this user ("Aubrey") had one of the best test performances. The remaining nine users were rather impressed and some even speculated about real-life applications where they might use the d3-hypertree. More about the positive findings can be found in Sections 4.3 and 4.4.

All users agreed that the d3-hypertree would need some adaptations and fixes in order to be able to be used by average users. After the first few test sessions, it became clear that many of the same issues were being experienced over again by almost every test user. These problems/issues can be grouped into performance-related issues and design-related issues.

In terms of performance, the major problem were seemingly random crashes of the entire tree, which could only be resolved by reloading. The d3-hypertree tends to crash after rash movements or fast zooming and/or certain zooming behavior. Depending on the browsing characteristics of the individual users, this happened up to four times within one session. Another performance issue was the general performance on tablet, which is sup-par. Ever present “laggyness” is only interrupted by short freezes from time to time, which is not very desirable for end users and terminates any immersion the users would build up.

The main design-related issues were lack of overview and lack of a search function. Almost every user remarked they were confused and overwhelmed by the sheer amount of data displayed on the screen, especially when moving deeper into the tree. Users were regularly unable to find their way back to locations they had previously visited, because they were unsure where they were. Suggestions by users to fix this included a mini-map or working with more colours. The d3-hypertree is very inefficient when searching for specific nodes or sub-trees. It will always be slower than text search which is why many users requested a textual search in order to move around the tree more quickly. However, test users also acknowledged that if an initial text search function were available, the d3-hypertree is much simpler to use to browse similar articles compared to other alternative implementations.

Another important remark was the lack of an introduction, tutorial, or other prompts which would explain how to use the d3-hypertree to a novice user. Without introduction, the tree was not very intuitive to most users. The remaining issues included inconsistent labelling, misleading emojis, and strange behavior as a result of the zoom function and tablet functionality. A list of all negative findings can be found in Section 4.6.

2 Introduction

The d3-hypertree was tested using the thinking aloud test method in order to discover issues or problems concerning the usability of the application. Two different datasets were tested, each with a slightly different configuration of d3-hypertree. Both configurations of dataset and hypertree implementation were chosen and set up explicitly for this test.

The first configuration used the Mammals subtree of Michael Glatzhofer's Hyperbolic Tree of Life. The Mammals subtree contains 13,623 nodes with a maximum depth of 42 levels. The d3-hypertree was set up to use a bottom-up layout, with the root node at the bottom of the panel and the tree growing upwards and outwards like a real-life tree. The integration of Wikipedia and the general visual representation were kept the same as in the original Hyperbolic Tree of Life. This test configuration will be referred to as the Tree of Life hereafter.

The second configuration used Google's Product Taxonomy which might be used in the context of online shopping, for example. The product tree has 5,427 nodes and is upto seven levels deep. The d3-hypertree interface was set up to use a 360 degree radial layout, with the root node in the center of the panel and the tree radiating out in all directions. This configuration will be referred to as the Product Tree hereafter.

In addition, the application was tested on two different devices (tablet and desktop), so differences between platforms would become apparent

3 Test Procedure

3.1 Test Methodology

A thinking aloud test is a way of testing the usability of computer applications, websites, or any type of user interface. Test users are asked to perform specific tasks while at the same time expressing their thoughts verbally, as described in Keith Andrews' course notes [And2018]. The screen and test user are recorded with the help of software and a video camera. This way the footage can be reviewed and analysed later on. Through this, it is possible to closely analyze the behavior and the decisions of the test user. The method is very good at finding many usability problems as well as giving examples of what led to these problems. It requires very little materials/resources and is therefore a very cheap and simple to set up. The main disadvantages of the testing method are that users are slowed down by aound 17% when thinking aloud and that thinking aloud sometimes influences their problem solving behavior. For these reasons, measurements of task completion times do not make sense in a thinking aloud test. The thinking aloud testing method is described in further detail by Carol Barnum [Bar2010, Chapter 7].

It is generally recommended to run a thinking aloud test with at least five test users. This test was done with 10 test users (1 pilot test user, 9 regular test users) in order to get a good amount of feedback from the entire procedure. Further discussion about the number of test users and its impact on the test can be found in a paper by Jakob Nielsen [Nie1994]. As Nielsen concluded, 4-5 test users are usually sufficient to achieve good results. Further reading on thinking aloud tests can be found in Janni Nielsen's paper [Nie2002].

3.2 User Profiles

The spectrum of users who could use the d3-hypertree is very diverse, and heavily depends on the dataset used. For instance Michael Glatzhofer's Tree of Life will most likely be used by biology focused people, as most of the tree consists of the latin names of different species.

The test was done with users in the university student age range. This age range was chosen for two reasons. Firstly, this is a demographic which is very relevant for the first users of the d3-hypertree, since that is where it comes from. Secondly, this was the easiest group of test users to recruit. All of the test users either study or have studied at university, but in different fields from law to marketing, and architecture to computer science. This made for a good mix of test users in terms of prior knowledge.

3.3 Test Users

The test users were chosen to represent university students. All test users were aged 18-25 and are either in the process of completing studies or have already completed a university degree.

Table 1 gives an overview of the test users who participated in the study. The mapping of real names to aliases is included in Appendix B.1.

Test User TP0 (Pilot Test) TP1 TP2 TP3 TP4 TP5 TP6 TP7 TP8 TP9
Alias "Kim" "Marshall" "Teyana" "Jermaine" "Aubrey" "Kendrick" "Anthony" "Arlissa" "Gary" "Robyn"
Date of Test 09.01.2019 11.01.2019 11.01.2019 11.01.2019 11.01.2019 12.01.2019 12.01.2019 13.01.2019 13.01.2019 14.01.2019
Time of Test 17:45 11:15 13:50 15:00 17:20 20:00 20:45 10:00 11:00 18:30
Language of Test EN EN EN EN EN EN EN EN EN EN
General Information
Sex female male female male male male male female male female
Age 23 23 22 19 22 22 22 20 22 20
Education secondary school,
studying interior design
secondary school,
studying law
BA,
International Management/ Economics
secondary school,
studying CS
secondary school,
studying law
secondary school,
studying Software Engineering
secondary school,
studying CS
secondary school,
studying psychology
secondary school,
studying law
secondary school,
studying physiotherapy
Sight Impairment
Sight Aid glasses glasses glasses - - - - glasses glasses glasses, contact lenses
Color Blindness no no no no no no no no no no
Personal Computer Use
OS Apple Macintosh Apple Macintosh Microsoft Windows Microsoft Windows Apple Macintosh Microsoft Windows, Unix Microsoft Windows Apple Macintosh Microsoft Windows Microsoft Windows
Years of Experience 10 20 12 12 10 10 16 9 15 13
Hours per Week 5 50 35 55 25 30 60 14 10 20
Tablet Use
Type of Tablet - - iOS iOS - - iOS - - iOS
Years of Experience - - 8 3 - - 5 - - 4
Hours per Week - - 20 3 - - 1 - - 2
Web Use
Hours per Week 12 40 75 60 50 20 60 35 35 42
Most Used Device smartphone smartphone smartphone desktop PC smartphone desktop PC desktop PC smartphone smartphone smartphone
Browser Safari Chrome Firefox Chrome Chrome, Firefox Firefox Chrome Safari Chrome Firefox
Domain-Specific Questions
File Managers used Mac Finder Windows Explorer, Mac Finder Windows Explorer Windows Explorer Mac Finder Windows Explorer Windows Explorer Mac Finder Windows Explorer Windows Explorer
Heard of Tree of Life no no no yes no yes no no no no
Browsed Product Hierarchies Amazon, Ebay, Clothing sites Amazon, Ebay, Geizhals Amazon Amazon, Ebay, Willhaben Amazon, Ebay, Willhaben, Shpock, Kleiderkreisel Amazon, Ebay Amazon, Ebay Amazon Amazon, Ebay, Geizhals Amazon
Heard of or Seen Hyperbolic Tree Browser no no no no no no no no no no
Previous Usability Tests
As Test Person yes yes yes yes no no yes no no no
In Test Team no no no no no yes yes no no -
Type of Test Thinking Aloud Formal Experiment Formal Experiment Thinking Aloud - Thinking Aloud Thinking Aloud - - -
Table 1: Overview of the test users.

3.4 Test Environment

The test setup for the PC is shown in Figure 1 and for the tablet in Figure 2.

Test room, test user in front of PC
Figure 1: Test setup for the PC.
Test room, testing on tablet
Figure 2: Test setup for the tablet.

The hardware (devices, browser, internet connection, etc.) and environment used for the test is shown in Table 2.

PC
Device Intel Core i7-5930k, 16gb RAM
Operating System Windows 10 Pro N 64-bit Build 17134 EN
Web Browser Chrome 71.0.3578.98
Internet Connection 40 mbps, A1 cable
Screen Resolution 1920×1080
Screen Size 21″ TFT
Tablet
Device Apple iPad Air 2
Operating System iOS 12.1.1
Web Browser Chrome 71.0.3578.89
Internet Connection WiFi, 40 mbps, A1 cable
Screen Resolution 2048×1536
Screen Size 9.7″
Test Setting
Location private home
Date of Pilot Test 2019-01-09
Date of Real Tests 2019-01-11 - 2019-01-14
Table 2: Hardware and environment used for the thinking aloud test.

The exact web browser version used for the tests is shown in Figure 3.

Screenshot of the Browser About Window
Figure 3: Browser version used for the tests.

A fresh install of the browser was used for this test. No AdBlockers or other software/add-ons were in use.

3.5 Test Recording

The tests were captured with an external video camera on a tripod in 720p HD resolution. The first tests were done with the help of an external microphone, but this was removed after noticing sound quality problems. Mirrors were used to capture the face of the test user.

The screen recordings were done using Morae Recorder. The whole screen was captured and the user's voice was recorded using an external microphone placed on the desk.

3.6 Training

No special training on the interface was given to the users, in order to test the intuitiveness of the application for users with no prior knowledge of it. The thinking aloud technique was explained verbally to each test user, but was not practised. The test users were simply asked to follow the tasks and solve them one by one, whilst expressing their thoughts verbally in English.

3.7 Tasks

The task list used for the Product Tree is shown in Table 3. The task descriptions given to the users for the Mammals branch of the Tree of Life are shown in Table 4. The actual task slips for both tests are included in Appendix A.5. The test users were able to read the tasks immediately before starting the test and check with the facilitator in order to prevent vocabulary misunderstandings.

Task No. Description
1 Please navigate to and open up the Food branch of the Products tree.
2 Within this Food branch, please navigate to and open up the Donuts node.
3 Would you say there are more items in the Bakery branch (where you found Donuts) or in the Meat branch?
4 Please zoom back out to the root node. Which of these two items is within the Electronics branch: Antennas or Web-cams?
5 Within the entire Products tree, please try to find the Basketballs node.
Table 3: Tasks for Product Tree.
Task No. Description
1 Please navigate to the root of the Bear (Ursidae) subtree.
2 Find the node corresponding to the Brown Bear and read the first paragraph of the corresponding wikipedia entry.
3 Would you say there are more species (sub-nodes) of Bear (Ursidae) or more of Dog (Canidae)?
4 Which species is more closely related to Whales (Baleen) according to this visualisation: Alpacas (Vicugna pacos) or Elefants (Elephantidae)?
5 Find the two black Howler Monkey nodes within the Primate tree.
Table 4: Tasks for Tree of Life.

The presentation order of device and dataset was varied according to a pre-defined scheme. In this study, the pilot test with TP0 showed that the Tree of Life dataset was very hard to use on a tablet, so an executive decision was made to only test the Product Tree on the tablet for the remaining test users. The final presentation order for each test user is shown in Table 5.

First Session Second Session
TP0 Tablet, Tree of Life PC, Product Tree
TP1 Tablet, Product Tree PC, Tree of Life
TP2 Tablet, Product Tree PC, Tree of Life
TP3 Tablet, Product Tree PC, Tree of Life
TP4 Tablet, Product Tree PC, Tree of Life
TP5 PC, Tree of Life Tablet, Product Tree
TP6 PC, Tree of Life Tablet, Product Tree
TP7 PC, Tree of Life Tablet, Product Tree
TP8 PC, Tree of Life Tablet, Product Tree
TP9 PC, Tree of Life Tablet, Product Tree
Table 5: The presentation order of devices and datasets.

3.8 Interview Questions

The following questions were asked of each user immediately after the final task, once for each test configuration. A summary of remarks made by test users can be found in Section 4.7.

  • Opening Question
    • "How was it?"
  • Standard Questions
    1. "Did anything strike you as particularly good?"
    2. "Did anything strike you as particularly bad?"
    3. "Do you have any other remarks, good or bad that come to mind?"
    4. "Now after filling out the questionnaire did anything else come to mind you would want to say about the application?"

3.9 Feedback Questionnaire

After the interview, which was done after each test, the user was asked to fill out the feedback questionnaire given in Appendix A.7. The feedback questionnaire was given on paper. An overview of the results can be seen in Section 4.8.

4 Results

4.1 Discussion and Analysis

With each test, it became more clear what the main negative points to average users are. Almost every test user had the same issues, stemming from a mixture of performance problems and design decisions or missing features. In terms of performance, one of the main issues was the performance on a tablet, which was permanently laggy and would freeze sometimes. Another major problem were crashes of the tree when performing rash movements or zooming.

Design-wise, a pressing problem was the implementation of Wikipedia synchronized to the Tree of Life on tablets, which led to the cancellation of testing of the Tree of Life on tablet altogether. The other main design problems were the lack of overview and search capabilities. All users agreed that the overview provided by the d3-hypertree was mediocre at best. It is very easy to become lost and to not arrive at the desired location even if it has been explored before. Some users proposed an overview map or working with colors to make it easier to disinguish between locations.

Searching for a specific node or subtree without knowing its location in advance is very slow compared to text search. This is why most users proposed a search bar that would be linked to the nodes and would enable users to find nodes by entering a search term. Once a specific node has been found, the d3-hypertree is perfect for browsing neighboring categories and much more intuitive than other visualizations for similar datasets.

4.2 Task Completion

A summary of how many users completed each task within a reasonable timeframe and whether any assistance was given is shown in Table 6 and 7.

Task 1 Task 2 Task 3 Task 4 Task 5
TP0 1 0 1 1 0*
TP1 1 0 1 1 0*
TP2 0* 1 1 1 1
TP3 0 1 1 1 1
TP4 1 1 1 1 1
TP5 1 1 1* 1 1
TP6 0 1 1 1 1
TP7 1 1 1 1 1*
TP8 1 1 1 0 1
TP9 1 1 1* 1 0
Total 7 8 10 9 7
% 70 80 100 90 70
Table 6: Task completion rates for the Tree of Life. An asterisk (*) indicates that assistance was given.
Task 1 Task 2 Task 3 Task 4 Task 5
TP0 1 1 1 1 1
TP1 1 1 1 1 1
TP2 1 1 1 1 1
TP3 1 1 1 1 1
TP4 1 1 1 1 1
TP5 1 1 1* 1 1
TP6 1 1 1 0 1
TP7 1 1 1 1 1
TP8 1 1 1 1 1
TP9 1 1 1 1 0
Total 10 10 10 9 9
% 100 100 100 90 90
Table 7: Task completion rates for the Product Tree. An asterisk (*) indicates that assistance was given.

4.3 Top Three Positive Findings

The three most positive findings according to their average (mean) positivity ratings are described in more detail below. The positivity rating scheme used to rank positive findings is shown in Table 8.

PositivityMeaning
4Extremely Positive
3Major Positive
2Minor Positive
1Cosmetic Positive
0Not a Positive
Table 8: Positivity ratings.

P01. Icons/Emojis

Title: Icons/Emojis
Description: Most users very much appreciated the emojis and said they would be lost without them. Many requested more icons/distinguishability.
Video Clip(s): p01-tp1.mp4
Timestamp(s): TP1_ToL 00:38, TP2_ToL 02:30
Location (How Reproducible?): Use the Tree of Life d3-hypertree
Positivity Rating: 4
Figure 4: "Marshall" only used the icons to navigate through the tree at first.

Most test users found the emojis very helpful and said they would be lost without them. Three test users specifically requested more icons and/or more use of colors or other functions in order to increase the distinguishability, since the tree can become very overwhelming without certain landmarks or markers.

P02. Wikipedia Integration

Title: Wikipedia Integration
Description: Most users were very impressed by the synchronization with Wikipedia. Some used it to fulfill tasks although not requested to.
Video Clip(s): p02-tp3.mp4
Timestamp(s): TP1_ToL 14:10,TP2_ToL 03:30, TP6_ToL 03:30
Location (How Reproducible?): Use the Tree of Life d3-hypertree
Positivity Rating: 3
Figure 5: "Jermaine" used Wikipedia to help him navigate.

The Wikipedia part of the Tree of Life was used much more than expected. Many users used it in all types of ways. Some requested that there would be a link from the Wikipedia pages to the respective tree nodes.

P03. Home Star Icon

Title: Home Star Icon
Description: The root node marker represented by a star was very helpful to test users.
Video Clip(s): p03-tp2-1.mp4 p03-tp2-2.mp4
Timestamp(s): TP0_Prod 03:20,TP2_Prod 02:22, TP2_Prod 03:35
Location (How Reproducible?): From any point within the Tree of Life, a star icon can be seen which takes you back to the root node
Positivity Rating: 3
Figure 6: "Teyana" was a big fan of the home star icon.

The home star icon provided by d3-hypertree to identify the root node was a very useful visual landmark for test users. It seems users are thankful for any type of distinguishability or markers which guide them through these large tree datasets.

4.4 List of All Positive Findings

Table 9 shows a list of all the positive findings which emerged from the test, sorted in descreasing order of average (mean) positivity, i.e. the most positive are at the top of the table. The positivity ratings were assigned by the author of this report.

No. Title Description Video Clip(s) Timestamps(s) Location (how reproducible?) Positivity
0 Concept 9 out of 10 test users liked the concept and could see it being used in major applications. Results from post-test interviews. - Use d3-hypertree 4
1 Icons/Emojis Most users very much appreciated the Emojis. p01-tp1.mp4
TP1_ToL 00:38, TP2_ToL 02:30 Use the Tree of Life d3-hypertree. 4
2 Wikipedia Integration Users very impressed by the integration of Wikipedia. p02-tp3.mp4 TP1_ToL 14:10,TP2_ToL 03:30, TP6_ToL 03:30 Use the Tree of Life d3-hypertree. 4
3 Home Star Icon The home star icon identifying the root of the tree was very helpful to test users. p03-tp2-1.mp4 p03-tp2-2.mp4 TP0_Prod 03:20,TP2_Prod 02:22, TP2_Prod 03:35 From any point within the Tree of Life, a star can be seen which takes you back to the root node. 3
4 Consistency Users said the 2nd test was always much easier - so after a short steep learning curve the application is easier to use. p04-tp5.mp4 TP0_ToL 00:28, TP1_Prod 00:26, TP6_Prod 05:15 Use the d3-hypertree for a while. 2
5 Alphabetical Ordering Some users were very happy with the alphabetically sorted data in the Product Tree. p05-tp2.mp4 TP2_Prod 03:50 Check the labels of the Product Tree. 1
6 360 Distribution 2 users greatly preferred the 360 degree view from the Product Tree over the more "tree-like" view in the Tree of Life. - - Use Product Tree d3-hypertree. 1
Table 9: Aggregated list of all positive findings, in descending order of positivity.

4.5 Top Five Problems

The five most serious problems according to their average (mean) severity ratings are described in more detail below. Problem number 1 is the problem (negative finding) with the highest mean severity. The severity rating scheme used to rank problems is shown in Table 10.

SeverityMeaning
4Catastrophic problem
3Serious problem
2Minor problem
1Cosmetic problem
0Not a problem
Table 10: Severity ratings.

N01. Lack of Overview

Title: Lack of Overview
Description: The d3-hypertree is very overwhelming for new users and lacks any form of overview.
Video Clip(s): n01-tp2.mp4, n01-tp6.mp4
Timestamp(s): TP0_ToL 05:30, TP1_ToL 05:40, TP2_ToL 09:08...
Location (How Reproducible?): Try solving tasks using the d3-hypertree.
Severity Rating: 4
Figure 7: "Teyana" explains how they are overwhelmed by the amount of information shown.

Very often users were overwhelmed by the situation and were lacking an overview over the entire tree. Things like a "minimap" or some other way to create more overview would be helpful.

N02. Random Crashes

Title: Crashes
Description: The d3-hypertree tends to crash on both PC and tablet (although more on PC) when zooming too aggressively.
Video Clip(s): n02-tp1.mp4
Timestamp(s): TP0_ToL 06:35, TP1_ToL 04:00, TP2_ToL 05:30...
Location (How Reproducible?): Zooming in or out or moving around very fast.
Severity Rating: 4
Figure 8: "Marshall" wants to proceed to the next task, but is interrupted by a sudden crash.

Sudden crashes does not speak for an application, especially if they happen as often as during the test sessions. Some users managed to crash the tree 3-4 times within one session, which would make regular users stop using it.

N03. Lack of Search Function

Title: Lack of Search-Function
Description: It is very hard to find things within the tree without any prior knowledge.
Video Clip(s): n03-tp6.mp4
Timestamp(s): TP0_Prod 04:00, TP0_ToL 07:25, TP1_ToL 07:35...
Location (How Reproducible?): Try looking for something in the d3-hypertree.
Severity Rating: 4
Figure 9: Searching for something specific in the hyperbolic tree might be amusing sometimes, but it is not very efficient, as can be seen here when watching "Anthony".

Most tasks took a long time, because the lack of knowledge about the dataset as well as the overwhelming nature of the tree. A simple text search function would make searching for a specific item much quicker and more intuitive for users who can then proceed to explore further from the node they searched for.

N04. Tablet Performance

Title: Tablet Performance
Description: The performance of the application is very weak on a tablet.
Video Clip(s): n04-tp3.mp4 n04-tp4.mp4
Timestamp(s): TP0_ToL 04:50, TP1_Prod 01:00, TP2_Prod 00:15...
Location (How Reproducible?): Using d3-hypertree on tablet.
Severity Rating: 3
Figure 10: "Aubrey" notices the performance right at the start of Task 2.

The performance on a tablet was not exactly a problem when trying to finish the tasks, but it is very annoying, as sometimes it will take more than a second for a user action to be executed on the tree.

N05. Lack of Tutorial/Introduction

Title: Lack of Tutorial/Introduction
Description: Users ran into problems because of not knowing how to interact with the tree or how it is set up.
Video Clip(s): n05-tp1.mp4 n05-tp7.mp4
Timestamp(s): TP0_ToL 09:30, TP1_ToL 02:10, TP2_ToL 02:33...
Location (How Reproducible?): Inital use of d3-hypertree
Severity Rating: 3
Figure 11: "Marshall" can not seem to select the correct node, because he does not know he is able to zoom.

Most users were overwhelmed for a while when first using the d3-hypertree. A small introduction to the entire concept as well as little things like a zoom indicator or similar helpers would ameliorate this.

4.6 List of All Problems Found

Table 11 shows a list of all the problems observed in the test, sorted in descreasing order of average (mean) severity, i.e. the most severe are at the top of the table. The severity ratings were assigned by the author of this report.

No. Title Description Video Clip(s) Timestamps(s) Location (how reproducible?) Severity
1 Lack of Overview The d3-hypertree is very overwhelming for new users and lacks any form of overview. n01-tp2.mp4
n01-tp6.mp4
TP0_ToL 05:30, TP1_ToL 05:40, TP2_ToL 09:08... Try solving tasks using the d3-hypertree 4
2 Random Crashes The d3-hypertree tends to crash on both PC and tablet (although more on PC) when zooming too aggressively. n02-tp1.mp4 TP0_ToL 06:35, TP1_ToL 04:00, TP2_ToL 05:30... Zooming in or out or moving around very fast. 4
3 Lack of Search Function It is very hard to find things within the tree without any prior knowledge. n03-tp6.mp4 TP0_Prod 04:00, TP0_ToL 07:25, TP1_ToL 07:35... Try looking for something in the d3-hypertree. 4
4 Tablet Performance The performance of the application is very weak on tablet. n04-tp3.mp4 n04-tp4.mp4 TP0_ToL 04:50, TP1_Prod 01:00, TP2_Prod 00:15... Using d3-hypertree on tablet. 3
5 Lack of Tutorial/Introduction Users ran into problems because of not knowing how to interact with the tree or how it is set up. n05-tp1.mp4 n05-tp7.mp4 TP0_ToL 09:30, TP1_ToL 02:10, TP2_ToL 02:33... Inital use of d3-hypertree. 3
6 Tablet Wikipedia Wikipedia integration is pretty much unusable on tablet. It does not show at all at first and if you get it to show it messes with the rest of the page. n06-tp0.mp4 TP0_ToL 02:00 Using Tree of Life on Tablet and trying to use Wikipedia 3
7 Clipping Labels Labels tend to disappear randomly or not show at all unless explicitly zoomed into. n07-tp2.mp4 n07-tp8.mp4 n07-tp9.mp4 TP0_Prod 05:00, TP2_Prod 01:35, TP6_Prod 00:30... Zoom around the d3-hypertree. 2
8 Data Inconsistency Data inconsistencies (2 meat branches in Product Tree) are very unsettling to users. n08-tp1.mp4 TP1_Prod 02:20, TP2_Prod 03:10 Interacting/searching with/for nodes. 2
9 Monkey Emojis misleading Within the Primate tree, 3 different monkey emojis (closed mouth, closed eyes, closed ears) were used which rather confused users. - - Looking for a monkey node in the Tree of Life. 1
10 Infinite Zoom It can happen rather easily on tablet that the user zooms to the far end of the globe which makes it seem like the application crashed. n10-tp1.mp4 TP1_Prod 04:40 Rash gestures on tablet. 1
Table 11: Aggregated list of all problems found, in descending order of severity.

4.7 Interviews

Nine out of ten users liked the concept of hyperbolic browsing and the d3-hypertree and could imagine using this somewhere else more regularly. Only one user was strongly opposed to the idea to using this in everyday situations and thought it was impracticable. Seven users requested more distinguishability and every single one said more overview is needed. Since there is so much data on the screen, it can become overwhelming for users very quickly.

Four users did say the interface would need some more work in order to be implemented into existing platforms, since it is not that intuitive for a typical computer user. The utility for the Tree of Life was confirmed by all of the test users. Four test users said the Product Tree needed some adaptations in order to be useful for existing product taxonomies. That siad, three users also mentioned it being more neatly arranged, due to its alphabetical ordering and the 360 degree spread of the tree.

4.8 Feedback Questionnaires

Table 12 shows a summary of the ratings given by users in the feedback questionnaire at the end of the Tree of Life test. Table 13 shows a summary of the ratings given by users in the feedback questionnaire at the end of the product tree test. The neutral scale in the original feedback questionnaire has been mapped to a weighted scale between 6 (best) and 0 (worst). The numbers in bold indicate the (rounded) mean rating. The original blank questionnaire can be found in Section A.7. Scans of the questionnaires completed by each user can be found in Section B.2.

Table 12: Summary of user ratings from the Tree of Life feedback questionnaire.
Table 13: Summary of user ratings from the Product Tree feedback questionnaire.

References

[And2018]
Keith Andrews; Human-Computer Interaction: Course Notes; 09 Mar 2018. https://courses.isds.tugraz.at/hci/hci.pdf
[Bar2010]
Carol M. Barnum; Usability Testing Essentials; Morgan Kaufmann, 2010. ISBN 012375092X.
[Nie1994]
Jakob Nielsen; Estimating the Number of Subjects Needed for a Thinking Aloud Test.; International Journal of Human-Computer Studies 41.3 (1994): 385-397. doi:10.1006/ijhc.1994.1065 PDF
[Nie2002]
Janni Nielsen, Torkil Clemmensen, and Carsten Yssing; Getting Access to What Goes On in People's Heads?: Reflections on the Think-Aloud Technique.; Proc. 2nd Nordic Conference on Human-Computer Interaction (NordiCHI 2002). doi:10.1145/572020.572033 PDF

A Test Team Materials

A.1 Checklist

The following checklist checklist.html was used for the tests.

A.2 Orientation Script

The following orientation script orient.html was used for the tests.

A.3 Background Questionnaire

The following blank background questionnaire background.html was used in the tests.

A.4 Consent Form

The following blank consent form consent.html was used in the tests.

A.5 Task Slips

The following task slips were presented to the test users:

A.6 Data Collection Form

The following data collection form datacoll.pdf was used for the test.

A.7 Feedback Questionnaire

The following blank feedback questionnaires were used in the test:

B User Materials

B.1 User Aliases

The mapping of test users' real names to their aliases is documented in the file users.html.

B.2 Completed Background Questionnaires

For each test user, the completed background questionnaire was scanned as PDF:

B.3 Signed Consent Forms

For each test user, the signed consent form was scanned as PDF:

  • consent_tp0.pdf
  • consent_tp1.pdf
  • consent_tp2.pdf
  • consent_tp3.pdf
  • consent_tp4.pdf
  • consent_tp5.pdf
  • consent_tp6.pdf
  • consent_tp7.pdf
  • consent_tp8.pdf
  • consent_tp9.pdf

B.4 Completed Feedback Questionnaires

For each test user, the completed feedback questionnaire was scanned as PDF:

B.5 Data Log Forms

For each test user, the log file written by the scribe was scanned as PDF:

C Test Videos

C.1 External Videos

For each user, the entire test was captured with a tripod-mounted external video camera:

Tree of Life Test:

  • TP0 "Kim": tp0-tol-ext.mp4.

  • TP1 "Marshall": tp1-tol-ext.mp4.

  • TP2 "Teyana": tp2-tol-ext.mp4.

  • TP3 "Jermaine": tp3-tol-ext.mp4.

  • TP4 "Aubrey": tp4-tol-ext.mp4.

  • TP5 "Kendrick": tp5-tol-ext.mp4.

  • TP6 "Anthony": tp6-tol-ext.mp4.

  • TP7 "Arlissa": tp7-tol-ext.mp4.

  • TP8 "Gary": tp8-tol-ext.mp4.

  • TP9 "Robyn": tp9-tol-ext.mp4.

Product Tree Test:

  • TP0 "Kim": tp0-prod-ext.mp4.

  • TP1 "Marshall": tp1-prod-ext.mp4.

  • TP2 "Teyana": tp2-prod-ext.mp4.

  • TP3 "Jermaine": tp3-prod-ext.mp4.

  • TP4 "Aubrey": tp4-prod-ext.mp4.

  • TP5 "Kendrick": tp5-prod-ext.mp4.

  • TP6 "Anthony": tp6-prod-ext.mp4.

  • TP7 "Arlissa": tp7-prod-ext.mp4.

  • TP8 "Gary": tp8-prod-ext.mp4.

  • TP9 "Robyn": tp9-prod-ext.mp4.

C.2 Session Capture Videos

For each user, the test session on the computer was captured with Morae and then exported in MP4 format:

Tree of Life Test:

  • TP0 "Kim": tp0-tol.mp4.

  • TP1 "Marshall": tp1-tol.mp4.

  • TP2 "Teyana": tp2-tol.mp4.

  • TP3 "Jermaine": tp3-tol.mp4.

  • TP4 "Aubrey": tp4-tol.mp4.

  • TP5 "Kendrick": tp5-tol.mp4.

  • TP6 "Anthony": tp6-tol.mp4.

  • TP7 "Arlissa": tp7-tol.mp4.

  • TP8 "Gary": tp8-tol.mp4.

  • TP9 "Robyn": tp9-tol.mp4.

Product Tree Test:

  • TP0 "Kim": tp0-prod.mp4.

  • TP1 "Marshall": tp1-prod.mp4.

  • TP2 "Teyana": tp2-prod.mp4.

  • TP3 "Jermaine": tp3-prod.mp4.

  • TP4 "Aubrey": tp4-prod.mp4.

  • TP5 "Kendrick": tp5-prod.mp4.

  • TP6 "Anthony": tp6-prod.mp4.

  • TP7 "Arlissa": tp7-prod.mp4.

  • TP8 "Gary": tp8-prod.mp4.

  • TP9 "Robyn": tp9-prod.mp4.

C.3 Session Capture Video Transcripts

The coding scheme for session capture video transcripts is shown in Table C.1. A chronological protocol of events during the test sessions is given in Tables C.2 through C.21. Times are given in minutes and seconds of elapsed time on the session capture video.

MarkerColour (r,g,b)Event
S (142,175,215) Start task.
E (172,205,245) End task.
N (248,180,203) Negative observation (problem).
P (178,208,129) Positive observation.
Q (255,242,230) Quote or comment from user.
F (230,230,230) Facilitator prompts user.
Table C.1: Coding scheme for session capture transcripts.
Time
mm:ss
Marker Event Description
TP0 "Kim" Product Tree PC
00:23.64 S Task 1
00:32.29 E Task 1
00:35.34 S Task 2
01:15.09 N Doesn't use zooming - needs tutorial or other prompt. N.05
01:28.96 E Task 2
01:37.42 S Task 3
01:56.46 E Task 3
02:01.93 S Task 4
02:30.43 F Prompted user to check for zooming
02:37.43 P Very happy with zooming
03:15.20 E Task 4
03:23.19 S Task 5
03:24.18 P Star was used P.03
04:03.34 N Needs search N.03
05:03.12 N Not all labels show - therefore the user can't see the basketballs node
05:55.35 E Task 5
Table C.2: Transcript of the session capture video of the Product Tree for user TP0 "Kim".
Time
mm:ss
Marker Event Description
TP0 "Kim" Tree of Life Tablet
00:22.80 S Task 1
00:31.02 P Interaction a lot better the second time around
00:45.67 E Task 1
00:48.09 S Task 2
01:27.63 N Annoyed because of lack of success
02:04.38 N Wikipedia is not shown as default on tablets - it must be drawn in from the side and is therefore not usable. Therefore no further tests were done wit the ToL on tablet
03:44.36 E Task 2
04:54.54 N Tablet performance not good N.04
05:11.40 S Task 3
05:35.69 N Lack of search or overview makes it hard to find something you already had found N.01
06:30.03 E Task 3
06:33.82 S Task 4
06:38.30 N Crash N.02
07:27.95 N Lack of search makes hard to find certain nodes/trees N.03
07:58.91 E Task 4
08:04.56 S Task 5
08:20.88 F Explained trees/hierarchy a bit - no prior knowledge
09:35.32 N Is at the correct location but doesn't understand that they need to proceed deeper into the tree - lack of introduction to the concept N.05
12:34.64 E Task 5
Table C.3: Transcript of the session capture video of the Tree of Life for the user TP0 "Kim".
Time
mm:ss
Marker Event Description
TP1 "Marshall" Tree of Life PC
00:23.26 S Task 1
00:40.59 P Likes the Emoji P.01
01:42.92 E Task 1
01:50.39 S Task 2
02:11.46 N Doesn't know there's a zoom function - has problems because of it (tutorial/introduction) N.05
03:54.88 E Task 2
04:01.69 N Crash by zooming. N.02
04:28.11 S Task 3
05:14.56 E Task 3
05:24.20 S Task 4
05:47.67 N Hard to notice the elephant even though it's on the screen - overview N.01
06:14.51 E Task 4
06:18.04 S Task 5
07:38.06 N No search-function and lack of biology makes it hard to find explicit nodes N.03
08:26.21 F Explained the concept of trees - subtrees/sub-sections to make the search simpler
11:18.23 N Crash by zooming N.02
12:55.99 N Crash N.02
14:13.10 P Found Wiki-Search - Used it to find out more about the Howler Monkey P.02
16:14.37 E Task 5
Table C.4: Transcript of the session capture video of the Tree of Life for the user TP1 "Marshall".
Time
mm:ss
Marker Event Description
TP1 "Marshall" Product Tree Tablet
00:21.19 S Task 1
00:29.64 P Tablet functionality was simple to this test user
00:32.65 E Task 1
00:34.70 S Task 2
00:37.61 Q Humor.
00:46.60 E Task 2
00:49.69 S Task 3
01:06.37 N Tablet performance N.04
02:22.16 N Data inconsistency leads to confusion
02:48.50 E Task 3
02:56.84 S Task 4
04:20.19 E Task 4
04:27.12 S Task 5
04:45.41 N Zooming into "infinity" is possible - which seems like a crash to the user
06:03.78 E Task 5
Table C.5: Transcript of the session capture video of the Product Tree for the user TP1 "Marshall".
Time
mm:ss
Marker Event Description
TP2 "Teyana" Tree of Life PC
00:20.27 S Task 1
00:31.31 N No interaction - only looking at the screen (tutorial,introduction) N.05
01:21.92 F Told test user that they are able to interact with the application
02:32.13 P Emoji helpful P.01
02:36.11 N Can't figure out how to zoom (tutorial,introduction) N.05
03:24.34 F Gave some advice because the user was discouraged from not knowing how to zoom
03:30.76 P Wiki P.02
04:29.42 E Task 1
04:33.59 S Task 2
05:20.21 E Task 2
05:29.74 S Task 3
05:33.26 N Crash N.02
06:35.21 N Crash N.02
07:37.89 E Task 3
07:44.63 S Task 4
08:59.41 E Task 4
09:10.61 N User critiques the lack of overview N.01
09:26.54 S Task 5
10:50.05 E Task 5
Table C.6: Transcript of the session capture video of the Tree of Life for the user TP2 "Teyana".
Time
mm:ss
Marker Event Description
TP2 "Teyana" Product Tree Tablet
00:13.61 S Task 1
00:17.09 N Tablet Performance N.04
00:53.29 E Task 1
01:03.15 E Task 2
01:38.64 N Labels clipping - hard to read
02:13.17 E Task 2
02:20.00 S Task 3
02:24.16 P Star - backwards movement
03:10.27 N Data inconsistency
03:30.24 E Task 3
03:36.35 S Task 4
03:38.36 P Another compliment for the star
03:52.11 P Alphabetical order in dataset is good
04:20.86 Q User would like to have the possibility to rotate the entire tree
05:12.32 E Task 4
05:14.70 S Task 5
05:40.81 N User overwhelmed - overview/search would be good N.01
06:17.54 E Task 5
Table C.7: Transcript of the session capture video of the Product Tree for the user TP2 "Teyana".
Time
mm:ss
Marker Event Description
TP3 "Jermaine" Tree of Life PC
00:18.36 S Task 1
00:20.31 P Wiki P.02
01:12.49 N Search requested - needed N.03
05:07.04 E Task 1
05:10.66 S Task 2
06:29.19 E Task 2
06:38.91 S Task 3
06:45.52 N Crash N.02
08:15.51 E Task 3
08:22.63 S Task 4
11:03.22 E Task 4
11:11.11 S Task 5
15:24.45 E Task 5
Table C.8: Transcript of the session capture video of the Tree of Life for the user TP3 "Jermaine".
Time
mm:ss
Marker Event Description
TP3 "Jermaine" Product Tree Tablet
00:13.09 S Task 1
00:19.62 E Task 1
00:22.32 S Task 2
00:47.99 E Task 2
00:53.67 S Task 3
01:25.52 E Task 3
01:30.10 S Task 4
01:38.54 N Tablet Performance N.04
03:25.37 E Task 4
03:32.94 S Task 5
03:48.94 N Can't find Sporting Goods - overview lacking N.01
06:36.22 E Task 5
Table C.9: Transcript of the session capture video of the Product Tree for the user TP3 "Jermaine".
Time
mm:ss
Marker Event Description
TP4 "Aubrey" Tree of Life PC
00:09.42 S Task 1
00:25.50 E Task 1
00:34.12 S Task 2
00:56.64 E Task 2
01:03.78 S Task 3
01:40.24 E Task 3
01:45.75 S Task 4
02:09.29 N Crash N.02
02:40.63 N Crash N.02
04:08.94 E Task 4
04:14.57 S Task 5
05:17.60 N Crash N.02
08:37.76 E Task 5
Table C.10: Transcript of the session capture video of the Tree of Life for the user TP4 "Aubrey".
Time
mm:ss
Marker Event Description
TP4 "Aubrey" Product Tree Tablet
00:07.84 S Task 1
00:16.04 E Task 1
00:19.28 S Task 2
00:23.68 N Tablet Performance N.04
00:43.45 E Task 2
00:47.84 S Task 3
01:19.93 E Task 3
01:21.52 S Task 4
03:15.58 E Task 4
03:15.91 S Task 5
03:49.65 E Task 5
Table C.11: Transcript of the session capture video of the Product Tree for the user TP4 "Aubrey".
Time
mm:ss
Marker Event Description
TP5 "Kendrick" Tree of Life PC
00:07.98 S Task 1
01:16.57 E Task 1
01:19.91 S Task 2
01:50.29 E Task 2
02:02.67 S Task 3
03:00.31 E Task 3
03:04.22 S Task 4
04:11.14 N Searching aimlessly - search function would be nice N.03
04:50.08 E Task 4
04:52.21 S Task 5
07:48.73 E Task 5
Table C.12: Transcript of the session capture video of the Tree of Life for the user TP5 "Kendrick".
Time
mm:ss
Marker Event Description
TP5 "Kendrick" Product Tree Tablet
00:09.61 S Task 1
00:31.45 E Task 1
00:34.50 S Task 2
01:53.05 E Task 2
02:04.71 F User was told zooming is an option
02:14.18 S Task 3
02:28.31 N Tablet Performance N.04
03:01.31 E Task 3
03:09.89 S Task 4
05:19.82 P User explains how the application is easy to use once you know how-to
05:50.83 E Task 4
05:51.69 S Task 5
07:16.78 E Task 5
Table C.13: Transcript of the session capture video of the Product Tree for the user TP5 "Kendrick".
Time
mm:ss
Marker Event Description
TP6 "Anthony" Tree of Life PC
00:07.68 S Task 1
00:22.87 N Lacking overview - the bear icon is on the screen but the user can't find it N.01
03:14.04 E Task 1
03:16.01 S Task 2
03:31.85 P Notices Wiki - likes it P.02
03:52.63 E Task 2
03:59.74 S Task 3
04:11.71 N Crash N.02
04:44.68 E Task 3
04:52.27 S Task 4
05:56.93 E Task 4
05:59.44 S Task 5
06:13.97 N Overwhelmed - doesn't know where to start - search would be great here N.03
07:43.37 Q Getting annoyed because of lack of success
09:00.25 E Task 5
Table C.14: Transcript of the session capture video of the Tree of Life for the user TP6 "Anthony".
Time
mm:ss
Marker Event Description
TP6 "Anthony" Product Tree Tablet
00:11.27 S Task 1
00:20.28 E Task 1
00:24.39 S Task 2
00:34.89 N Labels clipping
00:54.73 N Labels clipping
01:56.04 Q Annoyed by lack of success
02:29.41 E Task 2
02:33.75 S Task 3
02:35.75 F Introduced zooming to the user
03:16.21 E Task 3
03:19.56 S Task 4
04:55.15 N Getting annoyed - no success - lack of search/overview N.01
07:06.25 E Task 4
07:11.63 S Task 5
08:06.93 E Task 5
Table C.15: Transcript of the session capture video of the Product Tree for the user TP6 "Anthony".
Time
mm:ss
Marker Event Description
TP7 "Arlissa" Tree of Life PC
00:11.39 S Task 1
00:38.45 E Task 1
00:41.04 S Task 2
01:09.51 E Task 2
01:14.43 S Task 3
01:28.59 E Task 3
01:35.26 S Task 4
02:30.58 E Task 4
02:33.42 S Task 5
03:54.61 N Doesn't understand tree - found one black howler monkey and proceeds to leave the sub-tree (tutorial, introduction) N.05
04:34.16 F Explained how trees and sub-trees work in CS
05:33.06 E Task 5
Table C.16: Transcript of the session capture video of the Tree of Life for the user TP7 "Arlissa".
Time
mm:ss
Marker Event Description
TP7 "Arlissa" Product Tree Tablet
00:06.51 S Task 1
00:12.81 E Task 1
00:15.49 S Task 2
00:33.56 E Task 2
00:35.44 S Task 3
01:09.48 E Task 3
01:15.71 S Task 4
01:59.65 E Task 4
02:02.44 S Task 5
03:06.01 E Task 5
Table C.17: Transcript of the session capture video of the Product Tree for the user TP7 "Arlissa".
Time
mm:ss
Marker Event Description
TP8 "Gary" Tree of Life PC
00:07.58 S Task 1
00:31.04 E Task 1
00:34.72 S Task 2
00:46.20 N Labels clipping in and out
01:15.59 E Task 2
01:21.08 S Task 3
01:23.38 N Crash N.02
02:04.62 E Task 3
02:09.02 S Task 4
02:56.46 N Aimless searching - search function would be handy N.03
05:00.18 E Task 4
05:03.72 S Task 5
06:23.78 E Task 5
Table C.18: Transcript of the session capture video of the Tree of Life for the user TP8 "Gary".
Time
mm:ss
Marker Event Description
TP8 "Gary" Product Tree Tablet
00:11.36 S Task 1
00:23.00 E Task 1
00:26.24 S Task 2
00:37.18 E Task 2
00:45.82 S Task 3
01:39.69 E Task 3
01:42.58 S Task 4
02:15.55 E Task 4
02:18.97 S Task 5
03:27.82 E Task 5
Table C.19: Transcript of the session capture video of the Product Tree for the user TP8 "Gary".
Time
mm:ss
Marker Event Description
TP9 "Robyn" Tree of Life PC
00:07.19 S Task 1
01:46.77 E Task 1
02:02.49 S Task 2
02:49.98 E Task 2
02:55.86 S Task 3
04:54.76 F Explained trees & sub-trees
05:20.88 E Task 3
05:23.58 S Task 4
06:28.70 N Crash N.02
09:22.15 E Task 4
09:32.94 S Task 5
14:53.39 E Task 5
Table C.20: Transcript of the session capture video of the Tree of Life for the user TP9 "Robyn".
Time
mm:ss
Marker Event Description
TP9 "Robyn" Product Tree Tablet
00:14.41 S Task 1
00:23.89 E Task 1
00:27.95 S Task 2
01:11.38 E Task 2
01:18.10 S Task 3
01:47.48 E Task 3
01:53.66 S Task 4
03:23.78 E Task 4
03:29.00 S Task 5
04:29.53 N Basketballs Label doesn't show - user has a hard time
05:26.52 N Lack of introduction of tutorial makes it hard for people to understand the tree concept N.05
06:54.54 E Task 5
Table C.21: Transcript of the session capture video of the Product Tree for the user TP9 "Robyn".