Pink Trombone · appendix

Perceptual test — questionnaire

← Back to Pink Trombone

This appendix documents the questionnaire given to participants in the subjective evaluation. It ran online as a MUSHRA test; the listening questions themselves are available here. The test was originally written in Spanish — this is a faithful English translation of each section.

Introduction & control questions

Introduction

Welcome to our subjective test on intelligibility and the synthesizer’s ability to imitate the human voice. Thank you for taking part.

The aim is to assess the automatic control of an articulatory synthesizer, the Pink Trombone — a tool that mimics human speech by controlling the tongue, lips, vocal cords, and other articulators. Before you begin, get a feel for it at dood.al/pinktrombone.

We want to verify how accurately the Pink Trombone reproduces human vowel sounds that can be correctly interpreted, and how well it imitates a given recording. Please note:

Questions? Contact mateo.camara@upm.es.

Screening questions

Set 1 — interpret a static vowel

Rate how strongly each sound resembles the indicated vowel, from 0 (not that vowel at all) to 100 (perfectly interpreted). (Questions available online.)

Set 2 — interpret a sequence of vowels

Rate how well you perceive the indicated sequence of vowels, from 0 (not at all) to 100 (perfectly interpreted). (Questions available online.)

Set 3 — interpret imitations

Evaluate how well each synthetic sound imitates a human reference. The sounds can not be identical — judge coherence, as if a person were imitating another without being able to change their own vocal tract. Listen to the reference first, then rate from 0 (very poor imitation) to 100 (very credible). (Questions available online.)