0% found this document useful (0 votes)

18 views10 pages

Artigo VoxPlot

The study evaluates the effectiveness of the VOXplot software in analyzing voice quality, specifically focusing on hoarseness and breathiness through acoustic measures. It identifies four key acoustic parameters—harmonics-to-noise ratio (HNR), pitch perturbation quotient (PPQ5), smoothed cepstral peak prominence (CPPS), and glottal-to-noise excitation ratio (GNE)—that correlate strongly with perceptual judgments of voice quality. The findings suggest that VOXplot can reliably discriminate between hoarseness and breathiness in voice assessments.

Uploaded by

Leyrick Emanuelle

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views10 pages

Artigo VoxPlot

Uploaded by

Leyrick Emanuelle

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Journal of

Clinical Medicine

Article
Advances in Clinical Voice Quality Analysis with VOXplot
Ben Barsties v. Latoszek 1, * , Jörg Mayer 2 , Christopher R. Watts 3 and Bernhard Lehnert 4

1 Speech-Language Pathology, SRH University of Applied Health Sciences, 40210 Düsseldorf, Germany
2 Institute for Natural Language Processing, University of Stuttgart, 70049 Stuttgart, Germany;
jmayer@lingphon.net
3 Harris College of Nursing & Health Sciences, Texas Christian University, Fort Worth, TX 76109, USA
4 Department of Oto-Rhino-Laryngology, Phoniatrics and Pedaudiology Division, University Medicine
Greifswald, 17475 Greifswald, Germany
* Correspondence: benjamin.barstiesvonlatoszek@srh.de

Abstract: Background: The assessment of voice quality can be evaluated perceptually with standard
clinical practice, also including acoustic evaluation of digital voice recordings to validate and further
interpret perceptual judgments. The goal of the present study was to determine the strongest acoustic
voice quality parameters for perceived hoarseness and breathiness when analyzing the sustained
vowel [a:] using a new clinical acoustic tool, the VOXplot software. Methods: A total of 218 voice
samples of individuals with and without voice disorders were applied to perceptual and acoustic
analyses. Overall, 13 single acoustic parameters were included to determine validity aspects in
relation to perceptions of hoarseness and breathiness. Results: Four single acoustic measures could
be clearly associated with perceptions of hoarseness or breathiness. For hoarseness, the harmonics-to-
noise ratio (HNR) and pitch perturbation quotient with a smoothing factor of five periods (PPQ5), and,
for breathiness, the smoothed cepstral peak prominence (CPPS) and the glottal-to-noise excitation
ratio (GNE) were shown to be highly valid, with a significant difference being demonstrated for each
of the other perceptual voice quality aspects. Conclusions: Two acoustic measures, the HNR and the
PPQ5, were both strongly associated with perceptions of hoarseness and were able to discriminate
hoarseness from breathiness with good confidence. Two other acoustic measures, the CPPS and the
GNE, were both strongly associated with perceptions of breathiness and were able to discriminate
breathiness from hoarseness with good confidence.

Citation: Barsties v. Latoszek, B.;

Keywords: voice quality analysis; voice diagnostic; acoustic measures; hoarseness; breathiness
Mayer, J.; Watts, C.R.; Lehnert, B.
Advances in Clinical Voice Quality
Analysis with VOXplot. J. Clin. Med.
2023, 12, 4644. https://doi.org/
1. Introduction
10.3390/jcm12144644
Standard clinical practice for the evaluation of voice disorders includes a battery of
Academic Editor: Lee Michael Akst
multidimensional assessments (e.g., visual analysis, auditory-perceptual judgment, aerody-
Received: 12 June 2023 namic analysis, acoustic analysis, and self-assessment [1]) aimed to describe and diagnose
Revised: 4 July 2023 the voice complaint. Voice disorders affect quality, volume, pitch, resonance, flexibility,
Accepted: 8 July 2023 and/or stamina. These vocal changes are the manifestation of disordered respiratory,
Published: 12 July 2023 laryngeal, and vocal tract functions, which might result, in many cases, from heterogeneous
local etiologies [2]. Many voice disorders are associated with abnormal oscillation patterns
of the vocal folds. The resulting voiced energy can vary as a function of vibrational changes
at different vocal fold areas, but especially at the free vocal fold margin. Furthermore,
Copyright: © 2023 by the authors.
the more a critical region of one vocal fold or both vocal folds are affected by laryngeal
Licensee MDPI, Basel, Switzerland.
pathology, the more variation in vocal sound energy and subsequent perceptions of voice
This article is an open access article
quality severity can be expected [3].
distributed under the terms and
Although voice quality is not a clearly defined term, there are two general approaches
conditions of the Creative Commons
to evaluation [4]. First, the subjective approach of listening to the patient’s voice and
Attribution (CC BY) license (https://
creativecommons.org/licenses/by/
assigning a score to different perceptual domains is considered a gold standard approach
4.0/).
for perceptual voice analysis. Second, the use of an objective instrumental approach can be

J. Clin. Med. 2023, 12, 4644. https://doi.org/10.3390/jcm12144644 https://www.mdpi.com/journal/jcm

J. Clin. Med. 2023, 12, 4644 2 of 10

used, in which a specific computer algorithm is applied to recorded voice signals. Examples
of instrumental assessment of voice quality include analysis of the acoustic voice sound
signal and the inverse-filtered oral airflow signal or its derivative. Although many different
terms have been used to describe voice quality, a wide acceptance has been acknowledged
for terms such as hoarseness or overall voice quality, and major subtypes of the general
anomalies in voice quality such as breathiness, roughness, and strain [4,5].
An objective acoustic analysis of voice signals is the most commonly used instrumental
tool in clinical practice and research for objectively characterizing voice disorders [6]. Voice
signals can be analyzed acoustically in the domains of time, frequency, amplitude, and
quefrency. A large number of acoustic measures have been introduced and described to
objectively predict dysphonia types and severities. This is illustrated in a taxonomy by
Buder [6] with 15 signal-processing-based categories. The reliable and valid use of objective
acoustic analysis in research or clinical practice depends on specific requirements (e.g.,
hardware, software, and examination circumstances) to enable voice analysis with high
accuracy and reliability [4,7].
The quantification of voice quality with acoustic methods has traditionally been
analyzed on sustained vowels. Although the assessment of voice quality based on sustained
vowels (SV) does not necessarily correspond to that of continuous speech (CS) [8,9], acoustic
measures from sustained vowels are ubiquitous in research and clinical practice. Acoustic
parameters that correlate strongly with auditory-perceptual judgments are included in
two examples of multiparametric acoustic indices: the acoustic voice quality index (AVQI)
for the evaluation of hoarseness, and the acoustic breathiness index (ABI), which assesses
the hoarseness subtype, breathiness [10]. Both AVQI and ABI have been used with wide
international acceptance for research and clinical practice for a number of reasons: (a) their
multivariate constructs based on linear regression analysis that combines relevant acoustic
markers; (b) the inclusion of both continuous speech and sustained vowels in the acoustic
analysis; (c), signal processing that uses algorithms of the freeware Praat; and (d) a single
score ranging from 0 to 10 for the entire recording being analyzed (i.e., the higher AVQI or
ABI score, the more severe the related anomaly of voice quality, and vice versa) [10].
The acoustic measures of AVQI and ABI include smoothed cepstral peak prominence
(CPPS); harmonics-to-noise ratio (HNR); shimmer percentage; shimmer dB; general slope
of the spectrum (Slope); and tilt of the regression line through the spectrum (Tilt); jitter local;
glottal-to-noise excitation ratio with a maximum frequency of 4500 Hz (GNE); relative level
of high-frequency noise between energy from 0 to 6 kHz and energy from 6 to 10 kHz (HF
Noise); HNR by Dejonckere (HNR-D), which analyses the harmonic shape of the spectral
display by using the frequency bandwidth between 500 and 1500 Hz and a cepstrum
to determine F0, and thus locate the harmonic structure in the long-term average of the
spectrum; differences between the amplitude of the first and second harmonics in the
spectrum (H1H2); and period standard deviation (PSD).
Next to AVQI and ABI, a third multivariate index with a long tradition in the evaluation
of overall voice quality on sustained vowels is the dysphonia severity index (DSI) [11,12].
The DSI includes four voice parameters (jitter local; highest frequency and lowest intensity
of a voice range profile; and maximum phonation time), in which jitter local is the only
acoustic single parameter directly associated with voice quality. To use the DSI with Praat
algorithms for signal processing the pitch perturbation quotient was considered in place of
jitter local [13].
VOXplot (Lingphon, Straubenhardt, Germany; https://voxplot.lingphon.com, ac-
cessed on 11 June 2023) is a new freeware application for acoustic voice quality analysis
based on the Praat algorithms for signal processing. Whereas Praat is a versatile and
correspondingly complex software for acoustic analysis of arbitrary signals, VOXplot is
specifically tailored to the analysis of voice quality. With Praat, only the algorithms are
used, while the user interface of VOXplot is designed to meet the demands of standard-
ized and intuitive ease of use for clinicians and researchers. VOXplot covers the entire
workflow of acoustic voice quality assessment: recording and recording quality assess-
J. Clin. Med. 2023, 12, 4644 3 of 10

ment, acoustic voice quality analysis, and generation of a concise PDF (or JPEG/PNG)
sheet with the analysis results. The core analysis of VOXplot is the voice quality analyses
of continuous speech and sustained vowels with AVQI and ABI. VOXplot is currently
available in 12 analysis languages for AVQI and ABI, which are based on more than one
decade of research knowledge [14,15]. The validation results of both indices relate only to
an objective evaluation of the hoarseness and breathiness levels for heterogeneous voice
disorders in comparison with vocally healthy volunteers with no further specification of a
specific disorder or vocal symptom. The usability of VOXplot is currently available in three
interface languages. Further details of sustained vowels can be analyzed qualitatively with
the narrowband spectrogram and quantitatively with single acoustic parameters.
As mentioned before, AVQI, ABI, and DSI are used in combination with highly sen-
sitive acoustic markers for the evaluation of hoarseness and breathiness. However, a
direct comparison of these objective metrics using the VOXplot application with perceptual
ratings of hoarseness or breathiness is missing. Therefore, the aim of this study was to
compare the concurrent validity and diagnostic validity outcomes of 13 single acoustic
voice quality measures between hoarseness and breathiness aspects on sustained vowels.

2. Materials and Methods

2.1. Participants
In the present study, the voice recordings and auditory-perceptual judgment of hoarse-
ness and breathiness acquired in a previous study [16] were applied to new analyses.
The group of dysphonic participants consisted of 175 patients with various organic and
nonorganic voice disorders and various degrees of dysphonia severity. The control group
of 43 vocally healthy volunteers reported no voice complaints, history of voice, speech, or
hearing problems, and no impact of voice problems as measured with the voice handicap
index [17].
Table 1 summarizes the demographic data and the types of dysphonia for the two
groups. For further details regarding the data and recording acquisition, and inclusion and
exclusion criteria, we refer to Barsties v. Latoszek et al. (2020) [16].

Table 1. Demographic data and types of voice disorders of the dysphonia and control groups.

Gender Age in Years

Group Type of Dysphonia Number
Female Male Mean SD
Carcinoma of head
55 13 42 61.25 10.18
and neck
Functional
38 26 12 52.11 16.48
dysphonia
Larynx carcinoma 28 1 27 69.96 9.05
Paralyses 25 14 11 63.36 16.09
Nodules 8 5 3 33.25 19.43
Dysphonia
Reflux laryngitis 4 4 0 54.50 5.45
Group
Cancer of unknown
4 2 2 61.00 8.21
primary syndrome
Mutational falsetto 3 0 3 15.67 3.06
Leukoplakia 2 0 2 57.00 8.49
Granuloma 2 0 2 42.00 11.31
Laryngitis 2 1 1 39.50 12.02
Parkinson’s 2 0 2 74.00 11.31
Polyp 1 0 1 60.00 -
Laryngeal trauma 1 0 1 78.00 -
Control
None 43 23 20 26.79 7.06
group
Abbreviation. SD = standard deviation.

All the participants gave their informed consent for inclusion before they participated
in the study. The study was conducted in accordance with the Declaration of Helsinki, and
the protocol was approved by the Ethics Committee of Greifswald University (BB072/16).
J. Clin. Med. 2023, 12, 4644 4 of 10

2.2. Auditory-Perceptual Judgment

For the auditory-perceptual judgment ratings, a panel of three male experts specialized
in voice disorders with experience ranging from 8 to 31 years was used. The GRBAS
scale was used for data collection. Each listener rated ordinally on a four-point scale the
hoarseness level, which is represented in the G-parameter (Grade), and the breathiness
severity, which is represented in the B-parameter (which represents the degree of the extent
of air leakage through the glottis).
For further details regarding the rating scale, rating procedure, anchor voices, reliabil-
ity results of the raters, and deviation of the rating level results from the expert panel for
hoarseness and breathiness, we refer to Barsties v. Latoszek et al. (2020) [16].

2.3. Acoustic Measurements

The acoustic analyses were conducted only on recordings of the sustained vowel [a:]
across 3 s of the mid-vowel segment from a single trial. The [a:] vowel was used as a typical
open front vowel for the clinical and scientific acoustic tasks, which is easily recognized
regardless of the native language, linguistic competence, or individual health problems
(e.g., hearing disorders) from the test person in comparison to other vowels [18,19]. These
sound files were applied to a new analysis using VOXplot. In total, 13 single voice quality
parameters were acquired from each recording, which are listed in Table 2.

Table 2. List of 13 acoustic measures for the voice quality evaluation.

Category Acoustic Measures Abbreviation

Smoothed cepstral peak prominence is the
distance between the first harmonic peak and
the point with equal quefrency on the CPPS (dB)
regression line through the smoothed
cepstrum.
Differences between the amplitudes of the first
and second harmonics in the spectrum. To
H1H2 (dB)
localize the first harmonic peak, a cepstrum
was performed for F0 determination.
Relative level of high-frequency noise between
energy from 0 to 6 kHz and energy from 6 to HF-Noise (dB)
10 kHz.
Harmonics-to-noise ratio is the base
10 logarithm of the ratio between the periodic
HNR (dB)
energy and the noise energy, multiplied by
10 HNR.
Fourier and linear prediction Harmonics-to-noise ratio from Dejonckere and
coefficient spectra Lebacq, which analyzes the harmonic
emergence of the spectral display comprised
within the frequency bandwidth between
HNR-D (dB)
500 Hz and 1500 Hz. A cepstrum was
performed to determine F0 and thus to localize
the harmonic structure in the long-term
average spectrum.
General slope of the spectrum is defined as the
difference between the energy within
0–1000 Hz and the energy within Slope (dB)
1000–10,000 Hz of the long-term average
spectrum.
Tilt of the regression line through the spectrum
is the difference between the energy within
0–1000 Hz and the energy within 1000–10,000 Tilt (dB)
Hz of the trendline through the long-term
average spectrum.
J. Clin. Med. 2023, 12, 4644 5 of 10

Table 2. Cont.

Category Acoustic Measures Abbreviation

Period standard deviation is the variation in
Frequency of short-term the standard deviation of periods in which the
PSD (ms)
perturbation measures length of the sample is important for a valid
computation of the standard deviation.
Two jitter variations:
Jitter local is the average difference between
Jitter local (%)
successive periods, divided by the average
period.
Frequency of short-term
Jitter of the five-point period perturbation
perturbation measures
quotient is the average absolute difference
between a period and the average of it and its PPQ5 (%)
four closest neighbors, divided by the average
period.
Two shimmer variations:
Shimmer local is the absolute mean difference
Shimmer (%)
between the amplitudes of successive periods,
Amplitude of short-term divided by the average amplitude.
perturbations measures
Shimmer local dB is the base 10 logarithm of
the difference between the amplitudes of Shimmer (dB)
successive periods, multiplied by 20.
Combines spectral and The glottal-to-noise-excitation (GNE) ratio
GNE
perturbation features with a maximum frequency of 4500 Hz.

2.4. Statistics
The association of the 13 acoustic parameters with the two auditory-perceptual evalu-
ations of hoarseness and breathiness from 218 recorded voice samples was investigated by
calculating Spearman’s rank correlation coefficients. An absolute correlation score of ≥0.70
is marked as a high relationship for the concurrent validity aspect between the acoustic
parameter and the perceived voice quality evaluation [20].
The Fisher r-to-z transformation was used to assess the statistical significance of the
two correlation coefficients from the outcomes of the acoustic parameter and perceived
hoarseness vs. perceived breathiness levels.
A receiver operating characteristic (ROC) curve was then generated in order to analyze
the diagnostic accuracy of the 13 acoustic metrics according to sensitivity (results of the
participants with hoarseness or breathiness) and specificity (results of participants without
hoarseness or breathiness). The power of the acoustic markers to discriminate between
the absence and presence of hoarseness or breathiness was estimated using the area under
the ROC curve (AROC ). An AROC of >0.90 is considered to be exceptionally good; an AROC
of <0.70 is considered to be low, and an AROC of ≤ 0.50 corresponds to a chance level of
diagnostic accuracy [21]. In order to find the optimal threshold value that best differentiates
between without and with hoarseness or breathiness, the Youden index (a measure that
uses a receiver operating characteristic to determine which threshold value is best suited to
distinguish two groups in a measurement) was calculated as sensitivity + specificity − 1.
The significant differences between the two ROC curves (calculated for hoarseness
and breathiness) of the acoustic measures were determined by the difference between the
areas under the curves [22].
The statistical analyses were performed using SPSS, version 23, for Windows (IBM
Corp., Armonk, NY, USA). The tests of significance between the two correlation coefficients
and between the areas under two independent ROC curves were analyzed on VassarStats
(R. Lowry, Vassar College, NY, USA, 1998–2023; http://vassarstats.net/, accessed on
11 June 2023). Results were considered statistically significant at p ≤ 0.05.
J. Clin. Med. 2023, 12, 4644 6 of 10

3. Results
Table 3 presents the validation outcomes for the 13 single acoustic voice quality
parameters in direct comparison to the auditory-perceptual ratings of hoarseness and
breathiness. The thresholds with sensitivity and specificity, based on the ROC statistics and
the Youden Index, are also listed in Table 3.

Table 3. Validation results of the 13 single acoustic voice quality parameters of the sustained vowel
phonation [a:].

Voice Quality Parameters Validation Parameters Hoarseness Breathiness

Correlation −0.76 * −0.81 *
AROC 0.823 * 0.915 **
CPPS (dB) Threshold 15.02 dB 14.47 dB
Sensitivity 84.7% 88.1%
Specificity 71.2% 81.7%
Correlation −0.70 −0.78 *
AROC 0.798 * 0.886 *
GNE Threshold 0.91 0.89
Sensitivity 88.9% 91.7%
Specificity 62.3% 74.3%
Correlation 0.03 0.12
AROC 0.448 0.584
H1H2 (dB) Threshold Chance−level based on AROC 6.39 dB
Sensitivity Chance−level based on AROC 40.4%
Specificity Chance−level based on AROC 82.6%
Correlation −0.71 * −0.56
AROC 0.812 * 0.794 *
HNR (dB) Threshold 23.34 dB 23.34 dB
Sensitivity 90.3% 78.9%
Specificity 62.9% 68.5%
Correlation −0.57 −0.38
AROC 0.760 * 0.701 *
HNR-D (dB) Threshold 31.77 dB 24.23 dB
Sensitivity 61.1% 77.1%
Specificity 80.8% 53.2%
Correlation −0.48 −0.49
AROC 0.698 0.728 *
HF noise (dB) Threshold 2.28 dB 2.29 dB
Sensitivity 80.6% 77.1%
Specificity 54.1% 62.4%
Correlation 0.68 0.57
AROC 0.839 * 0.808 *
Jitter local (%) Threshold 0.50% 0.57%
Sensitivity 70.8% 71.0%
Specificity 84.7% 78.0%
Correlation 0.71 * 0.55
AROC 0.833 * 0.799 *
PPQ5 (%) Threshold 0.29% 0.32%
Sensitivity 67.2% 67.0%
Specificity 84.5% 75.9%
Correlation 0.59 0.41
AROC 0.802 * 0.730 *
PSD (ms) Threshold 0.00012 ms 0.00018 ms
Sensitivity 65.3% 50.5%
Specificity 81.9% 88.1%
Correlation 0.65 0.53
AROC 0.773 * 0.780 *
Shimmer (%) Threshold 3.08% 3.58
Sensitivity 53.5% 57.0%
Specificity 91.7% 90.8%
J. Clin. Med. 2023, 12, 4644 7 of 10

Table 3. Cont.

Voice Quality Parameters Validation Parameters Hoarseness Breathiness

Correlation 0.66 0.55
AROC 0.783 * 0.786 *
Shimmer (dB) Threshold 0.27 dB 0.33 dB
Sensitivity 54.9% 57.9%
Specificity 91.7% 91.7%
Correlation −0.09 −0.11
AROC 0.617 0.602
Slope (dB) Threshold −25.08 dB −25.34 dB
Sensitivity 81.9% 80.7%
Specificity 39.7% 43.1%
Correlation 0.30 0.43
AROC 0.592 0.673
Tilt (dB) Threshold −10.32 dB −11.73 dB
Sensitivity 34.9% 81.7%
Specificity 86.1% 46.8%
* High correlation or high AROC indicating a marked relationship in concurrent validity or sufficient diagnostic
accuracy; ** exceptionally good diagnostic accuracy level. Darker grey boxes indicate nonsignificant differences of
p > 0.05 (corresponding to Fisher r-to-z transformation for correlation results and/or significant differences in
ROC results of AROC ).

For hoarseness, a strong correlation was present for CPPS, HNR, and PPQ5. No
acoustic parameter reached an exceptionally good level of AROC , and 4 of the 13 acoustic
parameters revealed a low level of AROC , in which one of them was characterized by a
chance level in diagnostic accuracy (H1H2).
For breathiness, a strong correlation was present for CPPS and GNE. However, GNE
reached an exceptionally good AROC result, and 9 of the remaining 12 acoustic parameters
had a strong level of diagnostic accuracy.
To assign a single acoustic voice quality parameter with high validity to a type of
voice abnormality, (a) the absolute correlation value and the AROC had to be >0.70, and
(b) significant differences in validity performances between hoarseness and breathiness
must be obtained in the correlation results or the AROC outcomes. According to the results
listed in Table 3 for hoarseness, two acoustic parameters could be identified as highly
valid (HNR and PPQ5) in comparison to breathiness. For breathiness, two acoustic metrics
(CPPS and GNE) were also revealed to have outstanding validity results in comparison
to hoarseness.

4. Discussion
The aim of the present study was to investigate the validity of single acoustic pa-
rameters representing voice quality characteristics of hoarseness or breathiness in a direct
comparison of the auditory-perceptual voice quality ratings of those domains from sus-
tained vowel [a:] phonation. Although multiparametric models are preferred in highly
valid evaluations of hoarseness or breathiness [4,9,23,24], single acoustic parameters are
mostly used in clinical practice and recommended protocols for instrumental assessment
of voice [7]. The present study attempted to reveal the most relevant acoustic markers
for hoarseness and breathiness from a pool of metrics, which are already part of relevant
multiparametric models in the evaluation of voice quality, such as DSI, AVQI, and ABI.
In general, the results from the initial AVQI and ABI studies were confirmed by the
present study, with comparable results to the correlation coefficients for hoarseness and
breathiness [9,24]. Although continuous speech was also considered in the voice quality
evaluation for AVQI and ABI, CPPS and HNR showed high agreement for hoarseness,
and CPPS and GNE presented the strongest results for breathiness. Because perceptions
of breathiness are associated with high irregularity in the acoustic spectrum (e.g., a lot
of spectral aperiodicity or noise), while perceptions of hoarseness can be associated with
multidimensional acoustic factors other than spectral aperiodicity, it was logical that the
discriminative ability of CPPS (which measures the periodicity in the acoustic spectrum)
for breathiness was significantly higher than for hoarseness in this study. Originally, CPPS
J. Clin. Med. 2023, 12, 4644 8 of 10

was developed for the vocal quality abnormality of breathiness [25], in which breathiness
is a main subtype of hoarseness [24]. Just like GNE, which was also developed for the eval-
uation of breathiness [26], the present study confirmed its strength in the evaluation of this
voice quality aspect with significantly higher concurrent validity and diagnostic accuracy.
A clearer unique identifier for hoarseness versus breathiness was shown in this study
by the two parameters HNR and PPQ5. In the case of HNR, it is the second most important
acoustic parameter in the AVQI formula after CPPS, which is supported by the results of
this study [9]. The findings of this study suggest that HNR is a general parameter that does
not necessarily correspond to other strong breathiness measures such as CPPS or GNE.
Only PPQ5 achieved a sufficiently high agreement with hoarseness and was significantly
differentiated from breathiness in the current study. This result was contrary to the results
of the original study on AVQI by Maryn et al. (2010) [9]. Furthermore, in a meta-analysis
on the evaluation of hoarseness, jitter parameters generally ranked significantly lower
than spectral or cepstral parameters and some shimmer markers [27], but, according to the
present results, PPQ5 seems to be robust enough to assess hoarseness in the evaluation of
sustained vowels, which may explain why this parameter is included in the DSI formula.
J. Clin. Med. 2023, 12, x FOR PEER REVIEW The new developments based on the present study were updated in VOXplot and 9 of
are11
available from version 2.0 (see Figure 1).

(a) (b)
Figure 1. VOXplot version 2.0: (a) the user interface for preparing the acoustic analysis of continu-
Figure 1. VOXplot version 2.0: (a) the user interface for preparing the acoustic analysis of continuous
ous speech and/or sustained vowels selected in the English language with the analysis language
speech and/or sustained vowels selected in the English language with the analysis language German
German for the thresholds evaluations of AVQI and ABI; (b) the outcome of the main voice qual-
for
itythe thresholds
parameters inevaluations of AVQI
VOXplot, which areand ABI; (b)quantitatively
evaluated the outcome of the main
and/or voice quality
qualitatively parameters
for hoarseness
in VOXplot,
and which are evaluated quantitatively and/or qualitatively for hoarseness and breathiness.
breathiness.

5. Conclusions
For the voice quality evaluation on the sustained vowel HNR and PPQ5 (for hoarse-
ness), and CPPS and GNE (for breathiness) yielded the highest significant validity results
compared to each of the other voice quality aspect.” These four acoustic parameters
should have priority in the evaluation of hoarseness and breathiness and are prominently
included in VOXplot (e.g., in the voice quality circle plot).
J. Clin. Med. 2023, 12, 4644 9 of 10

5. Conclusions
For the voice quality evaluation on the sustained vowel HNR and PPQ5 (for hoarse-
ness), and CPPS and GNE (for breathiness) yielded the highest significant validity results
compared to each of the other voice quality aspect.” These four acoustic parameters should
have priority in the evaluation of hoarseness and breathiness and are prominently included
in VOXplot (e.g., in the voice quality circle plot).

Author Contributions: Conceptualization, B.B.v.L., B.L., C.R.W. and J.M.; methodology, B.B.v.L., C.R.W.
and B.L.; software, J.M.; validation, B.B.v.L. and B.L.; formal analysis, B.B.v.L.; resources, B.L. and
B.B.v.L.; data curation, B.L.; writing—original draft preparation, B.B.v.L. and J.M.; writing—review and
editing, C.R.W. and B.L. All authors have read and agreed to the published version of the manuscript.
Funding: This research received no external funding.
Institutional Review Board Statement: The study was conducted in accordance with the Declaration
of Helsinki and approved by the Ethics Committee of Greifswald University (protocol code: BB072/16
and date of approval: 05-04-2016).
Informed Consent Statement: Informed consent was obtained from all the subjects involved in
the study.
Data Availability Statement: The original contributions presented in the study are included in the
article; further inquiries can be directed to the corresponding author.
Conflicts of Interest: J.M. is the developer of the software, VOXplot, and the owner of the company
lingphon.de (Straubenhardt, Germany). B.B.v.L. created the ABI and contributed to the development
of AVQI v.03. He also acts as a scientific advisor in the creation of the VOXplot software.

References
1. Dejonckere, P.H.; Bradley, P.; Clemente, P.; Cornut, G.; Crevier-Buchman, L.; Friedrich, G.; Van De Heyning, P.; Remacle, M.;
Woisard, V.; Committee on Phoniatrics of the European Laryngological Society (ELS). A basic protocol for functional assessment of
voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques.
Guideline elaborated by the Committee on Phoniatrics of the European Laryngological Society (ELS). Eur. Arch. Otorhinolaryngol.
2001, 258, 77–82. [CrossRef] [PubMed]
2. Verdolini, K.; Rosen, C.A.; Branski, R.C. Classification manual for voice disorders-I. In Special Interest Division 3, Voice and Voice
Disorders, American Speech-Language-Hearing Association; Lawrence Erlbaum Associates, Inc.: Mahwah, NJ, USA, 2006.
3. Fleischer, S.; Hess, M. The significance of videostroboscopy in laryngological practice. HNO 2006, 54, 628–634. [CrossRef]
[PubMed]
4. Barsties, B.; De Bodt, M. Assessment of voice quality: Current state-of-the-art. Auris Nasus Larynx 2015, 42, 183–188. [CrossRef]
[PubMed]
5. Shrivastav, R. Evaluating voice quality. In Handbook of Voice Assessments; Ma, E.P.M., Yiu, E.M.L., Eds.; Singular Publishing Group:
San Diego, CA, USA, 2011; pp. 305–318.
6. Buder, E.H. Acoustic analysis of voice quality: A tabulation of algorithms 1902–1990. In Voice Quality Measurement; Kent, R.D.,
Ball, M.J., Eds.; Singular Publishing Group: San Diego, CA, USA, 2000; pp. 119–244.
7. Patel, R.R.; Awan, S.N.; Barkmeier-Kraemer, J.; Courey, M.; Deliyski, D.; Eadie, T.; Paul, D.; Švec, J.G.; Hillman, R. Recommended
protocols for instrumental assessment of voice: American Speech-Language-Hearing Association expert panel to develop a
protocol for instrumental assessment of vocal function. Am. J. Speech Lang. Pathol. 2018, 27, 887–905. [CrossRef]
8. Maryn, Y.; Roy, N. Sustained vowels and continuous speech in the auditory-perceptual evaluation of dysphonia severity. J. Soc.
Bras. Fonoaudiol. 2012, 24, 107–112. [CrossRef] [PubMed]
9. Maryn, Y.; Corthals, P.; Van Cauwenberge, P.; Roy, N.; De Bodt, M. Toward improved ecological validity in the acoustic
measurement of overall voice quality: Combining continuous speech and sustained vowels. J. Voice 2010, 24, 540–555. [CrossRef]
10. Barsties v. Latoszek, B.; Mathmann, P.; Neumann, K. The cepstral spectral index of dysphonia, the acoustic voice quality index and
the acoustic breathiness index as novel multiparametric indices for acoustic assessment of voice quality. Curr. Opin. Otolaryngol.
Head Neck Surg. 2021, 29, 451–457. [CrossRef]
11. Sobol, M.; Sielska-Badurek, E.M. The Dysphonia Severity Index (DSI)-normative values. Systematic review and meta-analysis.
J. Voice 2022, 36, 143.e9–143.e13. [CrossRef]
12. Uloza, V.; Barsties, V.; Latoszek, B.; Ulozaite-Staniene, N.; Petrauskas, T.; Maryn, Y. A comparison of Dysphonia Severity Index
and Acoustic Voice Quality Index measures in differentiating normal and dysphonic voices. Eur. Arch. Otorhinolaryngol. 2018, 275,
949–958. [CrossRef]
J. Clin. Med. 2023, 12, 4644 10 of 10

13. Maryn, Y.; Morsomme, D.; De Bodt, M. Measuring the Dysphonia Severity Index (DSI) in the program Praat. J. Voice 2017, 31,
644.e29–644.e40. [CrossRef]
14. Batthyany, C.; Barsties, V.; Latoszek, B.; Maryn, Y. Meta-Analysis on the Validity of the Acoustic Voice Quality Index. J. Voice 2022,
in press. [CrossRef] [PubMed]
15. Barsties v. Latoszek, B.; Kim, G.H.; Delgado Hernandez, J.; Hosokawa, K.; Englert, M.; Neumann, K.; Hetjens, S. The validity of
the Acoustic Breathiness Index in the evaluation of breathy voice quality: A Meta-Analysis. Clin. Otolaryngol. 2021, 46, 31–40.
[CrossRef] [PubMed]
16. Barsties v. Latoszek, B.; Lehnert, B.; Janotte, B. Validation of the Acoustic Voice Quality Index Version 03.01 and Acoustic
Breathiness Index in German. J. Voice 2020, 34, 157.e17–157.e25. [CrossRef] [PubMed]
17. Nawka, T.; Wiesmann, U.; Gonnermann, U. Validation of the German version of the Voice Handicap Index. HNO 2003, 51,
921–930. [CrossRef]
18. Franca, M.C. Acoustic comparison of vowel sounds among adult females. J. Voice. 2012, 26, 671.e9–671.e17. [CrossRef]
19. Brockmann, M.; Drinnan, M.J.; Storck, C.; Carding, P.N. Reliable jitter and shimmer measurements in voice clinics: The relevance
of vowel, gender, vocal intensity, and fundamental frequency effects in a typical clinical task. J. Voice. 2011, 25, 44–53. [CrossRef]
20. Frey, L.R.; Botan, C.H.; Friedman, P.G.K.G. Investigating Communication: An Introduction to Research Methods; Prentice-Hall:
Englewood Cliffs, NJ, USA, 1991.
21. Hosmer, D.W.; Lemeshow, S. Applied Logistic Regression, 2nd ed.; John Wiley & Sons: Hoboken, NJ, USA, 2000; pp. 156–164.
22. Hanley, J.A.; McNeil, B.J. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 1982,
143, 29–36. [CrossRef] [PubMed]
23. Jayakumar, T.; Benoy, J.J. Acoustic Voice Quality Index (AVQI) in the measurement of voice quality: A systematic review and
meta-analysis. J. Voice 2022, in press. [CrossRef]
24. Barsties v. Latoszek, B.; Maryn, Y.; Gerrits, E.; De Bodt, M. The Acoustic Breathiness Index (ABI): A Multivariate Acoustic Model
for Breathiness. J. Voice 2017, 31, 511.e11–511.e27. [CrossRef]
25. Hillenbrand, J.; Houde, R.A. Acoustic correlates of breathy vocal quality: Dysphonic voices and continuous speech. J. Speech Hear.
Res. 1996, 39, 311–321. [CrossRef]
26. Michaelis, D.; Gramss, T.; Strube, H.W. Glottal-to-Noise Excitation Ratio—A New Measure for Describing Pathological Voices.
Acustica 1997, 83, 700–706.
27. Maryn, Y.; Roy, N.; De Bodt, M.; Van Cauwenberge, P.; Corthals, P. Acoustic measurement of overall voice quality: A meta-analysis.
J. Acoust. Soc. Am. 2009, 126, 2619–2634. [CrossRef] [PubMed]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual
author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to
people or property resulting from any ideas, methods, instructions or products referred to in the content.

VenkatSAP MM Resume
No ratings yet
VenkatSAP MM Resume
6 pages
03 Design Apis
No ratings yet
03 Design Apis
16 pages
6th Sem LI-Fi Technology Research Paper Final
No ratings yet
6th Sem LI-Fi Technology Research Paper Final
42 pages
ServicePlus- Deprived Scheduled Caste Certificate - वंचित अनुसूचित जाति प्रमाण पत्र
No ratings yet
ServicePlus- Deprived Scheduled Caste Certificate - वंचित अनुसूचित जाति प्रमाण पत्र
2 pages
Assignment # 1: Course: Instructor
No ratings yet
Assignment # 1: Course: Instructor
3 pages
50watts Audio Amplifier Using TDA7265
No ratings yet
50watts Audio Amplifier Using TDA7265
26 pages
A Method of Classifying VQ
No ratings yet
A Method of Classifying VQ
5 pages
1 s2.0 S1808869415306637 Main
No ratings yet
1 s2.0 S1808869415306637 Main
9 pages
A Hybrid Approach For Binary and Multi-Class Class
No ratings yet
A Hybrid Approach For Binary and Multi-Class Class
15 pages
L1000A TM EN TOEP C710616 134G 6 0 Addendum
No ratings yet
L1000A TM EN TOEP C710616 134G 6 0 Addendum
106 pages
An Executive Guide Biometrics
From Everand
An Executive Guide Biometrics
alasdair gilchrist
No ratings yet
Advanced Ch.03 Management Accounting S4HANA
No ratings yet
Advanced Ch.03 Management Accounting S4HANA
112 pages
N2OS-UserManual-20 0 7 4
No ratings yet
N2OS-UserManual-20 0 7 4
256 pages
Maria Betania - Análise Acústica
No ratings yet
Maria Betania - Análise Acústica
1 page
Vozanalise
No ratings yet
Vozanalise
1 page
Handbook of Basic Quality Control Tests for Diagnostic Radiology
From Everand
Handbook of Basic Quality Control Tests for Diagnostic Radiology
IAEA
No ratings yet
April 2010: Zaatar Noureddine
No ratings yet
April 2010: Zaatar Noureddine
66 pages
Capital One Offers Terms and Conditions
No ratings yet
Capital One Offers Terms and Conditions
4 pages
Review Paper VPD
No ratings yet
Review Paper VPD
4 pages
Determining Spot Heights From Contours
0% (1)
Determining Spot Heights From Contours
13 pages
Jasman 000149 000457 - 1
No ratings yet
Jasman 000149 000457 - 1
9 pages
Codas 36 3 E20230175
No ratings yet
Codas 36 3 E20230175
19 pages
Python GUI Automation For Beginners
100% (1)
Python GUI Automation For Beginners
126 pages
Comparative Utility of Voice Symptom Scale, Voice Handicap Index, and GRBAS in Assessing Voice Disorders: A Clinical Study
No ratings yet
Comparative Utility of Voice Symptom Scale, Voice Handicap Index, and GRBAS in Assessing Voice Disorders: A Clinical Study
6 pages
Jsip 2014021010293134
No ratings yet
Jsip 2014021010293134
7 pages
Accuracy of Acoustic Analysis Measurements in The
No ratings yet
Accuracy of Acoustic Analysis Measurements in The
12 pages
TN Apogee Prepress 10.0 - Apogee Impose
No ratings yet
TN Apogee Prepress 10.0 - Apogee Impose
49 pages
CVEP Perceptual Paramters
No ratings yet
CVEP Perceptual Paramters
4 pages
Machine Learning Based Identification of Relevant
No ratings yet
Machine Learning Based Identification of Relevant
15 pages
Che205s17 Reading 01e
No ratings yet
Che205s17 Reading 01e
14 pages
Voice Quality Scales
No ratings yet
Voice Quality Scales
8 pages
Biomechanical Analysis of The Voice As A
No ratings yet
Biomechanical Analysis of The Voice As A
13 pages
Comparison of Results in Two Acoustic Analysis Programs - Praat An
No ratings yet
Comparison of Results in Two Acoustic Analysis Programs - Praat An
8 pages
Verification Academy Patterns Library: Pattern Name: The BFM-Proxy Pair Pattern
No ratings yet
Verification Academy Patterns Library: Pattern Name: The BFM-Proxy Pair Pattern
5 pages
1 s2.0 S0385814614002053 Main
No ratings yet
1 s2.0 S0385814614002053 Main
6 pages
Open Voice Projection PDF
No ratings yet
Open Voice Projection PDF
141 pages
Oscalc Manual
No ratings yet
Oscalc Manual
90 pages
Voice Evaluation
No ratings yet
Voice Evaluation
47 pages
An Improved Method For Voice Pathology D
No ratings yet
An Improved Method For Voice Pathology D
13 pages
Acoustic Measurement of Roughness and Breathiness
No ratings yet
Acoustic Measurement of Roughness and Breathiness
26 pages
Assesment of Voice Disorders
No ratings yet
Assesment of Voice Disorders
13 pages
3.quantization and Transmission of Audio
No ratings yet
3.quantization and Transmission of Audio
10 pages
J Jvoice 2004 08 008
No ratings yet
J Jvoice 2004 08 008
19 pages
10.3934 Mbe.2020404
No ratings yet
10.3934 Mbe.2020404
22 pages
Geisel Layout
No ratings yet
Geisel Layout
1 page
Strivers A2Z DSA Completion Plan
No ratings yet
Strivers A2Z DSA Completion Plan
2 pages
Commissioning Generator AVR, PSS and Model Validation: Wenyan Gu, Member, IEEE
100% (1)
Commissioning Generator AVR, PSS and Model Validation: Wenyan Gu, Member, IEEE
5 pages
Acoustic Voice Quality Index
No ratings yet
Acoustic Voice Quality Index
3 pages
The Computing Profession
No ratings yet
The Computing Profession
21 pages
ZXCTN 6150 V3.00 Product Description - 20140106 - EN
100% (1)
ZXCTN 6150 V3.00 Product Description - 20140106 - EN
155 pages
Data Communication
No ratings yet
Data Communication
543 pages
Solved - The Fourth-Degree Polynomial F (X) 230x4 + 18x3 + 9x2...
No ratings yet
Solved - The Fourth-Degree Polynomial F (X) 230x4 + 18x3 + 9x2...
7 pages
Cross-Lingual Detection of Dysphonic Speech For Dutch and Hungarian Datasets
No ratings yet
Cross-Lingual Detection of Dysphonic Speech For Dutch and Hungarian Datasets
6 pages
Workshop On Acoustic Voice Analysis PDF
100% (1)
Workshop On Acoustic Voice Analysis PDF
263 pages
Kreiman Perceptual Evaluation of Voice Quality
No ratings yet
Kreiman Perceptual Evaluation of Voice Quality
20 pages
TEXT 1. LOPES 2024. Accuracy and Cut-Off Values of Cepstral Measures in TH
No ratings yet
TEXT 1. LOPES 2024. Accuracy and Cut-Off Values of Cepstral Measures in TH
16 pages
How To Configure DHCP in Cisco Router Using Packet Tracer and Gns3 - Router Switch Configuration Using Packet Tracer GNS3
100% (1)
How To Configure DHCP in Cisco Router Using Packet Tracer and Gns3 - Router Switch Configuration Using Packet Tracer GNS3
5 pages
1 s2.0 S221201731400365X Main
No ratings yet
1 s2.0 S221201731400365X Main
10 pages
Teamwork Hotel Preopening Tasks Setupmyhotel
No ratings yet
Teamwork Hotel Preopening Tasks Setupmyhotel
108 pages
Acoustic Parameters in Persian-Speaking Patients With Dysphonia
No ratings yet
Acoustic Parameters in Persian-Speaking Patients With Dysphonia
10 pages
The Dysphonia Severity Index
No ratings yet
The Dysphonia Severity Index
14 pages
Validation of AVQI
No ratings yet
Validation of AVQI
10 pages
Missing Neighbors in WCDMA Analysis Guide
100% (2)
Missing Neighbors in WCDMA Analysis Guide
15 pages
Voice Assessment
No ratings yet
Voice Assessment
26 pages
Barsties 2016
No ratings yet
Barsties 2016
13 pages
(Factors Predicting Voice Handicap Index) : Acta Otorrinolaringológica Española March 2006
No ratings yet
(Factors Predicting Voice Handicap Index) : Acta Otorrinolaringológica Española March 2006
10 pages
Jennifer - Springer - 2020 Published PDF
No ratings yet
Jennifer - Springer - 2020 Published PDF
14 pages
Indexing Structures For Files: Database Design Database Design
No ratings yet
Indexing Structures For Files: Database Design Database Design
9 pages
Prakash B Indian Perspective
No ratings yet
Prakash B Indian Perspective
5 pages
Perceptual Voice Qualities Database Database Characteristics
No ratings yet
Perceptual Voice Qualities Database Database Characteristics
9 pages
Valores Normales Praat
No ratings yet
Valores Normales Praat
6 pages
Biomedical Signal Processing and Control: Lydia Stappenbeck, Ben Barsties v. Latoszek, Ben Janotte, Bernhard Lehnert
No ratings yet
Biomedical Signal Processing and Control: Lydia Stappenbeck, Ben Barsties v. Latoszek, Ben Janotte, Bernhard Lehnert
7 pages
USN-500 - User Manual: G, Oz, GN, DWT
No ratings yet
USN-500 - User Manual: G, Oz, GN, DWT
1 page
Voice Evaluation and Therapy: Key Points
No ratings yet
Voice Evaluation and Therapy: Key Points
12 pages
A Comparison of Trained and Untrained Vocalists On The Dysphonia Severity Index PDF
No ratings yet
A Comparison of Trained and Untrained Vocalists On The Dysphonia Severity Index PDF
6 pages
Voice Pathology Detection Based On The Modified Voice Contour and SVM
No ratings yet
Voice Pathology Detection Based On The Modified Voice Contour and SVM
9 pages
Validation of The German Version of The Voice Handicap Index (VHI) (English)
No ratings yet
Validation of The German Version of The Voice Handicap Index (VHI) (English)
10 pages
Telephony-Based Voice Pathology Assessment Using Automated Speech Analysis
No ratings yet
Telephony-Based Voice Pathology Assessment Using Automated Speech Analysis
10 pages
1 s2.0 S0892199709000563 Main - 2
No ratings yet
1 s2.0 S0892199709000563 Main - 2
11 pages
The Current State and Future Possibilities of Mobile Phone "Voice Analyser" Applications, in Relation To Otorhinolaryngology
No ratings yet
The Current State and Future Possibilities of Mobile Phone "Voice Analyser" Applications, in Relation To Otorhinolaryngology
6 pages
Articulo 3
No ratings yet
Articulo 3
9 pages
Wevosys: The Dysphonia Severity Index (DSI)
No ratings yet
Wevosys: The Dysphonia Severity Index (DSI)
4 pages
Acoustic and Aerodynamic Assessment of Voice
No ratings yet
Acoustic and Aerodynamic Assessment of Voice
4 pages
Normative Values For The Voice Handicap Index-10: Yzpittsburgh, Pennsylvania
No ratings yet
Normative Values For The Voice Handicap Index-10: Yzpittsburgh, Pennsylvania
4 pages
Carroll 2
100% (1)
Carroll 2
19 pages
Vocal Acoustic Analysis - Jitter, Shimmer and HNR Parameters
No ratings yet
Vocal Acoustic Analysis - Jitter, Shimmer and HNR Parameters
11 pages
Voice Assessment Procedures
No ratings yet
Voice Assessment Procedures
7 pages
DSI Publicação
No ratings yet
DSI Publicação
5 pages
MDVP
100% (1)
MDVP
10 pages
Latex Allergy
No ratings yet
Latex Allergy
5 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Artigo VoxPlot

Uploaded by

Artigo VoxPlot

Uploaded by

Journal of

Citation: Barsties v. Latoszek, B.;

J. Clin. Med. 2023, 12, 4644. https://doi.org/10.3390/jcm12144644 https://www.mdpi.com/journal/jcm

2. Materials and Methods

Gender Age in Years

2.2. Auditory-Perceptual Judgment

2.3. Acoustic Measurements

Table 2. List of 13 acoustic measures for the voice quality evaluation.

Category Acoustic Measures Abbreviation

Category Acoustic Measures Abbreviation

Voice Quality Parameters Validation Parameters Hoarseness Breathiness

Voice Quality Parameters Validation Parameters Hoarseness Breathiness

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.