Quantification of Voice Type Components Present in Human Phonation Using a Modified Diffusive Chaos Technique

Ann Otol Rhinol Laryngol. 2019 Oct;128(10):921-931. doi: 10.1177/0003489419848451. Epub 2019 May 14.

Abstract

Purpose: Signal typing has been used to categorize healthy and disordered voices; however, human voices are likely comprised of differing proportions of periodic type 1 elements, type 2 elements that are periodic with modulations, aperiodic type 3 elements, and stochastic type 4 elements. A novel diffusive chaos method is presented to detect the distribution of voice types within a signal with the goal of providing an objective and clinically useful tool for evaluating the voice. It was predicted that continuous calculation of the diffusive chaos parameter throughout the voice sample would allow for construction of comprehensive voice type component profiles (VTCP).

Methods: One hundred thirty-five voice samples of sustained /a/ vowels were randomly selected from the Disordered Voice Database Model 4337. All samples were classified according to the voice type paradigm using spectrogram analysis, yielding 34 type 1, 35 type 2, 42 type 3, and 24 type 4 voice samples. All samples were then analyzed using the diffusive chaos method, and VTCPs were generated to show the distribution of the 4 voice type components (VTC).

Results: The proportions of VTC1 varied significantly between the majority of the traditional voice types (P < .001). Three of the 4 VTCs of type 3 voices were significantly different from the VTCs of type 4 voices (P < .001). These results were compared to calculations of spectrum convergence ratio, which did not vary significantly between voice types 1 and 2 or 2 and 3.

Conclusion: The diffusive chaos method demonstrates proficiency in generating comprehensive VTCPs for disordered voices with varying severity. In contrast to acoustic parameters that provide a single measure of disorder, VTCPs can be used to detect subtler changes by observing variations in each VTC over time. This method also provides the advantage of quantifying stochastic noise components that are due to breathiness in the voice.

Keywords: acoustic analysis; diffusive chaos; nonlinear dynamics; otolaryngology; voice disorders.

MeSH terms

  • Adult
  • Aged
  • Aged, 80 and over
  • Female
  • Humans
  • Male
  • Middle Aged
  • Phonation*
  • Sound Spectrography
  • Speech Production Measurement / methods*
  • Voice Disorders / classification
  • Voice Disorders / diagnosis*
  • Voice Quality*
  • Young Adult