|
|
Last updated:
June 26, 2008KayPENTAX in the News...Recently Published Studies Involving KayPENTAX InstrumentationCurrent postings (see below) include 329 article abstracts that represent a sampling of the recent publications citing our instrumentation. By Product Line: Flexible Endoscopes (including Transnasal Esophagoscopes) Pulsed Dye Laser (PDL) - KayPENTAX/Cynosure
Acoustic Analysis“Voice of Postradiotherapy Nasopharyngeal Carcinoma Patients: Evidence of Vocal Tract Effect,” Lin, Emily, Tzer-Zen Hwang, Jeremy Hornibrook, and Tika Ormond, Journal of Voice, Vol. 22 No. 3, pp. 351-364, May 2008. Summary: This study was aimed at identifying acoustic and physiological measures useful for monitoring voice changes in postnasopharyngeal patients with nonlaryngeal malignancies, and providing evidences of vocal tract effect on voice through comparisons between individuals with and without intact vocal tract. Simultaneous acoustic-electroglottographic signals recorded during phonation of vowels /i/ and /a/ sustained at habitual, high, and low pitch levels were compared among 10 postradiotherapy patients with nasopharyngeal carcinoma (NPC), 10 voice patients (VPs) with intact vocal tract, and 10 healthy individuals with normal voice (NORM). Results from a series of discriminant analyses revealed that the NPC group generally exhibited lower signal-to-noise (SNR) and open quotient (OQ) and higher Formant 1 frequency (F1) and speed quotient (SQ) than the NORM group. Unlike both VP and NORM groups, the NPC group failed to show a pitch effect on all voice measures, including OQ, SQ, percent jitter, percent shimmer, and SNR, suggesting an effect of radiotherapy and/or vocal tract on laryngeal behaviors. For the vowel /i/, on the other hand, only the NPC and NORM groups showed a pattern of pitch-dependent F1 raising, a reflection of increased pharyngeal narrowing. These findings suggested that the pitch effect on laryngeal behaviors differed not only between individuals with intact vocal tract and those without but also between those with structural and dynamic changes of vocal tract.
The KayPENTAX
Electroglottograph (EGG), Model 6103, was used to acquire three
EGG measures (i.e., Fo, OQ, and SQ) in this study. “Effects of Bilateral Subthalamic Nucleus Stimulation and Medication on Parkinsonian Speech Impairment,” D’Alatri, Lucia D., Gaetano Paludetti, M. Fiorella Contarino, Stefania Galla, Maria Raffaella Marchese, and Anna Rita Bentivoglio, Journal of Voice, Vol. 22 No. 3, pp. 365-372, May 2008. Summary: This study aimed to assess quantitatively the effect of bilateral subthalamic nucleus (STN) stimulation and medication on hypokinetic parkinsonian dysarthria. Twelve Italian patients (11 males and 1 female) with idiopathic Parkinson’s disease (mean age 60.29 ± 7.50 years) and bilateral STN implantation were studied. Neurological assessments and acoustic recordings were performed in four clinical conditions combining stimulation and medication to assess the degree of motor disabilities and speech impairment. Acoustic analysis was performed by means of the Multidimensional Voice Program and the Advanced Motor Speech Profile (Kay Elemetrics, Lincoln Park, NJ). None of the evaluated parameters deteriorated after TN deep brain stimulation. STN stimulation significantly improved motor performances and vocal tremor and provided a major stability to glottal vibration. Effect of stimulation on these parameters was superior to that of levodopa. No significant variations were observed in perceptual evaluation and in acoustic parameters related to prosody, articulation, and intensity after either stimulation or medication. The improvement of acoustic parameters related to glottal vibration and voice tremor was not accompanied by a substantial effect on speech intelligibility. STN stimulation was more effective on global motor limb dysfunctions than on dysarthria, but we did not report negative consequences on speech. Acoustic analysis, in this study, was performed using the KayPENTAX Computerized Speech Lab (CSL), Model 4300B, in conjunction with the KayPENTAX MDVP, Model 5105, and Advanced Motor Speech Profile, Model 5141.
“Acoustic Changes in Chinese Patients With Cancer-Related Unilateral Vocal Fold Paralysis After Medialization Thyroplasty,” Ng, Manwa L., Ripley K. Wong, William I. Wei, Y.H. Wong, and Paul K. Y. Lam, Contemporary Issues in Communication Science and Disorders, Vol. 35, pp. 17-24, Spring 2008. Abstract. The present study investigated the change in the voice quality of patients with unilateral vocal fold paralysis (UVFP) of benign and malignant causes after medialization thyroplasty. Thirty-four native Cantonese adults who had been diagnosed with UVFP participated in the study. Acoustical parameters including the average voice fundamental frequency, percent jitter, relative average perturbation (RAP), percent shimmer, and noise-to-harmonic ration (NHR) were measured from the sustained vowel /a/ that was recorded before and after the thyroplasty procedure. Maximum phonation time (MPT) was also obtained. Results indicated that, for both benign and malignant patients, all acoustical parameters except for NHR showed improvement after thyroplasty: Percent jitter, RAP, and percent shimmer values were significantly reduced, and MPT was significantly lengthened. Our findings support the notion that medialization thyroplasty is a useful palliative procedure to improve voice production in Cantonese-speaking UVFP patients. Despite the cancerous condition, it is still beneficial to malignant UVFP patients, and better voice quality can be achieved. In this study, acoustic analysis was performed using the KayPENTAX Multi-Dimensional Voice Profile (MDVP), Model 5105.
“Respiratory and Laryngeal Function During Spontaneous Speaking in Teachers With Voice Disorders,” Lowell, Soren Y., Julie M. Barkmeier-Kraemer, Jeanette D. Hoit, and Brad H. Story, JSLHR, Vol. 51 No. 2, pp. 333-349, April 2008. Purpose: To determine if respiratory and laryngeal function during spontaneous speaking were different for teachers with voice disorders compared with teachers without voice problems. Method: Eighteen teachers, 9 with and 9 without voice disorders, were included in this study. Respiratory function was measured with magnetometry, and laryngeal function was measured with electroglottography during 3 spontaneous speaking tasks: a simulated teaching task at a typical loudness level, a simulated teaching task at an increased loudness level, and a conversational speaking task. Electroglottography measures were also obtained for 3 structured speaking tasks: a paragraph reading task, a sustained vowel, and a maximum phonation time vowel. Results: Teachers with voice disorders started and ended their breath groups at significantly smaller lung volumes than teachers without voice problems during teaching-related speaking tasks; however there were no between-group differences in laryngeal measures. Task-related differences were found on several respiratory measures and on one laryngeal measure. Conclusions: These findings suggest that teachers with voice disorders used different speech breathing strategies than teachers without voice problems. Implications for clinical management of teachers with voice disorders are discussed. The KayPENTAX Electroglottograph (EGG) was used to assess laryngeal adduction characteristics during continuous speaking in this study.
“Fundamental Frequency Change During Offset and Onset of Voicing in Individuals with Parkinson Disease,” Goberman, Alexander M., and Michael Blomgren, Journal of Voice, Vol. 22 No. 2, pp. 178-191, March 2008. Summary: After years of treatment with the medication levodopa, most individuals with Parkinson disease (PD) experience fluctuations in response to their medications. Although relatively consistent perceptual voice improvements have been documented to correspond with these fluctuations, consistent quantitative data to support this finding are lacking. This mismatch may have occurred because most of this phonation research has centered on long-term phonatory measures (i.e., across speaking samples and prolonged vowel tasks). The current study examined short-term phonatory behavior in individuals with PD, specifically examining fundamental frequency (F0) at the offset and onset of phonation, before and after a voiceless consonant. The F0 analysis at phonatory offset supported the conclusion that individuals with PD have difficulty with the rapid offset of voicing, and that they are stopping vocal fold vibration primarily through vocal fold abduction (without adding tension). The F0 analysis at phonatory onset revealed that all groups use some laryngeal tension at the initiation of voicing. The tension was lowest for the PD participants who were in their OFF medication state, and it was highest for the age-matched control participants and the PD participants in their ON medication states. The speech samples in this study were digitized and analyzed using a KayPENTAX Computerized Speech Lab (CSL), Model 4400.
“The Relationship Between Perceptual Evaluation and Objective Multiparametric Evaluation of Dysphonia Severity,” Hakkesteegt, Marieke M., Michael P. Brocaar, Marjan H. Wieringa, and Louw Feenstra, Journal of Voice, Vol. 22 No. 2, pp. 138-145, March 2008. Summary: The purpose of this study was to investigate the usefulness of the Dysphonia Severity Index (DSI) as an objective multiparametric measurement in assessing dysphonia. The DSI was compared with the score on Grade of the GRBAS scale. Investigated was also whether the DSI is related to severity of dysphonia, which was represented by different diagnosis groups. Furthermore, it was investigated whether the DSI can differentiate between a group of patients and a control group. A total of 294 patients with different voice pathologies were included. A control group consisted of 118 volunteers without any voice complaints. The voices of all participants were perceptually evaluated on Grade, and the DSI was measured. The groups of patients with voice complaints have a lower DSI and higher scores on Grade than the control group. The DSI discriminates between patients with nonorganic voice disorders, vocal fold mass lesions, and vocal fold paresis/paralysis. To determine whether the DSI discriminates between patients and controls, the sensitivity and specificity for different DSI cutoff points were calculated. With a DSI cutoff of 3.0, maximum sensitivity (0.72) and specificity (0.75) were found. We conclude that the DSI is a useful instrument to objectively measure the severity of dysphonia. In this study, the KayPENTAX Multi-Speech, Model 3700, was used for acoustic analysis of sound files.
“Pitch Deviation Analysis of Pathological Voice in Connected Speech,” Laflen, Brandon J., Cathy L. Lazarus and Milan R. Amin, Annals of Otology, Rhinology & Laryngology, Vol. 117 (2), pp. 90-97, Feb. 2008 Objectives: This study compares normal and pathologic voices using a novel voice analysis algorithm that examines pitch deviation during connected speech. The study evaluates the clinical potential of the algorithm as a mechanism to distinguish between normal and pathologic voices using connected speech. Methods: Adult vocalizations from normal subjects and patients with known benign free-edge vocal fold lesions were analyzed. Recordings had been previously obtained in quiet under controlled conditions. Two phrases and sustained /a/ were recorded per subject. The subject populations consisted of 10 normal and 31 abnormal subjects. The voice analysis algorithm generated 2-dimensional patterns that represent pitch deviation in time and under variable window widths. Measures were collected from these patterns for window widths between 10 and 250 ms. For comparison, jitter and shimmer measures were collected from sustained /a/ by means of the Computerized Speech Lab (CSL). A t-test and tests of sensitivity and specificity assessed discrimination between normal and abnormal populations. Results: More than 58% of the measures collected from connected speech outperformed the CSL jitter and shimmer measures in population discrimination. Twenty-five percent of the experimental measures (including /a/) indicated significantly different populations (p < .01%). Conclusions: The results demonstrate that the algorithm distinguishes between normal and abnormal populations by use of samples of connected speech. The KayPENTAX Computerized Speech Lab (CSL) was used to obtain the jitter and shimmer measures analyzed in this study.
“The Effects of Frequency Range, Vowel, Dynamic Loudness Level, and Gender on Nasalance in Amateur and Classically Trained Singers,” Jennings, Jori Johnson, and David P. Kuehn, Journal of Voice, Vol. 22 No. 1, pp. 75-89, January 2008. Summary: This study addresses two questions: (1) How much nasality is present in classical Western singing? (2) What are the effects of frequency range, vowel, dynamic level, and gender on nasality in amateur and classically trained singers? The Nasometer II 6400 by KayPENTAX (Lincoln Park, NJ) was used to obtain nasalance values from 21 amateur singers and 25 classically trained singers while singing an ascending five-tone scalar passage in low, mid, and high frequency ranges. Each subject sang the scalar passage at both piano and mezzo-forte dynamic loudness levels on each of the five cardinal vowels (/a/, /e/, /i/, /o/, /u/). A repeated mixed-model analysis indicated a significant main effect for the amateur/classically trained distinction, dynamic loudness level, and vowel, but not for frequency range or gender. The amateur singers had significantly higher nasalance scores than classically trained singers in all ranges and on all vowels except /o/. Dynamic loudness level had a significant effect on nasalance for all subject groups except for female majors in the mid- and high-frequency ranges. The vowel, /i/, received significantly higher nasalance than all of the other vowels. Although results of this study show that dynamic loudness level, vowel, and level of training in classical singing have a significant effect on nasality, nasalance scores for most subjects were relatively low. Only six of the subjects, all of whom were amateur singers, had average nasalance scores that could be considered hypernasal (i.e., a nasalance average of 22 or above). The KayPENTAX Nasometer II, Model 6400, was used in this study to obtain nasalance values from each subject.
“Vocal Improvement After Voice Therapy in Unilateral Vocal Fold Paralysis,” Schindler, Antonio, Alessandro Bottero, Pasquale Capaccio, Daniela Ginocchio, Fulvio Adorni, and Francesco Ottaviani, Journal of Voice, Vol. 22 No. 1, pp. 113-118, January 2008. Summary: Unilateral vocal fold paralysis (UVFP) is associated with changes in acoustic and aerodynamic voice measurements and can have a significant impact on a patient’s quality of life. Few objective data regarding the efficacy of voice therapy for UVFP exist. The aim of this study was to retrospectively analyze voice modifications in a group of patients with UVFP before and after voice therapy. Forty patients with UVFP of different etiology were included in the study. Each subject had voice therapy with an experienced speech/language pathologist twice a week; the mean number of sessions was 12.6. A multidimensional assessment protocol was used; it included videoendoscopy, the maximum phonation time (MPT), the GIRBAS scale, spectrograms and a perturbation analysis, and the Voice Handicap Index (VHI). Pre- and posttreatment data were compared by means of the Wilcoxon and Student’s t tests. A complete glottal closure was seen in 8 patients before voice therapy and in 14 afterward. Mean MPT increased significantly. In the perceptual assessment, the difference was significant for five out of six parameters. A significant improvement was found on spectrographic analysis; as for perturbation analysis, the differences in jitter, shimmer, and noise-to-harmonic ratio values were significant. VHI values showed a clear and significant improvement. A significant improvement of voice quality and quality of life after voice therapy is an often reached and reasonable goal in patients with UVFP. In this study, the KayPENTAX Multi-Dimensional Voice Program (MDVP), Model 5105 was used in conjunction with the CSL, Model 4300, to perform objective voice evaluation.
“Acoustic Analyses of Sustained and Running Voices From Patients With Laryngeal Pathologies,” Zhang, Yu, and Jack J. Jiang, Journal of Voice, Vol. 22 No. 1, pp. 1-9, January 2008. Summary: In this paper, we investigated the acoustic characteristics of sustained and running vowels from normal subjects and patients with laryngeal pathologies. Perturbation methods (including jitter and shimmer), signal-to-noise ratio (SNR), and nonlinear dynamic methods (such as correlation dimension and second-order entropy) were used to analyze sustained and running vowels. We found that the sustained vowels and running voices from normal subjects and patients with laryngeal pathologies had low-dimensional dynamic characteristics. For sustained vowels, the analyses of jitter, shimmer, correlation dimension, and second-order entropy revealed significant differences between normal and pathological voices. For running voices, jitter and shimmer did not statistically discriminate between normal and pathological voices, but a significant difference was found for SNR, correlation dimension, and second-order entropy. The results suggest that nonlinear dynamic analysis and traditional SNR analysis may be valuable for the analysis of sustained and running vowels; perturbation analysis may be applicable for the analysis of sustained vowels but should be applied with caution for running voice analysis The voice samples used in this study were selected from the KayPENTAX Disordered Voice Database, Model 4337.
“An Exploration of Skin Acceleration Level as a Measure of Phonatory Function in Singing,” Lamarche, Anick, and Sten Ternström, Journal of Voice, Vol. 22 No. 1, pp. 10-22, January 2008. Summary: Two kinds of fluctuations are observed in phonetogram recordings of singing. Sound pressure level (SPL) can vary due to vibrato and also due to the effect of open and closed vowels. Since vowel variation is mostly a consequence of vocal tract modification and is not directly related to phonatory function, it could be helpful to suppress such variation when studying phonation. Skin acceleration level (SAL), measured at the jugular notch and on the sternum, might be less influenced by effects of the vocal tract. It is explored in this study as an alternative measure to SPL. Five female singers sang vowel series on selected pitches and in different tasks. Recorded data were used to investigate two null hypotheses: (1) SPL and SAL are equally influenced by vowel variation and (2) SPL and SAL are equally correlated to subglottal pressure (PS). Interestingly, the vowel variation effect was small in both SPL and SAL. Furthermore, in comparison to SPL, SAL correlated weakly to PS. SAL exhibited practically no dependence on fundamental frequency, rather, its major determinant was the musical dynamic. This results in a non-sloping, square-like phonetogram contour. These outcomes show that SAL potentially can facilitate phonetographic analysis of the singing voice. In this study, both the microphone/sound level meter and the pressure transducer were connected to the KayPENTAX Computerized Speech Lab (CSL), Model 4500, which acquires up to 4 channels of data.
“Mucosal Wave: A Normophonic Study Across Visualization Techniques,” Shaw, Heather S., and Dimitar D. Deliyski, Journal of Voice, Vol. 22 No. 1, pp. 23-33, January 2008. Summary: Visualization of vocal fold vibration is essential for accurate diagnoses and optimal treatment of persons with voice disorders. Recently, scientific and anecdotal reports have evidenced an increased amount of variation in the diagnostically relevant features of extent and symmetry of mucosal wave magnitude in normophonic speakers. The objectives of this study were to preliminarily ascertain the variation in mucosal wave magnitude and symmetry for normophonic speakers as assessed via standard and novel techniques, and compare findings across modal and pressed phonations. A correlational design with a multiple baseline across visualization methods approach was used. Mucosal wave presence, magnitude, and symmetry from 52 normophonic speakers were judged via stroboscopy, high-speed videoendoscopy (HSV) playback, mucosal wave playback, and mucosal wave kymography playback. Results demonstrate a prevalence of atypical magnitude and symmetry of mucosal wave during modal and pressed phonations by normophonic persons, differences across techniques, and a relationship between judgments and habitual fundamental frequency. Given the prevalence of mucosal wave magnitude and symmetry variations in the normophonic population, overdiagnosis may be possible without caution. The various visualization techniques provided unique information suggesting that it may be beneficial to use both full view and kymographic visualization techniques in combination. A major restriction of the current commercial HSV systems is the frame rate, typically limited to 2000 frames per second, which appears insufficient for most female habitual phonations. In this study, the KayPENTAX Digital Strobe, Model 9100B, coupled to a KayPENTAX 70-degree rigid endoscope, Model 9106, was used along with a laryngeal contact microphone to track vocal fold vibratory frequency. Also used in the study were the KayPENTAX High-Speed Video (HSV) system, Model 9700, and xenon light source, Model 7152. The KayPENTAX Multi-Dimensional Voice Program (MDVP), Model 5105, coupled with a condenser head-mount microphone, was used to record the acoustic signal synchronized with the HSV recording. “Influence of Speaker Gender on Listener Judgments of Tracheoesophageal Speech,” Eadie, Tanya L., Philip C. Doyle, Kerry Hansen, and Paul G. Beaudin, Journal of Voice, Vol. 22 No. 1, pp. 43-57, January 2008. Summary: The objectives of this prospective and exploratory study are to determine: (1) naïve listener preference for gender in tracheoesophageal (TE) speech when speech severity is controlled; (2) the accuracy of identifying TE speaker gender; (3) the effects of gender identification on judgments of speech acceptability (ACC) and naturalness (NAT); and (4) the acoustic basis of ACC and NAT judgments. Six male and six female adult TE speakers were matched for speech severity. Twenty naïve listeners made auditory-perceptual judgments of speech samples in three listening sessions. First, listeners performed preference judgments using a paired comparison paradigm. Second, listeners made judgments of speaker gender, speech ACC, and NAT using rating scales. Last, listeners made ACC and NAT judgments when speaker gender was provided coincidentally. Duration, frequency, and spectral measures were performed. No significant differences were found for preference of male or female speakers. All male speakers were accurately identified, but only two of six female speakers were accurately identified. Significant interactions were found between gender and listening condition (gender known) for NAT and ACC judgments. Males were judged more natural when gender was known; female speakers were judged less natural and less acceptable when gender was known. Regression analyses revealed that judgments of female speakers were best predicted with duration measures when gender was unknown, but with spectral measures when gender was known; judgments of males was best predicted with spectral measures. Naïve listeners have difficulty identifying the gender of female TE speakers. Listeners show no preference for speaker gender, but when gender is known, female speakers are least acceptable and natural. The nature of the perceptual task may affect the acoustic basis of listener judgments. The KayPENTAX Computerized Speech Lab (CSL), Model 4500, was used to perform the acoustic analyses in this study.
“Objective and Subjective Evaluation of Voice Quality in Multiple Sclerosis,” Dogan, Müzeyyen, Ipek Midi, Mine Almaz Yazici, Ismail Kocak, Dilek Günal, and Mehmet Ali Sehitoglu, Journal of Voice, Vol. 21 No. 6, pp. 735-740, November 2007. Summary: The aim of this comparative, controlled, cross-sectional study is to evaluate the voice quality in patients with multiple sclerosis (MS) by subjective and objective methods. Female patients with MS (n = 27) and age-and sex-matched healthy controls (n = 27) were included in this stud. Vocal functions were evaluated by a multidimensional set composed of videolaryngostroboscopic examination, acoustic analysis, and subjective measurements (GRBAS and “Voice Handicap Index”). Jitter percent, shimmer percent, and softa phonation index (SPI) values were higher in MS patients compared to controls (Jitt, P – 0.001; Shim, P – 0.033; SPI P < 0.0001). Maximum phonation time was significantly shorter for MS patients compared to controls (P < 0.0001). Stroboscopic examination revealed that 16 out of 27 MS patients have a “posterior chink” as glottic closure pattern with higher SPI values (40%). Noise to harmonic ratio (NHR) and mean fundamental frequency (F0) values were similar for MS and control groups (NHR, P = 0.737; F0, P – 0.976). In this study, most of the MS patients had dysphonia due to weakness of voice. MS tends to worsen acoustic parameters including fundamental frequency, SPI, and jitter values. These results are consistent with the more asthenic voice quality observed in MS group. The KayPENTAX Multi-Dimensional Voice Program (MDVP), Model 5105, was used in conjunction with the KayPENTAX Multi-Speech, Model 3700, for the acoustic analysis of voice samples in this study.
“Short-Term Effects of Endotracheal Intubation on Voice,” Hamdan, Abdul-Latif, Abla Sibai, Charbel Rameh, and Ghassan Kanazeh, Journal of Voice, Vol. 21 No. 6, pp. 762-768, November 2007. Summary: The objective of this study was to examine the vocal symptoms and acoustic changes perceived in the short period after endotracheal intubation, and to find the association between these changes and the endotracheal tube parameters. A total of 35 subjects were included. They were examined preoperatively, and 2 and 24 hours postoperatively. The vocal symptoms of hoarseness, vocal fatigue, loss of voice, throat clearing, globus pharyngeus, throat pain, and the acoustic variables mainly average fundamental frequency, relative average perturbation, shimmer, noise to harmony ratio, voice turbulence index, habitual pitch, and maximum phonation time (MPT) were assessed as such and in relation to the following endotracheal tube parameters: duration of anesthesia, number of intubation attempts, size of the tube, cuff volume, cuff mean pressure, and the emergence. The association between anesthesia parameters with incidence of vocal complaints and changes in acoustic parameters were examined using logistic and linear regression. Vocal fatigue was associated significantly with the increase in cuff volume and the number of intubation attempts. Throat clearing was associated significantly with the increase in cuff mean pressure. Only the increase in habitual pitch was associated significantly with the increase in cuff volume. The acute short-term effect of endotracheal intubation on voice is significant. The most important endotracheal tube parameters that affect the vocal changes are the cuff mean pressure and volume. The laryngeal contribution to these vocal changes seems to be minimal All vocal symptoms increased significantly except for globus pharyngeus at 2 hours postoperatively. The acoustic parameters did not change significantly except for a decrease in MPT. At 24 hours postoperatively, all vocal symptoms subsided with no significant difference to baseline value. The habitual pitch increased significantly, and the rest of the parameters remained comparable to baseline value. The KayPENTAX Visi-Pitch, Model 3300, was used to perform the acoustic analysis in this study.
“Effect of Deep Brain Stimulation on Different Speech Subsystems in Patients with Multiple Sclerosis,” Putzer, Manfred, William John Barry, and Jean Richard Moringlane, Journal of Voice, Vol. 21 No. 6, pp. 741-753, November 2007. Summary: The effect of deep brain stimulation on articulation and phonation subsystems in seven patients with multiple sclerosis (MS) was examined. Production parameters in fast syllable repetitions were defined and measured, and the phonation quality during vowel productions was analyzed. Speech material was recorded for patients (with and without stimulation) and for a group of healthy control speakers. With stimulation, the precision of glottal and supraglottal articulatory gestures is reduced, whereas phonation has a greater tendency to be hyperfunctional in comparison with the healthy control data. Different effects on the two speech subsystems are induced by electrical stimulation of the thalamus in patients with MS. In this study, the KayPENTAX CSL, Model 4300B, was used to capture and digitize the EGG and microphone signals that were recorded simultaneously.
“Acoustic Voice Analysis of Prelingually Deaf Adults Before and After Cochlear Implantation,” Evans, Maegan K., and Dimitar D. Deliyski, Journal of Voice, Vol. 21 No. 6, pp. 669-682, November 2007. Summary: It is widely accepted that many severe to profoundly deaf adults have benefited from cochlear implants (CIs). However, limited research has been conducted to investigate changes in voice and speech of prelingually deaf adults who receive CIs, a population well known for presenting with a variety of voice and speech abnormalities. The purpose of this study was to use acoustic analysis to explore changes in voice and speech for three prelingually deaf males pre- and post implantation over 6 months. The following measurements, some measured in varying contexts, were obtained: fundamental frequency (F0), jitter, shimmer, voice-to-harmonic ratio, voice turbulence index, soft phonation index, amplitude- and F0 variation, F0-range, speech rate, nasalance, and vowel production. Characteristics of vowel production were measured by determining the first formant (F1) and second formant (F2) of vowels in various contexts, magnitude of F2-variation, and rate of F2-variation. Perceptual measurements of pitch, pitch variability, loudness variability, speech rate, and intonation were obtained for comparison. Results are reported using descriptive statistics. The results showed patterns of change for some of the parameters while there was considerable variation across the subjects. All participants demonstrated a decrease in Fo in at least one context and demonstrated a change in nasalance toward the norm as compared to their normal hearing control. The two participants who were oral language communicators were judged to produce vowels with an average of 97.2% accuracy and the sign-language user demonstrated low percent accuracy for vowel production. The KayPENTAX Computerized Speech Lab (CSL), Model 4400, and Nasometer II, Model 6400, were used to obtain the objective data in this study. In addition to the core CSL program, other CSL applications that were used included the Motor Speech Profile (MSP), Model 5141, and Multi-Dimensional Voice Program (MDVP), Model 5105.
“Speech Breathing Behavior and Vocal Fold Function in Dysphonic Participants Before and After Therapy During Connected Speech: Preliminary Observations,” Schaeffer, Natalie, Contemporary Issues in Communication Science and Disorders, Vol. 34, pp. 61-72, Fall 2007. Abstract. This research is an extension of the author’s previous research in which speech breathing values (on the respigraph) of participants with abuse-related dysphonia and those with normal voices were compared during connected speech. Results from the previous study revealed that the dysphonic group used significantly lower end-expiratory values (i.e., extended exhalation below resting expiratory levels) in comparison to the group with normal voices. The present study investigated speech breathing values (on the respigraph) simultaneously with vocal fold function (on the electroglottograph) in 10 dysphonic participants, before and after therapy, during connected speech. Preliminary results indicated a significant improvement in speech breathing data (higher end-expiratory levels) and a trend toward increased vocal fold symmetry (lower speed quotients) following therapy. Additionally, posttherapy perceptual ratings revealed significant improvements in the participants’ vocal quality when compared to pretherapy ratings. In this study, vocal fold function, specifically contact quotient and contact index or speed quotient, was measured on EGG waveforms acquired using the KayPENTAX Electroglottograph, Model 6103. “Effect of Syllable-Initial Voicing on Vowel Duration During Simultaneous Communication in Speech Produced by Inexperienced Signers: A Systematic Replication,” Allen, Kristin, Sarah Maisonet, Dale E. Metz, Nicholas Schiavetti, and Robert L. Whitehead, Contemporary Issues in Communication Science and Disorders, Vol. 34, pp. 101-105, Fall 2007. Abstract. Under natural speaking conditions, or speaking alone (SA), vowels following word-initial voiced stop consonants are longer in duration than vowels following word-initial voiceless stops. This study investigated vowel durations following the production of word-initial voiced and voiceless stop consonants produced during simultaneous communication (SC) by recording inexperienced sign language users during SC and SA. Although the results indicated longer sentence and vowel durations for SC than SA, they showed no differences in the relative duration of vowels following voiced or voiceless stops. Vowel durations following voiced stop consonants were uniformly longer and than vowel durations following voiceless stops across both speaking conditions. This finding is consistent with previous research indicating that global temporal alterations observed in SC do not degrade important temporal cues of spoken English. The findings are also consistent with the finding of D. E. Metz et al. (2006), who investigated experienced signers’ vowel durations under identical experimental conditions as the present study. The KayPENTAX Computerized Speech Lab (CSL), Model 4300B, was used in this study to digitize and display the acoustic audio signals.
“Functional Analysis of Voice Using Simultaneous High-Speed Imaging and Acoustic Recordings,” Yan, Yuling, Edward Damrose, and Diane Bless, Journal of Voice, Vol. 21 No. 5, pp. 604-616, September 2007. Summary: We present a comprehensive, functional analysis of clinical voice data derived from both high-speed digital imaging (HSDI) of the larynx and simultaneously acquired acoustic recordings. The goals of this study are to: (1) correlate dynamic characteristics of the vocal folds derived from direct laryngeal imaging with indirectly acquired acoustic measurements; (2) define the advantages of using a combined imaging/acoustic approach for the analysis of voice condition; and (3) identify new quantitative measures to evaluate the regularity of the vocal fold vibration and the complexity of the vocal output—these measures will be key to successful diagnosis of vocal abnormalities. Image- and acoustic-based analyses are performed using an analytic phase plot approach previously introduced by our group (referred to as ‘Nyquist’ plot). Fast Fourier Transform (FFT) spectral analyses are performed on the same data for a comparison. Clinical HSDI and acoustic recordings from subjects having normal and specific voice pathologies, including muscular tension dysphonia (MTD) and recurrent respiratory papillomatosis (RRP) were analyzed using the Nyquist plot approach. The results of these analyses show that a combined imaging/acoustic analysis approach provides better characterization of the vibratory behavior of the vocal folds as it correlates with vocal output and pathology. Researchers used a KayPENTAX High-Speed Video System for simultaneous image and acoustic data acquisition in this study.
“The Effectiveness of Oral Resonance Therapy on the Perception of Femininity of Voice in Male-to-Female Transsexuals,” Carew, Lisa, Georgia Dacakis, and Jennifer Oates, Journal of Voice, Vol. 21 No. 5, pp. 591-603, September 2007. Summary: Ten male-to-female transsexuals participated in five sessions of oral resonance voice therapy targeting lip spreading and forward tongue carriage. Acoustic analysis of recordings made pre- and posttherapy found that participant formant frequency values (F1, F2, and F3, from the vowels /a/, /i/, and /U/, as well as fundamental frequency (F0), underwent a general increase posttherapy. F3 values, in particular, increased significantly posttreatment. Trends in listener ratings of these recordings showed that the majority of participants were perceived to sound more feminine following treatment. Participants’ self-ratings of their voices pre- and posttreatment also indicated that participants perceived their voices as sounding more feminine and that they were more satisfied with their voices following treatment. The present study supports the findings of previous studies that have demonstrated that resonance characteristics in male-to-female transsexuals can be changed to more closely approximate those of females through oral resonance therapy. This intervention study also demonstrates that a spontaneous increase in F0 is achieved during the course of therapy. Further, this study provides preliminary evidence to suggest that oral resonance therapy may be effective in increasing femininity of voice in male-to-male transsexual clients. The KayPENTAX CSL, Model 4300B, was used to perform the acoustic analyses in this study.
“Acoustic Analysis of the Interaction of Choral Arrangements, Musical Selection, and Microphone Location,” Morris, Richard J., Ashley J. Mustafa, Christopher R. McCrea, Linda P. Fowler, and Christopher Aspaas, Journal of Voice, Vol. 21 No. 5, pp. 568-575, September 2007. Summary: Acoustic differences were evaluated among three choral arrangements and two choral textures recorded at three microphone locations. A choir was recorded when singing two musical selections of different choral texture, one homophonic and one polyphonic. Both musical selections were sung in three choral arrangements: block sectional, sectional-in-columns, and mixed. Microphones were placed at the level of the choristers, the conductor, and the audience. The recordings at each location were analyzed using long-term average spectrum (LTAS). The LTAS from the mixed arrangement exhibited more signal amplitude than the other arrangements in the range of 1000-3500 Hz. When considering the musical selections, the chorus produced more signal amplitude in the region of 1800-2200 Hz for the homophonic selection. In addition, the LTAS produced by the choir for the homophonic selection varied across the microphone locations. As for the microphone location, the LTAS of the signal detected directly in front of the chorus had a greater slope than the other two locations. Thus, the acoustic signal near the choristers differed from the signals near the conductor and in the audience. Conductors may be using acoustic information from the region of the second and third formants when they decide how to arrange a choir for a particular musical selection. In this study, productions from the choir were digitized via digital-analog-digital connections between the DAT and the KayPENTAX CSL, Model 4400. The CSL was also used to analyze these files.
“The Role of Pitch Memory in Pitch Discrimination and Pitch Matching,” Moore, Robert E., Casie Keaton, and Christopher Watts, Journal of Voice, Vol. 21 No. 5, pp. 560-567, September 2007 Summary: Accurate control of vocal pitch (fundamental frequency) requires coordination of sensory and motor systems. Previous research has supported the relationship between perceptual accuracy and vocal pitch matching accuracy. The purpose of this study was to investigate the role of memory for pitch in pitch matching and pitch discrimination ability. Three experimental tasks were used. First, a pitch matching task was completed, in which the participants listened to target tones and vocally matched the pitch of the tones. The second task was a pitch discrimination task that required the participants to judge the pitch (same or different) of complex tone pairs. The third task was pitch discrimination with memory interference task that was similar to the pitch discrimination task except interference tones were added. Results of the pitch matching and pitch discrimination tasks yielded a significant correlation between pitch discrimination and pitch matching. These results support earlier findings of a relationship between pitch discrimination and pitch matching abilities. The results also suggest a possible role of pitch memory in both tasks. These findings may have implications for abilities related to accurate pitch control. The KayPENTAX Multi-Dimensional Voice Program (MDVP) was used in conjunction with the KayPENTAX CSL to calculate the fundamental frequency of each recorded sample.
“The Effects of Three Nebulized Osmotic Agents in the Dry Larynx,” Tanner, Kristine, Nelson Roy, Ray M. Merrill, and Mark Elstad, Vol. 50 No. 3, pp. 635-646, June 2007. Purpose: This investigation examined the effects of nebulized hypertonic saline, isotonic saline (IS), and sterile (hypotonic) water on phonation threshold pressure (PTP) and self-perceived phonatory effort (PPE) following a surface laryngeal dehydration challenge. Method: In a double-blind, randomized experimental trial, 60 vocally healthy women (n = 15 per group) underwent a laryngeal desiccation challenge involving oral breathing for 15 min using medical-grade dry air (RH < 1%). Three of the four groups then received nebulized isotonic saline (0.9% NaCl), hypertonic saline (7% NaCl), or sterile (hypotonic) water, respectively; the 4th group served as a nontreatment control. PTP and PPE were estimated for high-pitched productions at baseline, immediately postdesiccation, and at 5, 30, 35, and 50 min postnebulization. Results: PTP increased significantly for all groups following the desiccation challenge. PTP values were, on average, 0.5 cm H2O greater immediately postdesiccation versus baseline. In contrast PTP values did not change significantly following the administration of nebulized treatments, although a temporary trend toward a reduction in PTP was observed for the IS group. Unexpectedly, PPE ratings decreased significantly after the desiccation challenge. In general, PPE ratings were poorly correlated with PTP measures. Conclusions: A laryngeal desiccation challenge (i.e., temporary exposure to extremely low relative humidity while breathing transorally) significantly increased PTP. Although interesting trends emerged, none of the nebulized treatments significantly enhanced recovery from the negative effects of desiccation on PTP. In light of very low correlations between PTP and PPE, serious questions are raised regarding presumed associations between these measures. The KayPENTAX Multi-Speech, Model 3700, was used to acquire Fo data in this study.
“F2 Locus Equations: Phonetic Descriptors of Coarticulation in 17- to 22-Month Old Children,” Gibson, Terrie and Ralph N. Ohde, JHLSR, Vol. 50 No. 1, pp. 97-108, February 2007. The general purpose of this research was to describe coarticulation across voiced stop consonant place of articulation in 10 children younger than 2 years of age. A total of 1,182 voiced stop CV productions was analyzed using the locus equation metric, which yielded 3 regression lines that described the relation of F2 onset and F2 vowel for /bV/, /dV/, and /gV/ productions. The results revealed significant differential effects for slope and y-intercept as a function of stop consonant place of articulation. The ordering of the mean slope values for stop consonant place of articulation was /g/>/b/ and /d/, indicating that /g/ was produced with significantly greater coarticulation than /b/ or /d/. However, the unique vowel allophonic pattern of [g] coarticulation reported in the literature for English-speaking adults was generally not learned by these young children. Group and individual coarticulation trends are described in relation to developmental theories of sound acquisition. Results suggest that early coarticulation patterns are phoneme specific. In this study, CV productions for 7 children were digitized for acoustic analysis using the KayPENTAX Computerized Speech Lab (CSL), while the CV productions of the remaining 3 children were analyzed using the KayPENTAX Multi-Speech, Model 3700.
“Subjective and Objective Evaluation of Voice Quality in Patients with Asthma,” Dogan, Muzeyyen, Emel Eryuksel, Ismail Kocak, Turgay Celikel, and Mehmet Ali Sehitoglu, Journal of Voice, Vol. 21 No. 2, pp. 224-230, March 2007. Summary: Objectives: To evaluate the voice quality in patients with mild-to-moderate asthma by subjective and objective methods. Study design: Comparative, controlled, cross-sectional study. Methods: Patients with mild-to-moderate asthma (n = 40) and age- and sex-matched healthy controls (n = 40) were included. Acoustic analyses were performed by the Multi-Dimensional Voice Program (MDVP; Kay Elemetrics Corporation, Lincoln Park, NJ) and the movements of the vocal cords were examined by videolaryngostroboscopy (VLS). In addition, the duration of illness, maximum phonation time, “s/z” values, and vital capacity were evaluated. Voice Handicap Index (VHI) and GRB scales were used for subjective evaluations. Results: Maximum phonation time values were significantly shorter both in male and female asthma patients compared with controls (P < 0.0001). Also, average shimmer values in MDVP were higher for both sexes in the patient group compared with controls (P = 0.002 and P = 0.04, respectively). There was a significant difference between female patients and sex-matched controls with regard to mean noise-to-harmonic ration values (P = 0.006). Female patients with asthma had higher average jitter values compared with sex-matched controls (P < 0.0001). A significant difference was noted between asthma and control groups with regard to GRB scale (P < 0.0001, P < 0.001, and P < 0.0001, respectively). The VHI score was above the normal limit in 16 (40%), and VLS findings were abnormal in 39 (97.5%) asthmatics. Conclusion: In asthmatic patients, maximum phonation time, frequency, and amplitude perturbation parameters were impaired, but the vital capacity and the duration of illness did not correlate with these findings. In this study, the voice analysis was performed using the KayPENTAX Multi-Dimensional Voice Program.
“Adductor Spasmodic Dysphonia Versus Muscle Tension Dysphonia: Examining the Diagnostic Value of Recurrent Laryngeal Nerve Lidocaine Block,” Roy, Nelson, Marshall E. Smith, Brynn Allen, and Ray M. Merrill, Annals of Otology, Rhinology & Laryngology, Vol. 116 (3), pp. 161-168, March 2007. Objectives: Differentiating adductor spasmodic dysphonia (ADSD) from muscle tension dysphonia (MTD) can be difficult. This investigation examined the precision of response to unilateral lidocaine block of the recurrent laryngeal nerve (RLN block) as a potential diagnostic test to discriminate ADSD from MTD. Methods: Patients with ADSD (n = 23) and MTD (n = 20) were audio-recorded before and during RLN block. The patients completed self-ratings of dysphonia severity, vocal effort, and laryngeal tightness, and blinded listeners completed auditory-perceptual ratings of overall severity, breathiness, and strain of voice samples before and during the block. Results: Repeated-measures analysis of variance, with “group” (ADSD/MTD) as the between-subjects variable and “time” (before block/during block) as the within-subjects variable, confirmed significant “time” effects, but no significant “group-by-time” interaction effects, indicating that both disorder groups responded favorably to RLN block, according to patient- and listener-based ratings. Furthermore, low estimates of sensitivity and specificity and weak receiver operating characteristic curves confirmed that a positive response to the RLN block test did not distinguish ADSD from MTD. Conclusions: We conclude that RLN block offers little discriminatory value in the differential diagnosis of ADSD versus MTD, and a positive response to RLN block should not be considered confirmatory of ADSD. The KayPENTAX Multi-Speech, Model 3700, was used to digitize and store speech samples recorded before and during RLN block in this study.
“Evolution of Vocal Fold Nodules from Childhood to Adolescence,” De Bodt, M.S., K. Ketelslagers, T. Peeters, FL. Wuyts, F. Mertens, J. Pattyn, L. Heylen, A. Peeters, A. Boudewyns, and P. Van de Heyning, Journal of Voice, Vol. 21 No. 2, pp. 151-156, March 2007. Summary: Bilateral (quasi) symmetrical lesions of the anterior third of the vocal folds, commonly called vocal fold nodules (VFNs) are the most frequent vocal fold lesions in childhood caused by vocal abuse and hyperfunction. This study evaluates their long-term genesis with or without surgery and voice therapy. A group of 91 postmutational adolescents (mean age, 16 years), in whom VFNs were diagnosed in childhood, were questioned to analyze the evolution of their complaints. Thirty four of them could be clinically reexamined by means of the European Laryngological Society-protocol, including a complete laryngological investigation and voice assessment. A total of 21% of the questioned group (n = 91) had voice complaints persisting into postpubescence with a statistically significant difference (P ≤ 0.001) between boys (8%) and girls (37%). VFNs were still present in 47% of the girls and 7% of the boys of the clinically evaluated group (n = 34). Analysis of the data before and after puberty shows that the variables gender, allergy, and degree of dysphonia (“G”) in childhood enable a fairly correct prediction of persisting voice complaints in adolescence (sensitivity of 89% and specificity of 67%). The results of this study show a clearly different evolution for both sexes, with significant higher long-term risks for dysphonic girls with allergy. In this study, the KayPENTAX Voice Range Profile (VRP) and Multi-Dimensional Voice Program (MDVP) were used in conjunction with the Computerized Speech Lab (CSL) for acoustic analysis. “Long-Term Outcome of Hyperfunctional Voice Disorders Based on a Multiparameter Approach,” Van Lierde, K.M., S. Claeys, M. De Bodt, and P. van Cauwenberge, Journal of Voice, Vol. 21 No. 2, pp. 179-188, March 2007. Summary: The purpose of this study is to determine the long-term voice outcome (6.1 years after a well-defined voice treatment program) of hyperfunctional voice disorders in 27 subjects. All patients showed a muscle tension pattern type I (MTP I). Perceptual ratings, aerodynamic and acoustical analyses, Voice Handicap Index (DSI) were performed. The laryngovideostroboscopic images indicated that 51% of the subjects still show pathological laryngological findings. The negative evolution of the DSI from -1 to -3.2 is in agreement with this finding. Analysis of the components of the DSI shows that the main responsible variable for this negative change is the lowest intensity (I-low) that increased with 8.1 dB, indicating that subjects generally speak too loud, which is a typical problem for vocal hyperfunction. The VHI-score indicates an unimportant psychosocial impact of the voice disorder. The more objective and laryngostroboscopic findings indicate a chronic situation for a substantial part of the subjects and even a worse situation for some of them. Whether the long-term voice outcome results can be changed with the insertion of several follow-up voice rehabilitation sessions over the years remains unanswered and is a subject for further research. Acoustic analysis in this study was performed using the KayPENTAX Multi-Dimensional Voice Program (MDVP) and Voice Range Profile (VRP) in conjunction with the CSL.
“Current Diagnosis and Treatment of Laryngocele in Adults,” Dursun, Gursel, Ozan B. Ozgursoy, Suha Beton, and Hunkar Batikhan, Otolaryngology-Head and Neck Surgery, Vol. 136, No. 2, pp. 211-215, February 2007. Objectives: To evaluate the treatment outcome of a series of laryngoceles and to comment on the current diagnosis and management of laryngoceles. Study Design and Setting: A retrospective review of charts, radiological and histopathological notes, videolaryngostroboscopic records, and acoustic voice analyses of patients with laryngocele treated over a 10-year period was undertaken. Results: Seven patients had internal laryngoceles; one had external; another one had combined laryngocele. Patients with internal laryngocele underwent endoscopic CO2 laser resection, while those with external or combined laryngocele were treated via external approach. Quality of voice was improved and no recurrences were encountered during the follow-up. No evidence of laryngeal cancer was found on the histological examinations. Conclusions: Endoscopic CO2 laser resection of internal laryngocele provides a reliable and cost-effective method that minimizes hospitalization and the need for tracheotomy. We believe that advances in the applications of laser in microlaryngosurgery will alter the traditional management of all type of laryngoceles. The acoustic analysis in this study was performed using the KayPENTAX Computerized Speech Lab in conjunction with the Multi-Dimensional Voice Program (MDVP).
“The Effect of Visible Speech in the Perceptual Rating of Pathological Voices,” Martens, Jan W.M.A.F., Huib Versnel, and Philippe H. Dejonckere, Archives of Otolaryngology-Head & Neck Surgery, Vol. 133 No. 2, pp. 178-185, February 2007. Objectives: To test a simple method for improving consistency among raters for the perceptual evaluation of pathological voice quality by providing visible speech (spectrogram) as additional information because, to date, the interrater variability still limits the widespread clinical use of the best available rating system. Design: Experimental comparison between 2 different ways (with and without the addition of visible speech) of perceptual rating by trained professionals of recorded pathological voices. Furthermore, the correlation between acoustical (jitter, shimmer, and noise-harmonic ration) and perceptual parameters was investigated in both rating conditions. Subjects: Six experts evaluated 70 recorded pathological voices using the GIRBAS (grade, instability, roughness, breathiness, asthenicity, and strain) sale in 2 separate sessions: first, conventionally, without visible speech as additional information, and several months later, with visible speech as additional information. Main Outcome Measures: The κ interrater agreement and the correlation coefficient between GIRBAS scores and acoustic measures. Results: We found a significant effect of visible speech on the agreement between the raters. The interrater agreement according to κ statistics was significantly stronger with the addition of visible speech than without for rating grade, roughness, and breathiness. The correlation between acoustical and perceptual parameters showed no significant effect of visible speech. Conclusion: The addition of visible speech to the perceptual evaluation of pathological voices is an interesting clinical asset to enhance its reliability. The addition of visible speech to the clinical setting is feasible, since affordable computer programs are currently available that can provide the spectrogram in quasi-real time while conversing with the patient. The acoustical analysis might be applied in addition to perceptual rating in a multi-dimensional approach to assess voice quality. In this study, the KayPENTAX Multi-Dimensional Voice Program (MDVP) was used for acoustical analysis; spectrograms were generated using the Kay DSP Sona-Graph, Model 7800.
“Effects of Vocal Training and Phonatory Task on Voice Onset Time,” McCrea, Christopher R. and Richard J. Morris, Journal of Voice, Vol. 21 No. 1, pp. 54-63, January 2007. Summary: Objectives/Hypothesis: The purpose of this study was to examine the temporal-acoustic differences between trained singers and nonsingers during speech and singing tasks. Methods: Thirty male participants were separated into two groups of 15 according to level of vocal training (i.e., trained or untrained). The participants spoke and sang carrier phrases containing English voiced and voiceless bilabial stops, and voice onset time (VOT) was measured for the stop consonant productions. Results: Mixed analyses of variance revealed a significant main effect between speech and singing for /p/ and /b/, with VOT durations longer during speech than singing for /p/, and the opposite true for /b/. Furthermore, a significant phonatory task by vocal training interaction was observed for /p/ productions. Conclusions: The results indicated that the type of phonatory task influences VOT and that these influences are most obvious in trained singers secondary to the articulatory and phonatory adjustments learned during vocal training. The KayPENTAX Computerized Speech Lab (CSL), Model 4300B, was used to perform the acoustic analysis in this study.
“Transoral Approach to Laser Thyroarytenoid Myoneurectomy for Treatment of Adductor Spasmodic Dysphonia: Short-Term Results,” Su, Chih-Ying, Hui-Ching Chuang, Shang-Shyue Tsai, and Jeng-Fen Chiu, Annals of Otology, Rhinology & Laryngology, Vol. 116 (1, pp. 11-18, January 2007. Objectives: The surgical technique for the resection of the recurrent laryngeal nerve for adductor spasmodic dysphonia (ASD) has high late failure rates. During the past decade, botulinum toxin has emerged as the treatment of choice for ASD. Although effective, it also has significant disadvantages, including a temporary effect and an unpredictable dose-response relationship. In this study we investigated the effectiveness of a new transoral approach to laser thyroarytenoid myoneurectomy for treatment of ASD. Methods: Fourteen patients with ASD underwent transoral laser myoneurectomy of bilateral thyroarytenoid muscles. Under general anesthesia, an operating microscope and a carbon dioxide laser were used to perform myectomy of the mid-posterior belly of bilateral thyroarytenoid muscles together with neurectomy of the terminal nerve fibers among the deep muscle bundles Care was taken not to damage the vocalis ligaments, arytenoid cartilages, and lateral cricoarytenoid muscles. Preoperative and postoperative videolaryngostroboscopy and vocal assessments were studied. Results: The 13 patients who completed more than 6 months follow-up were enrolled in this study. Moderate and marked vocal improvement was achieved in 92% of the patients (12 of 13) after laser surgery during an average follow-up period of 17 months (range, 6 to 31 months). No vocal fold atrophy or paralysis was observed in any patient. None of the patients had a recurrence during the follow-up period. Conclusions: Transoral laser myoneurectomy of bilateral thyroarytenoid muscles is a relatively simple, effective, and valuable technique for the treatment of ASD. The durability of outcome achieved with this procedure is encouraging. A KayPENTAX Computerized Speech Lab (CSL), Model 4300B, was used to measure acoustic parameters including mean fundamental frequency, noise-to-harmonics ratio, jitter, and shimmer.
“The Speaker’s Formant,” Irene Velsvik Bele, Journal of Voice, Vol. 20 No.4, pp. 555-578, December 2006. Summary: The current study concerns speaking voice quality in two groups of professional voice users, teachers (n = 35) and actors (n = 36), representing trained and untrained voices. The voice quality of text reading at two intensity levels was acoustically analyzed. The central concept was the speaker’s formant (SPF), related to the perceptual characteristics “better normal voice quality” (BNQ) and “worse normal voice quality” (WNQ). The purpose of the current study was to get closer to the origin of the phenomenon of the SPF, and to discover the differences in spectral and formant characteristics between the two professional groups and the two voice quality groups. The acoustic analyses were long-term average spectrum (LTAS) and spectrographical measurements of formant frequencies. At very high intensities, the spectral slope was rather quadrangular without a clear SPF peak. The trained voices had a higher energy level in the SPF region compared with the untrained, significantly so in loud phonation. The SPF seemed to be related to both sufficiently strong overtones and a glottal setting, allowing for a lowering of F4 and a closeness of F3 and F4. However, the existence of SPF also in LTAS of the WNQ voices implies that more research is warranted concerning the formation of SPF, and concerning the acoustic correlates of the BNQ voices. The spectrograms in this study were generated with a KayPENTAX DSP Sona-Graph, Model 5500. “Intelligibility of Tracheoesophageal Speech in Noise,” Douglas A. McColl, Journal of Voice, Vol. 20 No. 4, pp. 605-615, December 2006. Summary: The purpose of this investigation is to determine the extent to which background noise negatively impacts the intelligibility of tracheoesophageal (TE) speech. Four male TE speakers provided speech samples that were recorded in quiet and in noise conditions. The listener/subjects occupied a sound-treated booth and were presented with two tasks. In Task 1, the subjects were required to transcribe TE speech stimuli recorded in quiet. In Task 2, the subjects were required to transcribe TE speech stimuli recorded in noise. Repeated measures 2 x 4 factorial analyses of variance were calculated for the dataset. The results of the statistical analysis revealed that the TE speech produced in quiet was significantly more intelligible to the listeners than the TE speech produced in noise for three of the four TE speakers. Furthermore, the results seem to support the hypothesis that the activation of a Lombard effect in TE speakers may detract from their overall speech intelligibility. In this study, the acoustical analyses of the vocal parameters (jitter, shimmer, and noise-to-harmonic ratio) were performed using the KayPENTAX CSL, Model 4300B in conjunction with the Multi-Dimensional Voice Program, (MDVP), Model 5105.
“The Effect of Perceptual Training on Inexperienced Listeners’ Judgments of Dysphonic Voice,” Eadie, Tanya L. and Carolyn R. Baylor, Journal of Voice, Vol. 20 No. 4, pp. 527-544, December 2006. Objectives/hypothesis: The purpose of this study was (1) to determine whether changes in intra- and interrater reliability occur for inexperienced listeners’ judgments of overall severity, roughness, and breathiness in dysphonic and normal speakers after 2 hours of listener training; and (2) to determine the acoustic bases of inexperienced listeners’ judgments before and after training. Study Design: Prospective, single group, pre- and postdesign. Methods: Thirty adult dysphonic and six normal speaker samples were selected from a database. Samples included 21 test stimuli and 15 training stimuli of both sustained vowels and connected speech. Sixteen inexperienced listeners judged all samples for overall severity, roughness, and breathiness using visual analog scales. Each listener provided pretraining ratings at baseline. Listeners were then trained using 15 anchor voice samples and 15 training stimuli. During training, listeners were provided with definitions of rating dimensions, accuracy feedback, and anchor samples. Listeners then judged test stimuli in a posttraining session. Speaker samples also were analyzed acoustically. Results: Intrarater reliability was least variable for judgments of overall severity, but improved further with training. Listener judgments of roughness and breathiness in vowels were least reliable at baseline, but they significantly improved between listeners after training. Finally, measures of cepstral peak prominence significantly predicted all voice quality judgments except roughness in vowels, which was predicted by shimmer. The acoustic bases of group perceptual judgments did not seem to change with training. Conclusions: These findings have implications for developing training programs in perceptual evaluation and mapping relationships between acoustic and perceptual characteristics of voice disorders. Voice samples from the KayPENTAX Disordered Voice Database and Program, Model 4337, were used in this study; the KayPENTAX Multi-Dimensional Voice Program (MDVP), Model 5105, was used for acoustic analysis. “The Effect of Speaking Context on Elicitation of Habitual Pitch,” Zraick, Richard, I., Mollie A. Gentry, Laura Smith-Olinde, and Brent A. Gregg, Journal of Voice, Vol. 20 No. 4, pp. 251-262, December 2006. Summary: The purpose of this study was to investigate if there was an effect of speaking context on the elicitation of habitual pitch [speaking fundamental frequency (SFF)]. Six simulated speaking contexts were created (speaking during a voice evaluation, speaking in public, speaking to a peer, speaking to a superior, speaking to a subordinate, and speaking to a parent or spouse), and the SFF for 30 adult women with normal voice was compared across these contexts. A one-way analysis of variance (ANOVA) revealed a statistically significant (P < 0.001) effect of simulated speaking context on SFF, with post hoc analyses indicating a statistically significant difference in SFF while “speaking to a superior” (P < 0.001) and “speaking to a subordinate” (P < 0.001). Possible reasons for an effect of speaking context are discussed. Also, the implications of the use of varied speaking contexts when eliciting SFF are discussed, as in the possibility of an effect of speaking context on the elicitation of other clinically useful voice parameters. In this study, voice samples were captured with the KayPENTAX Computerized Speech Lab (CSL), Model 4500, and were analyzed with the KayPENTAX Multi-Dimensional voice Program (MDVP), Model 5105.
“Ambulatory Monitoring of Disordered Voices,” Hillman, Robert E., James T. Heaton, Asa Masaki, Steven M. Zeitels, and Harold A. Cheyne, Annals of Otology, Rhinology & Laryngology, Vol. 115 (11), pp. 795-801, November 2006. Objectives: Recently developed systems for ambulatory monitoring of voice use employ miniature accelerometers place at the base of the anterior neck to sense phonation. As it is hope that such systems will help improve the clinical assessment and management of voice disorders, this study was undertaken to determine the impact of dysphonia severity on the accuracy of accelerometer-based estimates of vocal function. Methods: Simultaneous recordings were made of oral acoustic (microphone) and neck skin acceleration signals for 6 normal speakers and 18 patients with voice disorders (mild to severe dysphonia) as they performed several speech tasks. Measures of phonation time, fundamental frequency, and sound pressure level were extracted from the two types of signals and compared. Results: It was generally demonstrated that accelerometer-based measures closely approximated corresponding measurements obtained from a microphone signal across all levels of dysphonia severity. Furthermore, there was evidence that in some cases the accelerometer may actually represent a more robust approach for estimating phonation parameters in disordered voices. Conclusions: The results generally support the recent application of accelerometers as phonation sensors in ambulatory voice monitoring systems that can be used in the clinical assessment and management of voice disorders. In this study, the KayPENTAX Ambulatory Phonation Monitor (APM), Model 3200, was used to perform both data collection and analysis.
“Birth Control Bills and Nonprofessional Voice: Acoustic Analyses” Amir, Ofer, Tal Biron-Shental, and Esther Shabtai, Vol. 49 No. 5, pp. 1114-1126, October 2006. Purpose: Two studies are presented here. Study 1 was aimed at evaluating whether the voice characteristics of women who use birth control pills that contain different progestins differ from the voice characteristics of a control group. Study 2 presents a meta-analysis that combined the results of Study 1 with those from 3 recent studies that compared voices of women who use and do not use birth control pills. Method: In Study 1, voice samples from 30 women with no history of voice training, who use pills with different progestins (drospirenone, desogestrel, gestodene), and 10 women who do not use the pill were recorded at specific time points across the menstrual cycle and were analyzed acoustically. In Study 2, results from Study 1 were analyzed jointly with results from three recent studies, which used similar methodologies. Results: Results of Study 1 did not reveal acoustic differences in sustained phonation of vowels across the pill groups and controls. Results of the meta-analysis performed in Study 2 indicated that pill users exhibited lower jitter and shimmer values on sustained vowels, whereas no difference of fundamental frequency was observed among women who use the pill. Conclusions: These results support findings from previous studies, which suggested that no adverse effect on voice was detected among nonprofessional speakers who use new-generation monophasic birth control pills, for the measures studied. Furthermore, results of the meta-analysis suggested that some acoustic properties of the voice, which are reflected in perturbation measures in sustained vowels, may be improved among women who use the pill. The KayPENTAX Multi-Dimensional Voice Program (MDVP), Model 5105, was used to perform the acoustic analysis in this study. “Articulation Rate and Vowel Space Characteristics of Young Males with Fragile X Syndrome: Preliminary Acoustic Findings” Zajac, David J., Joanne E. Roberts, Elizabeth A. Hennon, Adrianne A. Harris, Elizabeth F. Barnes, and Jan Misenheimer, Vol. 49 No. 5, pp. 1147-1155, October 2006. Purpose: Increased speaking rate is a commonly reported perceptual characteristic among males with fragile X syndrome (FXS). The objective of this preliminary study was to determine articulation rate—one component of perceived speaking rate—and vowel space characteristics of young males with FXS. Method: Young males with FXS (n = 38), developmental age (DA)-matched males (n = 21), and chronological age (CA)-matched males (n = 16) were audiotaped while engaged in spontaneous conversation and a picture-naming task. Articulation rate in syllables per second during intelligible utterances and vowel space area/dispersion measures were acoustically determined for each speaker. Results: Males with FXS did not articulate significantly faster than CA-matched males. Area and dispersion of the acoustic vowel space also were similar between the 2 groups. Males with FXS, however, used significantly shorter utterances and had a tendency to pause less often than CA-matched males. In addition, males with FXS exhibited greater intraspeaker variability of formants associated with the vowel /a/. Conclusions: These preliminary findings suggest that articulation rate may not be a primary factor contributing to perceived speaking rate of males with FXS. Limitations of the study relative to speech production tasks and utterance intelligibility are discussed. Researchers in this study used the KayPENTAX Computerized Speech Lab (CSL), Model 4400, in the acoustic analysis performed.
“Response of the Female Vocal Quality and Resonance in Professional Voice Users Taking Oral Contraceptive Pills: A Multiparameter Approach,” Van Lierde, Kristiane M., Sofie Claeys, Marc De Bodt, and Paul Van Cauwenberge, Laryngoscope, Vol. 116, pp. 1894-1898, October 2006. Objectives: The purpose of this study was to analyze the vocal quality and resonance (nasality and nasalance values) during the menstrual cycle in professional voice users using oral contraceptive pills (OCPs). Although professional voice users are more sensitive and aware of their vocal quality, no changes of voice and resonance characteristics were expected because OCPs create a stable hormonal balance through the menstrual cycle. Study Design: The authors conducted a comparative study of 24 healthy, young professional voice users using OCPs. One assessment was performed between the 10th and 17th day of pill intake, when hormonal levels reached a steady state. The second assessment was performed during the first 3 days of menses, when no pills were taken and hormonal levels were minimized. Methods: Subjective (perceptual evaluation of voice and nasality) and objective (aerodynamic, voice range, acoustic, Dysphonia Severity Index [DSI], nasometer) assessment techniques were used. Results: The Mann-Whitney U test showed no significant difference between the perceptual evaluation of the voice and the nasality in the two assessments. The paired Student t test showed no significant difference regarding the maximum phonation time, the vocal performance, the acoustic parameters, and the DSI. Conclusion: These findings indicate that OCPs do not have an impact on the objective and subjective voice and resonance parameters in young professional voice users. This information is specifically relevant to professional voice users who are more aware of vocal quality changes and ear, nose and throat specialists/voice therapists who treat professional voice users with voice problems/disorders. Further research regarding the impact of increased vocal load during the premenstrual or menstrual phase in professional voice users using OCPs should be considered. The KayPENTAX Nasometer, Model 6200, was used to obtain nasalance values in this study. The KayPENTAX Voice Range Profile (VRP) and Multi-Dimensional Voice Program (MDVP) were used to obtain voice range measures and perform the acoustic analysis, respectively.
“Clinical Evaluation of Parkinson’s-Related Dysphonia,” Sewall, Gregory K., Jack Jiang, and Charles N. Ford, Laryngoscope, Vol. 116, pp. 1740-1744, October 2006. Objective/Hypothesis: Nearly one third of patients with idiopathic Parkinson’s disease (IPD) cite dysphonia, characterized subjectively as causing a harsh and breathy voice, as their most debilitating deficit. Medical or behavioral treatments may lead to voice improvement. The purpose of this study was 1) to determine whether vocal fold injection of Cymetra (micronized form of collagen, elastin, proteoglycans; Lifecell Co.) is associated with changes in dysphonic voice characteristics in subjects with IPD, as judged perceptually using a standard instrument Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V), and (2) which acoustic and aerodynamic measurements of voice are most reflective of any observed perceptual changes in voice. Study Design: Prospective clinical evaluation of patients with Parkinson’s-related dysphonia (PRD). Methods: Six patients with PRD were evaluated before treatment for the presence of dysphonia and glottal gap. All subjects underwent trans-oral vocal fold collagen injection using topical anesthesia in the otolaryngology clinic as part of their clinical care. At the initial clinic visit, and 10 to 14 days after vocal fold collagen injection, patients were asked to complete the Voice Handicap Index (VHI), a questionnaire concerning voice-related quality of life, and perceptual analyses of voice quality were performed. In addition, patients underwent acoustic (pitch/loudness range, maximum phonation time [MPT], and aerodynamic phonation threshold pressure [PTP] voice analysis. Results: Five of six subjects had self-perceived improvements in voice after treatment, as determined by the VHI (range, +8 to -24). All five subjects who completed testing demonstrated decreased PTP (range, -1.3 to -2.7, P = .002). Five of six subjects demonstrated statistically significant improvements in MPT (range, -2-16 s, P = .05). Five of six subjects had improved pitch range (-26-343 Hz), whereas all subjects had increased intensity range (0.6-23 dB) after injection. Conclusion: Trans-oral collagen injection in patients with PRD is safe, well tolerated, and is an effective temporary method of subjectively improving voice and speech in selected patients with IPD. Reduction of glottal gap with collagen improved MPT and subglottal PTP. The resulting gain of vocal efficiency may reduce vocal fatigue and provide a useful adjunct to voice therapy for PRD.
|