Home  |  Contact Us    

Home
Company History and Profile
Visiting KayPENTAX
Product Information
Publications
KayPENTAX in the News
Conferences and Workshops
What's New
International Representatives
Sales
Tech Support
Employment Opportunities

 

 

COMMUNIKAY Vol. 3, No. 1


Motor Speech Profile Introduced

Kay introduced a new software option for the powerful CSL platform at ASHA '95 in Orlando called the Motor Speech Profile (MSP), Model 4341. The new program, which is the result of collaborative efforts between Kay and clinical speech scientists, provides acoustic analysis of motor disordered speech based on a variety of specific tasks elicited from the patient. These results are then quantified and/or graphed to help objectify patient performance.

MSP is similar in concept to another CSL option, the Multi-Dimensional Voice Program (MDVP), which objectifies many voicing parameters. MSP is focused specifically on motor speech assessment. Both programs were developed by Dimitar Deliyski, Ph.D., of Kay's engineering department.

The evaluation of motor speech disorders is challenging for the speech-language pathologist and neurologist. While perceptual analysis is valuable, there is also a need to objectively evaluate a variety of parameters of the subject's speech to fully assess speech production. These can include, among others, breath support, voicing behavior, nasality, rate of speech, and vowel and consonant production. Each of these can be evaluated objectively to more fully describe the subject's speech. The clinician can supplement the more subjective impressions of overall intelligibility with data that can be measured and monitored over time. Ideally, even subjective evaluation of intelligibility should be rated by a new listener unfamiliar with a patient's speech patterns.

MSP provides built-in protocols to analyze separate parameters. Says Deliyski, "The program uses the acoustic parameters most closely associated with a perceptual attribute to objectify speech behavior. For example, distorted vowels are often characterized by neutralized second formant transitions. MSP sets up a task to measure and assess this behavior."

In another example, diadochokinetic (DDK) rate and its periodicity have been shown to be closely associated with articulator mobility. A DDK task is also included in the protocol which provides a model followed by the subject's attempt, and then automatic analysis of the production.

According to Deliyski, a new feature (not displayed at ASHA) that will ultimately be included in MSP is the analysis and graphic display of long-term frequency and amplitude modulation (e.g., tremor, vibrato, etc.) in the patient's voice. These modulation parameters will also be derived from a simple, protocol-driven elicitation of the task to facilitate sample collection and analysis for the speech clinician. The results from other instrumental procedures (e.g., aerodynamic, nasometric, etc.) can also be entered into the analysis report summary.

Kay received thoughtful input from a large number of clinicians and speech scientists during the design of MSP. Though the following list is not comprehensive, we wish to especially acknowledge the valuable contributions from: Klaas Bakker, Ph.D., SW Missouri State University; Frank Boutsen, Ph.D., University of South Alabama; Eugene Buder, Ph.D., University of Memphis; Carla Gress, Ph.D., University of California at San Francisco; and Shimon Sapir, Ph.D., Northwestern University. Please note that suggestions/contributions from these individuals do not imply their endorsement of this or any other Kay product.


Researchers Adapting MDVP for Sung Voice

Politicians. Actors. Lawyers. World-class opera singers. Cantors. Music educators. Even 7-year-old singers on cruise ships. Professional voice users all, and all among the scores of patients who routinely visit the Texas Voice Center in Houston.

But making bold new strides in the research arena is as much a part of the routine at the Center as is restoring and maintaining the vocal health of their patients.

Under the direction of Dr. C. Richard Stasney, clinical associate professor of otolaryngology at Baylor College of Medicine, the Center uses a team approach that successfully interfaces the three professions of vocal pedagogy, laryngology, and voice science/speech pathology.

Among the most exciting and ground-breaking work under way is that of Sharon Radionoff, the Center's singing voice specialist and voice technologist. For her dissertation, Radionoff is completing one of the largest studies ever performed in the singing world. She has collected hundreds of voice samples of singers that she has analyzed using Kay's Multi-Dimensional Voice Program.

"The Texas Voice Center is among the first clinics to routinely use the MDVP on our huge volume of voice patients," says Dr. J. David Garrett, assistant professor at Baylor College of Medicine and the voice scientist/voice physiologist at the Center. "We have seen that there really are very clear-cut correlations among the acoustics, physiology, and pathology," he explains, "and our goal has to be to continue to tie those together."

MDVP extracts reliable and accurate analysis of voicing parameters. The analysis results are graphed against normative thresholds. These thresholds are appropriate for flat-tone sustained phonation of the /a/ vowel. "But here," Garrett says, "when a voice patient comes in, we're doing a spoken /a/, which we then follow by doing the sung /a/. The problem is that we have no normative data on singers."

Radionoff's work will redefine these normative standards for singers. "Right now, the MDVP displays analysis results outside of normative thresholds even in the most perfect singing voice," she says. "But if we and other labs are going to be using this instrumentation for the singing production," she says, "it's critical that we have new normative thresholds that we can compare the population of abnormal singers to. We need to know what is normal for a singer."

Radionoff tracked 28 voice students at Rice University's Shepherd School of Music during a year of voice study. She collected data on three occasions (i.e., in October, January, and April). On each occasion, she collected phonatory and respiratory data, and a series of five trials for spoken and for sung phonations. All of these samples, multiplied by the 33 parameters extracted by the MDVP, will be used to develop what Dr. Garrett calls, "the ultimate normative database."

Radionoff is lavish in her praise of the reliability of the MDVP/CSL instrumentation. "Without reliable instrumentation, all of the numbers that have been collected would mean nothing."

Garrett and Radionoff look forward to having optional normative thresholds on the MDVP that would allow the user to "Set for Singer's Values." This would display all of the different parameters, but with each one scaled according to Radionoff's data. Then, they explain, what now appears to be deviation from normal might just be a pattern that naturally occurs when using vibrato, for example. "It will also give singers a feedback mechanism which," Garret claims, "will never replace their voice teachers or the human ear, but which will make their lives easier."

"We're at the threshold of a brand new era. It's an exciting time to be in the field," says Garrett.



Our Customers Ask...

What is the advantage of the new three-chip CCD camera offered by Kay?

For stroboscopy, general endoscopy, and FEES, Kay's standard packages include a single-chip CCD camera. Kay now also offers a three-chip camera, though it is not standard due to its higher price. The major advantage of the three-chip camera technology is that instead of integrating red, green, and blue on a single chip, the three primary colors each have their own separate CCD chip and circuitry. The result is truer color, higher signal-to-noise ratio (56 dB), and higher resolution (700 lines) than are possible with single-chip cameras (46 dB signal-to-noise, and 430 lines of resolution). Some customers will undoubtedly view the improved clarity worth the extra expense. And although the cost difference is still substantial (approximately 70% higher for the three-chip models), it has declined steadily over the last two years, a trend that will likely continue.

On the RLS (Rhino-Laryngeal Stroboscope), the audio volume knob [located in upper right section of the front panel] controls the audio microphone signal level. Do adjustments of this knob affect extracted amplitude measurements?

No. The audio microphone (attached to the top of the CCD camera and plugged into the strobe unit) has two functions. One is to provide voice amplitude measurements in real time as the patient phonates during the stroboscopic examination. This measurement is completely independent of the audio volume knob's position.

The second function is to provide the voice signal to the VCR so you can hear the patient's voice on playback of video recordings. The volume knob controls the audio microphone's signal level being fed to the VCR. Observe the VU meter on the VCR during phonation, and make adjustments so that the meter is at mid-range during comfortable phonation. You may need to make adjustments in extreme cases: for patients with very soft voices (e.g., unilateral vocal fold paresis), turn the knob clockwise to increase output level to the VCR; for very loud voices (e.g., professional singer phonating fortissimo), adjust the knob counterclockwise.

How can I import Palatometry data into the CSL to compare physiologic and acoustic data?

The Palatometer acquires data from 96 electrodes embedded in a custom-fitted pseudopalate worn in the mouth as well as the acoustic signal (using microphone input). Within the Palatometer software, the speech waveform is time-linked to the linguapalatal contact patterns made during a speech utterance. These two sets of data are stored to a single file (.nsp format) when saved to disk.

Users may wish to perform in-depth acoustic analysis (e.g., spectrographic, pitch, amplitude, etc.) concurrently with analysis of the physiologic data provided by the Palatometer. The CSL allows importation of data for this purpose from both Kay's Palatometer as well as the Reading (Hardcastle) Electropalatograph.

Within CSL, load the file of interest; the acoustic waveform will load to view A. Now create a new view (preferably square-shaped) to accommodate palatometric data. With the new view active, click Analyze on the Main Menu, then "Show Palatometer Display". The file's palatometric data will appear in this view. You may now create additional views for spectrographic, pitch, and other data; link all views. Cursor positions in all views of data are precisely linked in time to the palatometric data of the spoken utterance.


Technically Speaking...

Loading the latest CSL software on 4300A hardware

CSL software has been updated on an ongoing basis to add new features and functionality; the more recent releases (5.0 and higher) take advantage of the newer 4300B hardware architecture introduced in August 1993 (see summary in CommuniKay Vol. 1, No.1). However, these later software versions also work with the 4300A hardware. Although a few software features specific to 4300B hardware (e.g., DAT interface, up to four channels of input, etc.) do not function with 4300A hardware, users can take advantage of many other advantages in these later versions.

When updating to CSL version 5.0, observe the following points to ensure the program will install and run properly. First, the recommended entry level computer for the newer software is a PC 486 DX. Second, your computer's available conventional (DOS) memory must be at least 565K (less was required in prior versions). Type mem from the root directory to verify. If not, type memmaker (MS-DOS 5.0 or higher required) from the root directory to optimize your conventional memory. If this is still not adequate, manually edit the autoexec.bat file to free up memory.

Third, the new HARDWARE.CFG file, which contains the I/O and IRQ default values set at the factory, must be altered if your previous CSL software was set to non-factory settings. Configuration modifications that you may have made originally are not automatically carried over when you install the CSL 5.0 software. Note that any modifications made in your prior CSL software (versions 4.x and earlier) can be viewed in your old CSL.CFG file.

Finally, any options you have purchased will also be updated when you upgrade to the CSL 5.0 version. These should be installed along with the core CSL software.

Call the factory (ext.160) or your local representative for additional information.


Tips for Accurate Signal Acquisition

When you capture or import acquired signals into Kay's CSL or Visi-Pitch II, an understanding of how unwanted noise can be mixed with the signal of interest will improve your ability to interpret the analysis of a signal compromised by noise. Noise is generally taken to mean random fluctuations, added to or modulated with, a wanted signal. Here, however, noise means any unwanted signal, periodic or not, existing with a desired signal. For example, if you import a voice signal acquired using a multimedia card, you may unwittingly report voice parameters (e.g., jitter, harmonic/noise ratios) which have been significantly altered by noise added during signal acquisition. In another example, if you connect other components to your CSL during capture (e.g., capturing through a DAT player simultaneously), you may be adding noise without realizing it.

Any method of acquiring and storing signals can affect the signal quality. Noise can be introduced during signal acquisition in a number of ways. System components (e.g., microphone, cabling, pre-amplifier, amplifier, anti-aliasing filters and A/D) could be of poor quality. The CSL and Visi-Pitch II use professional-level system components and careful design which minimize system-generated noise. However, if you import signals acquired with other products, you should evaluate their performance. Additionally, noise sources (e.g., fan noise, electromagnetic signals from monitors or fluorescent light fixtures, power supply hum) may inadvertently be acquired along with the signal. Poor room acoustics can also add noise.

A digital storage system (e.g., computer acquisition, DAT recorder) converts the incoming continuous analog signal to a discrete digital signal. Converting an analog signal to a digital representation is performed by an analog-to-digital converter (A/D converter). Most systems today use a 16-bit A/D converter which produces a 16-bit binary number to represent the range of incoming values. A binary number with 16 places has a range of 216 or 65,536 possible values. This range of values equals 96 dB of possible signal level variation (20 log10 65536). This is the maximum achievable range, also called the dynamic range, with a 16-bit linear converter. The full 96 dB range, however, is seldom achieved because the associated electronics can rarely take advantage of the full dynamic range available. For example, most sound cards which plug into a computer can lose half of the dynamic range to system noise and large DC offsets.

One simple way to perform a partial check of a recording system, without the need for test equipment, is to acquire a signal exactly as you would during your work except with the microphone turned off. Then, with the Visi-Pitch II Waveform Editor or CSL, analyze the acquired signal for noise by scrolling the cursor along the waveform to note the noise level. If you repeat the above test with the microphone on (but quiet), you can separately measure the noise added by microphone pickup of acoustic and electromagnetic signals. For example, if the values range -200 or +200, under either condition, the eight or nine least significant bits (about 50 dB) are corrupted by noise. The true dynamic range of the 16-bit A/D has been reduced from 96 dB to about 46 dB!

A useful dynamic range of above 85 dB is recommended by the National Center for Speech and Voice for voice measurements. This means that the noise should fall within the range of -4 to +4 using the tests described above.


Calendar of Upcoming Events

Please look for Kay products on display at the following conferences, workshops, and congresses.
Editor's note: Additional listings for 1996 will be provided in Vol. 3, No. 2.


Conferences

February 22-25 Illinois Speech-Language Hearing Association, Chicago, IL
  Contact: ISHA (312) 644-8557
March 4-6 Pennsylvania Speech-Language-Hearing Association, Pittsburgh, PA
  Contact: PSHA, (412) 778-2052
March 21-24 California Speech-Language-Hearing Association, Monterey, CA
  Contact: CSHA, (916) 921-1568
March 22-23 Spectrum of Dysphagia, MCW Dysphagia Institute, Milwaukee, WI
  Contact: Reza Shaker, M.D., (414) 259-2742
March 29-30 American Academy of Otolaryngology, Cherry Blossom Conference, Arlington, VA
  Contact: AAO, (703) 836-4444
April 11-13 Texas Speech-Language-Hearing Association, Dallas, TX
  Contact: TSHA, (512) 452-4636
May 4-1 Combined Otolaryngological Spring Meetings (COSM), Orlando, FL
  Contact: AAO, (703) 836-4444
May 13-17 Acoustical Society of America (ASA), Indianapolis, IN
  Contact: ASA, (516) 576-2360
June 3-10 Care of the Professional Voice and International Congress on Phonosurgery, Philadelphia, PA
  Contact: Voice Foundation, (215) 735-7999
October 3-6 International Conference on Spoken Language Processing, Philadelphia, PA
  Contact: ICSLP @ Univ. of Delaware, (302) 651-6830

Workshops

March 7-9 CSL Training Workshop, Kay Elemetrics Corp., Lincoln Park, NJ
  Contact: Kay Elemetrics Corp., (201) 628-6200
April 19 In-House Training and Demonstrations, Kay Elemetrics Corp., Lincoln Park, NJ
  Contact: Kay Elemetrics Corp., (201) 628-6200
June 14 In-House Training and Demonstrations, Kay Elemetrics Corp., Lincoln Park, NJ
  Contact: Kay Elemetrics Corp., (201) 628-6200
June 28-29 Laryngeal Video Endostroboscopy (LVES), Nashville, TN
  Contact: Vanderbilt Univ. CME, (615) 322-4030
September 19-21 CSL Training Workshop, Kay Elemetrics Corp., Lincoln Park, NJ
  Contact: Kay Elemetrics Corp., (201) 628-6200
October 11-12 Laryngeal Video Endostroboscopy (LVES), Nashville, TN
  Contact: Vanderbilt Univ. CME, (615) 322-4030

International Conferences

April 19-20 Clinical Voice Disorders Course, Maastricht, The Netherlands
  Contact: University Hospital, Maastricht, Fax: +31-43-87-7878
April 23 CSL Training Workshop, Brussels, Belgium
  Contact: Kay Elemetrics Corp., Fax: +1-201-628-6363
June 3-10 Care of the Professional Voice and International Congress on Phonosurgery, Philadelphia, PA, USA
  Contact: Voice Foundation, Fax: +1-215-735-9293
July 28-August 1 4th International Conference on Head and Neck Cancer , Toronto, Canada
  Contact: Jatin Shah, M.D., Fax: +1-507-281-8328
August 30-31 Congenital and Postoperative Swallowing Disorders Conference, Munich, Germany
  Contact: Organizing Committee, Fax: +49-89-4140-4940
October 3-6 International Conference on Spoken Language Processing, Philadelphia, PA, USA
  Contact: H. Timothy Bunnell, Ph.D., Fax: +1-302-651-6895

Add Video-Otoscopy to Your Strobe

Are you getting the most out of your Kay stroboscopy system? Components used for stroboscopy (e.g., light source, CCD camera, lens/adaptor, VCR, monitor, printer) can also be used for other endoscopic procedures. An increasingly popular one is video-otoscopy.

Otolaryngologists routinely perform otoscopic exams to evaluate the ear canal and tympanic membrane. For physicians who would like a convenient means of sharing their observations with their patients, or for educating residents at teaching hospitals, video-otoscopy provides a large, clear image of the structures of interest on the video monitor for all to see. The physician can point to critical findings in the otoscopic exam and be certain that all are observing the same features. The exam can be video recorded for later viewing (e.g., after treatment) or used for pedagogical purposes at a later time. Hard copy color prints can be placed in the patient's chart or sent to a referring physician.

Kay's stroboscopy systems (both the Basic and Computerized) contain all of the components, except the rod otoscope, to perform video-otoscopy. However, Kay now offers a rod otoscope, and others are available from a number of manufacturers. These otoscopes contain a rod lens for transmitting the image and a circumferential fiberoptic bundle which transmits light for structure illumination. Most commercially available devices are easily coupled to the standard lens/adaptor eyepiece used for other rigid and flexible endoscopes.

Kay's Computerized System has the added advantage of allowing image digitization for storage in the computer. Left ear versus right ear can be placed side by side on the screen and annotated to clarify specific findings.

Remember that components of your stroboscopy system can be shared across a variety of endoscopic applications.

 

 

 

 

Copyright © 1996-2008 KayPENTAX. All rights reserved. Site Map  |  Contact Us