PARKINSON’S DISEASE ASSESSMENT USING SPEECH ANOMALIES: A REVIEW Taha Khan Department of Computer Engineering Dalarna University, 78188 Borlange, Sweden [email protected] ABSTRACT This paper presents a detailed systematic literature review on Parkinson’s disease severity assessment methods based on speech impairment. Techniques that cope up the challenges of voice signal processing in real-time have been reviewed. An analysis on finding relevant acoustic parameters that are robustly influenced by Parkinson’s disease symptoms and medication-related changes was done. Time-series analysis of voice acoustics, feature extraction and pattern recognition methods are evaluated on the basis of portability and compatibility to mobile devices. A twotier review methodology is developed based on the applicability of pathological voice recognition methods in real-time. The goal is to design an algorithm based on the review synthesis that can tackle real-time constraints in pathological voice recognition for the assessment of Parkinson’s disease severity. Pause detection, peak to average power rate clipping and zero thresholding rate calculations are proposed for human voice segmentation. Previous work depicts that these methods produce rich voice features in real-time. We suggest that these features may be further processed using wavelet transforms and used with a neural network for detection and quantification of speech anomalies related to Parkinson’s disease. General Terms Algorithms, Measurement, Reliability, Experimentation. Keywords Parkinson Disease, Hypokinetic Dysarthria, Speech Recognition. 1. INTRODUCTION The need for real world Parkinson’s disease (PD) treatment remains unmet for the vast majority of patients. Since PD is progressive, there is a need of follow up treatments over time. Medicine dosing needs to be adjusted daily with respect to physical exercise, food intake and mood [1]. Only 3-4% of the patients receive timely treatment [2]. Reasons for this are that individuals may have physical limitations which make it difficult for them to come for treatment or they may not have easy access to treatment provided by a therapist. High-performance personal digital assistants (PDA) with advanced sound processing capabilities have potential to tackle the challenge of treatment accessibility [3]. Clinician monitoring and feedback can be preserved using the PDA since it has the portability to adapt clinician directed treatment to home self training[3]. Portable feedback devices for pathological individuals have been previously investigated. Zicker et al. [3] found that an individual with PD was able to modify her behavior when she received feedback from a device that her speech was too soft. Multisite treatment and the effective data to evaluate the delivery of consistent treatment among clinicians can be acquired using a PDA [3]. 1.1 PD effects on Human Voice PD is characterized by the loss of dopaminergic neurons in brain [4]. This loss results in dysfunction of basal ganglia pathway which is an essential part of the circuitry that mediates motor and cognitive functions. As a result of dopamine loss in basal ganglia, there can be a number of motor symptoms such as rigidity, akinesia (loss of control over voluntary muscle movements), bradykinesia (abnormal slowness in muscle movements), rest tremor, postural abnormalities, and speech dysfunction [4]. Physical symptoms that can occur in the limbs can also occur in the speech system. These symptoms are classified as hypokinetic dysarthria (HKD) [5]. "Dysarthria" refers to a speech disorder due to a change in muscle control. Hypokinetic means reduced movement. Thus, hypokinetic dysarthria is reduced movement of the muscles used for speech production. 1.2 Characteristics of Hypokinetic Dysarthria HKD can affect respiration, phonation, resonation and articulation in speech [4-6]. Respiration problems disturb voice loudness in PD patients [6]. The reason is control of inhalation and exhalation enables a person to maintain adequate loudness of speech through a conversation. Person with PD may speak on the "bottom" of his or her breath i.e. inhale, exhale, then speak; rather than on the "top" i.e. inhale, speak, exhale remaining air [6]. PD individual’s voice is an average of 2-4 dB softer than the normal voice [6]. Vocal folds vibration during phonation creates pitch of the voice. Vocal folds vibrate quickly during high-pitched sounds and vibrate slowly during low-pitched sounds [6]. Many individuals with PD notice changes in pitch of their voices [7]. Monotone or lack of vocal inflection or melody in voice is also a common complaint[8]. Resonating system determines richness of the voice. Soft palate, located in the back of the mouth roof closes off the nasal cavity while speaking, except when producing nasal sounds such as “...ing”, “m...” or "n...”. The soft palate does not move normally in PD. A nasal quality in voice is produced as the air is leaked into the nose due to the soft palate’s inadequate movement [8]. Articulator system comprises of the face muscles, lips, tongue, and jaw. Imprecise articulation in PD is attributed to the reduced movement or lack of coordination of face muscles [7]. Imagine 1.3 Research Challenges Previous research on HKD analysis had constraints due to laboratory controlled settings. Voice classification in real-time environment is challenging. Speech datasets could possibly be collected via phone calls by the PD patients so that speech assessment can be more timely accurate and productive. But due to the background noise or in the medium of transmission, human voice detection becomes difficult. Distance of patient’s mouth from the phone’s mouth-piece may also create problem in recognizing voice amplitude [9]. Male and female voices have different pitch properties [10]. All these issues can lead to incorrect speech segmentation. Enough data sets are required for training and classifying the data according to clinical rating scale [11]. Due to the medical ethics and patient’s privacy issue this is another challenge. Effect of speech rate on overall intelligibility in PD is a matter of debate. A comparison of results of previous studies on speech rate in PD is hampered by methodological differences [12]. The research should also contribute to prove the validity of the above hypothesis. The paper has been sectioned as follows; section 2 reveals the aim of the study. Section 3 elaborates the synthesis of previous research. An approach for the characterization of speech impairment based on the review synthesis has been proposed in section 4. The paper has been concluded in section 5. 2. AIM OF THE STUDY The purpose of this study is to review methods which can distinguish between pathological and normal human speech based on a synthesis from a literature review. The goal of the review is to find solutions to problems that occur in real time voice signal processing. [21] for speech items ranked from 0-to-4 (normal to severe pathological state) needs to be chosen. The review methodology has been illustrated in figure 1. 3. SYNTHESIS OF PREVIOUS WORK In this section, the literature on acoustic analysis of HKD in PD patient is reviewed and synthesized. Choice of publications for review is based on the validation of methodology, credibility of experimental techniques for real-time voice analysis and portability of algorithm for the mobile companion. Synthesis is presented in form of a time-line i.e. starting from the investigation of essential acoustic parameters for HKD recognition to the methods implemented for classification between HKD from the normal voice. Keywords such as “Parkinson’s disease”, “Hypokinetic Dysarthria”, “Speech Recognition in real time” were used in the search engines. Histogram (in figure 2) displays the search engines used, number of total relevant literature found and the filtered publications based on review methodology. The most useful search hits for the synthesis have been found using the IEEE Explorer search engine. Other search engines used were Google Scholar, ELIN, EBRARY, and LIBRIS. Below is the synthesis of relevant literature evaluated. Database Search 250 Num ber of Hits that your face was very cold and it was difficult to move your facial muscles, your speech becomes slurred and unclear [6]. 200 150 Relevant Hits 100 Useful Hits 50 0 Google Scholar ELIN Ebrary LIBRIS IEEE Explorer Search Engines Figure 2. Search for the Relevant Literature. 3.1 Acoustic Measures for HKD recognition Investigation of relation between the pathological acoustic parameters and Parkinson’s disease has been of primary concern for researchers. Kristin M. Rosen, et al. [5] identified acoustic signatures that were robust to phonetic variation in PD patients speech conversations. 20 healthy control (HC) and 20 PD patients had 2 minutes of conversational speech. Group differences were detected in T-test in 8 of the 9 measures. Most specific (95%) and accurate (95%) differentiators of HKD and HC speech were found to be ‘Pause Time’ and ‘Spectral Range’. It was concluded that HKD can be distinguished from HC voice on the basis of amplitude variation and spectral range. Figure 1. Review Methodology. Acoustic features that are robust to medication changes in patients need to be determined. A choice of an efficient time-series analysis tool for voice signals needs to be made for better evaluation of segmented speech. A classification tool to classify pathological speech according to clinical speech assessments using the UPDRS (Unified Parkinson’s Disease Rating Scale) An approach was adopted by Paul McRae and K. Tjaden to characterize the relationship between vocal tract acoustic output and perceptual impressions of PD patient’s speech [13]. In this method speakers read the passage with habitual slow and fast rates. Voice of vowels ‘i’, ‘u’ and ‘a’ and fricatives ‘f’ and ‘s’ were of interest. Acoustic measures included articulatory rate, segment durations, vowel formant frequencies, and first moment coefficients. Results indicated high variations in temporal acoustic measures for the PD group when compared to HC group. Kris Tjaden [14] used vowel co-articulation and vowel syllables to study acoustic variability between 9 HKD males with PD and a group of 10 HC males [14]. Graded speaking rate was used to investigate effect of rate variation on co-articulation. Ratio of F2 formant onset frequency and F2 target frequency was used to infer co-articulation. Fricative F2 was obtained for speech stimuli and compared to F2 onset measures. The result (shown in figure 3a and 3b) was, ratios tended to be smaller for PD speakers than for HC [14]. frequencies [7]. To improve differentiation of HKD from normal speech, the acoustic metric must be minimally affected by speaker-related variability and maximally affected by the articulatory impairment, as reflected by vowel formant centralization. The metric used was called the formant centralization ratio (FCR). FCR effectively and robustly differentiated the groups with dysarthria from the HC group in [7]. Also, the FCR is insensitive to gender effects. FCR correlated highly when the correlated variable was the change induced by treatment. These findings suggest that the FCR is a valid and highly sensitive metric of vowel articulation for normal as well as for abnormal speech. 3.3 Automatic HKD Recognition a. Two repetitions of word “dice” are shown. Dashed trajectory corresponds to a repetition for PD speaker. Solid trajectory corresponds to repetition for healthy speaker of similar age. b. Two repetitions of word “dice” illustrate varying coarticulation. Solid repetition was produced at relatively faster speaking rate and has shorter vowel duration; dashed repetition was produced at slower rate and has longer vowel duration. Above studies narrowed down the research to formant frequency of consonants and vowels as it contains most relevant acoustics for HKD analysis. Emily Budkowski [8] calculated PD severity ratings using fundamental frequency of Voice Onset Time (VOT is the interval between the initial articulatory release of a stop consonant and onset of voicing for the subsequent vowel). Participants’ speech ranged from mild to severe. He found out that Levodopa appeared to have greater effect on VOT within PD group and it is a measure of medication related rate change. Results indicated that all participants exhibited HKD in OFF medication state. Figure 3. Formant Analysis [14]. Jennifer Kleinow and Anne Smith [15] revealed that for all speaker groups (HC and HKD), STI (Spatiotemporal index, a measure of spatial and temporal variability) values from the loud speech were closest to those from habitual speech. The reason is normalized movement pattern for loud speech resembled that of habitual speech. Amy T. Neel [16] and Kris Tjaden [17] depicted that loud speech resulted in intelligibility improvement in HKD acoustics. Above studies narrowed down the focus of research to vowel formant time-frequency analysis to classify between pathological and normal voice. a. Control group b. Patients group Figure 4. Sound Power Analysis [10]. Izworski et al. [10] worked with continuous sound analysis of PD patients for automatic HKD recognition. Sound emitted for a long period of time allowed for sound power analysis. Power value of each frame is calculated by summing up the values of the signal energy within the respective x (a) and x (a+m) limits of the t frame given in equation 1.1 where m is the frame length. a +m 3.2 Vowel Formant Frequency The most relevant acoustic parameters for production of vowels are the frequencies of first two formants, F1 and F2 [7]. These formant frequencies change as a function of the movements of articulators. In general, the frequency of F2 increases, and that of F1 decreases, as the tongue moves forward (e.g., to form the vowel ‘i’), and the frequency of F2 decreases as the tongue moves backward (e.g., to form the vowels ‘u’ and ‘ao’). Also, the frequency of F1 decreases when the tongue is elevated (e.g., to form the vowels ‘i’ and ‘u’) and increases when the tongue is lowered, alone or with a downward movement of the jaw (e.g., to form the vowel ‘a’). Furthermore, the frequencies of both F1 and F2 decrease when the lips are rounded (e.g., to form the vowel ‘u’) and increase when the lips are retracted or become unrounded (e.g., to form the vowels ‘i’ and ‘a’). HKD results in vowel formant centralization, that is, formants that have high frequencies tend to have lower frequencies, and formants that have low frequencies tend to have higher P(t ) = ∑ x(k ) 2 (1.1) k =a Polynomial p(x) of degree 4 that fits the data, p(x (t)) to P (t) in least squares sense is calculated to represent average values of P (t) is given in equation 1.2. p( x) = p1 x n + p2 x n −1 + ... + p n x + p n+1 (1.2) Sum of differences between vector P (t) and p(x) can be used as a voice stability parameter which is given as. a+m Stab = ∑ P(k ) − p(k ) (1.3) k =1 Values obtained were compared with the values obtained in the control group as shown in figure 4a and 4b. Values of Stab parameter were much greater for patients (i.e. Stab>0.6) than for persons from the control group. With many patients, distinct and varying breaks in phonation were observed as shown in figure 4b. With healthy persons, gradual quietening occurred, whereas PD patients ended the emission abruptly. This audio signal is transformed into Fourier series and it is found out that changes are observable especially in the vowels articulation. Results constitute the beginning of tests concentrated on automatic voice classification [10]. Artificial Neural Networks (ANN) can be used as a statistical tool to formulate the boundaries between speech abnormalities using the voice features. Mehmet F. et al. [18] used ANN for feature selection of pathological voice datasets. A fuzzy logic based ANN (called ANFC: Adaptive Neuro-Fuzzy Classifier) gave best recognition results. Datasets for this study consisted of 195 sustained vowel phonation from 31 people, of which 23 were diagnosed with PD. Pathological voice features such as fundamental frequency of voice, shimmer and jitter were used for classification. ANFC produced 94.35% accuracy for classification of PD group dataset. Using Discrete Wavelets Transforms (DWT) a clear difference was noticed between wavelet evolutions of HKD and normal voice [19]. A visual pattern is shown in figure 6a and 6b. ANN classification with five DWT coefficients produced 90% classification rate for pathological voice and 100% classification rate for normal voice [19]. Experiment proved that the formant analysis methods using Fourier Transform is not a suitable tool for speech analysis [11]. Since speech is a highly non-stationary signal, wavelet transform proved to be a better tool for analysis of non stationary signals [19] as it is useful in localizing a symptom both in time and frequency scales. 3.4 Automatic HKD recognition using hybrid approach: Wavelet Analysis and ANN L. Salhi et al [19] used Linear Predictive coding method on PD patients voice on formants F1, F2 and F3. Comparing with the normal voice, pathological voice showed high variations. a. Normal Voice b. Pathological Voice Figure 6. Wavelet Analysis [11]. Above synthesis reveals that the assessment of PD is possible through speech datasets via reduced voice loudness, mono pitch, short rushes, prolonged syllables, long pauses, and reduced phonation time. The synthesis depicts that HKD effected PD patients can be effectively classified on the basis of UPDRS ratings using classification tool like ANN. 4. PROPOSED ALGORITHM The biggest drawback found in the previous work was the controlled setting of experiments in laboratory environment [9, 20]. Since this study is aimed for real time processing of speech signals in a mobile companion, the biggest challenge is voice segmentation and noise removal in the speech signals achieved during phone calls in home environment. An algorithm design is proposed to tackle this challenge. First pre-processing of voice signals needs to be done succeeded with voice segmentation. The segmented voice may be decomposed using wavelet and later trained in the neural network for classification. Below are the details of each step. Figure 5. A Hybrid approach for HKD recognition [11]. Although these methods distinguished HKD with the normal voice but the problem was, these methods do not produce quantification values to make the decision. They introduced a hybrid approach using wavelet analysis and ANN. Normalized energies and entropies of wavelet coefficients were used to formulate feature vector of speech sample. The feature vector was used to identify impaired speech. A 3-layered feed forward ANN with Back Propagation (BP) algorithm was used for classification. The flowchart of this algorithm is shown in fig. 5. 4.1 Voice Pre-processing Most intelligibility relevant speech segments (consonants and vowels) contains minority of speech energy [9]. Intelligibility can be improved by amplifying if total energy remains constant. Amplitude compression is a dynamic function which provides larger amplification when input has low power over certain time. The effect is that, power of sound varies less in time and hence it is less noisy [9]. Speech enhancements can be done using High Pass filter and clipping [9]. Also the speech signal does not have unique maximum amplitude. Some waveforms are asymmetric than others, that is, they reach higher peaks in positive amplitude than negative or vice versa. More asymmetric waveforms have larger peak power. Human speech has “Peak to Average Peak Power” (PAPR) [9]. In order to remove noise, peaks over or less than PAPR threshold range can be clipped off in the speech signals. The speech signal can be further “windowed” using a Hamming Window [9]. Once the signal is pre-processed, voice can be segmented based on vocal sounds. Syllable segmentation of speech is important for correct voice classification because it reduces search space effectively. An automatic processor should succeed at identifying silence, vocalic, fricative and nasal sounds and stop gaps to provide gains in speech intelligibility [19]. In case of HKD, segmentation is difficult because syllable units are spread roughly via intensity changes but exact boundary positions are elusive in successive vowels. 4.2 Decomposition with Wavelet Transforms Once the voice signal is segmented, speech can be decomposed in transient and tonal components using Discrete Wavelet transforms (DWT) [11]. In disordered speech, the non-stationary behavior of the pitch can be analyzed using wavelets. The criteria for selecting a proper mother wavelet is to have a wavelet function with enough number of vanishing moments in order to represent the salient features of the disturbance. At the same time, this wavelet should provide sharp cut-off frequencies. Selected wavelet should be ortho normal. Daubechies 40 (Db40) shows sharper cut off frequency compared with the others. Using Db40, leakage energy between different resolution levels can be reduced [11]. Wavelet transforms outputs energy coefficients that represent the signal in time and frequency. Since wavelet energy coefficients are sensitive to transients, areas of changes can be amplified and detected using Laplace derivative. This can make the speech transients very rich [9]. Amplitude Compression Acquire Speech High-Pass Filter Figure 7. Five possible pause points in an utterance [20]. The first step in voice segmentation is “Pause Detection”. A pause exists at the end of several continuous speech words [20]. Each character is a syllable and each pause indicates syllable boundary, therefore pause detection plays a very important role in syllable segmentation. Short term energy in voice signals can be used to detect pauses. A threshold of short term energy is set first to detect pause. If the energy is less than certain threshold, it is marked as a pause and a syllable boundary [20]. After pause detection, speech signal may be divided into pieces (fig. 7). Voice Preprocessing PAPR Clipping Windowing Voice Segmentation Pause Detection Wavelet Transform ZCR threshold of Formants F1 and F2 ANN UPDRS 5.4(Unintelligible) Table 1. Voice Segmentation based on ZCR analysis [20]. With ZCR Rules Without ZCR Rules Syllables 856 856 Correct 819 803 Deletion 37 53 Insertion 64 79 Correct Rate 95.7% 93.8% Error Rate 11.8% 15.4% For continuous speech segmentation, zero crossing rates (ZCR) provide spectral information at low computational cost. ZCR is the rate when a waveform crosses time axis or changes its algebraic sign. Consonants have high ZCR whereas vowels have low ZCR [20]. Therefore an onset of high ZCR means beginning of consonant and it should be a starting boundary. An offset of high ZCR means an endpoint of consonant and it should be an ending boundary [20]. Table 1 depicts the comparison between voice segmentation with ZCR rules and without ZCR rules. Segmentation accuracy is drastically improved when ZCR rules are applied to the voice segmentation algorithm as shown in table 1. Decision UPDRS 5.0 (Normal) UPDRS 5.3 (Severe) UPDRS 5.1 (Mild) UPDRS 5.2 (Moderate) Figure 8. Flowchart of Proposed Algorithm. 4.3 Classification Using ANN Resulting DWT coefficients can be used as feature vector for training artificial neural networks using Back Propagation algorithm. In order to obtain optimal results, nature of the input coefficients vector is to be varied for training in ANN. As suggested by L. Salhi et al.[19], three different input vectors can be used for ANN. They are (1) DWT coefficients (2) DWT energy coefficients and (3) DWT entropy coefficients. The resulted pathological voice can be further classified according to UPDRS ratings [21] Based on the classification results, the level adjustments of medication can be made and other likely treatments can be opted. The flowchart of the proposed algorithm is shown in figure 8. 5. CONCLUSION This review evaluated HKD recognition algorithms to develop a tool for PD severity assessment using speech datasets of PD patients. Previous research had constraints due to the controlled settings of the laboratory environment. Possible solutions for voice analysis in real-time have been investigated. Voice acoustic parameters that were robust to medication changes are evaluated. Since voice signals are highly non-stationary, focus was laid on quantization of speech signals. During the study it was found that time-frequency methods fail to quantify the voice signals over the time-series based on the voice frequency. Wavelet transformation is found promising for voice analysis because it quantizes the non-stationary voice signals over the time-series using scale and translation parameters. In this way voice intelligibility in the waveforms can be analyzed in each time frame. During the review, artificial neural network was found to be a promising tool for voice classification. This review is a basis for future work to conceive a HKD recognition tool in a PDA for self-treatment of PD patients. Also clinicians will be aided from feedback generated by this tool in follow up of patients who suffer from HKD. 6. ACKNOWLEGDEMENT This research is a module of the project “E-Motions”. This project has been running in Dalarna University in collaboration with Abbott Laboratories and Nordforce Technology and is funded by a grant from Swedish Knowledge Foundation. 7. REFERENCES [1] Stewart, A. F. and Weiner, W.J. 2008.Parkinson's disease: diagnosis and clinical management. Demos Medical Publishing, New York. [2] Hartelius, L. and Svensson, P. 1994. Speech and swallowing symptoms associated with Parkinson's disease and multiple sclerosis. Folia Phoniatr Logop. 46, 1(1994), 9-17. [3] Zicker, J., Tompkins, W., Rubow, R., and Abbs, J. A Portable Microprocessor-Based Biofeedback Training Device. IEEE Trans Biomed Eng. 27 (Sep. 1980), 509-515. [4] Joan, K.Y., Tara,M., Whitehill, L., and So, S.Y.S. Intonation Contrast in Cantonese Speakers with Hypokinetic Dysarthria Associated with Parkinson’s disease. J Speech Lang Hear Res. 53(2010), 836-849. [5] Rosen, K. M., Kent, R. D., Delaney, A. L., and Duffy, J. R. Parametric Quantitative Acoustic Analysis of Conversation Produced by Speakers with Dysarthria and Healthy Speakers. J Speech Lang Hear Res. 49 (2006), 395-411. [6] Stefanie, C. and Schwantz, J.1998. Parkinson's disease: Speaking Out. The National Parkinson Foundation, Inc, Denver, Colorado. [7] Sapir, S., Ramig, L.O., and Spielman, J.L. Formant Centralization Ratio: A Proposal for a New Acoustic Measure of Dysarthric Speech. J Speech Lang Hear Res. 53 (2010), 114-125. [8] Budkowski, E. 2007. Voice Onset Time in Parkinson Disease. Master’s thesis. Bowling Green State University. [9] Dominguez, A.C.2009. Pre-Processing of Speech Signals for Noisy and Band-Limited Channels. Master’s thesis. KTH, Stockholm. [10] Izworski, A., Augustyniak, P., and Orzechowski, T. 2006. Processing and analysis of voice anomalies in course of Parkinson's diseases. In Proceedings of Eighth IASTED (Honolulu, Hawaii, USA, 14-16 August, 2006), 354-357. [11] Salhi, L., Mourad, T., and Cherif, A. Voice Disorders Identification Using Multilayer Neural Network. IAJIT, 7, 2 (2010), 177-185. [12] De Letter, M., Santens, P., de Bodt, M., Boon, P. and Borsel, J. V. Levodopa induced alterations in speech rate in advanced Parkinson’s disease. Acta Neurol Belg, 106 (2006) 19-22. [13] Salhi, L., Talbi, M., and Cherif, A. Voice Disorders Identification Using Hybrid Approach: Wavelet Analysis and Multilayer Neural Networks. WASET. 45 (2008) 330-339. [14] McRae, P., and Tjaden, K. Acoustic and Perceptual Consequences of Articulatory Rate Change in Parkinson Disease. J Speech Lang Hear Res. 45(2002), 35-50. [15] Tjaden, K. An Acoustic Study of Co articulation in Dysarthric Speakers with Parkinson Disease. J Speech Lang Hear Res. 43 (2000), 1466-1480. [16] Kleinow, J., Smith, A., and Ramig, L.O. Speech Motor Stability in IPD: Effects of Rate and Loudness Manipulations. J Speech Lang Hear Res. 44 (2001) 10411051. [17] Neel, A.T. Effects of Loud and Amplified Speech on Sentence and Word Intelligibility in Parkinson Disease. J Speech Lang Hear Res. 52 (2009), 1021-1033. [18] Tjaden, K., and Wilding, G.E. Rate and Loudness Manipulations in Dysarthria: Acoustic and Perceptual Findings. J Speech Lang Hear Res. 47 (2004), 766-783. [19] Caglar, M.F., Cetisli, B., and Toprak, I.B. Automatic Recognition of Parkinson’s disease from Sustained Phonation Tests Using ANN and Adaptive Neuro-Fuzzy Classifier. J Speech Lang Hear Res. 1, 2 (2010), 59-64. [20] Zhao, X., and O’Shaughnessy, D. 2008. A new hybrid approach for Automatic Speech Signal Segmentation using silence signal detection, energy convex hull and spectral variation. In IEEE CCECE, (Canada, 2008) 145-148. [21] Fahn S et al. 1987. UPDRS. Unified Parkinson Disease Rating Scale. In Fahn S, Marsden CD, Calne DB, Goldstein M, eds. Recent developments in Parkinson’s disease. (Florham Park, New Jersey: Macmillan Healthcare Information, 1987), 153-163.
© Copyright 2025 Paperzz