📗 Cite This Artifact
Speech and Voice Acoustic Analysis for Corticobasal Syndrome Diagnosis
Overview
Speech and voice acoustic analysis represents an emerging objective diagnostic tool for corticobasal syndrome (CBS), leveraging quantitative measures of speech production to distinguish CBS from other atypical parkinsonian disorders. While clinical speech evaluation has long been part of the neurological assessment, machine learning-based acoustic analysis can achieve high diagnostic accuracy for differentiating corticobasal degeneration (CBD) from progressive supranuclear palsy (PSP) and Parkinson's disease (PD)[@godinho2025].
Why Acoustic Analysis Matters for CBS
Clinical Gap
CBS presents with a distinctive speech profile that differs from PSP and PD:
- Apraxia of speech (AOS) — a core feature of CBS, rare in PSP
- Nonfluent aphasia — cortical language involvement
- Axial dysarthria — later-stage involvement
However, traditional clinical assessment relies on subjective listener judgment. Acoustic analysis provides objective, quantifiable biomarkers that can:
Evidence Base
Recent studies demonstrate that machine learning algorithms applied to speech samples can distinguish CBD from PSP/PD with up to 92% accuracy[@godinho2025]. This approaches the accuracy of more invasive or expensive diagnostic methods.
Acoustic Features
Fundamental Frequency (F0)
Description: The base frequency of the vocal fold vibration, perceived as pitch.
Overview
Speech and voice acoustic analysis represents an emerging objective diagnostic tool for corticobasal syndrome (CBS), leveraging quantitative measures of speech production to distinguish CBS from other atypical parkinsonian disorders. While clinical speech evaluation has long been part of the neurological assessment, machine learning-based acoustic analysis can achieve high diagnostic accuracy for differentiating corticobasal degeneration (CBD) from progressive supranuclear palsy (PSP) and Parkinson's disease (PD)[@godinho2025].
Why Acoustic Analysis Matters for CBS
Clinical Gap
CBS presents with a distinctive speech profile that differs from PSP and PD:
- Apraxia of speech (AOS) — a core feature of CBS, rare in PSP
- Nonfluent aphasia — cortical language involvement
- Axial dysarthria — later-stage involvement
However, traditional clinical assessment relies on subjective listener judgment. Acoustic analysis provides objective, quantifiable biomarkers that can:
Evidence Base
Recent studies demonstrate that machine learning algorithms applied to speech samples can distinguish CBD from PSP/PD with up to 92% accuracy[@godinho2025]. This approaches the accuracy of more invasive or expensive diagnostic methods.
Acoustic Features
Fundamental Frequency (F0)
Description: The base frequency of the vocal fold vibration, perceived as pitch.
CBS-specific findings:
- Increased variability in sustained vowel production compared to healthy controls
- Reduced F0 stability during continuous speech
- Lower mean F0 in advanced CBS compared to early-stage patients
Formant Frequencies (F1, F2, F3)
Description: Resonant frequencies of the vocal tract that shape vowel quality.
CBS-specific findings:
- Abnormal formant trajectories in apraxia of speech
- Imprecise vowel articulation (reduced formant differentiation)
- Prolonged vowel duration during consonant-vowel transitions[@harmon2024]
Jitter
Description: Cycle-to-cycle variation in fundamental frequency, reflecting vocal fold instability.
CBS-specific findings:
- Elevated jitter values in CBS compared to healthy controls
- Higher jitter correlates with disease severity[@johansson2023]
- Differentiates CBS from PD: CBS shows higher jitter in early stages
- Jitter (local) = (average absolute difference between consecutive periods / average period) × 100
Shimmer
Description: Cycle-to-cycle variation in amplitude, reflecting vocal fold closure irregularities.
CBS-specific findings:
- Increased shimmer in CBS with dysarthria
- Shimmer increases with disease progression
- More pronounced on the side of greater motor impairment
- Shimmer (local) = (average absolute difference between consecutive amplitudes / average amplitude) × 100
Harmonics-to-Noise Ratio (HNR)
Description: Ratio of harmonic energy to noise in the voice signal.
CBS-specific findings:
- Reduced HNR in CBS with vocal dysarthria
- Lower HNR correlates with breathy voice quality
- Can detect subclinical voice changes before clinical symptoms
Speech Rate and Pause Analysis
Description: Quantification of articulation rate, pause frequency, and pause duration.
CBS-specific findings:
- Reduced speech rate due to articulatory slowing
- Increased pause frequency between syllables
- Prolonged pause duration between sentences
- Irregular pause patterns in AOS
Comparison with PSP and PD
| Acoustic Feature | CBS | PSP | PD |
|-----------------|-----|-----|-----|
| Jitter | Markedly elevated | Moderately elevated | Mildly elevated |
| Shimmer | Elevated | Moderate | Mild |
| F0 variability | High | Moderate | Low |
| Formant precision | Impaired (AOS) | Preserved | Preserved |
| Speech rate | Slow, irregular | Slow, regular | Normal to slow |
| HNR | Reduced | Reduced | Preserved early |
Key Differentiators
- Prolonged phoneme durations
- Imprecise consonant production
- Decreased formant transition velocity
- Disrupted prosody
- Predominantly hypokinetic dysarthria
- Reduced pitch range
- Monopitch, monoloudness
- Hoarse voice quality
- Hypokinetic dysarthria
- Reduced loudness
- Variable speech rate
- Breathiness in advanced stages
Machine Learning Approaches
Feature Extraction Pipeline
Commonly Used Features
Time-domain:
- Jitter, shimmer, HNR
- Pitch variation coefficient
- Zero-crossing rate
- Formant frequencies (F1-F4)
- Spectral centroid
- Spectral entropy
- Mel-frequency cepstral coefficients (MFCCs)
- Speech rate
- Pause ratio
- Duration of vowels/consonants
- Fundamental frequency contours
Classification Algorithms
| Algorithm | Performance | Notes |
|-----------|-------------|-------|
| Random Forest | ~88-92% accuracy | Good for feature importance analysis |
| Support Vector Machine (SVM) | ~85-90% | Effective with limited data |
| Neural Networks | ~90-95% | Requires larger datasets |
| Gradient Boosting | ~87-92% | Robust to overfitting |
Validation Studies
- Godinho et al. (2025): ML model using 30 speech features achieved 92% accuracy distinguishing CBD from PSP/PD[@godinho2025]
- Rusz et al. (2023): Acoustic analysis correctly classified 86% of atypical parkinsonian cases[@rusz2023]
- Tsanas et al. (2024): Validated speech metrics as progression markers in PSP[@tsanas2024]
Smartphone-Based Platforms
Advantages
Platforms and Apps
| Platform | Features | Validation |
|----------|----------|------------|
| Voicewise | Cloud-based analysis, HIPAA compliant | [@robinzon2023] |
| mPower (Apple) | Research platform, large dataset | Parkinson disease focused |
| Kardia | Passive monitoring, voice tasks | Cardiac, adaptable |
| Praat (desktop) | Gold-standard acoustic analysis | Research use |
| VoiceSauce | Multi-parameter extraction | Research use |
Implementation Considerations
Clinical Integration
Assessment Protocol
- Record sustained vowel /a/ (5 seconds)
- Record diadochokinetic task (rapid "pa-ta-ka")
- Record reading passage (e.g., "The Rainbow Passage")
- Record spontaneous speech (1-2 minutes)
- Jitter, shimmer, HNR values
- Formant frequency measurements
- Speech rate metrics
- Compare to normative databases
- Compare to previous recordings
- Correlate with clinical findings
Integration with Existing Diagnostics
- Complementary to neuropsychological testing
- Adjunctive to neuroimaging (MRI, PET)
- Monitoring between clinical visits
- Research endpoint for clinical trials
Limitations and Future Directions
Current Limitations
Future Directions
References
Related Pages
- [Speech and Language Deficits in Corticobasal Syndrome](/mechanisms/speech-language-deficits-cbs)
- [Speech and Voice Disorders in Progressive Supranibular Palsy](/mechanisms/psp-speech-voice-disorders)
- [Corticobasal Syndrome](/diseases/corticobasal-syndrome)
- [Neuropsychological Testing for CBS/PSP](/diagnostics/neuropsychological-testing-cbs-psp)
- [Smartwatch Digital Biomarkers](/mechanisms/smartwatch-digital-biomarkers)
- [LSVT Voice Therapy for CBS/PSP](/therapeutics/section-249-advanced-lsvt-voice-speech-therapy-cbs-psp)
▸Metadataorigin_type: v1_polymorphic_backfill
| slug | diagnostics-cbs-speech-voice-acoustic-analysis |
| kg_node_id | None |
| entity_type | diagnostic |
| origin_type | v1_polymorphic_backfill |
| source_table | wiki_pages |
| wiki_page_id | wp-4704ba240104 |
| __merged_from | {'merged_at': '2026-05-13', 'unprefixed_id': 'diagnostics-cbs-speech-voice-acoustic-analysis'} |
| _schema_version | 1 |
No provenance edges found
Use ?embed=1 to load the artifact without SciDEX chrome — suitable for iframing into wiki pages or external sites.
<iframe src="http://scidex.ai/artifact/wiki-diagnostics-cbs-speech-voice-acoustic-analysis?embed=1" width="100%" height="600" style="border:0;border-radius:8px"></iframe>
[Speech and Voice Acoustic Analysis for Corticobasal Syndrome Diagnosis](http://scidex.ai/artifact/wiki-diagnostics-cbs-speech-voice-acoustic-analysis)
http://scidex.ai/artifact/wiki-diagnostics-cbs-speech-voice-acoustic-analysis