Ai Voice Test Could Help Spot Dangerous Throat Lesions Before Symptoms Appear

Trending 4 weeks ago

By measuring subtle changes successful sound quality, AI could thief doctors observe vulnerable vocal fold lesions earlier symptoms worsen.

 3dMediSphere / ShutterstockStudy: Voice arsenic a biomarker: exploratory study for benign and malignant vocal fold lesions. Image Credit: 3dMediSphere / Shutterstock

An exploratory study reveals that subtle changes successful sound patterns, particularly variability successful harmonic-to-noise ratio, could service arsenic early informing signs of vocal fold lesions, paving nan measurement for early AI-powered screening tools.

A caller study led by Oregon Health and Science University and Portland State University researchers identified chopped vocal features that whitethorn service arsenic imaginable biomarkers for early discovery of benign and malignant vocal fold lesions. The study is published successful nan journal Frontiers successful Digital Health.

Background

Alterations successful sound pitch, loudness, and value qualify vocal disorders. Various factors tin perchance trigger these disorders, including vocal fold pathology, neurologic conditions, aliases functional sound usage patterns.

Individuals pinch sound disorders often acquisition mediocre value of life, debased self-esteem, work-related difficulties, and societal isolation. These experiences are peculiarly much pronounced among individuals whose master roles importantly dangle connected sound communication.

Both benign and malignant vocal fold lesions (laryngeal cancer) are associated pinch sound disorders. While benign lesions substantially impact sound value and origin morbidity, malignant lesions are often life-threatening if near untreated.

Dysphonia (a information characterized by abnormal voice) is 1 of nan first symptoms of vocal fold lesions, which requires a diagnostic process including visualization of nan larynx and appraisal of nan lesion's morphology done video endoscopy. The larynx is an anatomical building successful nan cervix wherever vocal folds are located.

Recent advancements successful artificial intelligence (AI) technologies person facilitated quality sound study for early discovery of a assortment of wellness conditions, including laryngeal pathology, neurological and psychological disorders, caput and cervix cancers, and diabetes.

The usage of sound arsenic a integer biomarker provides a promising level for non-invasive discovery and screening of these perchance life-threatening conditions. The Voice to AI project, arsenic portion of nan National Institutes of Health (NIH) Bridge to Artificial Intelligence (Bridge2AI) consortium, intends to analyse sound arsenic a biomarker of wellness for usage successful objective care.

In nan existent study, researchers analyzed nan Bridge2AI-Voice dataset to place circumstantial acoustic features that efficaciously separate laryngeal crab and benign vocal fold lesions from different vocal pathologies and patient sound function. Acoustic features mention to measurable sound properties, including pitch, loudness, and quality.

The study

The dataset analyzed successful nan study includes 12,523 recordings of 306 participants collected crossed 5 sites successful North America. Acoustic analyses focused connected Rainbow Passage recordings (180 recordings from 176 participants) pinch features pre-extracted utilizing openSMILE software. The main purpose of nan study was nan recognition of acoustic features that tin separate nan voices of participants pinch vocal fold lesions from those without immoderate vocal disorders, arsenic good arsenic separate nan voices of participants pinch lesions from those pinch different vocal disorders.

The participants were categorized into 2 groups based connected lesion type and vocal upset diagnosis. The first group included participants pinch laryngeal cancer, benign lesions, aliases nary sound disorder, and nan 2nd group included participants pinch laryngeal crab aliases benign lesions without different sound disorders, arsenic good arsenic those pinch different vocal disorders (spasmodic dysphonia aliases vocal fold paralysis). Transgender participants were excluded from sex-stratified analyses because anterior voice-altering attraction could not beryllium verified.

Four acoustic features positive nan variability (standard deviation) of HNR, basal frequency, jitter, shimmer, and harmonic-to-noise ratio (HNR) were extracted from nan sound recordings of participants for comparative analysis. Fundamental wave refers to nan wave astatine which nan vocal cords vibrate; jitter is nan measurement of basal wave fluctuations; shimmer is nan measurement of fluctuations successful nan amplitude of sound waves; and HNR is nan ratio of nan periodic to aperiodic constituent successful a reside signal.

Key findings

The study of acoustic features revealed that participants pinch benign lesions person importantly different mean HNR and basal wave compared to those without immoderate sound disorder, and importantly different HNR variability (SD) compared to laryngeal cancer. HNR variability (SD) was not importantly different betwixt benign lesions and nary sound disorder. Mean HNR and basal wave did not disagree importantly betwixt benign lesions and laryngeal cancer.

The gender-related comparison revealed successful cisgender men akin differences successful mean HNR and HNR variability vs nary sound upset and HNR variability vs laryngeal cancer, but not successful female participants, which mightiness beryllium owed to nan smaller sample size.

No important differences were recovered for jitter aliases shimmer successful immoderate comparison, and nary acoustic characteristic importantly distinguished lesion groups from different vocal disorders successful nan 2nd study group.

Study significance

The study identifies harmonic-to-noise ratio variability (standard deviation) arsenic a promising voice-related biomarker for early discovery and monitoring of vocal fold lesions. The periodic constituent of this ratio arises from regular glottal pulses during phonation, and nan aperiodic constituent is nan sound produced from turbulence arsenic aerial flows done nan glottis (the halfway of nan larynx).

Both nan mean and nan modular deviation of nan harmonic-to-noise ratio were measured successful nan study, arsenic nan researchers believed that this variability would thief measurement consistency successful vocal production. The observed differences successful modular deviation betwixt benign and malignant lesion groups propose that this characteristic whitethorn service arsenic a useful marker for monitoring lesion progression and detecting laryngeal crab astatine an early stage.

However, nan study could not observe important differences successful nan harmonic-to-noise ratio and its variability betwixt participants pinch benign aliases malignant lesions and those pinch different vocal disorders. This indicates that distinguishing lesions from different vocal pathologies whitethorn beryllium much challenging.

Notably, nan study could not observe important differences successful nan harmonic-to-noise ratio and its variability among female participants. This highlights nan request for analyzing further acoustic features successful bid to see sound arsenic a promising early parameter of vocal fold lesions.

The authors emphasise that these are exploratory findings and do not represent a validated screening test. They telephone for larger, much divers cohorts and further acoustic features to beryllium assessed, peculiarly successful women, earlier integration into objective tools.

Overall, nan study findings item nan early imaginable of validated AI-based sound screening devices to place individuals pinch subtle sound changes who whitethorn not different activity care, particularly successful superior attraction aliases telehealth settings. Such devices could punctual earlier referrals to sound specialists, thief prioritize urgent cases, and trim diagnostic delays.

Journal reference:

  • Jenkins P. 2025. Voice arsenic a biomarker: exploratory study for benign and malignant vocal fold lesions. Frontiers successful Digital Health.DOI: 10.3389/fdgth.2025.1609811, https://www.frontiersin.org/journals/digital-health/articles/10.3389/fdgth.2025.1609811/full
More