PPGSpeech: A Wearable Silent Speech Interface Leveraging Neck-worn Photoplethysmography

  • Lingde Hu
  • , Wenbo Zhang
  • , Wenkang Zhang
  • , Yu He
  • , Seokmin Choi
  • , Yang Gao
  • , Jagmohan Chauhan
  • , Zhanpeng Jin*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Silent speech interfaces (SSIs) promise private and noise-immune communication, but current solutions often sacrifice user comfort, mobility, or privacy. This paper introduces PPGSpeech, a novel SSI that overcomes these limitations by pioneering the use of photoplethysmography (PPG) acquired from a comfortable, necklace-style wearable device. Our core discovery is that subtle neck muscle movements during silent articulation induce distinct, measurable modulations in the underlying PPG signal. To harness this phenomenon, we developed a complete end-to-end system featuring (1) a custom neck-worn sensor for multi-wavelength PPG acquisition, (2) a deep learning pipeline that converts 1D PPG signals into 2D time-frequency images via Continuous Wavelet Transform (CWT) and classifies them using a lightweight CNN, and (3) a Pix2Pix GAN model to reconstruct audible speech from the captured signals. In a 16-participant study covering a vocabulary of 15 commands and four confounding actions, our user-dependent model achieved a recognition accuracy of 81.41% ± 9.74. Furthermore, our speech reconstruction achieved a Mean Opinion Score (MOS) of 3.48 and a Word Correct Rate (WCR) of 60.67%, demonstrating that the PPG signal is sufficiently rich to recover intelligible speech. By establishing the viability of neck-based PPG for silent speech, PPGSpeech offers a discreet, privacy-preserving, and continuously wearable paradigm for next-generation human-computer interaction.

Original languageEnglish
JournalIEEE Internet of Things Journal
DOIs
StateAccepted/In press - 2025
Externally publishedYes

Keywords

  • Neck-worn Sensor
  • PPG
  • Silent Speech Recognition
  • Wearable

Fingerprint

Dive into the research topics of 'PPGSpeech: A Wearable Silent Speech Interface Leveraging Neck-worn Photoplethysmography'. Together they form a unique fingerprint.

Cite this