Towards Voice Reconstruction from EEG during Imagined Speech

Young-Eun Lee; Seo-Hyun Lee; Sang-Ho Kim; Seong-Whan Lee

2023 AAAI AAAI 2023

Towards Voice Reconstruction from EEG during Imagined Speech

Abstract

Abstract Translating imagined speech from human brain activity into voice is a challenging and absorbing research issue that can provide new means of human communication via brain signals. Efforts to reconstruct speech from brain activity have shown their potential using invasive measures of spoken speech data, but have faced challenges in reconstructing imagined speech. In this paper, we propose NeuroTalk, which converts non-invasive brain signals of imagined speech into the user's own voice. Our model was trained with spoken speech EEG which was generalized to adapt to the domain of imagined speech, thus allowing natural correspondence between the imagined speech and the voice as a ground truth. In our framework, an automatic speech recognition decoder contributed to decomposing the phonemes of the generated speech, demonstrating the potential of voice reconstruction from unseen words. Our results imply the potential of speech synthesis from human EEG signals, not only from spoken speech but also from the brain signals of imagined speech.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Healthcare & Medicine and Machine Learning and Speech & Audio

🐣 Hot Topic Early Bird — brain-computer interface

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Young-Eun Lee , Seo-Hyun Lee , Sang-Ho Kim , Seong-Whan Lee

Topics

Machine Learning > Core Methods > Representation Learning Speech & Audio > Synthesis > Text-to-Speech Healthcare & Medicine > Research > Biosignal Processing Machine Learning > Learning Types > Transfer Learning Deep Learning > Learning Types > Representation Learning Speech & Audio > Synthesis > Speech Synthesis Artificial Intelligence > Core AI > Brain-Computer Interface

Keywords

brain computer interface speech synthesis automatic speech recognition brain-computer interface phoneme recognition eeg signal imagined speech brain signal voice reconstruction

Download PDF

Related papers

A Model-Agnostic Heuristics for Selective Classification 2023

Tackling Safe and Efficient Multi-Agent Reinforcement Learning via Dynamic Shielding (Student Abstract) 2023

Head-Free Lightweight Semantic Segmentation with Linear Transformer 2023

Hierarchical ConViT with Attention-Based Relational Reasoner for Visual Analogical Reasoning 2023

Deep Spiking Neural Networks with High Representation Similarity Model Visual Pathways of Macaque and Mouse 2023