Automatic Glottis Detection and Segmentation in Stroboscopic Videos Using Convolutional Networks

Divya Degala; Achuth Rao M.V.; Rahul Krishnamurthy; Pebbili Gopikishore; Veeramani Priyadharshini; Prakash T.K.; Prasanta Kumar Ghosh

2020 INTERSPEECH INTERSPEECH 2020

Automatic Glottis Detection and Segmentation in Stroboscopic Videos Using Convolutional Networks

Abstract

Laryngeal videostroboscopy is widely used for the analysis of glottal vibration patterns. This analysis plays a crucial role in the diagnosis of voice disorders. It is essential to study these patterns using automatic glottis segmentation methods to avoid subjectiveness in diagnosis. Glottis detection is an essential step before glottis segmentation. This paper considers the problem of automatic glottis segmentation using U-Net based deep convolutional networks. For accurate glottis detection, we train a fully convolutional network with a large amount of glottal and non-glottal images. In glottis segmentation, we consider U-Net with three different weight initialization schemes: 1) Random weight Initialization (RI), 2) Detection Network weight Initialization (DNI) and 3) Detection Network encoder frozen weight Initialization (DNIFr), using two different architectures: 1) U-Net without skip connection (UWSC) 2) U-Net with skip connection (USC). Experiments with 22 subjects’ data reveal that the performance of glottis segmentation network can be increased by initializing its weights using those of the glottis detection network. Among all schemes, when DNI is used, the USC yields an average localization accuracy of 81.3% and a Dice score of 0.73, which are better than those from the baseline approach by 15.87% and 0.07 (absolute), respectively.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Healthcare & Medicine

🧭 Keyword Pioneer — glottis detection

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Divya Degala , Achuth Rao M.V. , Rahul Krishnamurthy , Pebbili Gopikishore , Veeramani Priyadharshini , Prakash T.K. , Prasanta Kumar Ghosh

Topics

Deep Learning > Architectures > Neural Networks Computer Vision > Processing > Image Segmentation Computer Vision > Domain-Specific > Medical Imaging Healthcare & Medicine > Clinical > Medical Imaging

Keywords

image segmentation object detection medical imaging convolutional network glottis segmentation glottis detection

Download PDF

Related papers

Memory Controlled Sequential Self Attention for Sound Recognition 2020

Dual Attention in Time and Frequency Domain for Voice Activity Detection 2020

Automatic Prediction of Speech Intelligibility Based on X-Vectors in the Context of Head and Neck Cancer 2020

A Noise Robust Technique for Detecting Vowels in Speech Signals 2020

Joint Detection of Sentence Stress and Phrase Boundary for Prosody 2020