Realtime Generation of Audible Textures Inspired by a Video Stream

Simone Mellace; Jérôme Guzzi; Alessandro Giusti; Luca M. Gambardella

2019 AAAI AAAI 2019

Realtime Generation of Audible Textures Inspired by a Video Stream

Abstract

Abstract We showcase a model to generate a soundscape from a camera stream in real time. The approach relies on a training video with an associated meaningful audio track; a granular synthesizer generates a novel sound by randomly sampling and mixing audio data from such video, favoring timestamps whose frame is similar to the current camera frame; the semantic similarity between frames is computed by a pretrained neural network. The demo is interactive: a user points a mobile phone to different objects and hears how the generated sound changes.

🚀 Conference Pioneer — AAAI 2019

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning and Speech & Audio

🧭 Keyword Pioneer — pretrained neural network

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Simone Mellace , Jérôme Guzzi , Alessandro Giusti , Luca M. Gambardella

Topics

Machine Learning > Core Methods > Representation Learning Computer Vision > Generation > Video Generation Speech & Audio > Synthesis > Speech Enhancement Deep Learning > Learning Types > Multi-Modal Learning Speech & Audio > Synthesis > Speech Synthesis

Keywords

multimodal learning video analysis semantic similarity audio synthesis pretrained neural network sound generation neural network real-time audio soundscape generation granular synthesis video stream processing

Download PDF

Related papers

Cooperative Multimodal Approach to Depression Detection in Twitter 2019

Learning to Align Question and Answer Utterances in Customer Service Conversation with Recurrent Pointer Networks 2019

Community Detection in Social Networks Considering Topic Correlations 2019

Session-Based Recommendation with Graph Neural Networks 2019

Blameworthiness in Multi-Agent Settings 2019