Shinji Watanabe
182 papers
· 2013–2026
· 11 conferences
· across top CS/AI conferences
Achievements
π
Academic Marathon
(12)
π
Cross-Pollinator
(9)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(39)
π
Conference Polyglot
(11)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Renaissance Researcher
(5)
π
Keyword Trendsetter Combo
(8)
π
Conference Loyalist
(22)
π
Domain Dominant
(51)
π€
Dynamic Duo
(35)
π
Triple Crown
π±
Topic Pioneer
π¬
Deep Specialist
(27)
π§¬
Topic Evolution
π
Keyword Champion
(6)
π
Grand Slam
π₯
Mega-Team
(76)
π
Century Club
(180)
π
Conference Pioneer
π₯
Unstoppable
(10)
β
The Questioner
(3)
β‘
Prolific Year
(31)
ποΈ
Keyword Collector
(199)
π
Trend Setter
Conferences
INTERSPEECH (120)
ACL (22)
NAACL (12)
EMNLP (6)
EACL (4)
ICLR (4)
ICML (4)
AAAI (3)
IJCNLP (3)
IJCAI (2)
NIPS (2)
Top co-authors
Research topics
Keywords
automatic speech recognition
(51)
speech recognition
(30)
self-supervised learning
(22)
end-to-end speech recognition
(21)
speech translation
(21)
speech enhancement
(16)
end-to-end model
(16)
spoken language understanding
(15)
connectionist temporal classification
(12)
beam search
(10)
attention mechanism
(9)
end-to-end learning
(9)
speech processing
(9)
neural network
(9)
speaker diarization
(8)
speech separation
(8)
speech synthesis
(8)
language model
(8)
data augmentation
(7)
transfer learning
(7)
Papers
Summarizing Speech: A Comprehensive Survey
EMNLP 2025
Cross-Talk Reduction
IJCAI 2024
Self-training ASR Guided by Unsupervised ASR Teacher
INTERSPEECH 2024
BASS: Block-wise Adaptation for Speech Summarization
INTERSPEECH 2023
ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
INTERSPEECH 2023
Exploration on HuBERT with Multiple Resolution
INTERSPEECH 2023
Deep Speech Synthesis from MRI-Based Articulatory Representations
INTERSPEECH 2023
TriniTTS: Pitch-controllable End-to-end TTS without External Aligner
INTERSPEECH 2022
Deep Speech Synthesis from Articulatory Representations
INTERSPEECH 2022
Online Continual Learning of End-to-End Speech Recognition Models
INTERSPEECH 2022
Two-Pass Low Latency End-to-End Spoken Language Understanding
INTERSPEECH 2022
When Is TTS Augmentation Through a Pivot Language Useful?
INTERSPEECH 2022
Residual Language Model for End-to-end Speech Recognition
INTERSPEECH 2022
Memory-Efficient Training of RNN-Transducer with Sampled Softmax
INTERSPEECH 2022
ASR2K: Speech Recognition for Around 2000 Languages without Audio
INTERSPEECH 2022
Better Intermediates Improve CTC Inference
INTERSPEECH 2022
Toward Streaming ASR with Non-Autoregressive Insertion-Based Model
INTERSPEECH 2021
Layer Pruning on Demand with Intermediate CTC
INTERSPEECH 2021
Acoustic Event Detection with Classifier Chains
INTERSPEECH 2021
SUPERB: Speech Processing Universal PERformance Benchmark
INTERSPEECH 2021
Multi-Mode Transformer Transducer with Stochastic Future Context
INTERSPEECH 2021
Leveraging Pre-Trained Language Model for Speech Sentiment Analysis
INTERSPEECH 2021
End-to-End ASR with Adaptive Span Self-Attention
INTERSPEECH 2020
Learning Speaker Embedding from Text-to-Speech
INTERSPEECH 2020
Speaker-Conditional Chain Model for Speech Separation and Extraction
INTERSPEECH 2020
Insertion-Based Modeling for End-to-End Automatic Speech Recognition
INTERSPEECH 2020
End-to-End SpeakerBeam for Single Channel Target Speech Recognition
INTERSPEECH 2019
Speaker Recognition Benchmark Using the CHiME-5 Corpus
INTERSPEECH 2019
End-to-End Multilingual Multi-Speaker Speech Recognition
INTERSPEECH 2019
Vectorized Beam Search for CTC-Attention-Based Speech Recognition
INTERSPEECH 2019
Pre-Trained Text Embeddings for Enhanced Text-to-Speech Synthesis
INTERSPEECH 2019
Multi-Head Decoder for End-to-End Speech Recognition
INTERSPEECH 2018
Student-Teacher Learning for BLSTM Mask-based Speech Enhancement
INTERSPEECH 2018
Semi-Supervised End-to-End Speech Recognition
INTERSPEECH 2018
Auxiliary Feature Based Adaptation of End-to-end ASR Systems
INTERSPEECH 2018
Multi-Modal Data Augmentation for End-to-end ASR
INTERSPEECH 2018
ESPnet: End-to-End Speech Processing Toolkit
INTERSPEECH 2018
Single-Channel Multi-Speaker Separation Using Deep Clustering
INTERSPEECH 2016