Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Architectures
Deep Learning
›
Architectures
›
Transformers
9294 directly classified papers
Papers per year
2011: 1
2014: 2
2015: 6
2016: 17
2017: 67
2018: 156
2019: 404
2020: 769
2021: 1217
2022: 1446
2023: 1628
2024: 1574
2025: 1647
2026: 360
Papers
Multi-Attention Network for One Shot Learning
CVPR 2017
Generating the Future With Adversarial Transformers
CVPR 2017
Diversity driven attention model for query-based abstractive summarization
ACL 2017
Accelerating Eulerian Fluid Simulation With Convolutional Networks
ICML 2017
Sentence Modeling with Deep Neural Architecture using Lexicon and Character Attention Mechanism for Sentiment Classification
IJCNLP 2017
To Plan or not to Plan? Discourse Planning in Slot-Value Informed Sequence to Sequence Models for Language Generation
INTERSPEECH 2017
Sequence to Sequence Modeling for User Simulation in Dialog Systems
INTERSPEECH 2017
End-to-End Speech Recognition with Auditory Attention for Multi-Microphone Distance Speech Recognition
INTERSPEECH 2017
Image-to-Markup Generation with Coarse-to-Fine Attention
ICML 2017
Approaching Human Performance in Behavior Estimation in Couples Therapy Using Deep Sentence Embeddings
INTERSPEECH 2017
Video Question Answering via Hierarchical Spatio-Temporal Attention Networks
IJCAI 2017
DeepAM: Migrate APIs with Multi-modal Sequence to Sequence Learning
IJCAI 2017
SentiNLP at IJCNLP-2017 Task 4: Customer Feedback Analysis Using a Bi-LSTM-CNN Model
IJCNLP 2017
The Meaning Factory at SemEval-2017 Task 9: Producing AMRs with Neural Semantic Parsing
SEMEVAL 2017
Experiments in Character-Level Neural Network Models for Punctuation
INTERSPEECH 2017
Learning Scalable Deep Kernels with Recurrent Structure
JMLR 2017
Hashtag Recommendation for Multimodal Microblog Using Co-Attention Network
IJCAI 2017
Local Monotonic Attention Mechanism for End-to-End Speech And Language Processing
IJCNLP 2017
Speech Bandwidth Extension Using Bottleneck Features and Deep Recurrent Neural Networks
INTERSPEECH 2016
Attention Assisted Discovery of Sub-Utterance Structure in Speech Emotion Recognition
INTERSPEECH 2016
Attending to Characters in Neural Sequence Labeling Models
COLING 2016
A Neural Attention Model for Disfluency Detection
COLING 2016
Improving Attention Modeling with Implicit Distortion and Fertility for Machine Translation
COLING 2016
First Step Towards End-to-End Parametric TTS Synthesis: Generating Spectral Parameters with Neural Attention
INTERSPEECH 2016
A Convolutional Attention Network for Extreme Summarization of Source Code
ICML 2016
<
1
…
368
369
370
371
372
>