Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Evaluation
345 directly classified papers
Papers per year
2014: 1
2016: 3
2017: 1
2018: 9
2019: 21
2020: 34
2021: 32
2022: 50
2023: 28
2024: 90
2025: 76
Papers
Pitfalls in the Evaluation of Sentence Embeddings
ACL 2019
Do Neural Dialog Systems Use the Conversation History Effectively? An Empirical Study
ACL 2019
QADiver: Interactive Framework for Diagnosing QA Models
AAAI 2019
Detecting Overfitting of Deep Generative Networks via Latent Recovery
CVPR 2019
Auditing Deep Learning processes through Kernel-based Explanatory Models
EMNLP 2019
Knowing When to Stop: Evaluation and Verification of Conformity to Output-Size Specifications
CVPR 2019
Visualizing the Loss Landscape of Neural Nets
NIPS 2018
Understanding Deep Learning Performance through an Examination of Test Set Difficulty: A Psychometric Case Study
EMNLP 2018
When data permutations are pathological: the case of neural natural language inference
EMNLP 2018
Portable, layer-wise task performance monitoring for NLP models
EMNLP 2018
Attaining the Unattainable? Reassessing Claims of Human Parity in Neural Machine Translation
EMNLP 2018
Diachronic degradation of language models: Insights from social media
ACL 2018
Evaluating neural network explanation methods using hybrid documents and morphosyntactic agreement
ACL 2018
A Boo(n) for Evaluating Architecture Performance
ICML 2018
What Do Deep Networks Like to See?
CVPR 2018
Massive Exploration of Neural Machine Translation Architectures
EMNLP 2017
Spatially Binned ROC: A Comprehensive Saliency Metric
CVPR 2016
Predicting When Saliency Maps Are Accurate and Eye Fixations Consistent
CVPR 2016
Group MAD Competition - A New Methodology to Compare Objective Image Quality Models
CVPR 2016
How to Evaluate Foreground Maps?
CVPR 2014
<
1
…
10
11
12
13
14
>