← Optimization & Theory

Deep Learning › Optimization & Theory ›

Evaluation

345 directly classified papers

Papers per year

Papers

Pitfalls in the Evaluation of Sentence Embeddings ACL 2019

Do Neural Dialog Systems Use the Conversation History Effectively? An Empirical Study ACL 2019

QADiver: Interactive Framework for Diagnosing QA Models AAAI 2019

Detecting Overfitting of Deep Generative Networks via Latent Recovery CVPR 2019

Auditing Deep Learning processes through Kernel-based Explanatory Models EMNLP 2019

Knowing When to Stop: Evaluation and Verification of Conformity to Output-Size Specifications CVPR 2019

Visualizing the Loss Landscape of Neural Nets NIPS 2018

Understanding Deep Learning Performance through an Examination of Test Set Difficulty: A Psychometric Case Study EMNLP 2018

When data permutations are pathological: the case of neural natural language inference EMNLP 2018

Portable, layer-wise task performance monitoring for NLP models EMNLP 2018

Attaining the Unattainable? Reassessing Claims of Human Parity in Neural Machine Translation EMNLP 2018

Diachronic degradation of language models: Insights from social media ACL 2018

Evaluating neural network explanation methods using hybrid documents and morphosyntactic agreement ACL 2018

A Boo(n) for Evaluating Architecture Performance ICML 2018

What Do Deep Networks Like to See? CVPR 2018

Massive Exploration of Neural Machine Translation Architectures EMNLP 2017

Spatially Binned ROC: A Comprehensive Saliency Metric CVPR 2016

Predicting When Saliency Maps Are Accurate and Eye Fixations Consistent CVPR 2016

Group MAD Competition - A New Methodology to Compare Objective Image Quality Models CVPR 2016

How to Evaluate Foreground Maps? CVPR 2014