Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Evaluation
345 directly classified papers
Papers per year
2014: 1
2016: 3
2017: 1
2018: 9
2019: 21
2020: 34
2021: 32
2022: 50
2023: 28
2024: 90
2025: 76
Papers
Intra Order-preserving Functions for Calibration of Multi-Class Neural Networks
NIPS 2020
Identifying Model Weakness with Adversarial Examiner
AAAI 2020
ML-LOO: Detecting Adversarial Examples with Feature Attribution
AAAI 2020
Sanity Checks for Saliency Metrics
AAAI 2020
Building Calibrated Deep Models via Uncertainty Matching with Auxiliary Interval Predictors
AAAI 2020
Relative Attributing Propagation: Interpreting the Comparative Contributions of Individual Units in Deep Neural Networks
AAAI 2020
Beyond Accuracy: Behavioral Testing of NLP Models with CheckList
ACL 2020
Adversarial NLI: A New Benchmark for Natural Language Understanding
ACL 2020
On The Evaluation of Machine Translation Systems Trained With Back-Translation
ACL 2020
Probing Linguistic Features of Sentence-Level Representations in Neural Relation Extraction
ACL 2020
Cold Case: The Lost MNIST Digits
NIPS 2019
Higher-order Comparisons of Sentence Encoder Representations
EMNLP 2019
What Part of the Neural Network Does This? Understanding LSTMs by Measuring and Dissecting Neurons
EMNLP 2019
Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control
EMNLP 2019
Tightness-Aware Evaluation Protocol for Scene Text Detection
CVPR 2019
Spectral Metric for Dataset Complexity Assessment
CVPR 2019
Sensitive-Sample Fingerprinting of Deep Neural Networks
CVPR 2019
Rethinking the Evaluation of Video Summaries
CVPR 2019
Interpretable and Fine-Grained Visual Explanations for Convolutional Neural Networks
CVPR 2019
Demystifying Black-box Models with Symbolic Metamodels
NIPS 2019
Certifying Geometric Robustness of Neural Networks
NIPS 2019
A Benchmark for Interpretability Methods in Deep Neural Networks
NIPS 2019
Evaluating BERT for natural language inference: A case study on the CommitmentBank
EMNLP 2019
The Feasibility of Embedding Based Automatic Evaluation for Single Document Summarization
EMNLP 2019
REO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image Captioning
EMNLP 2019
<
1
…
10
11
12
13
14
>