Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Evaluation
345 directly classified papers
Papers per year
2014: 1
2016: 3
2017: 1
2018: 9
2019: 21
2020: 34
2021: 32
2022: 50
2023: 28
2024: 90
2025: 76
Papers
Hard Gate Knowledge Distillation - Leverage Calibration for Robust and Reliable Language Model
EMNLP 2022
GENIE: Toward Reproducible and Standardized Human Evaluation for Text Generation
EMNLP 2022
Model Criticism for Long-Form Text Generation
EMNLP 2022
FALTE: A Toolkit for Fine-grained Annotation for Long Text Evaluation
EMNLP 2022
SEAL: Interactive Tool for Systematic Error Analysis and Labeling
EMNLP 2022
Iterative Stratified Testing and Measurement for Automated Model Updates
EMNLP 2022
Improved Evaluation of Automatic Source Code Summarisation
EMNLP 2022
When does dough become a bagel? Analyzing the remaining mistakes on ImageNet
NIPS 2022
GULP: a prediction-based metric between representations
NIPS 2022
Spectral Bias in Practice: The Role of Function Frequency in Generalization
NIPS 2022
BackdoorBench: A Comprehensive Benchmark of Backdoor Learning
NIPS 2022
Introspective Learning : A Two-Stage approach for Inference in Neural Networks
NIPS 2022
NAS-Bench-360: Benchmarking Neural Architecture Search on Diverse Tasks
NIPS 2022
Video compression dataset and benchmark of learning-based video-quality metrics
NIPS 2022
AutoML Two-Sample Test
NIPS 2022
Deconfounded Representation Similarity for Comparison of Neural Networks
NIPS 2022
Agreement-on-the-line: Predicting the Performance of Neural Networks under Distribution Shift
NIPS 2022
Distilled Gradient Aggregation: Purify Features for Input Attribution in the Deep Neural Network
NIPS 2022
MORA: Improving Ensemble Robustness Evaluation with Model Reweighing Attack
NIPS 2022
Verifiability and Predictability: Interpreting Utilities of Network Architectures for Point Cloud Processing
CVPR 2021
Building Reliable Explanations of Unreliable Neural Networks: Locally Smoothing Perspective of Model Interpretation
CVPR 2021
Ranking Neural Checkpoints
CVPR 2021
CAMERAS: Enhanced Resolution and Sanity Preserving Class Activation Mapping for Image Saliency
CVPR 2021
Debiasing Methods in Natural Language Understanding Make Bias More Accessible
EMNLP 2021
Automatic Text Evaluation through the Lens of Wasserstein Barycenters
EMNLP 2021
<
1
…
9
10
11
…
14
>