← Optimization & Theory

Deep Learning › Optimization & Theory ›

Evaluation

345 directly classified papers

Papers per year

Papers

Hard Gate Knowledge Distillation - Leverage Calibration for Robust and Reliable Language Model EMNLP 2022

GENIE: Toward Reproducible and Standardized Human Evaluation for Text Generation EMNLP 2022

Model Criticism for Long-Form Text Generation EMNLP 2022

FALTE: A Toolkit for Fine-grained Annotation for Long Text Evaluation EMNLP 2022

SEAL: Interactive Tool for Systematic Error Analysis and Labeling EMNLP 2022

Iterative Stratified Testing and Measurement for Automated Model Updates EMNLP 2022

Improved Evaluation of Automatic Source Code Summarisation EMNLP 2022

When does dough become a bagel? Analyzing the remaining mistakes on ImageNet NIPS 2022

GULP: a prediction-based metric between representations NIPS 2022

Spectral Bias in Practice: The Role of Function Frequency in Generalization NIPS 2022

BackdoorBench: A Comprehensive Benchmark of Backdoor Learning NIPS 2022

Introspective Learning : A Two-Stage approach for Inference in Neural Networks NIPS 2022

NAS-Bench-360: Benchmarking Neural Architecture Search on Diverse Tasks NIPS 2022

Video compression dataset and benchmark of learning-based video-quality metrics NIPS 2022

AutoML Two-Sample Test NIPS 2022

Deconfounded Representation Similarity for Comparison of Neural Networks NIPS 2022

Agreement-on-the-line: Predicting the Performance of Neural Networks under Distribution Shift NIPS 2022

Distilled Gradient Aggregation: Purify Features for Input Attribution in the Deep Neural Network NIPS 2022

MORA: Improving Ensemble Robustness Evaluation with Model Reweighing Attack NIPS 2022

Verifiability and Predictability: Interpreting Utilities of Network Architectures for Point Cloud Processing CVPR 2021

Building Reliable Explanations of Unreliable Neural Networks: Locally Smoothing Perspective of Model Interpretation CVPR 2021

Ranking Neural Checkpoints CVPR 2021

CAMERAS: Enhanced Resolution and Sanity Preserving Class Activation Mapping for Image Saliency CVPR 2021

Debiasing Methods in Natural Language Understanding Make Bias More Accessible EMNLP 2021

Automatic Text Evaluation through the Lens of Wasserstein Barycenters EMNLP 2021