Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Machine Learning
›
Learning Types
›
Evaluation
1654 directly classified papers
Papers per year
2005: 1
2006: 1
2007: 1
2008: 2
2009: 1
2010: 3
2011: 2
2012: 3
2013: 5
2014: 4
2015: 4
2016: 11
2017: 19
2018: 32
2019: 39
2020: 72
2021: 110
2022: 202
2023: 222
2024: 351
2025: 569
Papers
Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison
EACL 2017
Improving Evaluation of Document-level Machine Translation Quality Estimation
EACL 2017
JFLEG: A Fluency Corpus and Benchmark for Grammatical Error Correction
EACL 2017
Re-evaluating Automatic Metrics for Image Captioning
EACL 2017
Inference is Everything: Recasting Semantic Resources into a Unified Evaluation Framework
IJCNLP 2017
A Parallel Corpus for Evaluating Machine Translation between Arabic and European Languages
EACL 2017
Rescale-Invariant SVM for Binary Classification
IJCAI 2017
Towards an Automatic Turing Test: Learning to Evaluate Dialogue Responses
ACL 2017
Evaluating Compound Splitters Extrinsically with Textual Entailment
ACL 2017
A Challenge Set Approach to Evaluating Machine Translation
EMNLP 2017
Further Investigation into Reference Bias in Monolingual Evaluation of Machine Translation
EMNLP 2017
Empirical Evaluation of Resampling Procedures for Optimising SVM Hyperparameters
JMLR 2017
Adversarial Examples for Evaluating Reading Comprehension Systems
EMNLP 2017
Time for a Change: a Tutorial for Comparing Multiple Classifiers Through Bayesian Analysis
JMLR 2017
Fisher Consistency for Prior Probability Shift
JMLR 2017
Level Playing Field for Million Scale Face Recognition
CVPR 2017
Revisiting the Evaluation for Cross Document Event Coreference
COLING 2016
Two Illuminant Estimation and User Correction Preference
CVPR 2016
Semantic overfitting: what ‘world’ do we consider when evaluating disambiguation of text?
COLING 2016
Power of Ordered Hypothesis Testing
ICML 2016
Marginal Contrast Among Romanian Vowels: Evidence from ASR and Functional Load
INTERSPEECH 2016
Estimating Accuracy from Unlabeled Data: A Bayesian Approach
ICML 2016
Using Spatial Order to Boost the Elimination of Incorrect Feature Matches
CVPR 2016
Choice of V for V-Fold Cross-Validation in Least-Squares Density Estimation
JMLR 2016
Guarding against Spurious Discoveries in High Dimensions
JMLR 2016
<
1
…
63
64
65
66
67
>