Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core AI
Artificial Intelligence
›
Core AI
›
AI Safety
2972 directly classified papers
Papers per year
2002: 1
2006: 1
2007: 1
2012: 4
2013: 1
2015: 5
2016: 1
2017: 13
2018: 40
2019: 91
2020: 111
2021: 181
2022: 204
2023: 333
2024: 642
2025: 1031
2026: 312
Papers
Querying to Find a Safe Policy under Uncertain Safety Constraints in Markov Decision Processes
AAAI 2020
Adversarial Separation Network for Speaker Recognition
INTERSPEECH 2020
Robustness to Programmable String Transformations via Augmented Abstract Training
ICML 2020
Towards Controllable Biases in Language Generation
EMNLP 2020
Neural Network Control Policy Verification With Persistent Adversarial Perturbation
ICML 2020
Guaranteeing Safety of Learned Perception Modules via Measurement-Robust Control Barrier Functions
CORL 2020
Sampling-based Reachability Analysis: A Random Set Theory Approach with Adversarial Sampling
CORL 2020
Fastened CROWN: Tightened Neural Network Robustness Certificates
AAAI 2020
Weight Poisoning Attacks on Pretrained Models
ACL 2020
On Isometry Robustness of Deep 3D Point Cloud Models Under Adversarial Attacks
CVPR 2020
A Self-supervised Approach for Adversarial Robustness
CVPR 2020
When NAS Meets Robustness: In Search of Robust Architectures Against Adversarial Attacks
CVPR 2020
Fundamental Tradeoffs between Invariance and Sensitivity to Adversarial Perturbations
ICML 2020
Black-box Certification and Learning under Adversarial Perturbations
ICML 2020
What Machines See Is Not What They Get: Fooling Scene Text Recognition Models With Adversarial Text Images
CVPR 2020
Towards Socially Responsible AI: Cognitive Bias-Aware Multi-Objective Learning
AAAI 2020
Efficient Proximal Mapping of the 1-path-norm of Shallow Networks
ICML 2020
Efficient Robustness Certificates for Discrete Data: Sparsity-Aware Randomized Smoothing for Graphs, Images and More
ICML 2020
Constrained Markov Decision Processes via Backward Value Functions
ICML 2020
Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder
EMNLP 2020
Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification
INTERSPEECH 2020
Learning for Safety-Critical Control with Control Barrier Functions
L4DC 2020
Adversarially Robust Streaming Algorithms via Differential Privacy
NIPS 2020
TBT: Targeted Neural Network Attack With Bit Trojan
CVPR 2020
Transferable Calibration with Lower Bias and Variance in Domain Adaptation
NIPS 2020
<
1
…
109
110
111
…
119
>