Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core AI
Artificial Intelligence
›
Core AI
›
AI Safety
2972 directly classified papers
Papers per year
2002: 1
2006: 1
2007: 1
2012: 4
2013: 1
2015: 5
2016: 1
2017: 13
2018: 40
2019: 91
2020: 111
2021: 181
2022: 204
2023: 333
2024: 642
2025: 1031
2026: 312
Papers
Adaptive Risk Sensitive Model Predictive Control with Stochastic Search
L4DC 2021
Neural Lyapunov Redesign
L4DC 2021
Probabilistic robust linear quadratic regulators with Gaussian processes
L4DC 2021
Randomized Smoothing of All Shapes and Sizes
ICML 2020
FastLAS: Scalable Inductive Logic Programming Incorporating Domain-Specific Optimisation Criteria
AAAI 2020
A Multi-Objective Approach to Mitigate Negative Side Effects
IJCAI 2020
Incorporating Failure Events in Agents’ Decision Making to Improve User Satisfaction
IJCAI 2020
Towards Adversarially Robust Knowledge Graph Embeddings
AAAI 2020
Adversary for Social Good: Protecting Familial Privacy through Joint Adversarial Attacks
AAAI 2020
Strategic Classification is Causal Modeling in Disguise
ICML 2020
Safe non-smooth black-box optimization with application to policy search
L4DC 2020
SAFER: A Structure-free Approach for Certified Robustness to Adversarial Word Substitutions
ACL 2020
LSTM Neural Networks: Input to State Stability and Probabilistic Safety Verification
L4DC 2020
Policy Optimization for $\mathcal{H}_2$ Linear Control with $\mathcal{H}_\infty$ Robustness Guarantee: Implicit Regularization and Global Convergence
L4DC 2020
Towards Certificated Model Robustness Against Weight Perturbations
AAAI 2020
Diagnosing Software Faults Using Multiverse Analysis
IJCAI 2020
Towards Trustable Explainable AI
IJCAI 2020
Adversarially Robust Distillation
AAAI 2020
Efficient Verification of ReLU-Based Neural Networks via Dependency Analysis
AAAI 2020
Detecting Adversarial Attacks via Subset Scanning of Autoencoder Activations and Reconstruction Error
IJCAI 2020
Handling Black Swan Events in Deep Learning with Diversely Extrapolated Neural Networks
IJCAI 2020
Generating Adversarial Examples for Holding Robustness of Source Code Processing Models
AAAI 2020
Contracting Implicit Recurrent Neural Networks: Stable Models with Improved Trainability
L4DC 2020
On the Robustness of Data-Driven Controllers for Linear Systems
L4DC 2020
Lagrangian Decomposition for Neural Network Verification
UAI 2020
<
1
…
108
109
110
…
119
>