Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core AI
Artificial Intelligence
›
Core AI
›
Safety
317 directly classified papers
Papers per year
2016: 1
2017: 1
2018: 4
2019: 8
2020: 11
2021: 21
2022: 29
2023: 36
2024: 87
2025: 117
2026: 2
Papers
TBT: Targeted Neural Network Attack With Bit Trojan
CVPR 2020
RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models
EMNLP 2020
Abstract Constraints for Safe and Robust Robot Learning from Demonstration
AAAI 2020
Certifying Geometric Robustness of Neural Networks
NIPS 2019
Provably Robust Deep Learning via Adversarially Trained Smoothed Classifiers
NIPS 2019
Linear Stochastic Bandits Under Safety Constraints
NIPS 2019
Probabilistic Model Checking of Robots Deployed in Extreme Environments
AAAI 2019
Towards Reliable Learning for High Stakes Applications
AAAI 2019
End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks
AAAI 2019
Why ReLU Networks Yield High-Confidence Predictions Far Away From the Training Data and How to Mitigate the Problem
CVPR 2019
Convergent Policy Optimization for Safe Reinforcement Learning
NIPS 2019
Constrained Cross-Entropy Method for Safe Reinforcement Learning
NIPS 2018
Scaling provable adversarial defenses
NIPS 2018
A Lyapunov-based Approach to Safe Reinforcement Learning
NIPS 2018
Learning Safe Policies with Expert Guidance
NIPS 2018
Formal Guarantees on the Robustness of a Classifier against Adversarial Manipulation
NIPS 2017
Safe Exploration in Finite Markov Decision Processes with Gaussian Processes
NIPS 2016
<
1
…
9
10
11
12
13
>