Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core AI
Artificial Intelligence
›
Core AI
›
Safety
317 directly classified papers
Papers per year
2016: 1
2017: 1
2018: 4
2019: 8
2020: 11
2021: 21
2022: 29
2023: 36
2024: 87
2025: 117
2026: 2
Papers
Sample-Specific Output Constraints for Neural Networks
AAAI 2021
Fooling Thermal Infrared Pedestrian Detectors in Real World Using Small Bulbs
AAAI 2021
Safe Reinforcement Learning by Imagining the Near Future
NIPS 2021
Safety Assurance for Systems with Machine Learning Components
AAAI 2021
Safe Reinforcement Learning with Linear Function Approximation
ICML 2021
Profanity-Avoiding Training Framework for Seq2seq Models with Certified Robustness
EMNLP 2021
Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer
EMNLP 2021
FaceSec: A Fine-Grained Robustness Evaluation Framework for Face Recognition Systems
CVPR 2021
Preventing Repeated Real World AI Failures by Cataloging Incidents: The AI Incident Database
AAAI 2021
Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations
NIPS 2021
Counterexample Guided RL Policy Refinement Using Bayesian Optimization
NIPS 2021
Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds
NIPS 2021
REM: Efficient Semi-Automated Real-Time Moderation of Online Forums
ACL 2021
Anti-Backdoor Learning: Training Clean Models on Poisoned Data
NIPS 2021
Learning Policies with Zero or Bounded Constraint Violation for Constrained MDPs
NIPS 2021
Topological Detection of Trojaned Neural Networks
NIPS 2021
Leveraging on Deep Reinforcement Learning for Autonomous Safe Decision-Making in Highway On-ramp Merging (Student Abstract)
AAAI 2021
Toward Operational Safety Verification of AI-Enabled CPS (Student Abstract)
AAAI 2020
Supervised Discovery of Unknown Unknowns through Test Sample Mining (Student Abstract)
AAAI 2020
Clean-Label Backdoor Attacks on Video Recognition Models
CVPR 2020
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
ICML 2020
Certifying Confidence via Randomized Smoothing
NIPS 2020
Querying to Find a Safe Policy under Uncertain Safety Constraints in Markov Decision Processes
AAAI 2020
Efficient Verification of ReLU-Based Neural Networks via Dependency Analysis
AAAI 2020
Learning from Interventions Using Hierarchical Policies for Safe Learning
AAAI 2020
<
1
…
9
10
11
12
13
>