Dmitrii Krasheninnikov
4 papers
· 2019–2024
· 3 conferences
· across top CS/AI conferences
Achievements
π
Interdisciplinary Bridge
π§
Keyword Pioneer
π
Conference Polyglot
(3)
π
Academic Marathon
(5)
π
Cross-Pollinator
(12)
πΊοΈ
Taxonomy Completionist
(11)
π£
Hot Topic Early Bird
Conferences
NIPS (2)
ICLR (1)
ICML (1)
Top co-authors
Keywords
reinforcement learning
(2)
model evaluation
(1)
model safety
(1)
reward function
(1)
reward hacking
(1)
capability elicitation
(1)
password-locked model
(1)
safety evaluation
(1)
hidden capability
(1)
deterministic policy
(1)
stochastic policy
(1)
llm alignment
(1)
large language model
(1)
proxy reward
(1)
dangerous capability
(1)
unhackable proxy
(1)
fine-tuning evaluation
(1)