David Kaczér
2 papers
· 2025–2025
· 2 conferences
· across top CS/AI conferences
Achievements
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🌍
Conference Polyglot
(2)
🐝
Cross-Pollinator
(12)
🗺️
Taxonomy Completionist
(11)
Conferences
ACL (1)
EMNLP (1)
Top co-authors
Keywords
language model
(2)
multilingual nlp
(1)
cross-lingual transfer
(1)
text representation
(1)
multilingual pretraining
(1)
markov chain monte carlo
(1)
markov model
(1)
data filtering
(1)
multilingual embedding
(1)
subword tokenisation
(1)
path counting
(1)
subword regularization
(1)
text quality
(1)
pretraining datum
(1)
multilingual data curation
(1)
knowledge distillation
(1)
pretraining data filtering
(1)