Papers
290 papers found
IPA CHILDES & G2P+: Feature-Rich Resources for Cross-Lingual Phonology and Phonemic Language Modeling
Zébulon Goriely, Paula Buttery
LawToken: a single token worth more than its constituents
Yu-Hsiang Tseng, Hsin-Yu Chou, Shu-Kai Hsieh
Lost in Variation? Evaluating NLI Performance in Basque and Spanish Geographical Variants
Jaione Bengoetxea, Itziar Gonzalez-Dios, Rodrigo Agerri
Planning for Success: Exploring LLM Long-term Planning Capabilities in Table Understanding
Thi-Nhung Nguyen, Hoang Ngo, Dinh Phung et al.
Polarity inversion operators in PLM
David Kletz, Pascal Amsili, Marie Candito
Principal Parts Detection for Computational Morphology: Task, Models and Benchmark
Dorin Keshales, Omer Goldman, Reut Tsarfaty
Quasi-symbolic Semantic Geometry over Transformer-based Variational AutoEncoder
Yingji Zhang, Danilo Carvalho, Andre Freitas
Short-circuiting Shortcuts: Mechanistic Investigation of Shortcuts in Text Classification
Leon Eshuijs, Shihan Wang, Antske Fokkens
Timestep Embeddings Trigger Collapse in Diffusion Text Generation
Ryota Nosaka, Takuya Matsuzaki
What does memory retrieval leave on the table? Modelling the Cost of Semi-Compositionality with MINERVA2 and sBERT
Sydelle de Souza, Ivan Vegner, Francis Mollica et al.
What is an “Abstract Reasoner”? Revisiting Experiments and Arguments about Large Language Models
Tian Yun, Chen Sun, Ellie Pavlick
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization
Ine Gevers, Victor De Marez, Luna De Bruyne et al.
Advancing Arabic Sentiment Analysis: ArSen Benchmark and the Improved Fuzzy Deep Hybrid Network
Yang Fang, Cheng Xu, Shuhao Guan et al.
Aligning Alignments: Do Colexification and Distributional Similarity Align as Measures of cross-lingual Lexical Alignment?
Taelin Karidi, Eitan Grossman, Omri Abend
A Multimodal Large Language Model “Foresees” Objects Based on Verb Information but Not Gender
Shuqi Wang, Xufeng Duan, Zhenguang Cai
An Empirical Comparison of Vocabulary Expansion and Initialization Approaches For Language Models
Nandini Mundra, Aditya Nanda Kishore Khandavally, Raj Dabre et al.
A Novel Instruction Tuning Method for Vietnamese Mathematical Reasoning using Trainable Open-Source Large Language Models
Quang-Vinh Nguyen, Thanh-Do Nguyen, Van-Vinh Nguyen et al.
AntLM: Bridging Causal and Masked Language Models
Xinru Yu, Bin Guo, Shiwei Luo et al.
Are BabyLMs Second Language Learners?
Lukas Edman, Lisa Bylinina, Faeze Ghorbanpour et al.
A surprisal oracle for when every layer counts
Xudong Hong, Sharid Loáiciga, Asad Sayeed
Automatic Quality Estimation for Data Selection and Curriculum Learning
Hiep Nguyen, Lynn Yip, Justin DeBenedetto