Papers
290 papers found
Towards Data-Efficient Language Models: A Child-Inspired Approach to Language Learning
Mohammad Amin Ghanizadeh, Mohammad Javad Dousti
TpT-ADE: Transformer Based Two-Phase ADE Extraction
Suryamukhi Kuchibhotla, Manish Singh
Transformer verbatim in-context retrieval across time and scale
Kristijan Armeni, Marko Pranjić, Senja Pollak
Translating Across Cultures: LLMs for Intralingual Cultural Adaptation
Pushpdeep Singh, Mayur Patidar, Lovekesh Vig
Using Curriculum Masking Based on Child Language Development to Train a Large Language Model with Limited Training Data
Evan Lucas, Dylan Gaines, Tagore Rao Kosireddy et al.
WhatIf: Leveraging Word Vectors for Small-Scale Data Augmentation
Alex Lyman, Bryce Hepner
What should Baby Models read? Exploring Sample-Efficient Data Composition on Model Performance
Hong Meng Yam, Nathan Paek
Words That Stick: Using Keyword Cohesion to Improve Text Segmentation
Amit Maraj, Miguel Vargas Martin, Masoud Makrehchi
A Block Metropolis-Hastings Sampler for Controllable Energy-based Text Generation
Jarad Forristal, Fatemehsadat Mireshghallah, Greg Durrett et al.
A Comparative Study on Textual Saliency of Styles from Eye Tracking, Annotations, and Language Models
Karin de Langis, Dongyeop Kang
Alignment via Mutual Information
Shinjini Ghosh, Yoon Kim, Ramon Fernandez Astudillo et al.
A Minimal Approach for Natural Language Action Space in Text-based Games
Dongwon Ryu, Meng Fang, Gholamreza Haffari et al.
ArchBERT: Bi-Modal Understanding of Neural Architectures and Natural Languages
Mohammad Akbari, Saeed Ranjbar Alvar, Behnam Kamranian et al.
Attribution and Alignment: Effects of Local Context Repetition on Utterance Production and Comprehension in Dialogue
Aron Molnar, Jaap Jumelet, Mario Giulianelli et al.
Can Language Models Be Tricked by Language Illusions? Easier with Syntax, Harder with Semantics
Yuhan Zhang, Edward Gibson, Forrest Davis
Challenging the “One Single Vector per Token” Assumption
Mathieu Dehouck
ChiSCor: A Corpus of Freely-Told Fantasy Stories by Dutch Children for Computational Linguistics and Cognitive Science
Bram van Dijk, Max van Duijn, Suzan Verberne et al.
Cross-Document Event Coreference Resolution: Instruct Humans or Instruct GPT?
Jin Zhao, Nianwen Xue, Bonan Min
Enhancing Code-mixed Text Generation Using Synthetic Data Filtering in Neural Machine Translation
Dama Sravani, Radhika Mamidi
Exploring Transformers as Compact, Data-efficient Language Models
Clayton Fields, Casey Kennington
Future Lens: Anticipating Subsequent Tokens from a Single Hidden State
Koyena Pal, Jiuding Sun, Andrew Yuan et al.
HNC: Leveraging Hard Negative Captions towards Models with Fine-Grained Visual-Linguistic Comprehension Capabilities
Esra Dönmez, Pascal Tilli, Hsiu-Yu Yang et al.
How Fragile is Relation Extraction under Entity Replacements?
Yiwei Wang, Bryan Hooi, Fei Wang et al.
Humans and language models diverge when predicting repeating text
Aditya Vaidya, Javier Turek, Alexander Huth