Research Explorer

Towards Data-Efficient Language Models: A Child-Inspired Approach to Language Learning

Mohammad Amin Ghanizadeh, Mohammad Javad Dousti

2024 CONLL

TpT-ADE: Transformer Based Two-Phase ADE Extraction

Suryamukhi Kuchibhotla, Manish Singh

2024 CONLL

Transformer verbatim in-context retrieval across time and scale

Kristijan Armeni, Marko Pranjić, Senja Pollak

2024 CONLL

Translating Across Cultures: LLMs for Intralingual Cultural Adaptation

Pushpdeep Singh, Mayur Patidar, Lovekesh Vig

2024 CONLL

Using Curriculum Masking Based on Child Language Development to Train a Large Language Model with Limited Training Data

Evan Lucas, Dylan Gaines, Tagore Rao Kosireddy et al.

2024 CONLL

WhatIf: Leveraging Word Vectors for Small-Scale Data Augmentation

Alex Lyman, Bryce Hepner

2024 CONLL

What should Baby Models read? Exploring Sample-Efficient Data Composition on Model Performance

Hong Meng Yam, Nathan Paek

2024 CONLL

When Babies Teach Babies: Can student knowledge sharing outperform Teacher-Guided Distillation on small datasets?

Srikrishna Iyer

2024 CONLL

Words That Stick: Using Keyword Cohesion to Improve Text Segmentation

Amit Maraj, Miguel Vargas Martin, Masoud Makrehchi

2024 CONLL

A Block Metropolis-Hastings Sampler for Controllable Energy-based Text Generation

Jarad Forristal, Fatemehsadat Mireshghallah, Greg Durrett et al.

2023 CONLL

A Comparative Study on Textual Saliency of Styles from Eye Tracking, Annotations, and Language Models

Karin de Langis, Dongyeop Kang

2023 CONLL

Alignment via Mutual Information

Shinjini Ghosh, Yoon Kim, Ramon Fernandez Astudillo et al.

2023 CONLL

A Minimal Approach for Natural Language Action Space in Text-based Games

Dongwon Ryu, Meng Fang, Gholamreza Haffari et al.

2023 CONLL

ArchBERT: Bi-Modal Understanding of Neural Architectures and Natural Languages

Mohammad Akbari, Saeed Ranjbar Alvar, Behnam Kamranian et al.

2023 CONLL

Attribution and Alignment: Effects of Local Context Repetition on Utterance Production and Comprehension in Dialogue

Aron Molnar, Jaap Jumelet, Mario Giulianelli et al.

2023 CONLL

Can Language Models Be Tricked by Language Illusions? Easier with Syntax, Harder with Semantics

Yuhan Zhang, Edward Gibson, Forrest Davis

2023 CONLL

Challenging the “One Single Vector per Token” Assumption

Mathieu Dehouck

2023 CONLL

ChiSCor: A Corpus of Freely-Told Fantasy Stories by Dutch Children for Computational Linguistics and Cognitive Science

Bram van Dijk, Max van Duijn, Suzan Verberne et al.

2023 CONLL

Cross-Document Event Coreference Resolution: Instruct Humans or Instruct GPT?

Jin Zhao, Nianwen Xue, Bonan Min

2023 CONLL

Enhancing Code-mixed Text Generation Using Synthetic Data Filtering in Neural Machine Translation

Dama Sravani, Radhika Mamidi

2023 CONLL

Exploring Transformers as Compact, Data-efficient Language Models

Clayton Fields, Casey Kennington

2023 CONLL

Future Lens: Anticipating Subsequent Tokens from a Single Hidden State

Koyena Pal, Jiuding Sun, Andrew Yuan et al.

2023 CONLL

HNC: Leveraging Hard Negative Captions towards Models with Fine-Grained Visual-Linguistic Comprehension Capabilities

Esra Dönmez, Pascal Tilli, Hsiu-Yu Yang et al.

2023 CONLL

How Fragile is Relation Extraction under Entity Replacements?

Yiwei Wang, Bryan Hooi, Fei Wang et al.

2023 CONLL

Humans and language models diverge when predicting repeating text

Aditya Vaidya, Javier Turek, Alexander Huth

2023 CONLL

Papers