← Resources & Methods

Natural Language Processing › Resources & Methods ›

Text Representation

2246 directly classified papers

Papers per year

Papers

Exploring the Integration of Eye Movement Data on Word Embeddings NAACL 2025

mStyleDistance: Multilingual Style Embeddings and their Evaluation ACL 2025

Evaluating LLM-Prompting for Sequence Labeling Tasks in Computational Literary Studies NAACL 2025

Field to Model: Pairing Community Data Collection with Scalable NLP through the LiFE Suite ACL 2025

TEXT-CAKE: Challenging Language Models on Local Text Coherence COLING 2025

Modeling Complex Semantics Relation with Contrastively Fine-Tuned Relational Encoders ACL 2025

PosterSum: A Multimodal Benchmark for Scientific Poster Summarization IJCNLP 2025

ClaimCatchers at SemEval-2025 Task 7: Sentence Transformers for Claim Retrieval ACL 2025

Formalizing Style in Personal Narratives EMNLP 2025

Dictionaries to the Rescue: Cross-Lingual Vocabulary Transfer for Low-Resource Languages Using Bilingual Dictionaries ACL 2025

Clustering LLM-based Word Embeddings to Determine Topics from Bangla Articles IJCNLP 2025

Experiential Semantic Information and Brain Alignment: Are Multimodal Models Better than Language Models? ACL 2025

How do Language Models Generate Slang: A Systematic Comparison between Human and Machine-Generated Slang Usages EMNLP 2025

SELEXINI – a large and diverse automatically parsed corpus of French COLING 2025

AutoChunker: Structured Text Chunking and its Evaluation ACL 2025

Can Uniform Meaning Representation Help GPT-4 Translate from Indigenous Languages? ACL 2025

Constrained Non-negative Matrix Factorization for Guided Topic Modeling of Minority Topics EMNLP 2025

Challenging Assumptions in Learning Generic Text Style Embeddings NAACL 2025

Conditional Dichotomy Quantification via Geometric Embedding ACL 2025

LawToken: a single token worth more than its constituents CONLL 2025

Beyond Contrastive Learning: Synthetic Data Enables List-wise Training with Multiple Levels of Relevance EMNLP 2025

Cyber Protectors@DravidianLangTech 2025: Abusive Tamil and Malayalam Text Targeting Women on Social Media using FastText NAACL 2025

Exploring morphology-aware tokenization: A case study on Spanish language modeling EMNLP 2025

A GitHub-based Workflow for Annotated Resource Development ACL 2025

Detecting Inconsistencies in Narrative Elements of Cross Lingual Nakba Texts COLING 2025