Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Resources & Methods
Natural Language Processing
›
Resources & Methods
›
Text Representation
2246 directly classified papers
Papers per year
2006: 2
2007: 4
2008: 1
2009: 6
2010: 2
2011: 3
2012: 3
2013: 7
2014: 7
2015: 4
2016: 30
2017: 126
2018: 177
2019: 231
2020: 245
2021: 296
2022: 240
2023: 210
2024: 292
2025: 297
2026: 63
Papers
Exploring the Integration of Eye Movement Data on Word Embeddings
NAACL 2025
mStyleDistance: Multilingual Style Embeddings and their Evaluation
ACL 2025
Evaluating LLM-Prompting for Sequence Labeling Tasks in Computational Literary Studies
NAACL 2025
Field to Model: Pairing Community Data Collection with Scalable NLP through the LiFE Suite
ACL 2025
TEXT-CAKE: Challenging Language Models on Local Text Coherence
COLING 2025
Modeling Complex Semantics Relation with Contrastively Fine-Tuned Relational Encoders
ACL 2025
PosterSum: A Multimodal Benchmark for Scientific Poster Summarization
IJCNLP 2025
ClaimCatchers at SemEval-2025 Task 7: Sentence Transformers for Claim Retrieval
ACL 2025
Formalizing Style in Personal Narratives
EMNLP 2025
Dictionaries to the Rescue: Cross-Lingual Vocabulary Transfer for Low-Resource Languages Using Bilingual Dictionaries
ACL 2025
Clustering LLM-based Word Embeddings to Determine Topics from Bangla Articles
IJCNLP 2025
Experiential Semantic Information and Brain Alignment: Are Multimodal Models Better than Language Models?
ACL 2025
How do Language Models Generate Slang: A Systematic Comparison between Human and Machine-Generated Slang Usages
EMNLP 2025
SELEXINI – a large and diverse automatically parsed corpus of French
COLING 2025
AutoChunker: Structured Text Chunking and its Evaluation
ACL 2025
Can Uniform Meaning Representation Help GPT-4 Translate from Indigenous Languages?
ACL 2025
Constrained Non-negative Matrix Factorization for Guided Topic Modeling of Minority Topics
EMNLP 2025
Challenging Assumptions in Learning Generic Text Style Embeddings
NAACL 2025
Conditional Dichotomy Quantification via Geometric Embedding
ACL 2025
LawToken: a single token worth more than its constituents
CONLL 2025
Beyond Contrastive Learning: Synthetic Data Enables List-wise Training with Multiple Levels of Relevance
EMNLP 2025
Cyber Protectors@DravidianLangTech 2025: Abusive Tamil and Malayalam Text Targeting Women on Social Media using FastText
NAACL 2025
Exploring morphology-aware tokenization: A case study on Spanish language modeling
EMNLP 2025
A GitHub-based Workflow for Annotated Resource Development
ACL 2025
Detecting Inconsistencies in Narrative Elements of Cross Lingual Nakba Texts
COLING 2025
<
1
…
9
10
11
…
90
>