Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Models
Deep Learning
›
Models
›
Language Models
705 directly classified papers
Papers per year
2008: 1
2012: 1
2014: 1
2015: 1
2017: 8
2018: 21
2019: 41
2020: 86
2021: 95
2022: 127
2023: 126
2024: 98
2025: 99
Papers
Distribution Prompting: Understanding the Expressivity of Language Models Through the Next-Token Distributions They Can Produce
EMNLP 2025
R-BPE: Improving BPE-Tokenizers with Token Reuse
EMNLP 2025
Tokenization and Representation Biases in Multilingual Models on Dialectal NLP Tasks
EMNLP 2025
Untie the Knots: An Efficient Data Augmentation Strategy for Long-Context Pre-Training in Language Models
ACL 2025
LLäMmlein: Transparent, Compact and Competitive German-Only Language Models from Scratch
ACL 2025
SR-LLM: Rethinking the Structured Representation in Large Language Model
ACL 2025
Positional Overload: Positional Debiasing and Context Window Extension for Large Language Models using Set Encoding
ACL 2025
A Text is Worth Several Tokens: Text Embedding from LLMs Secretly Aligns Well with The Key Tokens
ACL 2025
On Support Samples of Next Word Prediction
ACL 2025
Scaling up the State Size of RNN LLMs for Long-Context Scenarios
ACL 2025
EAC-MoE: Expert-Selection Aware Compressor for Mixture-of-Experts Large Language Models
ACL 2025
M2RC-EVAL: Massively Multilingual Repository-level Code Completion Evaluation
ACL 2025
A New Formulation of Zipf’s Meaning-Frequency Law through Contextual Diversity
ACL 2025
VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models
ACL 2025
Controllable Style Arithmetic with Language Models
ACL 2025
A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment
ACL 2025
Enhancing Lexicon-Based Text Embeddings with Large Language Models
ACL 2025
Towards the Law of Capacity Gap in Distilling Language Models
ACL 2025
TESS 2: A Large-Scale Generalist Diffusion Language Model
ACL 2025
mRNA2vec: mRNA Embedding with Language Model in the 5'UTR-CDS for mRNA Design
AAAI 2025
GigaChat Family: Efficient Russian Language Modeling Through Mixture of Experts Architecture
ACL 2025
L3TC: Leveraging RWKV for Learned Lossless Low-Complexity Text Compression
AAAI 2025
VenusFactory: An Integrated System for Protein Engineering with Data Retrieval and Language Model Fine-Tuning
ACL 2025
Knowledge Graph Completion with Relation-Aware Anchor Enhancement
AAAI 2025
Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models
ACL 2024
<
1
2
3
4
5
…
29
>