← Application Areas

Machine Learning › Application Areas ›

Model Compression

1503 directly classified papers

Papers per year

Papers

EAC-MoE: Expert-Selection Aware Compressor for Mixture-of-Experts Large Language Models ACL 2025

StolenLoRA: Exploring LoRA Extraction Attacks via Synthetic Data ICCV 2025

Beyond Low-Rank Tuning: Model Prior-Guided Rank Allocation for Effective Transfer in Low-Data and Large-Gap Regimes. ICCV 2025

Dense2MoE: Restructuring Diffusion Transformer to MoE for Efficient Text-to-Image Generation ICCV 2025

RED-CT: A Systems Design Methodology for Using LLM-labeled Data to Train and Deploy Edge Linguistic Classifiers COLING 2025

Best Practices for Distilling Large Language Models into BERT for Web Search Ranking COLING 2025

Lightweight Safety Guardrails Using Fine-tuned BERT Embeddings COLING 2025

Efficient Vocabulary Reduction for Small Language Models COLING 2025

DadmaTools V2: an Adapter-Based Natural Language Processing Toolkit for the Persian Language COLING 2025

AAIG at GenAI Detection Task 1: Exploring Syntactically-Aware, Resource-Efficient Small Autoregressive Decoders for AI Content Detection COLING 2025

Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation COLING 2025

VRCP: Vocabulary Replacement Continued Pretraining for Efficient Multilingual Language Models COLING 2025

Variance-Based Pruning for Accelerating and Compressing Trained Networks ICCV 2025

Outlier-Aware Post-Training Quantization for Image Super-Resolution ICCV 2025

Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation NAACL 2025

LangCompress: Language-Aware Compression of Large Language Models IJCNLP 2025

General Compression Framework for Efficient Transformer Object Tracking ICCV 2025

Dynamic-VLM: Simple Dynamic Visual Token Compression for VideoLLM ICCV 2025

Interpreting the Effects of Quantization on LLMs IJCNLP 2025

AHCPTQ: Accurate and Hardware-Compatible Post-Training Quantization for Segment Anything Model ICCV 2025

Aligning Sizes of Intermediate Layers by LoRA Adapter for Knowledge Distillation NAACL 2025

As easy as PIE: understanding when pruning causes language models to disagree NAACL 2025

Verifiable Format Control for Large Language Model Generations NAACL 2025

SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models NAACL 2025

LVPruning: An Effective yet Simple Language-Guided Vision Token Pruning Approach for Multi-modal Large Language Models NAACL 2025