← Application Areas

Machine Learning › Application Areas ›

Model Compression

1503 directly classified papers

Papers per year

Papers

Single-step Diffusion for Image Compression at Ultra-Low Bitrates WACV 2026

Efficient Text-Guided Convolutional Adapter for the Diffusion Model WACV 2026

FIRM-MoE:Fine-GrainedExpert Decomposition for Resource-Adaptive MoE Inference AAAI 2026

OTARo: Once Tuning for All Precisions Toward Robust On-Device LLMs AAAI 2026

GFT: Graph Feature Tuning for Efficient Point Cloud Analysis WACV 2026

One-Cycle Structured Pruning via Stability-Driven Subnetwork Search WACV 2026

Beyond Real Weights: Hypercomplex Representations for Stable Quantization WACV 2026

TED-4DGS: Temporally Activated and Embedding-based Deformation for 4DGS Compression WACV 2026

DiffBench Meets DiffAgent: End-to-End LLM-Driven Diffusion Acceleration Code Generation AAAI 2026

Prune&Comp: Free Lunch for Layer-Pruned LLMs via Iterative Pruning with Magnitude Compensation AAAI 2026

Parameter-Efficient Fine-Tuning with Circulant and Diagonal Vectors IJCAI 2025

EAC-MoE: Expert-Selection Aware Compressor for Mixture-of-Experts Large Language Models ACL 2025

500xCompressor: Generalized Prompt Compression for Large Language Models ACL 2025

“Give Me BF16 or Give Me Death”? Accuracy-Performance Trade-Offs in LLM Quantization ACL 2025

SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models NAACL 2025

QPruner: Probabilistic Decision Quantization for Structured Pruning in Large Language Models NAACL 2025

LVPruning: An Effective yet Simple Language-Guided Vision Token Pruning Approach for Multi-modal Large Language Models NAACL 2025

LangCompress: Language-Aware Compression of Large Language Models IJCNLP 2025

Interpreting the Effects of Quantization on LLMs IJCNLP 2025

AAIG at GenAI Detection Task 1: Exploring Syntactically-Aware, Resource-Efficient Small Autoregressive Decoders for AI Content Detection COLING 2025

GIFT-SW: Gaussian noise Injected Fine-Tuning of Salient Weights for LLMs ACL 2025

Multilingual Iterative Model Pruning: What Matters? IJCNLP 2025

Quaff: Quantized Parameter-Efficient Fine-Tuning under Outlier Spatial Stability Hypothesis ACL 2025

Boost Embodied AI Models with Robust Compression Boundary IJCAI 2025

Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation COLING 2025