Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Deep Learning
›
Learning Types
›
Model Compression
124 directly classified papers
Papers per year
2016: 4
2017: 2
2018: 3
2019: 8
2020: 11
2021: 17
2022: 12
2023: 16
2024: 23
2025: 28
Papers
Separate the Wheat from the Chaff: A Post-Hoc Approach to Safety Re-Alignment for Fine-Tuned Language Models
ACL 2025
MobiLoRA: Accelerating LoRA-based LLM Inference on Mobile Devices via Context-aware KV Cache Optimization
ACL 2025
LED-Merging: Mitigating Safety-Utility Conflicts in Model Merging with Location-Election-Disjoint
ACL 2025
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding
ACL 2024
The Mamba in the Llama: Distilling and Accelerating Hybrid Models
NIPS 2024
Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent
EMNLP 2024
Hear You Say You: An Efficient Framework for Marine Mammal Sounds’ Classification
AAAI 2024
BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
NIPS 2024
Training Binary Neural Networks via Gaussian Variational Inference and Low-Rank Semidefinite Programming
NIPS 2024
Simple and Fast Distillation of Diffusion Models
NIPS 2024
Exploiting Activation Sparsity with Dense to Dynamic-k Mixture-of-Experts Conversion
NIPS 2024
Privacy-Preserving Face Recognition Using Trainable Feature Subtraction
CVPR 2024
FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping
EMNLP 2024
SlimSAM: 0.1% Data Makes Segment Anything Slim
NIPS 2024
SparseLLM: Towards Global Pruning of Pre-trained Language Models
NIPS 2024
SpikedAttention: Training-Free and Fully Spike-Driven Transformer-to-SNN Conversion with Winner-Oriented Spike Shift for Softmax Operation
NIPS 2024
GOVERN: Gradient Orientation Vote Ensemble for Multi-Teacher Reinforced Distillation
EMNLP 2024
PromptKD: Distilling Student-Friendly Knowledge for Generative Language Models via Prompt Tuning
EMNLP 2024
ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization
NIPS 2024
Unified Gradient-Based Machine Unlearning with Remain Geometry Enhancement
NIPS 2024
Learning from Imperfect Data: Towards Efficient Knowledge Distillation of Autoregressive Language Models for Text-to-SQL
EMNLP 2024
3-in-1: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and Composability
NIPS 2024
Breaking ReLU Barrier: Generalized MoEfication for Dense Pretrained Models
EMNLP 2024
Multistep Distillation of Diffusion Models via Moment Matching
NIPS 2024
SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization
NIPS 2024
<
1
2
3
4
5
>