Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core AI
Artificial Intelligence
›
Core AI
›
Model Compression
1928 directly classified papers
Papers per year
2013: 2
2014: 1
2015: 6
2016: 4
2017: 13
2018: 47
2019: 81
2020: 114
2021: 172
2022: 191
2023: 272
2024: 370
2025: 489
2026: 166
Papers
LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones
WACV 2025
ELMGS: Enhancing Memory and Computation Scalability through Compression for 3D Gaussian Splatting
WACV 2025
Q-TempFusion: Quantization-Aware Temporal Multi-Sensor Fusion on Bird's-Eye View Representation
WACV 2025
Ego-VPA: Egocentric Video Understanding with Parameter-Efficient Adaptation
WACV 2025
ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers
ICCV 2025
Memory-Efficient Backpropagation for Fine-Tuning LLMs on Resource-Constrained Mobile Devices
EMNLP 2025
TRNAS: A Training-Free Robust Neural Architecture Search
ICCV 2025
SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs
ICCV 2025
SSVQ: Unleashing the Potential of Vector Quantization with Sign-Splitting
ICCV 2025
MULTIGUARD: An Efficient Approach for AI Safety Moderation Across Languages and Modalities
EMNLP 2025
ViM-VQ: Efficient Post-Training Vector Quantization for Visual Mamba
ICCV 2025
Recover-LoRA: Data-Free Accuracy Recovery of Degraded Language Models via Low-Rank Adaptation
EMNLP 2025
ECHO-LLaMA: Efficient Caching for High-Performance LLaMA Training
EMNLP 2025
LLMs on a Budget? Say HOLA
EMNLP 2025
Controllable Memorization in LLMs via Weight Pruning
EMNLP 2025
When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient Reasoning
EMNLP 2025
Knockoff Branch: Model Stealing Attack via Adding Neurons in the Pre-Trained Model
WACV 2025
Data Generation for Hardware-Friendly Post-Training Quantization
WACV 2025
Beyond Dynamic Quantization: An Efficient Static Hierarchical Mix-precision Framework for Near-Lossless LLM Compression
EMNLP 2025
On-device System of Compositional Multi-tasking in Large Language Models
EMNLP 2025
Pre-Trained Multiple Latent Variable Generative Models are Good Defenders Against Adversarial Attacks
WACV 2025
GenPTQ: Green Post-Training Quantization for Large-Scale ASR Models with Mixed-Precision Bit Allocation
EMNLP 2025
KOEnsAttack: Towards Efficient Data-Free Black-Box Adversarial Attacks via Knowledge-Orthogonalized Substitute Ensembles
ICCV 2025
MDP: Multidimensional Vision Model Pruning with Latency Constraint
CVPR 2025
Comparative Knowledge Distillation
WACV 2025
<
1
…
8
9
10
…
78
>