Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Model Compression
1674 directly classified papers
Papers per year
2012: 1
2013: 2
2014: 2
2015: 7
2016: 9
2017: 27
2018: 51
2019: 79
2020: 189
2021: 165
2022: 206
2023: 207
2024: 325
2025: 399
2026: 5
Papers
Crash2DocAI: Automated Integration of Post-Crash Car Part Images into Technical Reports
WACV 2026
PVeRA: Probabilistic Vector-Based Random Matrix Adaptation
WACV 2026
One-Cycle Structured Pruning via Stability-Driven Subnetwork Search
WACV 2026
GFT: Graph Feature Tuning for Efficient Point Cloud Analysis
WACV 2026
Beyond Real Weights: Hypercomplex Representations for Stable Quantization
WACV 2026
SODA: Spectral Orthogonal Decomposition Adaptation for Diffusion Models
WACV 2025
LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones
WACV 2025
Advancing Weight and Channel Sparsification with Enhanced Saliency
WACV 2025
DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing
WACV 2025
Model Unlearning via Sparse Autoencoder Subspace Guided Projections
EMNLP 2025
Data Generation for Hardware-Friendly Post-Training Quantization
WACV 2025
FT-MDT: Extracting Decision Trees from Medical Texts via a Novel Low-rank Adaptation Method
EMNLP 2025
TORE: Token Recycling in Vision Transformers for Efficient Active Visual Exploration
WACV 2025
eLIR-Net: An Efficient AI Solution for Image Retouching
WACV 2025
AMP-ViT: Optimizing Vision Transformer Efficiency with Adaptive Mixed-Precision Post-Training Quantization
WACV 2025
Attend to Not Attended: Structure-then-Detail Token Merging for Post-training DiT Acceleration
CVPR 2025
PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting
CVPR 2025
Team HUMANE at AVeriTeC 2025: HerO 2 for Efficient Fact Verification
ACL 2025
Data-Free Group-Wise Fully Quantized Winograd Convolution via Learnable Scales
CVPR 2025
ICP: Immediate Compensation Pruning for Mid-to-high Sparsity
CVPR 2025
ELMGS: Enhancing Memory and Computation Scalability through Compression for 3D Gaussian Splatting
WACV 2025
Revisiting Disparity from Dual-Pixel Images: Physics-Informed Lightweight Depth Estimation
WACV 2025
BitNet: 1-bit Pre-training for Large Language Models
JMLR 2025
MedQwen-PE: Medical Qwen for Parameter-Efficient Multilingual Patient-Centric Summarization, Question Answering and Information Extraction
IJCNLP 2025
DyRoNet: Dynamic Routing and Low-Rank Adapters for Autonomous Driving Streaming Perception
WACV 2025
<
1
2
3
4
5
…
67
>