← Optimization & Theory

Deep Learning › Optimization & Theory ›

Optimization

1638 directly classified papers

Papers per year

Papers

Joint Optimization of Camera Model and Deep Neural Network for Image Recognition WACV 2026

SkelSplat: Robust Multi-view 3D Human Pose Estimation with Differentiable Gaussian Rendering WACV 2026

ODEt(ODEl): Shortcutting the Time and the Length in Diffusion and Flow Models for Faster Sampling WACV 2026

LegoMT2: Selective Asynchronous Sharded Data Parallel Training for Massive Neural Machine Translation ACL 2025

Assigning Distinct Roles to Quantized and Low-Rank Matrices Toward Optimal Weight Decomposition ACL 2025

Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Probability Theory ACL 2025

QuZO: Quantized Zeroth-Order Fine-Tuning for Large Language Models EMNLP 2025

ECoRAG: Evidentiality-guided Compression for Long Context RAG ACL 2025

LAMB: A Training-Free Method to Enhance the Long-Context Understanding of SSMs via Attention-Guided Token Filtering ACL 2025

CLaSp: In-Context Layer Skip for Self-Speculative Decoding ACL 2025

Document-Level Text Generation with Minimum Bayes Risk Decoding using Optimal Transport ACL 2025

Q-Mamba: Towards more efficient Mamba models via post-training quantization ACL 2025

TKG-DM: Training-free Chroma Key Content Generation Diffusion Model CVPR 2025

Efficient Ensemble for Fine-tuning Language Models on Multiple Datasets ACL 2025

Unifying Uniform and Binary-coding Quantization for Accurate Compression of Large Language Models ACL 2025

Understanding Silent Data Corruption in LLM Training ACL 2025

SpindleKV: A Novel KV Cache Reduction Method Balancing Both Shallow and Deep Layers ACL 2025

Fuzzy Speculative Decoding for a Tunable Accuracy-Runtime Tradeoff ACL 2025

AutoMixer: Checkpoint Artifacts as Automatic Data Mixers ACL 2025

VLMInferSlow: Evaluating the Efficiency Robustness of Large Vision-Language Models as a Service ACL 2025

How to Mitigate Overfitting in Weak-to-strong Generalization? ACL 2025

DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization ACL 2025

TITAN-Guide: Taming Inference-Time Alignment for Guided Text-to-Video Diffusion Models ICCV 2025

MobiLoRA: Accelerating LoRA-based LLM Inference on Mobile Devices via Context-aware KV Cache Optimization ACL 2025

Why Are Positional Encodings Nonessential for Deep Autoregressive Transformers? A Petroglyph Revisited ACL 2025