Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Optimization
1638 directly classified papers
Papers per year
2006: 5
2007: 2
2008: 4
2009: 2
2010: 2
2011: 3
2012: 8
2013: 25
2014: 19
2015: 22
2016: 31
2017: 42
2018: 68
2019: 104
2020: 148
2021: 174
2022: 178
2023: 209
2024: 345
2025: 244
2026: 3
Papers
Joint Optimization of Camera Model and Deep Neural Network for Image Recognition
WACV 2026
SkelSplat: Robust Multi-view 3D Human Pose Estimation with Differentiable Gaussian Rendering
WACV 2026
ODEt(ODEl): Shortcutting the Time and the Length in Diffusion and Flow Models for Faster Sampling
WACV 2026
LegoMT2: Selective Asynchronous Sharded Data Parallel Training for Massive Neural Machine Translation
ACL 2025
Assigning Distinct Roles to Quantized and Low-Rank Matrices Toward Optimal Weight Decomposition
ACL 2025
Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Probability Theory
ACL 2025
QuZO: Quantized Zeroth-Order Fine-Tuning for Large Language Models
EMNLP 2025
ECoRAG: Evidentiality-guided Compression for Long Context RAG
ACL 2025
LAMB: A Training-Free Method to Enhance the Long-Context Understanding of SSMs via Attention-Guided Token Filtering
ACL 2025
CLaSp: In-Context Layer Skip for Self-Speculative Decoding
ACL 2025
Document-Level Text Generation with Minimum Bayes Risk Decoding using Optimal Transport
ACL 2025
Q-Mamba: Towards more efficient Mamba models via post-training quantization
ACL 2025
TKG-DM: Training-free Chroma Key Content Generation Diffusion Model
CVPR 2025
Efficient Ensemble for Fine-tuning Language Models on Multiple Datasets
ACL 2025
Unifying Uniform and Binary-coding Quantization for Accurate Compression of Large Language Models
ACL 2025
Understanding Silent Data Corruption in LLM Training
ACL 2025
SpindleKV: A Novel KV Cache Reduction Method Balancing Both Shallow and Deep Layers
ACL 2025
Fuzzy Speculative Decoding for a Tunable Accuracy-Runtime Tradeoff
ACL 2025
AutoMixer: Checkpoint Artifacts as Automatic Data Mixers
ACL 2025
VLMInferSlow: Evaluating the Efficiency Robustness of Large Vision-Language Models as a Service
ACL 2025
How to Mitigate Overfitting in Weak-to-strong Generalization?
ACL 2025
DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
ACL 2025
TITAN-Guide: Taming Inference-Time Alignment for Guided Text-to-Video Diffusion Models
ICCV 2025
MobiLoRA: Accelerating LoRA-based LLM Inference on Mobile Devices via Context-aware KV Cache Optimization
ACL 2025
Why Are Positional Encodings Nonessential for Deep Autoregressive Transformers? A Petroglyph Revisited
ACL 2025
<
1
2
3
4
5
…
66
>