← Optimization & Theory

Machine Learning › Optimization & Theory ›

Neural Network Optimization

3648 directly classified papers

Papers per year

Papers

PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space Model AAAI 2025

Coupling-based Convergence Diagnostic and Stepsize Scheme for Stochastic Gradient Descent AAAI 2025

Optimized Gradient Clipping for Noisy Label Learning AAAI 2025

Mamba YOLO: A Simple Baseline for Object Detection with State Space Model AAAI 2025

Ultra-High Resolution Segmentation via Boundary-Enhanced Patch-Merging Transformer AAAI 2025

Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network AAAI 2025

Robust and Adaptive AI Models for Medication Usage Forecasting Using ICD-9/10 Code (Student Abstract) AAAI 2025

Reducing Divergence in Batch Normalization for Domain Adaptation AAAI 2025

AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks AAAI 2025

LLaVA Steering: Visual Instruction Tuning with 500x Fewer Parameters through Modality Linear Representation-Steering ACL 2025

SYSTRAN @ IWSLT 2025 Low-resource track ACL 2025

Multi-View 3D Human Pose Estimation with Weakly Synchronized Images AAAI 2025

DMPT: Decoupled Modality-Aware Prompt Tuning for Multi-Modal Object Re-Identification WACV 2025

Covariance-Based Space Regularization for Few-Shot Class Incremental Learning WACV 2025

Self-Aligning Depth-Regularized Radiance Fields for Asynchronous RGB-D Sequences WACV 2025

The Effect of SGD Batch Size on Autoencoder Learning: Sparsity, Sharpness, and Feature Learning JMLR 2025

Instability, Computational Efficiency and Statistical Accuracy JMLR 2025

Stabilizing Sharpness-Aware Minimization Through A Simple Renormalization Strategy JMLR 2025

Last-iterate Convergence of Shuffling Momentum Gradient Method under the Kurdyka-Lojasiewicz Inequality JMLR 2025

Asymmetric Learning for Spectral Graph Neural Networks AAAI 2025

Losing Momentum in Continuous-time Stochastic Optimisation JMLR 2025

TRACT: Regression-Aware Fine-tuning Meets Chain-of-Thought Reasoning for LLM-as-a-Judge ACL 2025

ReGLA: Refining Gated Linear Attention NAACL 2025

ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention AAAI 2025

Taming LLMs with Gradient Grouping ACL 2025