Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3648 directly classified papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space Model
AAAI 2025
Coupling-based Convergence Diagnostic and Stepsize Scheme for Stochastic Gradient Descent
AAAI 2025
Optimized Gradient Clipping for Noisy Label Learning
AAAI 2025
Mamba YOLO: A Simple Baseline for Object Detection with State Space Model
AAAI 2025
Ultra-High Resolution Segmentation via Boundary-Enhanced Patch-Merging Transformer
AAAI 2025
Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network
AAAI 2025
Robust and Adaptive AI Models for Medication Usage Forecasting Using ICD-9/10 Code (Student Abstract)
AAAI 2025
Reducing Divergence in Batch Normalization for Domain Adaptation
AAAI 2025
AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks
AAAI 2025
LLaVA Steering: Visual Instruction Tuning with 500x Fewer Parameters through Modality Linear Representation-Steering
ACL 2025
SYSTRAN @ IWSLT 2025 Low-resource track
ACL 2025
Multi-View 3D Human Pose Estimation with Weakly Synchronized Images
AAAI 2025
DMPT: Decoupled Modality-Aware Prompt Tuning for Multi-Modal Object Re-Identification
WACV 2025
Covariance-Based Space Regularization for Few-Shot Class Incremental Learning
WACV 2025
Self-Aligning Depth-Regularized Radiance Fields for Asynchronous RGB-D Sequences
WACV 2025
The Effect of SGD Batch Size on Autoencoder Learning: Sparsity, Sharpness, and Feature Learning
JMLR 2025
Instability, Computational Efficiency and Statistical Accuracy
JMLR 2025
Stabilizing Sharpness-Aware Minimization Through A Simple Renormalization Strategy
JMLR 2025
Last-iterate Convergence of Shuffling Momentum Gradient Method under the Kurdyka-Lojasiewicz Inequality
JMLR 2025
Asymmetric Learning for Spectral Graph Neural Networks
AAAI 2025
Losing Momentum in Continuous-time Stochastic Optimisation
JMLR 2025
TRACT: Regression-Aware Fine-tuning Meets Chain-of-Thought Reasoning for LLM-as-a-Judge
ACL 2025
ReGLA: Refining Gated Linear Attention
NAACL 2025
ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention
AAAI 2025
Taming LLMs with Gradient Grouping
ACL 2025
<
1
…
13
14
15
…
146
>