Research Explorer
Papers Conferences Authors Topics Keywords Trends Achievements Explore
← Back to papers
2025 ICML ICML 2025

ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks

👥 Mega-Team — 40 authors

Authors

Saurabh Jha , Rohan R. Arora , Yuji Watanabe , Takumi Yanagawa , Yinfang Chen , Jackson Clark , Bhavya Bhavya , Mudit Verma , Harshit Kumar , Hirokuni Kitahara , Noah Zheutlin , Saki Takano , Divya Pathak , Felix George , Xinbo Wu , Bekir O Turkkan , Gerard Vanloo , Michael Nidd , Ting Dai , Oishik Chatterjee , Pranjal Gupta , Suranjana Samanta , Pooja Aggarwal , Rong Lee , Jae-Wook Ahn , Debanjana Kar , Amit Paradkar , Yu Deng , Pratibha Moogi , Prateeti Mohapatra , Naoki Abe , Chandrasekhar Narayanaswami , Tianyin Xu , Lav R. Varshney , Ruchi Mahindru , Anca Sailer , Laura Shwartz , Daby Sow , Nicholas C. M. Fuller , Ruchir Puri
Download PDF

Related papers

Scaling Sparse Feature Circuits For Studying In-Context Learning 2025
Incremental Gradient Descent with Small Epoch Counts is Surprisingly Slow on Ill-Conditioned Problems 2025
SToFM: a Multi-scale Foundation Model for Spatial Transcriptomics 2025
Batch List-Decodable Linear Regression via Higher Moments 2025
GS-Bias: Global-Spatial Bias Learner for Single-Image Test-Time Adaptation of Vision-Language Models 2025