Papers
4,184 papers found
Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?
Zhiyuan Zeng, Qinyuan Cheng, Zhangyue Yin et al.
Learning to Generate Gradients for Test-Time Adaptation via Test-Time Training Layers
Qi Deng, Shuaicheng Niu, Ronghao Zhang et al.
MT3: Meta Test-Time Training for Self-Supervised Test-Time Adaption
Alexander Bartler, Andre Bühler, Felix Wiewel et al.
Scaling LLM Test-Time Compute Optimally Can be More Effective than Scaling Parameters for Reasoning
Charlie Victor Snell, Jaehoon Lee, Kelvin Xu et al.
On the Adversarial Risk of Test Time Adaptation: An Investigation into Realistic Test-Time Data Poisoning
Yongyi Su, Yushu Li, Nanqing Liu et al.
Efficient Latent Semantic Clustering for Scaling Test-Time Computation of LLMs
Sungjae Lee, Hoyoung Kim, Jeongyeon Hwang et al.
MIRAGE: Scaling Test-Time Inference with Parallel Graph-Retrieval-Augmented Reasoning Chains
Kaiwen Wei, Rui Shan, Dongsheng Zou et al.
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning
Jian Zhao, Runze Liu, Kaiyan Zhang et al.
Learning a Continue-Thinking Token for Enhanced Test-Time Scaling
Liran Ringel, Elad Tolochinsky, Yaniv Romano
Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning
Guijin Son, Jiwoo Hong, Hyunwoo Ko et al.
Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Probability Theory
Yexiang Liu, Zekun Li, Zhi Fang et al.
METAL: A Multi-Agent Framework for Chart Generation with Test-Time Scaling
Bingxuan Li, Yiwei Wang, Jiuxiang Gu et al.
Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering
William Jurayj, Jeffrey Cheng, Benjamin Van Durme
LAILab at ArchEHR-QA 2025: Test-time scaling for evidence selection in grounded question answering from electronic health records
Tuan Dung Le, Thanh Duong, Shohreh Haddadan et al.
EQA-RM: A Generative Embodied Reward Model with Test-time Scaling
Yuhang Chen, Zhen Tan, Tianlong Chen
T2: An Adaptive Test-Time Scaling Strategy for Contextual Question Answering
Zhengyi Zhao, Shubo Zhang, Zezhong Wang et al.
Sticker-TTS: Learn to Utilize Historical Experience with a Sticker-driven Test-Time Scaling Framework
Jie Chen, Jinhao Jiang, Yingqian Min et al.
Thought calibration: Efficient and confident test-time scaling
Menghua Wu, Cai Zhou, Stephen Bates et al.
Step-level Verifier-guided Hybrid Test-Time Scaling for Large Language Models
Kaiyan Chang, Yonghao Shi, Chenglong Wang et al.
s1: Simple test-time scaling
Niklas Muennighoff, Zitong Yang, Weijia Shi et al.
Logical Reasoning with Outcome Reward Models for Test-Time Scaling
Ramya Keerthy Thatikonda, Wray Buntine, Ehsan Shareghi
DiMo-GUI: Advancing Test-time Scaling in GUI Grounding via Modality-Aware Visual Reasoning
Hang Wu, Hongkai Chen, Yujun Cai et al.
Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning
Ziyang Wang, Jaehong Yoon, Shoubin Yu et al.
Accelerated Test-Time Scaling with Model-Free Speculative Sampling
Woomin Song, Saket Dingliwal, Sai Muralidhar Jayanthi et al.
Z1: Efficient Test-time Scaling with Code
Zhaojian Yu, Yinghao Wu, Yilun Zhao et al.