Research Explorer

Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?

Zhiyuan Zeng, Qinyuan Cheng, Zhangyue Yin et al.

2025 ACL

Learning to Generate Gradients for Test-Time Adaptation via Test-Time Training Layers

Qi Deng, Shuaicheng Niu, Ronghao Zhang et al.

2025 AAAI

MT3: Meta Test-Time Training for Self-Supervised Test-Time Adaption

Alexander Bartler, Andre Bühler, Felix Wiewel et al.

2022 AISTATS

Scaling LLM Test-Time Compute Optimally Can be More Effective than Scaling Parameters for Reasoning

Charlie Victor Snell, Jaehoon Lee, Kelvin Xu et al.

2025 ICLR

On the Adversarial Risk of Test Time Adaptation: An Investigation into Realistic Test-Time Data Poisoning

Yongyi Su, Yushu Li, Nanqing Liu et al.

2025 ICLR

Efficient Latent Semantic Clustering for Scaling Test-Time Computation of LLMs

Sungjae Lee, Hoyoung Kim, Jeongyeon Hwang et al.

2025 EMNLP

MIRAGE: Scaling Test-Time Inference with Parallel Graph-Retrieval-Augmented Reasoning Chains

Kaiwen Wei, Rui Shan, Dongsheng Zou et al.

2026 AAAI

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Jian Zhao, Runze Liu, Kaiyan Zhang et al.

2026 AAAI

Learning a Continue-Thinking Token for Enhanced Test-Time Scaling

Liran Ringel, Elad Tolochinsky, Yaniv Romano

2025 AACL

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Guijin Son, Jiwoo Hong, Hyunwoo Ko et al.

2025 ACL

Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Probability Theory

Yexiang Liu, Zekun Li, Zhi Fang et al.

2025 ACL

METAL: A Multi-Agent Framework for Chart Generation with Test-Time Scaling

Bingxuan Li, Yiwei Wang, Jiuxiang Gu et al.

2025 ACL

Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering

William Jurayj, Jeffrey Cheng, Benjamin Van Durme

2025 ACL

LAILab at ArchEHR-QA 2025: Test-time scaling for evidence selection in grounded question answering from electronic health records

Tuan Dung Le, Thanh Duong, Shohreh Haddadan et al.

2025 ACL

EQA-RM: A Generative Embodied Reward Model with Test-time Scaling

Yuhang Chen, Zhen Tan, Tianlong Chen

2025 EMNLP

T2: An Adaptive Test-Time Scaling Strategy for Contextual Question Answering

Zhengyi Zhao, Shubo Zhang, Zezhong Wang et al.

2025 EMNLP

Sticker-TTS: Learn to Utilize Historical Experience with a Sticker-driven Test-Time Scaling Framework

Jie Chen, Jinhao Jiang, Yingqian Min et al.

2025 EMNLP

Thought calibration: Efficient and confident test-time scaling

Menghua Wu, Cai Zhou, Stephen Bates et al.

2025 EMNLP

Step-level Verifier-guided Hybrid Test-Time Scaling for Large Language Models

Kaiyan Chang, Yonghao Shi, Chenglong Wang et al.

2025 EMNLP

s1: Simple test-time scaling

Niklas Muennighoff, Zitong Yang, Weijia Shi et al.

2025 EMNLP

Logical Reasoning with Outcome Reward Models for Test-Time Scaling

Ramya Keerthy Thatikonda, Wray Buntine, Ehsan Shareghi

2025 EMNLP

DiMo-GUI: Advancing Test-time Scaling in GUI Grounding via Modality-Aware Visual Reasoning

Hang Wu, Hongkai Chen, Yujun Cai et al.

2025 EMNLP

Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning

Ziyang Wang, Jaehong Yoon, Shoubin Yu et al.

2025 EMNLP

Accelerated Test-Time Scaling with Model-Free Speculative Sampling

Woomin Song, Saket Dingliwal, Sai Muralidhar Jayanthi et al.

2025 EMNLP

Z1: Efficient Test-time Scaling with Code

Zhaojian Yu, Yinghao Wu, Yilun Zhao et al.

2025 EMNLP

Papers