Data Augmentation for Instruction Following Policies via Trajectory Segmentation

Niklas Hoepner; Ilaria Tiddi; Herke van Hoof

2025 AAAI AAAI 2025

Data Augmentation for Instruction Following Policies via Trajectory Segmentation

Abstract

Abstract The scalability of instructable agents in robotics or gaming is often hindered by limited data that pairs instructions with agent trajectories. However, large datasets of unannotated trajectories containing sequences of various agent behaviour (play trajectories) are often available. In a semi-supervised setup, we explore methods to extract labelled segments from play trajectories. The goal is to augment a small annotated dataset of instruction-trajectory pairs to improve the performance of an instruction-following policy trained downstream via imitation learning. Assuming little variation in segment length, recent video segmentation methods can effectively extract labelled segments. To address the constraint of segment length, we propose Play Segmentation (PS), a probabilistic model that finds maximum likely segmentations of extended subsegments, while only being trained on individual instruction segments. Our results in a game environment and a simulated robotic gripper setting underscore the importance of segmentation; randomly sampled segments diminish performance, while incorporating labelled segments from PS improves policy performance to the level of a policy trained on twice the amount of labelled data.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Reinforcement Learning and Robotics

🧭 Keyword Pioneer — play trajectory

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Niklas Hoepner , Ilaria Tiddi , Herke van Hoof

Topics

Artificial Intelligence > Core AI > Planning Machine Learning > Learning Types > Semi-Supervised Learning Machine Learning > Application Areas > Data Augmentation Reinforcement Learning > Methods > Policy Learning Reinforcement Learning > Applications > Robotics Robotics > Capabilities > Manipulation Machine Learning > Learning Types > Imitation Learning Machine Learning > Learning Types > Data Augmentation

Keywords

imitation learning data augmentation policy learning instruction following trajectory segmentation play trajectory

Download PDF

Related papers

BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving 2025

APIRL: Deep Reinforcement Learning for REST API Fuzzing 2025

Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation 2025

3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly Detection 2025

Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics 2025