Major agents robotics

roto 2.0: The Robot Tactile Olympiad

Elle Miller, Jayaram Reddy, Ayush Deshmukh, Trevor McInroe, David Abel, Oisin Mac Aodha

Published: May 20, 2026 — 17:22 UTC

Problem
The paper addresses the limitations in tactile-based reinforcement learning (RL), particularly the fragmentation in research and the predominance of orientation tasks that do not generalize well to real-world applications. The authors present a preprint of the Robot Tactile Olympiad version 2.0 (roto 2.0), which aims to standardize tactile-based RL across various robotic morphologies. The existing benchmarks often rely on state information or distillation, which can obscure the fundamental challenges of tactile manipulation. Roto 2.0 seeks to fill this gap by focusing on end-to-end “blind” manipulation, where agents rely solely on proprioception and tactile feedback.

Method
Roto 2.0 introduces a GPU-parallelized benchmark that encompasses four distinct robotic morphologies, ranging from 16 to 24 degrees of freedom (DOF). The core technical contribution lies in its design, which emphasizes blind manipulation tasks that do not utilize any state information. The benchmark is structured to facilitate the training of agents in a manner that prioritizes algorithmic development over extensive RL tuning. The authors provide robustly tuned baselines and open-source the environments, allowing for reproducibility and accessibility. The training compute details are not explicitly disclosed, but the GPU-parallelization suggests significant computational efficiency.

Results
The performance of the blind agents in roto 2.0 is notably superior, achieving 13 Baoding ball rotations in 10 seconds. This performance represents an order of magnitude improvement over the current state-of-the-art benchmarks in tactile manipulation. The authors compare their results against existing benchmarks, demonstrating that their approach significantly accelerates the speed of manipulation tasks, thereby setting a new standard for future research in tactile-based RL.

Limitations
The authors acknowledge that the benchmark is still in its early stages and may not encompass all possible manipulation scenarios. They also note that while the focus on blind manipulation is a strength, it may limit the exploration of hybrid approaches that combine tactile sensing with other modalities. Additionally, the reliance on specific robotic morphologies may restrict generalizability to other types of robots. The paper does not address potential scalability issues or the computational costs associated with training agents in more complex environments.

Why it matters
Roto 2.0 has significant implications for the field of tactile-based RL, as it provides a standardized framework that can accelerate research and development in this area. By reducing the barriers to entry and enabling researchers to focus on fundamental algorithmic challenges, roto 2.0 could lead to breakthroughs in robotic manipulation tasks that are more aligned with real-world applications. The open-source nature of the benchmark fosters collaboration and innovation, potentially catalyzing advancements in tactile sensing and manipulation strategies across various robotic platforms.

Authors: Elle Miller, Jayaram Reddy, Ayush Deshmukh, Trevor McInroe, David Abel, Oisin Mac Aodha, Sethu Vijayakumar
Source: arXiv cs.LG
URL: https://arxiv.org/abs/2605.21429v1

By Callan Zhang · May 20, 2026 · Editorial standards →

Summarised from the primary source with AI assistance under human editorial oversight. Turing Wire is not a primary source — read the original for the authoritative account.

Source: arXiv cs.LG