Piper All In One

Piper All In One is an imitation-learning stack for AgileX Piper robotic arms and Cobot Magic systems. It covers the full robot-learning loop: hardware bring-up, teleoperated data collection, data replay, LeRobot dataset conversion, and policy inference.

‼️ Safety first: this repository controls real robot arms. Keep the emergency stop reachable, start from a clear workspace, verify CAN/camera mappings before motion, and test every new policy with slow, supervised motions before normal operation.

📰 News

Apr 29 2026: Repository released.

📋 Contents

📰 News
📋 Contents
🗂️ Repository Layout
⚙️ Setup
- Prerequisites
  - Hardware
  - Software
- Installation
🚀 Usage
🧩 Configuration Reference
- Task Configs
- Policy Clients
📊 Dataset Format
🙏 Acknowledgements

🗂️ Repository Layout

.
├── camera_ws/                         # ROS workspace for RealSense cameras
├── piper_ws/                          # ROS workspace for Piper messages and launch files
├── collect_data/                      # Data collection, replay, and visualization scripts
├── convert_data/                      # HDF5 filtering and LeRobot conversion scripts
├── inference/                         # Policy clients and sync/async inference runners
├── lerobot/                           # LeRobot submodule used for dataset conversion

⚙️ Setup

Prerequisites

Hardware

Four AgileX Piper or PiperX arms: two leader arms for teleoperation, and two follower arms for execution.
Two USB-to-CAN modules: one for each leader-follower pair.
Three RealSense cameras: one front camera and two wrist cameras, with each wrist camera mounted on a follower arm.
A control machine with enough USB bandwidth for all cameras and CAN devices.

Software

Ubuntu 20.04: recommended because this release targets ROS Noetic.
ROS Noetic: follow the installation guide and install ros-noetic-desktop-full.
Conda: used to manage Python environments.
Terminator: recommended when running multiple ROS terminals.

Installation

Clone the Repository

git clone --recurse-submodules https://github.com/innovator-zero/piper-aio.git
cd piper-aio

If you already cloned without submodules, initialize them with:

git submodule update --init --recursive

Install System Packages

sudo apt update
sudo apt install -y \
  libgflags-dev \
  libgoogle-glog-dev \
  libusb-1.0-0-dev \
  libeigen3-dev \
  libkdl-parser-dev \
  can-utils \
  ethtool \
  net-tools

sudo apt install -y \
  ros-$ROS_DISTRO-image-geometry \
  ros-$ROS_DISTRO-camera-info-manager \
  ros-$ROS_DISTRO-image-transport \
  ros-$ROS_DISTRO-image-publisher \
  ros-$ROS_DISTRO-libuvc-ros \
  ros-$ROS_DISTRO-ddynamic-reconfigure

Configure CAN Interfaces

Connect the CAN modules, then list their USB bus information:

bash piper_ws/scripts/find_all_can_port.sh

Edit piper_ws/scripts/can_config.sh:

Set EXPECTED_CAN_COUNT to the number of connected CAN modules.
Update the USB_PORTS mapping so each bus-info value maps to the intended CAN interface name and bitrate.
The default two-module mapping uses can_left:1000000 and can_right:1000000.

After each machine startup, run the CAN configuration script once to rename and bring up the CAN interfaces:

bash piper_ws/scripts/can_config.sh

Create Python Environments

Create the main environment used by ROS control, collection, replay, and inference:

conda create -y -n piperaio python=3.10
conda activate piperaio

pip install python-can piper_sdk
pip install empy==3.3.4 rospkg catkin_pkg numpy==1.26.4 h5py opencv-python matplotlib
conda install -y libffi==3.3

If you plan to convert collected data to the LeRobot dataset format, create a separate conda environment for LeRobot (dataset v2.1, commit 2b71789):

conda create -y -n lerobot python=3.10
conda activate lerobot

conda install -y ffmpeg -c conda-forge

cd lerobot
pip install -e .
pip install datasets==3.6.0
cd ..

Note: Keep the datasets package version consistent between conversion and training. Version mismatches can make converted datasets unreadable.

Build the Camera Workspace

The camera workspace is based on realsense-ros and should be built from source. Install RealSense SDK 2.0 using the official guide, then build the workspace:

sudo apt install -y ros-$ROS_DISTRO-realsense2-camera
exec $SHELL

conda activate piperaio

cd camera_ws/src
catkin_init_workspace
cd ..
catkin_make \
  -DPYTHON_EXECUTABLE=$HOME/miniconda3/envs/piperaio/bin/python \
  -DCATKIN_ENABLE_TESTING=False \
  -DCMAKE_BUILD_TYPE=Release
cd ..

Set PYTHON_EXECUTABLE to the Python executable in the piperaio conda environment if your conda path differs.

List connected camera serial numbers:

bash piper_ws/scripts/rs_camera_serial.sh

Add the serial numbers to lines 2-4 of camera_ws/src/realsense-ros/realsense2_camera/launch/multi_camera.launch according to their physical positions.

Build the Piper Workspace

conda activate piperaio

cd piper_ws/src
catkin_init_workspace
cd ..
catkin_make -DPYTHON_EXECUTABLE=$HOME/miniconda3/envs/piperaio/bin/python
cd ..

Source the Piper workspace in your shell startup file:

# Bash
echo "source $(pwd)/piper_ws/devel/setup.bash" >> ~/.bashrc

# Zsh
echo "source $(pwd)/piper_ws/devel/setup.zsh" >> ~/.zshrc

🚀 Usage

Run each long-running command in a separate terminal, and make sure the piperaio environment is active where needed.

Start Cameras

./start_cam.sh

Use rqt_image_view to inspect the streams from the three cameras.

Start Arms

Use mode 0 for leader-follower teleoperation during data collection:

./start_mode0.sh

Use mode 1 when controlling the follower arms during replay or inference, and wait until both follower arms report Enable Status: True:

./start_mode1.sh

Collect Data

Power on all four arms.
Start cameras with ./start_cam.sh.
Start arms in mode 0 with ./start_mode0.sh.
Start the collection script:

conda activate piperaio

python collect_data/collect_data.py --dataset_dir ~/data --task_name test

Keyboard controls:

ENTER: start recording an episode.
SPACE: stop the current recording.
s: save the current episode.
q: discard the current episode.
Ctrl+C: exit the collection script.

Episodes are saved to {dataset_dir}/{task_name}/episode_{episode_idx}.hdf5.

When --episode_idx is omitted or set to 0, the script automatically uses the next available episode index in the save directory.

Note: the system may occasionally drop frames and report syn fail, so avoid collecting data too quickly, especially during complex motions. Also check USB bandwidth and verify all camera and arm topics are publishing at the expected rate.

Visualize Data

conda activate piperaio

python collect_data/visualize_episodes.py --dataset_dir ~/data --task_name test --episode_idx 0

The visualization script plays and saves camera videos, joint plots, and end-effector pose plots for the selected episode.

Replay Data

Power on only the two follower arms.
Start cameras with ./start_cam.sh.
Start arms in mode 1 with ./start_mode1.sh.
Replay an episode:

conda activate piperaio

python collect_data/replay_data.py --dataset_dir ~/data --task_name test --episode_idx 0 --replay_mode joint

replay_mode can be either joint or eef. The script reads the recorded data and sends control commands to the control topics.

Convert Data to LeRobot

First inspect collected episodes for outliers:

conda activate lerobot

python convert_data/dataset_filter.py --root_dir ~/data/test --std_threshold 5.0

--root_dir should point to {dataset_dir}/{task_name}. The script analyzes observations/qpos and action, then reports files with values outside the standard-deviation threshold configured by --std_threshold.

Before conversion, edit convert_data/convert_to_lerobot.py:

Set INSTRUCTION in line 16 to the language instruction used for training.
Add any rejected HDF5 episode paths to exclude_files in line 175.

Convert one or more task directories to a LeRobot v2.1 dataset using the feature schema in convert_data/features.py:

conda activate lerobot

python convert_data/convert_to_lerobot.py \
  --src_path ~/data \
  --tgt_path ~/lerobot \
  --repo_ids test \
  --save_repoid test \
  --cut_head \
  --cut_tail

Useful options:

--src_path: root directory containing collected task folders, which corresponds to {dataset_dir}.
--tgt_path: output root for the converted LeRobot dataset.
--repo_ids: one or more task names to convert, which correspond to {task_name}.
--save_repoid: output dataset name. Defaults to the first --repo_ids value.
--cut_head / --no-cut_head: trim or keep still frames at the start.
--cut_tail / --no-cut_tail: trim or keep still frames at the end.

Run Policy Inference

Policy inference uses task settings from inference/task_configs.yaml and client code from inference/clients.py. Refer to Configuration Reference on how to add new tasks and clients.

We provide two inference runners:

Synchronous inference in inference/infer_sync.py: standard practice for action-chunking policy. The robot executes one action chunk and requests the next chunk only after the current chunk has been consumed. During policy inference, the robot controller pauses and resumes only when the new actions arrive.
Asynchronous inference in inference/infer_async.py: initiates inference for the next chunk before the current chunk is fully executed. Once the next inference request is triggered, the robot continues executing the remaining actions in the ongoing chunk. Before the final action is completed, the newly predicted chunk is expected to be available, enabling seamless execution without halting. Useful for real-time methods such as Training-time RTC and FASTER.

Refer to our FASTER paper for a clear understanding of the sync and async inference pipelines.

Start the robot in follower-control mode:

./start_cam.sh
./start_mode1.sh

Start your policy server on the same machine or on a LAN-connected workstation. For remote inference, prefer wired LAN to reduce latency and packet loss.

Synchronous Inference

conda activate piperaio

python inference/infer_sync.py \
  --task towel \
  --model openpi \
  --host 127.0.0.1 \
  --port 8000 \
  --ctrl_type joint

Useful options:

--task: task key from inference/task_configs.yaml.
--model: policy client type. Currently openpi is supported.
--host: policy server host.
--port: policy server port.
--ctrl_type: joint or eef.
--chunk_size: number of actions expected per inference call, default 50. If it is set to be smaller than the action chunk size generated from policy, then only the first chunk_size actions will be executed.
--save_rollout: save rollout observations and executed actions to HDF5.
--save_dir: directory for rollout HDF5 files when --save_rollout is set.

Asynchronous Inference

conda activate piperaio

python inference/infer_async.py \
  --task towel \
  --model openpi \
  --host 127.0.0.1 \
  --port 8000 \
  --ctrl_type joint \
  --mode rtc \
  --delay 4 \
  --exec_horizon 25

Additional async options:

--mode: async inference mode. Use rtc for RTC-style inference (using action prefix), or naive for the naive asynchronous mode (such as SmolVLA).
--delay: inference delay $d:= \lfloor \Delta t_\text{infer}/\Delta t_{\text{ctrl}}\rfloor$, determined by the inference latency $\Delta t_\text{infer}$ and control period $\Delta t_{\text{ctrl}}$. We recommend setting it one step larger to account for variation in inference time and transmission latency.
--exec_horizon: execution horizon $s$ for the action chunk. The client executes only the first $s$ valid actions (excluding delayed ones), then triggers a new inference request. This value should be larger than or equal to $d$.
--streaming: supports the Streaming Client-Server Interface proposed in FASTER.

During inference, press SPACE to enter interactive mode:

c: continue.
r: reset to the initial arm positions and restart.
q: save the current rollout if enabled, then quit.

🧩 Configuration Reference

Task Configs

Each top-level task in inference/task_configs.yaml must define:

towel:
  language_instruction: "Fold the towel."
  left0: [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.1] # or *default_left0
  right0: [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.1] # or *default_right0
  action_postprocess:
    left_gripper:
      - when: below
        threshold: 0.02
        set: 0.0
    right_gripper:
      - when: below
        threshold: 0.02
        set: 0.0

Required fields:

language_instruction: instruction sent to the policy. Keep this consistent with the instruction used during dataset conversion and training.
left0 and right0: initial joint positions for the follower arms.

Optional field:

action_postprocess: gripper post-processing rules for model outputs.
- when: below or above.
- threshold: condition threshold.
- set: replacement value when the condition is true.
- add: offset added when the condition is true.

Policy Clients

The default OpenpiClient in inference/clients.py uses the openpi WebSocket client. To add a new client:

Implement a client class in inference/clients.py.
Match the public methods used by the inference scripts:

predict_action(payload)
warmup()

Import and select the new client in inference/infer_sync.py or inference/infer_async.py. Also add the new client name to the --model choices.
Install any dependencies in the piperaio environment.

📊 Dataset Format

Collected episodes are HDF5 files with the following default keys:

/observations/images/cam_high
/observations/images/cam_left_wrist
/observations/images/cam_right_wrist
/observations/qpos
/observations/qvel
/observations/effort
/observations/eef_pose
/action
/collect

Shape conventions:

RGB images: (T, 480, 640, 3), uint8.
Joint state qpos (6-DoF + gripper), velocity qvel, effort effort, end-effector pose eef_pose (xyz + rpy + gripper), and action arrays: (T, 14), representing left arm 7 dimensions followed by right arm 7 dimensions.
/collect: per-frame string label, for example teleop or rollout.

🙏 Acknowledgements

We thank the following repositories for their references and prior work:

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
camera_ws/src/realsense-ros		camera_ws/src/realsense-ros
collect_data		collect_data
convert_data		convert_data
inference		inference
lerobot @ 2b71789		lerobot @ 2b71789
piper_ws		piper_ws
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
ros_operator.py		ros_operator.py
start_cam.sh		start_cam.sh
start_mode0.sh		start_mode0.sh
start_mode1.sh		start_mode1.sh

Folders and files

Latest commit

History

Repository files navigation

Piper All In One

📰 News

📋 Contents

🗂️ Repository Layout

⚙️ Setup

Prerequisites

Hardware

Software

Installation

Clone the Repository

Install System Packages

Configure CAN Interfaces

Create Python Environments

Build the Camera Workspace

Build the Piper Workspace

🚀 Usage

Start Cameras

Start Arms

Collect Data

Visualize Data

Replay Data

Convert Data to LeRobot

Run Policy Inference

Synchronous Inference

Asynchronous Inference

🧩 Configuration Reference

Task Configs

Policy Clients

📊 Dataset Format

🙏 Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages