跳转到内容

RoboCasa 评测

RoboCasa 是一个大规模家庭环境仿真基准。这里我们使用的是 GR1 Tabletop Tasks 子集,包含 24 个桌面 Pick-and-Place 任务,使用 Fourier GR1 人形机器人的上半身(双臂)进行操作。

本文档提供复现我们实验结果的操作指南。

评测流程主要包含两部分:

  1. 配置 robocasa 环境与依赖。
  2. 分别在 starVLArobocasa 环境中启动服务并运行评测。

我们已在 NVIDIA A100 上验证该流程可稳定运行。


任务GR00T-N1.6StarVLA-GR00T-Qwen3StarVLA-π-Qwen3StarVLA-OFT-Qwen3StarVLA-FAST-Qwen3
PnP Bottle To Cabinet Close51.546.026.030.038.0
PnP Can To Drawer Close13.080.062.076.044.0
PnP Cup To Drawer Close8.554.042.044.056.0
PnP Milk To Microwave Close14.048.050.044.044.0
PnP Potato To Microwave Close41.528.042.032.014.0
PnP Wine To Cabinet Close16.546.032.036.014.0
PnP Novel From Cuttingboard To Basket58.048.040.050.054.0
PnP Novel From Cuttingboard To Cardboardbox46.540.046.040.042.0
PnP Novel From Cuttingboard To Pan68.568.060.070.058.0
PnP Novel From Cuttingboard To Pot65.052.040.054.058.0
PnP Novel From Cuttingboard To Tieredbasket46.556.044.038.040.0
PnP Novel From Placemat To Basket58.542.044.032.036.0
PnP Novel From Placemat To Bowl57.544.052.058.038.0
PnP Novel From Placemat To Plate63.048.050.052.042.0
PnP Novel From Placemat To Tieredshelf28.518.028.024.018.0
PnP Novel From Plate To Bowl57.060.052.060.052.0
PnP Novel From Plate To Cardboardbox43.550.040.050.030.0
PnP Novel From Plate To Pan51.054.036.066.048.0
PnP Novel From Plate To Plate78.770.048.068.050.0
PnP Novel From Tray To Cardboardbox51.538.034.044.028.0
PnP Novel From Tray To Plate71.056.064.056.034.0
PnP Novel From Tray To Pot64.550.044.062.046.0
PnP Novel From Tray To Tieredbasket57.036.050.054.036.0
PnP Novel From Tray To Tieredshelf31.516.028.030.016.0
平均47.647.843.948.839.0

注:所有数值均为成功率百分比。所有 24 个任务使用单一模型训练。结果基于每个任务 50 次 rollout。


首先下载检查点:

请先参考 官方 RoboCasa 安装指南 安装基础 robocasa-gr1-tabletop-tasks 环境。

然后安装 socket 支持:

Terminal window
pip install tyro

Step 1. 启动服务端(starVLA 环境)

Section titled “Step 1. 启动服务端(starVLA 环境)”

在第一个终端中,激活 starVLA conda 环境并运行:

Terminal window
python deployment/model_server/server_policy.py \
--ckpt_path ${your_ckpt} \
--port 5678 \
--use_bf16

在第二个终端中,激活 robocasa conda 环境并运行:

Terminal window
export PYTHONPATH=$(pwd):${PYTHONPATH}
your_ckpt=StarVLA/Qwen3-VL-OFT-Robocasa/checkpoints/steps_90000_pytorch_model.pt
python examples/Robocasa_tabletop/eval_files/simulation_env.py\
--args.env_name ${env_name} \
--args.port 5678 \
--args.n_episodes 50 \
--args.n_envs 1 \
--args.max_episode_steps 720 \
--args.n_action_steps 12 \
--args.video_out_path ${video_out_path} \
--args.pretrained_path ${your_ckpt}

如果你有更多 GPU,可以使用批量评测脚本:

Terminal window
bash examples/Robocasa_tabletop/batch_eval_args.sh

注意:请确保在 batch_eval_args.sh 中填写了正确的 checkpoint 路径。


HuggingFace 下载 PhysicalAI-Robotics-GR00T-X-Embodiment-Sim 目录数据集到 playground/Datasets/nvidia/PhysicalAI-Robotics-GR00T-X-Embodiment-Sim 目录。

如需仅下载相关的微调文件夹,可参考 GR00T-N1.5 仓库的说明。

或使用脚本下载 *_1000 文件夹:

Terminal window
python examples/Robocasa_tabletop/download_gr00t_ft_data.py

可通过修改参数 data_mix 选择不同的数据集,使用以下脚本对 *_1000 数据集进行微调:

Terminal window
bash examples/Robocasa_tabletop/train_files/run_robocasa.sh