Skip to content

Latest commit

 

History

History
53 lines (44 loc) · 2.8 KB

File metadata and controls

53 lines (44 loc) · 2.8 KB

Evaluation

When/After adapting to the target robot action space through post-training, you can keep tracking the performance of the checkpoints at a specified interval.

bash eval.sh

Important arguments:

  • --checkpoints-dir: Path to the saved checkpoints.
  • --experiment: Configuration name.
  • --save-dir: Path to save the generated videos.
  • --num-frames: Length of the ground truth video, including the first condition frame.
  • --num-samples: Number of the samples to evaluate on.
  • --dataset-path: Path to the evaluation datasets. You can also concatenate multiple evaluation sets here by spliting them with commas.
  • --data-split: Which subset for evaluation. (full: sample from 100%; train: the first 95% of each concatenated eval set; test: the last 5% of each concatenated eval set).
  • --deterministic-uniform-sampling: If enabled, sample from the concatenated datasets uniformly instead of by their lengths.
  • --checkpoint-interval: Interval of the checkpoints to evaluate.
  • --infinite: Continuously check for new checkpoints at the desired interval to track performance during training.

To reproduce our evaluation samples:

  • In-lab Eval:
    --num-frames 49 \
    --num-samples 100 \
    --dataset-path "datasets/PhysicalAI-Robotics-GR00T-Teleop-GR1/In-lab_Eval/gr1_unified.pnp_handover_plate_robot,datasets/PhysicalAI-Robotics-GR00T-Teleop-GR1/In-lab_Eval/gr1_unified.pnp_cucumber_robot,datasets/PhysicalAI-Robotics-GR00T-Teleop-GR1/In-lab_Eval/gr1_unified.pnp_corn_robot,datasets/PhysicalAI-Robotics-GR00T-Teleop-GR1/In-lab_Eval/gr1_unified.pnp_dragonfruit_robot,datasets/PhysicalAI-Robotics-GR00T-Teleop-GR1/In-lab_Eval/gr1_unified.pour_items_into_basket_robot,datasets/PhysicalAI-Robotics-GR00T-Teleop-GR1/In-lab_Eval/gr1_unified.right_to_left_handover_corn_robot,datasets/PhysicalAI-Robotics-GR00T-Teleop-GR1/In-lab_Eval/gr1_unified.mug_robot,datasets/PhysicalAI-Robotics-GR00T-Teleop-GR1/In-lab_Eval/gr1_unified.fold_cloth_long_robot,datasets/PhysicalAI-Robotics-GR00T-Teleop-GR1/In-lab_Eval/gr1_unified.fold_cloth_robot,datasets/PhysicalAI-Robotics-GR00T-Teleop-GR1/In-lab_Eval/gr1_unified.color_puzzles_robot" \
    --data-split full \
    --deterministic-uniform-sampling \
    
  • EgoDex Eval:
    --num-frames 49 \
    --num-samples 100 \
    --dataset-path "datasets/PhysicalAI-Robotics-GR00T-Teleop-GR1/EgoDex_Eval" \
    --data-split full \
    --deterministic-uniform-sampling \
    
  • DreamDojo-HV Eval:
    --num-frames 49 \
    --num-samples 100 \
    --dataset-path "datasets/PhysicalAI-Robotics-GR00T-Teleop-GR1/DreamDojo-HV_Eval" \
    --data-split full \
    --deterministic-uniform-sampling \
    

<= Previous: [DreamDojo Post-Training]

=> Next: [Trouble Shooting]