Policy Evaluation Guide

We provide both LeRobot policy formats and custom policy formats.

Using LeRobot Policies

Evaluate trained LeRobot models (ACT, Diffusion Policy, SmolVLA):

python -m scripts.eval \
    --policy_type lerobot \
    --policy_path outputs/train/act \
    --garment_type "top_long" \
    --dataset_root Datasets/example/top_long \
    --num_episodes 5 \
    --enable_cameras \
    --device cpu

💡 Tip: Using --device cpu ensures the simulator runs on CPU to avoid GUI conflicts in some environments, while the actual policy (LeRobot or Custom) will still be loaded to CUDA for fast inference. --enable_cameras is required to see camera views in the GUI or record videos.

Requirements:

pretrained_model directory with config files
Training dataset metadata (for --dataset_root)
For VLA models: add --task_description "fold the garment on the table"

Creating Custom Policies (Without using lerobot)

Three Simple Steps

1. Create Policy Class

# my_policy.py
import numpy as np
from typing import Dict
from scripts.eval_policy.base_policy import BasePolicy
from scripts.eval_policy.registry import PolicyRegistry

@PolicyRegistry.register("my_policy")
class MyPolicy(BasePolicy):
    def __init__(self, model_path=None, device="cuda", **kwargs):
        super().__init__(**kwargs)
        self.device = device
        # Load your model here
        
    def reset(self):
        """Called at the start of each episode."""
        pass
    
    def select_action(self, observation: Dict[str, np.ndarray]) -> np.ndarray:
        """
        Args:
            observation: Dict with keys like:
                - "observation.state": (N,) float32 - joint angles
                - "observation.images.top": (H,W,3) uint8 - top camera
                - "observation.images.wrist_left/right": (H,W,3) uint8
        
        Returns:
            action: (action_dim,) float32 - joint angle commands
                Single-arm: (6,), Dual-arm: (12,)
                Dim order: [shoulder_pan, shoulder_lift, elbow_flex, wrist_flex, wrist_roll, gripper]
        """
        # Your inference logic
        action = self.model.predict(observation)
        return action.astype(np.float32)

2. Register Policy

Just add @PolicyRegistry.register("my_policy") decorator (already shown above).

3. Import in init.py

# scripts/eval_policy/__init__.py
from .my_policy import MyPolicy

Evaluate Your Policy

python -m scripts.eval \
    --policy_type custom \
    --policy_path "path/to/your/model" \
    --garment_type "top_long" \
    --num_episodes 5 \
    --enable_cameras \
    --device cpu

Testing Single Garments: To evaluate a specific garment instead of a whole category, add the garment name to Assets/objects/Challenge_Garment/Release/Release_test_list.txt and run the script with --garment_type custom.

Policy Requirements

Your custom policy must:

✅ Inherit from BasePolicy
✅ Implement select_action(observation: Dict) -> np.ndarray
✅ Return actions as float32 numpy array
✅ Handle action dimensions: (12) dual-arm

Optional:

Implement reset() to clear temporal buffers

Tips

Processing images:

import torch
image = observation["observation.images.top"]  # (H,W,3) uint8
image_tensor = torch.from_numpy(image).permute(2,0,1) / 255.0  # (C,H,W) [0,1]

Handling temporal information:

def __init__(self, **kwargs):
    super().__init__(**kwargs)
    self.obs_buffer = []

def reset(self):
    self.obs_buffer.clear()

def select_action(self, observation):
    self.obs_buffer.append(observation)
    # Use buffer for RNN/LSTM inference

Troubleshooting

Issue	Solution
Policy not found	Check decorator `@PolicyRegistry.register("name")` and import in `__init__.py`
Action dimension error	Verify action shape: (6) for single-arm, (12) for dual-arm
LeRobot fails to load	Ensure `--policy_path` points to `pretrained_model` dir and `--dataset_root` is correct

Reference Files

example_participant_policy.py - Complete implementation examples
base_policy.py - Interface definition
registry.py - Registration system

For more help, contact us LeHome Challenge.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Policy Evaluation Guide

Using LeRobot Policies

Creating Custom Policies (Without using lerobot)

Three Simple Steps

Evaluate Your Policy

Policy Requirements

Tips

Troubleshooting

Reference Files

FilesExpand file tree

policy_eval.md

Latest commit

History

policy_eval.md

File metadata and controls

Policy Evaluation Guide

Using LeRobot Policies

Creating Custom Policies (Without using lerobot)

Three Simple Steps

Evaluate Your Policy

Policy Requirements

Tips

Troubleshooting

Reference Files