The success rate varies significantly when using different seeds during training.

When training the metaworld_bin-picking task using two different seeds, seed0 and seed1, I observed significant differences in success rates. Seed1 achieved a maximum success rate of 0.7, while seed0 consistently remained far below seed1's rate (peaking at 0.4). I would like to understand the cause of this discrepancy. I trained the model using 10 sample datasets.
<img width="458" height="300" alt="Image" src="https://github.com/user-attachments/assets/1f993b75-b34f-4e90-8ab1-45dda3c12812" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The success rate varies significantly when using different seeds during training. #166

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

The success rate varies significantly when using different seeds during training. #166

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions