Data auto-generation part using a simple state machine by Papaercold · Pull Request #127 · LightwheelAI/leisaac

Papaercold · 2026-01-28T05:35:12Z

Summary

This PR introduces a state-machine–based data generation pipeline for the SO101 pick-orange task in LeIsaac.

The main implementation lives under scripts/environments/state_machine, where a scripted finite state machine is used to generate deterministic pick-and-place demonstrations. To integrate this pipeline with the existing teleoperation and replay infrastructure, a new teleoperation device type named so101_state_machine is added, with corresponding changes under source/leisaac.

The implementation intentionally follows the coding style and structure of
scripts/environments/teleoperation/teleop_se3_agent.py, and includes detailed comments for readability and maintainability.

Key Features

State-machine–based data generation
- Implemented under scripts/environments/state_machine
- Designed to be deterministic, readable, and easy to extend
New teleoperation device: so101_state_machine
- Integrated into the existing teleoperation interface
- Allows scripted policies to drive the environment using the same action pipeline as keyboard/gamepad devices
Replay compatibility
- Recorded datasets can be replayed using the existing
  scripts/environments/teleoperation/replay.py
- No custom replay logic is required

Usage Examples

Generate data

python scripts/environments/state_machine/pick_orange.py \
  --dataset_file=./datasets/dataset_test.hdf5 \
  --task=LeIsaac-SO101-PickOrange-v0 \
  --num_envs=1 \
  --device=cuda \
  --enable_cameras \
  --record \
  --num_demos=1

Replay recorded data

python scripts/environments/teleoperation/replay.py \
  --dataset_file=./datasets/dataset_test.hdf5 \
  --task=LeIsaac-SO101-PickOrange-v0 \
  --num_envs=1 \
  --device=cuda \
  --enable_cameras \
  --select_episodes 1 \
  --replay_mode=action \
  --task_type=so101_state_machine

Observed Issue / Open Question

When running the state-machine–based data generation script, the robot gripper exhibits a brief downward drop at the beginning of each episode before stabilizing.

At first glance, this behavior appears to be gravity-related. However, gravity is explicitly disabled at spawn time for all relevant teleoperation devices, including the newly introduced state-machine device:

def use_teleop_device(self, teleop_device) -> None:
    self.task_type = teleop_device
    if teleop_device in ["keyboard", "gamepad", "so101_state_machine"]:
        self.scene.robot.spawn.rigid_props.disable_gravity = True

Given this configuration, it is unclear whether the observed initial drop is truly caused by gravity. Other potential factors may include controller or drive initialization behavior, insufficient drive stiffness during the first few simulation steps, or the absence of an explicit action warm-start when the episode begins.

I would appreciate feedback on the following questions:

Is this type of initial transient expected when introducing a scripted state machine as a teleoperation device?
Could this behavior be related to controller initialization, drive parameter settings, or action preprocessing rather than gravity itself?
Are there recommended best practices in Isaac Lab / LeIsaac for avoiding such initial transients when using scripted or non-interactive teleoperation devices?

Any insights, suggestions, or references to similar patterns in existing tasks would be greatly appreciated.

Notes

This PR focuses on clarity, determinism, and minimal intrusion into existing teleoperation and replay logic.
The issue described above does not affect replay correctness once the episode is running, but it does impact the visual behavior at the very beginning of an episode.

EverNorif · 2026-01-28T07:52:38Z

@Papaercold Thank you for your PR. Due to a busy schedule recently, I may not be able to review the code right away, I'll check it later.

Regarding the brief drop of the gripper that you mentioned, I haven’t observed this behavior so far. I’ll check for it again using the latest code.

Papaercold · 2026-02-19T06:30:47Z

@EverNorif
I have refactored the state machine implementation.

The state machine code has been moved to:

source/leisaac/leisaac/state_machine

The scripts for automatic data generation using the state machine are now located in:

scripts/environments/state_machine

In addition, I have fixed the issue where the robot arm would drop at the beginning due to gravity (i.e., it was not properly applying control effort during initialization).

Going forward, all task-specific state machines will be rewritten based on the StateMachineBase abstract class to ensure a consistent and extensible structure.

To make the review process easier, I can also provide a brief documentation file explaining the automatic data generation workflow and the structure of the state machine.

Please let me know if you would like me to add that documentation as part of this PR.

Papaercold · 2026-02-19T06:36:01Z

In addition, I would appreciate it if you could let me know your next plans.

Since the current structure has been refactored to be more modular and extensible, beyond the existing automatic pick-orange task, I can also add more tasks if needed, or integrate an RL module based on new requirements.

Thank you very much for your time and review.

Zihan Gao

Papaercold · 2026-02-21T23:22:14Z

I have now added STATE_MACHINE_README.md to the repository, including both English and Simplified Chinese versions.
Please feel free to refer to it during the review process.
Thank you very much for your help.

Papaercold · 2026-02-22T04:17:48Z

The fold_cloth state machine is still under development.

At the moment, it is very difficult to complete the cloth-folding task using the current state machine.

I believe there are still some issues with the collision configuration. In some cases, the gripper penetrates the cloth mesh, which prevents it from being properly grasped and lifted.

EverNorif · 2026-02-25T02:39:56Z

@Papaercold Thank you! I will review it next week.

Papaercold · 2026-03-03T01:31:16Z

I’ve started working on the reinforcement learning module based on the rsl_rl library, but it is still under development.

For now, please ignore the files under the rl folder as well as the new rl_config settings during your review.

The directory structure and corresponding file descriptions are also documented in the Markdown file.

If you prefer, I can submit a separate PR for the RL part after you finish reviewing the state machine implementation.

EverNorif · 2026-03-03T08:05:44Z

@Papaercold Yes, I think it would be better to keep the PR focused on a single feature. You can save your progress locally or in another branch first. This PR should only track the state machine–related functionality.

Papaercold · 2026-03-03T08:14:11Z

@Papaercold Yes, I think it would be better to keep the PR focused on a single feature. You can save your progress locally or in another branch first. This PR should only track the state machine–related functionality.

Sure, I’ll make the changes now. I’ll keep this PR focused on the state machine functionality.

Papaercold · 2026-03-03T08:39:34Z

@EverNorif The modifications have been completed. In this commit, the RL module has been fully removed. I have also removed fold_cloth.py, as its current performance is not yet satisfactory.

The remaining code consists only of the components that I have tested and verified to be functioning correctly. You may refer to the Markdown documentation for guidance during your review.

EverNorif

I tested this PR locally and it runs successfully.

There are still some issues with the structure and implementation that need to be adjusted.

In terms of structure, I think this organization would be better：

scripts/datagen/state_machine/
-- generate.py(or any other name you think is appropriate.)      # Unified runner script: run the state machine based on the provided task, rather than being limited to specific one.
-- replay.py           # Replay script for state-machine demonstrations

source/leisaac/leisaac/datagen/state_machine/
-- base.py             # StateMachineBase abstract class
-- pick_orange.py      # PickOrangeStateMachine
-- ... # other state machine implement

EverNorif · 2026-03-04T03:04:08Z

dependencies/IsaacLab

It is recommended to use a fixed version of IsaacLab rather than the latest one.

EverNorif · 2026-03-04T03:08:22Z

scripts/environments/state_machine/pick_orange.py

At the moment, it seems that each file corresponds to a single task. I think it would be better to refactor this into a more general entry point, generate.py, which takes different Tasks as input and generates the corresponding action.

In generate.py, we would only call the relevant interfaces from StateMachineBase, while the specific implementations would be placed in separate task-specific state machine modules.

EverNorif · 2026-03-04T03:11:45Z

source/leisaac/leisaac/state_machine/__init__.py

+"""State machine implementations for LeIsaac tasks."""
+
+from .base import StateMachineBase
+from .fold_cloth import FoldClothStateMachine


This line should also be removed.

EverNorif · 2026-03-04T03:16:52Z

source/leisaac/leisaac/state_machine/pick_orange.py

+        # body_pos_w is always valid after env.reset() and does not suffer from stale sensor data.
+        if self._orange_now == 1 and step == 0:
+            self._initial_ee_pos = env.scene["robot"].data.body_pos_w[:, -1, :].clone()
+            # 或者直接设置 drive params（依据你 robot API）


translate it to english

EverNorif · 2026-03-04T03:18:00Z

record_pick_orange.sh

There’s no need to provide the launch script here, it can simply be documented instead.

EverNorif · 2026-03-04T03:18:05Z

replay_pick_orange.sh

There’s no need to provide the launch script here, it can simply be documented instead.

EverNorif · 2026-03-04T03:19:19Z

STATE_MACHINE_README.md

Later, this document can be moved to the docs folder and included as part of the website content.

EverNorif · 2026-03-04T03:21:45Z

STATE_MACHINE_README.md

+...
+```
+
+**Important:** `demo_0` is always empty. The **K-th recorded demonstration** is stored as `demo_K`.


Why is demo_0 always empty here? The teleop script doesn’t seem to have this issue. As far as I remember, if no data is recorded, reset() does not log any data.

EverNorif · 2026-03-04T03:22:48Z

STATE_MACHINE_README.md

+
+| `--select_episodes N` | Episode loaded | Content |
+|---|---|---|
+| 0 | `demo_0` | Empty (no actions) — causes `TypeError` |


You can provide more detailed error messages in the code.

Papaercold · 2026-03-04T04:09:53Z

@EverNorif Thanks for the review. I’ll make the requested changes shortly and follow up here if anything comes up.

Zihan Gao and others added 17 commits January 27, 2026 11:30

Add data auto-generate module.

f870f10

Add data auto-generate module.

e52df2e

Add data auto-generate module.

ae15ddc

Merge branch 'LightwheelAI:main' into auto-data-generation

4fea004

Add data auto-generate module.

a245712

Add data auto-generate module.

38df5fc

Add auto_terminate.

83b9e0b

Add auto_terminate.

3b78d34

Add auto_terminate.

47d8504

Add Description.

2e448cf

State Machinecode refactoring.

0e5f008

State Machinecode refactoring.

9750b3d

State Machinecode refactoring.

34be002

State Machinecode refactoring.

d590d27

State Machinecode refactoring.

b8c97a9

State Machinecode refactoring.

4567af7

Add State Machine code.

6ad6606

EverNorif linked an issue Jan 28, 2026 that may be closed by this pull request

[Feature Request] Automatic Data Generation Pipeline #110

Open

EverNorif self-requested a review January 28, 2026 07:54

Papaercold and others added 10 commits February 18, 2026 00:19

Apply pre-commit fixes (black/isort/pyupgrade) for several files

e694585

Apply pre-commit fixes (black/isort/pyupgrade) for pick_orange.py

5074699

Merge branch 'LightwheelAI:main' into auto-data-generation

18da4fa

Change structure..

9e73433

Create StateMacchine Class.

310aee6

Refactor code.

fb8304d

Fix bugs.

9328529

Delete redundant files

a5338d6

Delete redundant files.

e1c8729

Change PickOrangeStateMachine

c4f8111

Papaercold added 2 commits February 18, 2026 22:10

Change PickOrangeStateMachine

002133f

Change PickOrangeStateMachine

2460e14

Papaercold and others added 10 commits February 19, 2026 01:37

Change PickOrangeStateMachine

a748872

Change PickOrangeStateMachine

c07cd7f

Add state_machine/fold_cloth.py

9f0bdcd

Fix bugs

656fa77

Add state_machine/replay.py

30f8061

Add readme

2c8a938

fix bugs

f81ba70

fix bugs

d1c3cef

fix bugs

e81f2dd

fix bugs

246a48f

Zihan Gao added 3 commits February 21, 2026 18:26

fix bugs

733de37

Change documents

1bde9b0

Change bi_arm_cfg

1342c18

Zihan Gao added 3 commits March 2, 2026 12:45

Add RL module - 1st version.

c7d8cca

Change documents.

5dc5b8d

Change bash.

8d1c810

Delete RL part.

f34cb6e

EverNorif requested changes Mar 4, 2026

View reviewed changes

Conversation

Papaercold commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Key Features

Usage Examples

Generate data

Replay recorded data

Observed Issue / Open Question

Notes

Uh oh!

EverNorif commented Jan 28, 2026

Uh oh!

Papaercold commented Feb 19, 2026

Uh oh!

Papaercold commented Feb 19, 2026

Uh oh!

Papaercold commented Feb 21, 2026

Uh oh!

Papaercold commented Feb 22, 2026

Uh oh!

EverNorif commented Feb 25, 2026

Uh oh!

Papaercold commented Mar 3, 2026

Uh oh!

EverNorif commented Mar 3, 2026

Uh oh!

Papaercold commented Mar 3, 2026

Uh oh!

Papaercold commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

EverNorif left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Papaercold commented Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Papaercold commented Jan 28, 2026 •

edited

Loading

Papaercold commented Mar 3, 2026 •

edited

Loading