You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
RL-Boy Reinforcement Kid is a small-sized open-source robot project and a full-stack open-source scaled-down humanoid robot project. Centered around running end-to-end reinforcement learning algorithms, it has 22 degrees of freedom, including waist, hands, and head. Among small-sized robots, it fully possesses the ability to run end-to-end reinforcement learning algorithms and imitation learning algorithms. Meanwhile, due to having upper limbs, it can also run open-source software packages related to VLA operations. Reinforcement Kid has a 1-degree-of-freedom head and an OLED screen, so it can display expressions well, and combined with AI models, it can implement dialogue and interactive actions.
Features of the Reinforcement Kid:
Ultra-small size and high degree of freedom: The enhanced small robot has 22 degrees of freedom, with a rotatable waist and head, but is only 55 cm tall.
End-to-end network drive: The reinforcement agent can implement the RL deep reinforcement learning algorithm, while the upper limb can implement operations by running the VLA model, thus realizing the development of the cerebellum-cerebrum fusion algorithm.
Open source of whole-link power builder: In addition to open-sourcing robot mechanical design and BOM diagram materials, the robot's upper and lower computer software, as well as the onnx model deployment framework, have been open-sourced. The model deployment framework is fully synchronized with isacc and sim2sim scripts.
Provide remote control equipment: Provide servo-based dual-arm remote control software that can meet the data acquisition requirements for controlling and supporting the upper limbs and waist of the robot.
Basic Performance of the Reinforcement Kid:
Degree of Freedom
Control System
Reinforcement Learning
Embodied Operation
Facial Interaction
Remote Control Operation
1 head,1 at the waist,Single Arm 4 + 1,Gripper5 reps per leg
Compatible with Raspberry Pi, Odroid, Coolpi, Chips such as RDK Sweet Potato
Open-source reinforcement learning algorithms support standing, stable walking, and accurate URDF, enabling Zero-Shot transfer
Can run upper limb operation algorithms and open-source typical small models such as ACT and DP
The robot's face uses ESP32, which can run the AI Xiaozhi software to implement facial expressions and voice Agent conversations
The robot's face uses ESP32, which can run the AI Xiaozhi software to implement facial expressions and voice Agent conversations
The BOM of the Reinforcement Kid is mainly composed of 3D printing materials and carbon fiber sheets, which facilitates DIY and processing:
HMI Host Computer Configuration for STM32 Board: Connect the Core and Extcan circuit boards via USB, complete the motor reverse checkbox and motor type selection according to the configuration diagram, and each motor completes CAN_ID configuration through its own Dami host computer:
Left Leg (Type: STM32-Core CAN1)
Brushless Motor
Type
CAN ID Number
Deflection
6006
1
Side Exhibition
8006
2
Thigh
8006
3
Calf
8006
4
Sole
6006
5
Waist
6006
6
Head Heading
3507
7
Right Leg (Type: STM32-Core CAN2)
Brushless Motor
Type
CAN ID Number
Deflection
6006
1
Side Exhibition
8006
2
Thigh
8006
3
Calf
8006
4
Sole
6006
5
Left Arm (Type: STM32-Core CAN1)
Brushless Motor
Type
CAN ID Number
Motor 1
3507
1
Motor 2
3507
2
Motor 3
3507
3
Motor 4
3507
4
Right Arm (Type: STM32-Core CAN2)
Brushless Motor
Type
CAN ID Number
Motor 1
3507
1
Motor 2
3507
2
Motor 3
3507
3
Motor 4
3507
4
Configure the core STM32 of the HMI interface as follows:
Configure the core ExtCan of the HMI interface as follows:
3.Update STM32 firmware:
2025/7/14
[extcan.hex]
4.Update the firmware and parameters of the ODroid control software package: Update the software of the Odroid internal system via WinSCP
5.Modify rc.local to ensure the correct operation of the self-starting script: