Training Schematic

The primary objective of this study is to demonstrate the significance of utilizing tactile sensors during the soft-capture phase of grasping. To highlight this, we compare two identical agents that differ in only one aspect to observe how this singular difference influences the training outcomes. One agent is equipped with tactile sensors, thereby incorporating the normal contact force applied to the robotic gripper within its state information, whereas the other agent lacks this feature.

Training Schematic

Results

Reward and SuccessRate Comparison

TACTILE Sensor Feedback Random Episode

WITH_TACTILE Here is sample episode: Sample Episode With Tactile

NO TACTILE Sensor Feedback Random Episode

WITHOUT_TACTILE Here is sample episode: Sample Episode With no Tactile

Observation Space

The state representation for each agent combines several parameters: the pose and velocity of the gripper, the pose and velocity of the target, their respective differences, and the minimum distance in each direction between them, all defined within the inertial frame. Consequently, this results in a 39-dimensional state space for the agent without access to contact force data. In contrast, the other agent's state space is 40-dimensional, including an additional dimension for the cumulative normal contact force exerted on the gripper. The observation space for the robotic arm environment is represented by the configuration, Box(-inf, inf, (40,), float32) or Box(-inf, inf, (40,), float32). This space consists of a set of variables, each describing a distinct attribute related to the position, movement, and velocity of both the robotic gripper and its target. These variables embody an extensive range of information about the environment, capturing the dynamism and intricacies involved in the manipulative tasks of the robotic arm.

The table provided below offers a comprehensive overview of each variable within the observation space. It outlines not only the variable itself, but also the corresponding limits and the unit of measurement used. This range from negative infinity to positive infinity underscores the continuous nature of these variables, further emphasizing the complexity of the tasks and movements this robotic arm is designed to perform. Observation Space

Action Space

The action space is defined within a Box(-1.0, 1.0, (6,), float32), which encapsulates the absolute position and orientation of the 3f RobotiQ gripper when functioning as an end-effector. Control actions are enforced by modulating the physical motion of the gripper's base across six degrees of freedom (6dof). This comprises three translational (linear) and three rotational (angular) movements that are executed by the robotic manipulator through inverse kinematics. For compatibility purposes, control action inputs are scaled to a range between -1 and 1. The elements of the action array are as follows:

Action Space

Rewards

In this work, we introduce a novel reward function that integrates both dense and sparse rewards, aiming to address the challenge of precisely approaching to grasp a moving, floating object. The agent employs the dense reward component to ascertain the appropriate approach towards the target, while simultaneously maintaining its position and orientation. Subsequently, the sparse reward aspect of the reward function provides guidance to maintain an optimal posture, preserve a safe distance between the gripper and the target, and ultimately prevent contact with the target.

Episode End

The episode will be truncated when the duration reaches a total of max_episode_steps which by default is set to 500 timesteps. The episode is never terminated since the task is continuing with an infinite horizon.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Screenshots		Screenshots
__pycache__		__pycache__
evaluation		evaluation
models		models
tensorboard		tensorboard
urdf		urdf
EnvDebug.py		EnvDebug.py
EnvTest.py		EnvTest.py
EnvTrain.py		EnvTrain.py
LICENSE		LICENSE
README.md		README.md
checkenv.py		checkenv.py
datavis.ipynb		datavis.ipynb
debug_env.py		debug_env.py
no_tactile_T7.png		no_tactile_T7.png
noise.py		noise.py
requirements.txt		requirements.txt
robotiq.py		robotiq.py
robotiqGymEnv.py		robotiqGymEnv.py
success_perc.py		success_perc.py
tactile_T1.png		tactile_T1.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Training Schematic

Results

Reward and SuccessRate Comparison

TACTILE Sensor Feedback Random Episode

NO TACTILE Sensor Feedback Random Episode

Observation Space

Action Space

Rewards

Episode End

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

baha2r/Soft_Capture_Tactile

Folders and files

Latest commit

History

Repository files navigation

Training Schematic

Results

Reward and SuccessRate Comparison

TACTILE Sensor Feedback Random Episode

NO TACTILE Sensor Feedback Random Episode

Observation Space

Action Space

Rewards

Episode End

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages