Cloned#

Description#

Data obtained by training an imitation policy on the demonstrations from expert and human, then running the policy, and mixing data at a 50-50 ratio with the demonstrations. This dataset is provided by D4RL. The environment used to collect the dataset is AdroitHandRelocate-v1.

Dataset Specs#


Total Timesteps	`1000000`
Total Episodes	`3758`
Dataset Observation Space	`Box(-inf, inf, (39,), float64)`
Dataset Action Space	`Box(-1.0, 1.0, (30,), float32)`
Algorithm	`None`
Author	`Rodrigo de Lazcano`
Email	`rperezvicente@farama.org`
Code Permalink	`https://github.com/rodrigodelazcano/d4rl-minari-dataset-generation`
Minari Version	`~=0.4`
download	`minari.download_dataset("relocate-cloned-v1")`

Environment Specs#


ID	`AdroitHandRelocate-v1`
Action Space	`Box(-1.0, 1.0, (30,), float32)`
Observation Space	`Box(-inf, inf, (39,), float64)`