Cloned#

Description#

Data obtained by training an imitation policy on the demonstrations from expert and human, then running the policy, and mixing data at a 50-50 ratio with the demonstrations. This dataset is provided by D4RL. The environment used to collect the dataset is AdroitHandHammer-v1.

Dataset Specs#


Total Timesteps	`1000000`
Total Episodes	`3606`
Flatten Observations	`False`
Flatten Actions	`False`
Algorithm	`None`
Author	`Rodrigo de Lazcano`
Email	`rperezvicente@farama.org`
Code Permalink	`https://github.com/rodrigodelazcano/d4rl-minari-dataset-generation`
download	`minari.download_dataset("hammer-cloned-v0")`

Environment Specs#


ID	`AdroitHandHammer-v1`
Action Space	`Box(-1.0, 1.0, (26,), float32)`
Observation Space	`Box(-inf, inf, (46,), float64)`