Cloned#

Description#

Data obtained by training an imitation policy on the demonstrations from expert and human, then running the policy, and mixing data at a 50-50 ratio with the demonstrations. This dataset is provided by D4RL. The environment used to collect the dataset is AdroitHandHammer-v1.

Dataset Specs#

Total Timesteps

1000000

Total Episodes

3606

Flatten Observations

False

Flatten Actions

False

Algorithm

None

Author

Rodrigo de Lazcano

Email

rperezvicente@farama.org

Code Permalink

https://github.com/rodrigodelazcano/d4rl-minari-dataset-generation

download

minari.download_dataset("hammer-cloned-v0")

Environment Specs#

ID

AdroitHandHammer-v1

Action Space

Box(-1.0, 1.0, (26,), float32)

Observation Space

Box(-inf, inf, (46,), float64)