Cloned#

Description#
Data obtained by training an imitation policy on the demonstrations from expert
and human
, then running the policy, and mixing data at a 50-50 ratio with the demonstrations. This dataset is provided by D4RL. The environment used to collect the dataset is AdroitHandRelocate-v1
.
Dataset Specs#
Total Timesteps |
|
Total Episodes |
|
Dataset Observation Space |
|
Dataset Action Space |
|
Algorithm |
|
Author |
|
|
|
Code Permalink |
|
Minari Version |
|
download |
|
Environment Specs#
ID |
|
Action Space |
|
Observation Space |
|