I find that a better perspective to reinforcement learning is to treat it as an experiment. For instance, the Mujoco physical environment doesn’t have a detailed descriptions to go with it. I need to figure out exactly what does every observation mean. The clues are the gym file and mujoco xml file.
[Read More]