UnderspecifiedEnv
The UnderspecifiedEnv class defines a UPOMDP, and acts similarly to (but not identically to) a Gymnax environment.
The UnderspecifiedEnv class has the following interface
params = env.default_paramsaction_space = env.action_space(params)obs, state = env.reset_to_level(rng, level, params)obs, state, reward, done, info = env.step(rng, state, action, params)
Every environment must implement only the following methods
step_env: Perform a step of the environmentreset_env_to_level: Reset the environment to a particular levelaction_space: Return the action space of the environment
The environment also does not automatically reset to a new level once the environment has restarted.
Look at the AutoReplay wrapper if this is desired.