Generic¶

Generic gym environment specifically tailored to work with Jiminy Simulator as backend physics engine, and Jiminy Viewer as 3D visualizer. It implements the official OpenAI Gym API and extended it to add more functionalities.

class gym_jiminy.common.envs.generic.BaseJiminyEnv(simulator, step_dt, simulation_duration_max=86400.0, debug=False, render_mode=None, **kwargs)[source]¶

Bases: InterfaceJiminyEnv[Obs, Act], Generic[Obs, Act]

Base class to train an agent in Gym OpenAI using Jiminy simulator for physics computations.

It creates an Gym environment wrapping an already instantiated Jiminy simulator and behaves like any standard Gym environment.

The observation space is a dictionary gathering the current simulation time, the real robot state, and the sensors data. The action is a vector gathering the torques of the actuator of the robot.

There is no reward by default. It is up to the user to overload this class to implement one. It has been designed to be highly flexible and easy to customize by overloading it to fit the vast majority of users’ needs.

Note

In evaluation or debug modes, log files of the simulations are automatically written in the default temporary file directory of the system. Writing is triggered by calling stop manually or upon reset, right before starting the next episode. The path of the lof file assocciated with a given is stored under key log_path of the extra info output when calling reset. The user is responsible for deleting manually old log files if necessary.

Parameters:

simulator (Simulator) – Jiminy Python simulator used for physics computations. It must be fully initialized.
step_dt (float) – Environment timestep for learning. Note that it is independent from the controller and observation update periods. The latter are configured via engine.set_options.
simulation_duration_max (float) – Maximum duration of a simulation. If the current simulation time exceeds this threshold, then it will triggers is_truncated=True. It cannot exceed the maximum possible duration before telemetry log time overflow which is extremely large (about 30 years). Beware that log data are stored in RAM, which may cause out-of-memory error if the episode is lasting for too long without reset. Optional: About 4GB of log data assuming 5ms control update period and telemetry disabled for everything but the robot configuration.
render_mode (str | None) – Desired rendering mode, ie “human” or “rgb_array”. If “human” is specified, calling render will open a graphical window for visualization, otherwise a rgb image is returned, as a 3D numpy array whose first dimension are the 3 red, green, blue channels and the two subsequent dimensions are the pixel height and weight respectively. None to select automatically the most appropriate mode based on the user-specified rendering backend if any, or the machine environment. Note that “rgb_array” does not require a graphical window manager. Optional: None by default.
debug (bool) – Whether the debug mode must be enabled. Doing it enables telemetry recording.
kwargs (Any) – Extra keyword arguments that may be useful for derived environments with multiple inheritance, and to allow automatic pipeline wrapper generation.

simulator: Simulator¶

render_mode: str | None = None¶

stepper_state: jiminy.StepperState¶

is_simulation_running: npt.NDArray[np.bool_]¶

robot: jiminy.Robot¶

robot_state: jiminy.RobotState¶

derived: InterfaceJiminyEnv¶: Top-most block from which this environment is part of when leveraging modular pipeline design capability.

log_fieldnames: Mapping[str, FieldNested]¶: Fielnames associated with all the variables that have been recorded to the telemetry by any of the layer of the whole pipeline environment.

property np_random: Generator¶

Returns the environment’s internal _np_random that if not set will initialise with a random seed.

Returns:: Instances of np.random.Generator

num_steps: npt.NDArray[np.int64]¶: Number of simulation steps that has been performed since last reset of the base environment.

Note

The counter is incremented before updating the observation at the end of the step, and consequently, before evaluating the reward and the termination conditions.

quantities: QuantityManager¶

observation: Obs¶

action: Act¶

_initialize_action_space()[source]¶

Configure the action space of the environment.

The action is a vector gathering the torques of the actuator of the robot.

Warning

This method is called internally by reset method. It is not meant to be overloaded since the actual action space of the robot is uniquely defined.

Return type:: None

_initialize_seed(seed=None)[source]¶

Specify the seed of the environment.

Note

This method is not meant to be called manually.

Warning

It also resets the low-level jiminy Engine. Therefore one must call the reset method afterward.

Parameters:: seed (int | None) – Random seed, as a positive integer. Optional: A strongly random seed will be generated by gym if omitted.
Returns:: Updated seed of the environment
Return type:: None

register_variable(name, value, fieldnames=None, namespace=None, *, is_eval_only=True)[source]¶

Register variable to the telemetry.

Warning

Variables are registered by reference. Consequently, the user is responsible to manage the lifetime of the data to prevent it from being garbage collected.