CoreSpace
CoreSpace(
shape = None, dtype = None
)
Deep Deterministic Policy Gradient
Arguments
actor_model (keras.nn.Model instance): See Model for details.
critic_model (keras.nn.Model instance): See Model for details.
optimizer (keras.optimizers.Optimizer instance):
See Optimizer for details.
action_inp (keras.layers.Input / keras.layers.InputLayer instance):
See Input for details.
tau (float): tau.
gamma (float): gamma.
Methods:
.is_np_flattenable
.is_np_flattenable()
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
.sample
.sample(
mask = None
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
.contains
.contains(
x
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
CoreEnv
The abstract game class that is used by all agents. This class has the exact same API that OpenAI Gym uses so that integrating with it is trivial. In contrast to the OpenAI Gym implementation, this class only defines the abstract methods without any actual implementation.
To implement your own game, you need to define the following methods:
seedresetsteprenderclose
Refer to the Gym documentation.
Methods:
.close
.close()
Override in your subclass to perform any necessary cleanup.
Environments will automatically close() themselves when garbage collected or when the program exits.
.render
.render(
mode: str = 'human'
)
Renders the game.
The set of supported modes varies per game. (And some game do not support rendering at all.)
Arguments
mode (str): The mode to render with. (default is 'human')
.reset
.reset(
*args
)
Resets the state of the game and returns an initial observation.
Returns
observation (abstract): The initial observation of the space. Initial reward is assumed to be 0.
.seed
.seed(
seed = None
)
set the seed
.step
.step(
action
)
Run one timestep of the game's dynamics.
Accepts an action and returns a tuple (observation, reward, done, info).
Arguments
action (abstract): An action provided by the game.
Returns
info (dict): Contains auxiliary diagnostic information (helpful for debugging, and sometimes learning).
.goal
.goal()
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.